Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Wenbo Zhang
Wenboz
Follow
https://onepounchman.github.io/
AI & ML interests
Causal Inference, Out-of-distribuition Robustness, NLP
Recent Activity
updated
a dataset
5 days ago
Wenboz/llama3-instruct-reward-logps-ultrafeedback
updated
a dataset
5 days ago
Wenboz/mistral-instruct-reward-logps-ultrafeedback
updated
a dataset
5 days ago
Wenboz/llama3-base-reward-logps-ultrafeedback
View all activity
Organizations
None yet
models
16
Sort: Recently updated
Wenboz/mistral-7b-base-p3o
Updated
12 days ago
Wenboz/zephyr-7b-dpo-full
Text Generation
•
Updated
16 days ago
•
249
Wenboz/zephyr-7b-dpo-lora
Updated
Oct 20, 2024
Wenboz/llama3-wpo-lora
Updated
Sep 22, 2024
Wenboz/llama3-dpo-lora
Updated
Sep 20, 2024
Wenboz/zephyr-7b-wpo-lora
Updated
Sep 18, 2024
Wenboz/llama3-dpo-full
Updated
Sep 10, 2024
Wenboz/FsfairX-LLaMA3-RM-clone
Updated
Sep 2, 2024
•
3
Wenboz/aromarm_clone
Updated
Sep 1, 2024
•
6
Wenboz/phi3-offline-dpo-lora-noise-0.0-5e-7-thre-1.5-42
Updated
Jul 9, 2024
Expand 16 models
datasets
10
Sort: Recently updated
Wenboz/llama3-instruct-reward-logps-ultrafeedback
Viewer
•
Updated
5 days ago
•
61.8k
•
4
Wenboz/mistral-instruct-reward-logps-ultrafeedback
Viewer
•
Updated
5 days ago
•
62.7k
•
10
Wenboz/llama3-base-reward-logps-ultrafeedback
Viewer
•
Updated
5 days ago
•
63.1k
•
14
Wenboz/mistral-base-reward-logps-ultrafeedback
Viewer
•
Updated
5 days ago
•
63.1k
•
5
Wenboz/mistral-base-proxy-reward-ultrafeedback
Viewer
•
Updated
15 days ago
•
63.1k
•
36
Wenboz/hh_clean_test_messages
Updated
Jul 13, 2024
•
2
Wenboz/SELM-Phi-3-mini-4k-instruct-dataset
Viewer
•
Updated
Jul 1, 2024
•
6
•
29
Wenboz/hh_sft_messages
Viewer
•
Updated
Jun 20, 2024
•
48.4k
•
29
Wenboz/hh_clean
Viewer
•
Updated
Jun 19, 2024
•
48.4k
•
30
Wenboz/hh_sft
Viewer
•
Updated
Jun 18, 2024
•
65.5k
•
31