arxiv:2412.13670
Mingzhe Du
Elfsong
AI & ML interests
Code Generation / Preference Alignment / Bias Mitigation
Recent Activity
updated
a dataset
about 17 hours ago
Elfsong/Venus_t
updated
a dataset
about 17 hours ago
Elfsong/Venus_t
updated
a dataset
about 17 hours ago
Elfsong/Venus_t
Organizations
Papers
2
spaces
5
models
17
Elfsong/Phi-4-14B-Instruct-sft
Text Generation
•
Updated
•
2
Elfsong/Llama-3.1-8B-Instruct-sft
Text Generation
•
Updated
•
238
•
1
Elfsong/Phi-3.5-4B-instruct-sft
Text Generation
•
Updated
•
5
•
1
Elfsong/Llama-3.3-70B-Instruct-dpo
Text Generation
•
Updated
•
15
Elfsong/Llama-3.3-70B-Instruct-stf
Text Generation
•
Updated
•
67
Elfsong/Llama-3.1-8B-Instruct-dpo
Text Generation
•
Updated
•
39
Elfsong/mouadsfilter
Text2Text Generation
•
Updated
•
4
Elfsong/dpo
Updated
Elfsong/debias_model
Updated
Elfsong/my_awesome_model
Updated
datasets
66
Elfsong/Venus_t
Viewer
•
Updated
•
2.08k
•
630
Elfsong/Venus_KTO
Viewer
•
Updated
•
631k
•
19
Elfsong/Venus_SFT
Viewer
•
Updated
•
276k
•
31
Elfsong/Venus_DPO
Viewer
•
Updated
•
127k
•
15
Elfsong/Llama-3.3-70B-Instruct-sft-response
Viewer
•
Updated
•
256
•
29
Elfsong/Llama-3.3-70B-Instruct-dpo-response
Viewer
•
Updated
•
256
•
27
Elfsong/Llama-3.3-70B-Instruct-response
Viewer
•
Updated
•
256
•
29
Elfsong/Llama-3.1-8B-Instruct-dpo-response
Viewer
•
Updated
•
256
•
28
Elfsong/Llama-3.1-8B-Instruct-response
Viewer
•
Updated
•
256
•
29
Elfsong/gpt-4o-response
Viewer
•
Updated
•
256
•
29