arxiv:2410.04612
Jonathan Chang
jdchang
AI & ML interests
None yet
Recent Activity
updated
a model
5 days ago
jdchang/same-step-llama31-ba888
updated
a model
5 days ago
jdchang/same-step-llama31-ba666
updated
a model
5 days ago
jdchang/same-step-llama31-ba444
Organizations
Papers
1
models
35
jdchang/same-step-llama31-ba888
Text Generation
•
Updated
•
8
jdchang/same-step-llama31-ba666
Text Generation
•
Updated
•
8
jdchang/same-step-llama31-ba444
Text Generation
•
Updated
•
8
jdchang/same-step-llama31-ba222
Text Generation
•
Updated
•
8
jdchang/same-step-llama31-ba1110
Text Generation
•
Updated
•
8
jdchang/reinforce-llama31-ba227
Text Generation
•
Updated
•
4
jdchang/same-step-sft-ba888
Text Generation
•
Updated
•
5
jdchang/same-step-sft-ba666
Text Generation
•
Updated
•
8
jdchang/same-step-sft-ba444
Text Generation
•
Updated
•
8
jdchang/same-step-sft-ba222
Text Generation
•
Updated
•
7
datasets
18
jdchang/evol_instruct
Viewer
•
Updated
•
78.3k
•
57
jdchang/ultrafeedback-llama-3.1-70b-general-armo-preference
Viewer
•
Updated
•
60.8k
•
36
jdchang/ultrafeedback-llama-3.1-70b-specific-armo-preference
Viewer
•
Updated
•
60.8k
•
50
jdchang/ultrafeedback-llama-3.1-70b-specific-armo
Viewer
•
Updated
•
60.8k
•
36
jdchang/ultrafeedback-llama-3.1-70b-general-armo
Viewer
•
Updated
•
60.8k
•
41
jdchang/ultrafeedback-llama-3.1-8b-specific-armo-preference
Viewer
•
Updated
•
60.4k
•
49
jdchang/ultrafeedback-llama-3.1-8b-specific-armo
Viewer
•
Updated
•
60.4k
•
73
jdchang/ultrafeedback-llama-3.1-8b-general-armo-preference
Viewer
•
Updated
•
60.5k
•
44
jdchang/ultrafeedback-llama-3.1-8b-general-armo
Viewer
•
Updated
•
60.5k
•
63
jdchang/ultrafeedback-llama-3.1-70b-specific
Viewer
•
Updated
•
60.8k
•
36