See our paper at https://huggingface.co/papers/2405.19332.
Shenao Zhang
ZhangShenao
AI & ML interests
None yet
Recent Activity
updated
a model
1 day ago
ZhangShenao/math_gsm-Mistral-7B-Instruct-v0.3-m-iter-1_sample_7500_nsk_ml512_mlr5e-5_ent0.05
published
a model
1 day ago
ZhangShenao/math_gsm-Mistral-7B-Instruct-v0.3-m-iter-1_sample_7500_nsk_ml512_mlr5e-5_ent0.05
updated
a dataset
1 day ago
ZhangShenao/math_gsm-Mistral-7B-Instruct-v0.3-iter1_sample_7500_nsk_ml512_mlr5e-5_ent0.05
Organizations
Collections
3
-
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-3
Text Generation • Updated • 100 • 5 -
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-2
Text Generation • Updated • 20 -
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-1
Text Generation • Updated • 22 -
Self-Exploring Language Models: Active Preference Elicitation for Online Alignment
Paper • 2405.19332 • Published • 15
models
281
ZhangShenao/math_gsm-Mistral-7B-Instruct-v0.3-m-iter-1_sample_7500_nsk_ml512_mlr5e-5_ent0.05
Updated
ZhangShenao/math_gsm-Mistral-7B-Instruct-v0.3-e-iter-1_sample_7500_nsk_ml512_mlr5e-5_ent0.05
Updated
ZhangShenao/math_math-Mistral-7B-Instruct-v0.3-m-iter-1_sample_7500_nsk_ml512_mlr5e-5_ent0.05
Updated
ZhangShenao/math_math-Mistral-7B-Instruct-v0.3-e-iter-1_sample_7500_nsk_ml512_mlr5e-5_ent0.05
Updated
ZhangShenao/math_math-Meta-Llama-3-8B-Instruct-m-iter-1_sample_7500_nsk_ml512_mlr5e-5_ent0.05
Updated
ZhangShenao/math_math-Meta-Llama-3-8B-Instruct-e-iter-1_sample_7500_nsk_ml512_mlr5e-5_ent0.05
Updated
ZhangShenao/math_math-gemma-1.1-7b-it-m-iter-1_sample_7500_nsk_ml512_mlr5e-5
Updated
•
2
ZhangShenao/math_math-gemma-1.1-7b-it-e-iter-1_sample_7500_nsk_ml512_mlr5e-5
Updated
•
3
ZhangShenao/math_gsm-Meta-Llama-3-8B-Instruct-m-iter-1_sample_7500_nsk_ml512_mlr5e-5_ent0.05
Updated
•
2
ZhangShenao/math_gsm-Meta-Llama-3-8B-Instruct-e-iter-1_sample_7500_nsk_ml512_mlr5e-5_ent0.05
Updated
•
2
datasets
161
ZhangShenao/math_gsm-Mistral-7B-Instruct-v0.3-iter1_sample_7500_nsk_ml512_mlr5e-5_ent0.05
Viewer
•
Updated
•
7.47k
•
4
ZhangShenao/math_math-Mistral-7B-Instruct-v0.3-iter1_sample_7500_nsk_ml512_mlr5e-5_ent0.05
Viewer
•
Updated
•
7.5k
•
4
ZhangShenao/math_math-Meta-Llama-3-8B-Instruct-iter1_sample_7500_nsk_ml512_mlr5e-5_ent0.05
Viewer
•
Updated
•
7.5k
•
4
ZhangShenao/math_math-gemma-1.1-7b-it-iter1_sample_7500_nsk_ml512
Viewer
•
Updated
•
7.5k
•
4
ZhangShenao/math_gsm-Meta-Llama-3-8B-Instruct-iter1_sample_7500_nsk_ml512_mlr5e-5_ent0.05
Viewer
•
Updated
•
7.47k
•
4
ZhangShenao/pure-sft-code_opencoder_edu-deepseek-coder-6.7b-instruct-iter_sample_120000_tp
Viewer
•
Updated
•
118k
•
23
ZhangShenao/math_gsm-gemma-2-9b-it-iter1_sample_7500_nsk_ml512
Viewer
•
Updated
•
7.47k
•
24
ZhangShenao/math_gsm-gemma-1.1-7b-it-iter1_sample_7500_nsk_ml512
Viewer
•
Updated
•
7.47k
•
8
ZhangShenao/sft-math_gsm-gemma-2-9b-it-iter_sample_7500_tp_lr5e-6
Viewer
•
Updated
•
7.47k
•
7
ZhangShenao/sft-math_math-gemma-2-9b-it-iter_sample_7500_tp
Viewer
•
Updated
•
7.5k
•
8