Nguyễn Minh Phúc
DatPySci
AI & ML interests
Reinforcement learning, NLP
Recent Activity
updated
a dataset
2 days ago
DatPySci/hh_gpt2_large_w2s_feedback
updated
a dataset
2 days ago
DatPySci/hh_gpt2_medium_w2s_feedback
updated
a dataset
2 days ago
DatPySci/hh_gpt2_w2s_feedback
Organizations
Collections
1
models
95
DatPySci/EleutherAI_pythia-1b-deduped__dpo_shift_beta_0.1_steps_72000__tldr
Updated
DatPySci/EleutherAI_pythia-1b-deduped__dpo_shift_beta_0.05_steps_72000__tldr
Updated
DatPySci/EleutherAI_pythia-1b-deduped__dpo_shift_beta_0.01_steps_72000__tldr
Updated
DatPySci/EleutherAI_pythia-1b-deduped__dpo_shift_beta_0.1_steps_32400__tldr
Updated
DatPySci/EleutherAI_pythia-1b-deduped__dpo_shift_beta_0.05_steps_32400__tldr
Updated
DatPySci/EleutherAI_pythia-1b-deduped__dpo_shift_beta_0.01_steps_32400__tldr
Updated
DatPySci/llama3-1b_reward_tldr
Text Classification
•
Updated
•
5
DatPySci/EleutherAI_pythia-2.8b-deduped__ipo_pythia-2.8b_beta-0.1__tldr
Updated
DatPySci/EleutherAI_pythia-2.8b-deduped__dpo_pythia-2.8b_beta-0.05__tldr
Updated
DatPySci/EleutherAI_pythia-2.8b-deduped__length_IS_pythia-2.8b_beta-0.05__tldr
Updated
datasets
43
DatPySci/hh_gpt2_large_w2s_feedback
Viewer
•
Updated
•
53.8k
•
2
DatPySci/hh_gpt2_medium_w2s_feedback
Viewer
•
Updated
•
53.8k
•
3
DatPySci/hh_gpt2_w2s_feedback
Viewer
•
Updated
•
53.8k
•
4
DatPySci/tldr_gpt2_large_w2s_feedback
Viewer
•
Updated
•
46.4k
•
2
DatPySci/tldr_gpt2_medium_w2s_feedback
Viewer
•
Updated
•
46.4k
•
2
DatPySci/tldr_gpt2_w2s_feedback
Viewer
•
Updated
•
46.4k
•
10
DatPySci/gpt2-medium_dpo_tldr_temp_1_2
Viewer
•
Updated
•
8k
•
3
DatPySci/gpt2_dpo_tldr_temp_1_0
Viewer
•
Updated
•
3.88k
•
3
DatPySci/gpt2-large_dpo_tldr_temp_1_0
Viewer
•
Updated
•
3.88k
•
3
DatPySci/gpt2-medium_dpo_tldr_temp_1_0
Viewer
•
Updated
•
3.88k
•
3