Nguyễn Minh Phúc's picture

4

Nguyễn Minh Phúc

DatPySci

AI & ML interests

Reinforcement learning, NLP

Recent Activity

updated a dataset 2 days ago

DatPySci/hh_gpt2_large_w2s_feedback

updated a dataset 2 days ago

DatPySci/hh_gpt2_medium_w2s_feedback

updated a dataset 2 days ago

DatPySci/hh_gpt2_w2s_feedback

View all activity

Organizations

Collections 1

models 95

DatPySci/EleutherAI_pythia-1b-deduped__dpo_shift_beta_0.1_steps_72000__tldr

Updated Nov 18, 2024

DatPySci/EleutherAI_pythia-1b-deduped__dpo_shift_beta_0.05_steps_72000__tldr

Updated Nov 18, 2024

DatPySci/EleutherAI_pythia-1b-deduped__dpo_shift_beta_0.01_steps_72000__tldr

Updated Nov 18, 2024

DatPySci/EleutherAI_pythia-1b-deduped__dpo_shift_beta_0.1_steps_32400__tldr

Updated Nov 18, 2024

DatPySci/EleutherAI_pythia-1b-deduped__dpo_shift_beta_0.05_steps_32400__tldr

Updated Nov 18, 2024

DatPySci/EleutherAI_pythia-1b-deduped__dpo_shift_beta_0.01_steps_32400__tldr

Updated Nov 18, 2024

DatPySci/llama3-1b_reward_tldr

Text Classification • Updated Nov 11, 2024 • 5

DatPySci/EleutherAI_pythia-2.8b-deduped__ipo_pythia-2.8b_beta-0.1__tldr

Updated Nov 9, 2024

DatPySci/EleutherAI_pythia-2.8b-deduped__dpo_pythia-2.8b_beta-0.05__tldr

Updated Nov 8, 2024

DatPySci/EleutherAI_pythia-2.8b-deduped__length_IS_pythia-2.8b_beta-0.05__tldr

Updated Oct 26, 2024

datasets 43

DatPySci/hh_gpt2_large_w2s_feedback

Viewer • Updated 2 days ago • 53.8k • 2

DatPySci/hh_gpt2_medium_w2s_feedback

Viewer • Updated 2 days ago • 53.8k • 3

DatPySci/hh_gpt2_w2s_feedback

Viewer • Updated 2 days ago • 53.8k • 4

DatPySci/tldr_gpt2_large_w2s_feedback

Viewer • Updated 2 days ago • 46.4k • 2

DatPySci/tldr_gpt2_medium_w2s_feedback

Viewer • Updated 2 days ago • 46.4k • 2

DatPySci/tldr_gpt2_w2s_feedback

Viewer • Updated 2 days ago • 46.4k • 10

DatPySci/gpt2-medium_dpo_tldr_temp_1_2

Viewer • Updated 4 days ago • 8k • 3

DatPySci/gpt2_dpo_tldr_temp_1_0

Viewer • Updated 4 days ago • 3.88k • 3

DatPySci/gpt2-large_dpo_tldr_temp_1_0

Viewer • Updated 4 days ago • 3.88k • 3

DatPySci/gpt2-medium_dpo_tldr_temp_1_0

Viewer • Updated 4 days ago • 3.88k • 3