Jian Hu's picture

Jian Hu

chuyi777

·

https://hujian.website

hijkzzz

AI & ML interests

Reinforcement Learning

Recent Activity

liked a model 7 days ago

CohereForAI/c4ai-command-r7b-12-2024

upvoted a paper 14 days ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

commented on a paper 14 days ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

View all activity

Organizations

chuyi777's activity

commented a paper 14 days ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published 19 days ago • 86 •

New activity in OpenRLHF/Mistral-7b-PRM-Math-Shepherd 3 months ago

怎么下载模型呢？

#1 opened 3 months ago by

New activity in mustafaaljadery/gemma-2B-10M 9 months ago

OOM on A100

#3 opened 9 months ago by

New activity in ai21labs/Jamba-v0.1 9 months ago

Is there any SFT or Chat model?

#41 opened 9 months ago by