arxiv:2412.17256
Yuzhen Huang
yuzhen17
AI & ML interests
None yet
Recent Activity
authored
a paper
about 12 hours ago
B-STaR: Monitoring and Balancing Exploration and Exploitation in
Self-Taught Reasoners
upvoted
a
paper
11 days ago
B-STaR: Monitoring and Balancing Exploration and Exploitation in
Self-Taught Reasoners
upvoted
a
paper
11 days ago
Diving into Self-Evolving Training for Multimodal Reasoning