Aditya Kothari's picture

1 2 7

Aditya Kothari

AdityaKothari

·

AI & ML interests

None yet

Recent Activity

reacted to qq8933's post with 👍 about 1 month ago

LLaMA-O1: Open Large Reasoning Model Frameworks For Training, Inference and Evaluation With PyTorch and HuggingFace Large Reasoning Models powered by Monte Carlo Tree Search (MCTS), Self-Play Reinforcement Learning, PPO, AlphaGo Zero's dua policy paradigm and Large Language Models! https://github.com/SimpleBerry/LLaMA-O1/ What will happen when you compound MCTS ❤ LLM ❤ Self-Play ❤RLHF? Just a little bite of strawberry!🍓 Past related works: https://huggingface.co/papers/2410.02884 https://huggingface.co/papers/2406.07394

liked a model about 1 month ago

black-forest-labs/FLUX.1-Fill-dev

View all activity

Organizations

AdityaKothari's activity

reacted to qq8933's post with 👍 about 1 month ago

Post

6358

LLaMA-O1: Open Large Reasoning Model Frameworks For Training, Inference and Evaluation With PyTorch and HuggingFace
Large Reasoning Models powered by Monte Carlo Tree Search (MCTS), Self-Play Reinforcement Learning, PPO, AlphaGo Zero's dua policy paradigm and Large Language Models!
https://github.com/SimpleBerry/LLaMA-O1/

What will happen when you compound MCTS ❤ LLM ❤ Self-Play ❤RLHF?
Just a little bite of strawberry!🍓

Past related works:
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning (2410.02884)
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B (2406.07394)

2 replies

·

liked a model about 1 month ago

black-forest-labs/FLUX.1-Fill-dev

Updated Nov 25, 2024 • 42.9k • 440

updated a collection 2 months ago

Wellness AI

5 items • Updated Nov 7, 2024 • 1

upvoted a collection 2 months ago

Wellness AI

5 items • Updated Nov 7, 2024 • 1

updated a collection 2 months ago

Wellness AI

5 items • Updated Nov 7, 2024 • 1

liked 2 models 3 months ago

AdityaKothari/WellnessAI-1B

Updated Sep 29, 2024 • 18 • 1

AdityaKothari/WellnessAI-3B

Updated Sep 28, 2024 • 7 • 1

updated 5 models 3 months ago

AdityaKothari/WellnessAI-1B

Updated Sep 29, 2024 • 18 • 1

AdityaKothari/WellnessAI-7B-5-bit

Updated Sep 28, 2024 • 1

AdityaKothari/WellnessAI-3B

Updated Sep 28, 2024 • 7 • 1

AdityaKothari/WellnessAI-8B

Updated Sep 28, 2024 • 8 • 1

AdityaKothari/WellnessAI-7B-F16

Updated Sep 28, 2024 • 5 • 1

liked 4 models 4 months ago

AdityaKothari/WellnessAI-8B

Updated Sep 28, 2024 • 8 • 1

m42-health/Llama3-Med42-8B

Text Generation • Updated Aug 20, 2024 • 1.87k • 53

AdityaKothari/WellnessAI-7B-5-bit

Updated Sep 28, 2024 • 1

AdityaKothari/WellnessAI-7B-F16

Updated Sep 28, 2024 • 5 • 1

New activity in Writer/Palmyra-Med-70B-32K 4 months ago

How about a scaled down version like 7B/8B?

#2 opened 5 months ago by