Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Misc
Reset Misc
Inference Endpoints
grpo
AutoTrain Compatible
text-generation-inference
4-bit precision
Misc with no match
Eval Results
Merge
8-bit precision
custom_code
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
9
Full-text search
Edit filters
Sort: Trending
Active filters:
grpo
Clear all
philschmid/qwen-2.5-3b-r1-countdown
Text Generation
•
Updated
1 day ago
•
31
•
3
Novaciano/ESP-NSFW-GRPO-1B-Sin_Censura-GGUF
Updated
3 days ago
•
59
•
2
nbd22/Llama-3.1-8B-Instruct-GRPO-gsm8k-ft-lora
Updated
3 days ago
sergiopaniego/Qwen2-0.5B-GRPO
Updated
about 11 hours ago
spinech/qwen-2.5-3b-r1-countdown
Text Generation
•
Updated
about 20 hours ago
riddickz/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
Updated
about 3 hours ago
yooneo/qwen-0.5b-r1-aha
Updated
about 3 hours ago
justinj92/qwen-r1-aha-moment
Updated
about 3 hours ago
yooneo/qwen-1.5b-r1-aha
Updated
about 2 hours ago