Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Ji-Xiang
's Collections
RLVR Datasets
Thinking/Reasoning Datasets
WebGPU
RLHF Datasets
HTML to Markdown
Math Datasets
Logical Reasoning Datasets
Multilingual-dataset
Object Detection
rag dataset
image-to-video
Multilingual Large Language Models
SFT Datasets
Recommended Datasets
Coder LLM
Text-to-Video
Multimodal Language Models
Image Chatbot
traditional-chinese-dataset
Suggest Spaces
Suggestion Models
Chinese models
China models
Uncensored models
china-dataset
common-dataset
unfiltered dataset
Image Generator AI
Edge Computing
Voice
Medical
Big Language Models
GGUF Models
TTS
Visual Question Answering
Chat
Multi Tasks
Vision
DPO datasets
ORPO-DPO datasets
Code dataset
SLM (small language models)
automatic speech recognition (ASR)
Vision-Language dataset
MoE
Dense Passage Retrieval (DPR) Datasets
Audio-To-Text
background-removal
Extreme Quantization
Try on
RLVR Datasets
updated
about 24 hours ago
Reinforcement Learning from Verifiable Rewards (RLVR) Datasets
Upvote
-
allenai/RLVR-MATH
Viewer
•
Updated
Nov 20, 2024
•
7.5k
•
190
•
6
allenai/RLVR-GSM
Viewer
•
Updated
Nov 21, 2024
•
8.79k
•
147
•
2
allenai/RLVR-IFeval
Viewer
•
Updated
Nov 21, 2024
•
15k
•
217
•
7
Upvote
-
Share collection
View history
Collection guide
Browse collections