Textoon: Generating Vivid 2D Cartoon Characters from Text Descriptions Paper β’ 2501.10020 β’ Published 4 days ago β’ 16
Multiple Choice Questions: Reasoning Makes Large Language Models (LLMs) More Self-Confident Even When They Are Wrong Paper β’ 2501.09775 β’ Published 5 days ago β’ 16
PaSa: An LLM Agent for Comprehensive Academic Paper Search Paper β’ 2501.10120 β’ Published 4 days ago β’ 23
view article Article The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about... By srinivasbilla β’ about 7 hours ago β’ 11
Do generative video models learn physical principles from watching videos? Paper β’ 2501.09038 β’ Published 6 days ago β’ 26
Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps Paper β’ 2501.09732 β’ Published 5 days ago β’ 60
AnyStory: Towards Unified Single and Multiple Subject Personalization in Text-to-Image Generation Paper β’ 2501.09503 β’ Published 5 days ago β’ 10
Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models Paper β’ 2501.09686 β’ Published 5 days ago β’ 29
XMusic: Towards a Generalized and Controllable Symbolic Music Generation Framework Paper β’ 2501.08809 β’ Published 6 days ago β’ 9
Towards Best Practices for Open Datasets for LLM Training Paper β’ 2501.08365 β’ Published 7 days ago β’ 44
HALoGEN: Fantastic LLM Hallucinations and Where to Find Them Paper β’ 2501.08292 β’ Published 7 days ago β’ 16
MMDocIR: Benchmarking Multi-Modal Retrieval for Long Documents Paper β’ 2501.08828 β’ Published 6 days ago β’ 26
MangaNinja: Line Art Colorization with Precise Reference Following Paper β’ 2501.08332 β’ Published 6 days ago β’ 53
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper β’ 2501.08313 β’ Published 6 days ago β’ 263
O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning Paper β’ 2501.06458 β’ Published 10 days ago β’ 29
VideoAuteur: Towards Long Narrative Video Generation Paper β’ 2501.06173 β’ Published 10 days ago β’ 31