-
DocGraphLM: Documental Graph Language Model for Information Extraction
Paper • 2401.02823 • Published • 35 -
Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters
Paper • 2403.02677 • Published • 16 -
FlashSpeech: Efficient Zero-Shot Speech Synthesis
Paper • 2404.14700 • Published • 29 -
TextGrad: Automatic "Differentiation" via Text
Paper • 2406.07496 • Published • 28
Collections
Discover the best community collections!
Collections including paper arxiv:2406.07496