DavidML29
's Collections
Papers
updated
A Comparative Study on Automatic Coding of Medical Letters with
Explainability
Paper
•
2407.13638
•
Published
•
5
Internet of Agents: Weaving a Web of Heterogeneous Agents for
Collaborative Intelligence
Paper
•
2407.07061
•
Published
•
27
AgentInstruct: Toward Generative Teaching with Agentic Flows
Paper
•
2407.03502
•
Published
•
51
Graph-Based Captioning: Enhancing Visual Descriptions by Interconnecting
Region Captions
Paper
•
2407.06723
•
Published
•
11
Lookback Lens: Detecting and Mitigating Contextual Hallucinations in
Large Language Models Using Only Attention Maps
Paper
•
2407.07071
•
Published
•
12
HoloDreamer: Holistic 3D Panoramic World Generation from Text
Descriptions
Paper
•
2407.15187
•
Published
•
12
Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in
Long-Horizon Tasks
Paper
•
2408.03615
•
Published
•
30
MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular
Annotations for Medicine
Paper
•
2408.02900
•
Published
•
25
GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards
General Medical AI
Paper
•
2408.03361
•
Published
•
85
The AI Scientist: Towards Fully Automated Open-Ended Scientific
Discovery
Paper
•
2408.06292
•
Published
•
117
OpenResearcher: Unleashing AI for Accelerated Scientific Research
Paper
•
2408.06941
•
Published
•
30
LLM-3D Print: Large Language Models To Monitor and Control 3D Printing
Paper
•
2408.14307
•
Published
•
3
Writing in the Margins: Better Inference Pattern for Long Context
Retrieval
Paper
•
2408.14906
•
Published
•
138
OLMoE: Open Mixture-of-Experts Language Models
Paper
•
2409.02060
•
Published
•
77
LongCite: Enabling LLMs to Generate Fine-grained Citations in
Long-context QA
Paper
•
2409.02897
•
Published
•
44
Paper Copilot: A Self-Evolving and Efficient LLM System for Personalized
Academic Assistance
Paper
•
2409.04593
•
Published
•
23
Minstrel: Structural Prompt Generation with Multi-Agents Coordination
for Non-AI Experts
Paper
•
2409.13449
•
Published
•
10
Imagine yourself: Tuning-Free Personalized Image Generation
Paper
•
2409.13346
•
Published
•
68
LEOPARD : A Vision Language Model For Text-Rich Multi-Image Tasks
Paper
•
2410.01744
•
Published
•
26
Large Language Models as Markov Chains
Paper
•
2410.02724
•
Published
•
30
MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language
Models
Paper
•
2410.13085
•
Published
•
21
Florence-VL: Enhancing Vision-Language Models with Generative Vision
Encoder and Depth-Breadth Fusion
Paper
•
2412.04424
•
Published
•
58
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for
Fast, Memory Efficient, and Long Context Finetuning and Inference
Paper
•
2412.13663
•
Published
•
116