-
Learning to Decode Collaboratively with Multiple Language Models
Paper • 2403.03870 • Published • 18 -
OS-Copilot: Towards Generalist Computer Agents with Self-Improvement
Paper • 2402.07456 • Published • 42 -
VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models
Paper • 2403.05438 • Published • 18
Bikram Mondal
notBik
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
1 day ago
Next Token Prediction Towards Multimodal Intelligence: A Comprehensive
Survey
liked
a model
14 days ago
FastVideo/FastHunyuan
liked
a model
14 days ago
answerdotai/ModernBERT-base
Organizations
None yet
Collections
1
models
None public yet
datasets
None public yet