Victor Mustar's picture

Victor Mustar PRO

victor

·

victormustar

AI & ML interests

Building the UX of this website

Recent Activity

upvoted an article about 15 hours ago

🐺🐦‍⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark

upvoted a paper 2 days ago

1.58-bit FLUX

liked a dataset 2 days ago

kenhktsui/longtalk-cot-v0.1

View all activity

Articles

Inference for PROs

Organizations

victor's activity

upvoted an article about 15 hours ago

Article

🐺🐦‍⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark

By

•

1 day ago

• 24

upvoted a paper 2 days ago

1.58-bit FLUX

Paper • 2412.18653 • Published 10 days ago • 63

upvoted 4 papers 4 days ago

How Well Do LLMs Generate Code for Different Application Domains? Benchmark and Evaluation

Paper • 2412.18573 • Published 10 days ago • 1

DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines

Paper • 2310.03714 • Published Oct 5, 2023 • 32

RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response

Paper • 2412.14922 • Published 16 days ago • 82

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Paper • 2412.18925 • Published 10 days ago • 82

upvoted a paper 8 days ago

YuLan-Mini: An Open Data-efficient Language Model

Paper • 2412.17743 • Published 11 days ago • 59

upvoted a collection 8 days ago

DeepSeek-V3

2 items • Updated 9 days ago • 91

upvoted a paper 11 days ago

Parallelized Autoregressive Visual Generation

Paper • 2412.15119 • Published 15 days ago • 49

upvoted a collection 12 days ago

Vision Language Models

Grounding, chat • 5 items • Updated 4 days ago • 10

upvoted a paper 14 days ago

AniDoc: Animation Creation Made Easier

Paper • 2412.14173 • Published 16 days ago • 49

upvoted 2 papers 15 days ago

Spectrum: Targeted Training on Signal to Noise Ratio

Paper • 2406.06623 • Published Jun 7, 2024 • 12

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 15 days ago • 334

upvoted 3 papers 18 days ago

GenEx: Generating an Explorable World

Paper • 2412.09624 • Published 22 days ago • 87

RetroLLM: Empowering Large Language Models to Retrieve Fine-grained Evidence within Generation

Paper • 2412.11919 • Published 18 days ago • 33

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published 21 days ago • 136

upvoted 4 papers 21 days ago

Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation

Paper • 2412.06531 • Published 26 days ago • 71

STIV: Scalable Text and Image Conditioned Video Generation

Paper • 2412.07730 • Published 24 days ago • 70

SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints

Paper • 2412.07760 • Published 24 days ago • 50

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Paper • 2412.09596 • Published 22 days ago • 92