Le Huy Hoang's picture

34 10

Le Huy Hoang

splendor1811

·

huyhoang18112k2

AI & ML interests

Computer Vision

Organizations

None yet

splendor1811's activity

upvoted a collection 2 months ago

MIT Talk 31/10 Papers

14 items • Updated Oct 28, 2024 • 31

upvoted a paper 8 months ago

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15, 2024 • 88

upvoted 2 articles 8 months ago

Article

Mixture of Experts Explained

Dec 11, 2023

• 243

Article

SeeMoE: Implementing a MoE Vision Language Model from Scratch

By

•

Jun 23, 2024

• 34

upvoted 4 papers 8 months ago

FLAME: Factuality-Aware Alignment for Large Language Models

Paper • 2405.01525 • Published May 2, 2024 • 25

StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation

Paper • 2405.01434 • Published May 2, 2024 • 53

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published Apr 29, 2024 • 119

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published May 2, 2024 • 120

upvoted an article 8 months ago

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Apr 15, 2024

• 171

upvoted a collection 8 months ago

LLaVa-NeXT

LLaVa-NeXT (also known as LLaVa-1.6) improves upon the 1.5 series by incorporating higher image resolutions and more reasoning/OCR datasets. • 8 items • Updated Jul 19, 2024 • 27

upvoted a paper 8 months ago

Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models

Paper • 2402.19427 • Published Feb 29, 2024 • 52

upvoted 3 collections 8 months ago

Knowledge distillation

88 items • Updated Feb 7, 2024 • 7

RAG

122 items • Updated Sep 13, 2024 • 19

LLMs

355 items • Updated 5 days ago • 25

upvoted 2 collections 9 months ago

Multimodal

251 items • Updated Sep 23, 2024 • 16

MoE

137 items • Updated Jul 9, 2024 • 20

upvoted a paper 9 months ago

Multi-Head Mixture-of-Experts

Paper • 2404.15045 • Published Apr 23, 2024 • 59

upvoted an article 9 months ago

Article

seemore: Implement a Vision Language Model from Scratch

By

•

Jun 23, 2024

• 69

upvoted a paper 9 months ago

MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI

Paper • 2311.16502 • Published Nov 27, 2023 • 35

upvoted an article 9 months ago

Article

Fine-tune Llama 3 with ORPO

By

•

Apr 22, 2024

• 230