LLaVA Internal

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

eva98 authored a paper 28 days ago

Efficiently Programming Large Language Models using SGLang

eva98 authored a paper 28 days ago

NVILA: Efficient Frontier Visual Language Models

luodian authored a paper 28 days ago

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

View all activity

llava-internal's activity

eva98

authored 2 papers 28 days ago

Efficiently Programming Large Language Models using SGLang

Paper • 2312.07104 • Published Dec 12, 2023 • 7

NVILA: Efficient Frontier Visual Language Models

Paper • 2412.04468 • Published Dec 5, 2024 • 57

luodian

authored a paper 28 days ago

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

Paper • 2412.05237 • Published about 1 month ago • 47

luodian

authored a paper about 1 month ago

Large Multi-modal Models Can Interpret Features in Large Multi-modal Models

Paper • 2411.14982 • Published Nov 22, 2024 • 16

luodian

authored 4 papers 3 months ago

luodian

authored 7 papers 5 months ago

Octopus: Embodied Vision-Language Programmer from Environmental Feedback

Paper • 2310.08588 • Published Oct 12, 2023 • 34

MIMIC-IT: Multi-Modal In-Context Instruction Tuning

Paper • 2306.05425 • Published Jun 8, 2023 • 11

FunQA: Towards Surprising Video Comprehension

Paper • 2306.14899 • Published Jun 26, 2023 • 1

MMBench: Is Your Multi-modal Model an All-around Player?

Paper • 2307.06281 • Published Jul 12, 2023 • 5

Long Context Transfer from Language to Vision

Paper • 2406.16852 • Published Jun 24, 2024 • 32

LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models

Paper • 2407.12772 • Published Jul 17, 2024 • 33

LLaVA-OneVision: Easy Visual Task Transfer

Paper • 2408.03326 • Published Aug 6, 2024 • 60

luodian

authored a paper 6 months ago

LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models

Paper • 2407.07895 • Published Jul 10, 2024 • 40

liuhaotian

authored a paper 6 months ago

CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs

Paper • 2406.18521 • Published Jun 26, 2024 • 28

eva98

authored a paper 7 months ago

Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models

Paper • 2406.04271 • Published Jun 6, 2024 • 29

lmzheng

authored a paper 12 months ago

Efficiently Programming Large Language Models using SGLang

Paper • 2312.07104 • Published Dec 12, 2023 • 7

liuhaotian

authored a paper about 1 year ago

LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents

Paper • 2311.05437 • Published Nov 9, 2023 • 48

AI & ML interests

Recent Activity

Team members 5

llava-internal's activity