19 27 62

Li Dong

unilm

AI & ML interests

Language Model Pre-Training

Recent Activity

liked a dataset 5 days ago

MAmmoTH-VL/MAmmoTH-VL-Instruct-12M

liked a dataset 5 days ago

Yuanshi/Subjects200K

liked a dataset 5 days ago

jackyhate/text-to-image-2M

View all activity

Organizations

unilm's activity

liked 3 datasets 5 days ago

liked 2 datasets 6 days ago

HuggingFaceTB/smoltalk

Viewer • Updated Nov 26, 2024 • 2.2M • 10.5k • 261

qingy2024/QwQ-LongCoT-Verified-130K

Viewer • Updated 13 days ago • 467k • 563 • 20

liked a dataset 9 days ago

HuggingFaceTB/finemath

Viewer • Updated 9 days ago • 48.3M • 24.2k • 205

liked a model 13 days ago

microsoft/VidTok

Updated 7 days ago • 24

liked a dataset 14 days ago

TIGER-Lab/OmniEdit-Filtered-1.2M

Viewer • Updated 26 days ago • 1.2M • 21.7k • 40

liked a dataset 15 days ago

OpenGVLab/OmniCorpus-CC-210M

Viewer • Updated Nov 17, 2024 • 208M • 696 • 19

liked 2 datasets 16 days ago

PixArt-alpha/SAM-LLaVA-Captions10M

Viewer • Updated Jan 12, 2024 • 11.5M • 170 • 55

apple/DataCompDR-1B

Viewer • Updated Jul 30, 2024 • 1.28B • 170k • 18

upvoted a paper 16 days ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published 19 days ago • 132

liked 2 datasets 16 days ago

JourneyDB/JourneyDB

Updated Aug 10, 2023 • 2.05k • 66

mlfoundations/datacomp_1b

Viewer • Updated Aug 21, 2023 • 1.39B • 2.36k • 30

liked a dataset 17 days ago

Koala-36M/Koala-36M-v1

Viewer • Updated Oct 12, 2024 • 36M • 380 • 25

authored 4 papers 19 days ago

Language Models as Inductive Reasoners

Paper • 2212.10923 • Published Dec 21, 2022 • 2

Image as a Foreign Language: BEiT Pretraining for All Vision and Vision-Language Tasks

Paper • 2208.10442 • Published Aug 22, 2022

RedStone: Curating General, Code, Math, and QA Data for Large Language Models

Paper • 2412.03398 • Published 28 days ago • 1

Multimodal Latent Language Modeling with Next-Token Diffusion

Paper • 2412.08635 • Published 21 days ago • 41

upvoted a paper 19 days ago

Multimodal Latent Language Modeling with Next-Token Diffusion

Paper • 2412.08635 • Published 21 days ago • 41