KW's picture

60 1046

KW

kevineen

·

AI & ML interests

None yet

Recent Activity

liked a model less than a minute ago

RUC-AIBOX/Virgo-72B

upvoted a paper 5 minutes ago

VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

liked a model 44 minutes ago

qingy2024/UwU-7B-Instruct

View all activity

Organizations

kevineen's activity

upvoted a paper 5 minutes ago

VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Paper • 2501.01957 • Published 3 days ago • 5

upvoted a paper 1 day ago

Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion

Paper • 2412.04424 • Published Dec 5, 2024 • 59

upvoted a paper 3 days ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published 24 days ago • 136

upvoted a paper 5 days ago

1.58-bit FLUX

Paper • 2412.18653 • Published 13 days ago • 66

upvoted a collection 6 days ago

YuLan-Mini

A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details. • 5 items • Updated 8 days ago • 10

upvoted a paper 11 days ago

Large Motion Video Autoencoding with Cross-modal Video VAE

Paper • 2412.17805 • Published 14 days ago • 23

upvoted a paper 13 days ago

Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction

Paper • 2412.04454 • Published Dec 5, 2024 • 57

upvoted 2 papers 17 days ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 18 days ago • 335

AniDoc: Animation Creation Made Easier

Paper • 2412.14173 • Published 19 days ago • 49

upvoted a collection 30 days ago

InternVL2.5

Better than InternVL 2.0 • 18 items • Updated 6 days ago • 78

upvoted 2 collections about 1 month ago

LLM-jp-3 Pre-trained Models

Pre-trained models in the LLM-jp-3 model series • 4 items • Updated 13 days ago • 5

AIMv2

A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. • 19 items • Updated Nov 22, 2024 • 68

upvoted a paper about 2 months ago

xLSTM: Extended Long Short-Term Memory

Paper • 2405.04517 • Published May 7, 2024 • 12

upvoted an article 3 months ago

Article

🧨 Diffusers welcomes Stable Diffusion 3.5 Large

Oct 22, 2024

• 49

upvoted a collection 3 months ago

Gemma-APS Release

Gemma models for text-to-propositions segmentation. The models are distilled from fine-tuned Gemini Pro model applied to multi-domain synthetic data. • 3 items • Updated 24 days ago • 20

upvoted 2 papers 3 months ago

Differential Transformer

Paper • 2410.05258 • Published Oct 7, 2024 • 169

Loong: Generating Minute-level Long Videos with Autoregressive Language Models

Paper • 2410.02757 • Published Oct 3, 2024 • 36

upvoted a paper 4 months ago

MuCodec: Ultra Low-Bitrate Music Codec

Paper • 2409.13216 • Published Sep 20, 2024 • 23

upvoted a collection 4 months ago

Kurage

Multipurpose RAG models for many languages • 13 items • Updated Oct 10, 2024 • 2

upvoted a paper 4 months ago

InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning

Paper • 2409.12568 • Published Sep 19, 2024 • 48