Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
3
7
Straughter Guthrie
PRO
jmanhype
Follow
21world's profile picture
todosi's profile picture
fullstack's profile picture
3 followers
ยท
4 following
AI & ML interests
None yet
Recent Activity
liked
a model
17 days ago
GoodiesHere/Apollo-LMMs-Apollo-3B-t32
replied
to
merve
's
post
17 days ago
Apollo is a new family of open-source video language models by Meta, where 3B model outperforms most 7B models and 7B outperforms most 30B models ๐งถ โจ the models come in 1.5B https://huggingface.co/Apollo-LMMs/Apollo-1_5B-t32, 3B https://huggingface.co/Apollo-LMMs/Apollo-3B-t32 and 7B https://huggingface.co/Apollo-LMMs/Apollo-7B-t32 with A2.0 license, based on Qwen1.5 & Qwen2 โจ the authors also release a benchmark dataset https://huggingface.co/spaces/Apollo-LMMs/ApolloBench The paper has a lot of experiments (they trained 84 models!) about what makes the video LMs work โฏ๏ธ Try the demo for best setup here https://huggingface.co/spaces/Apollo-LMMs/Apollo-3B they evaluate sampling strategies, scaling laws for models and datasets, video representation and more! > The authors find out that whatever design decision was applied to small models also scale properly when the model and dataset are scaled ๐ scaling dataset has diminishing returns for smaller models > They evaluate frame sampling strategies, and find that FPS sampling is better than uniform sampling, and they find 8-32 tokens per frame optimal > They also compare image encoders, they try a variation of models from shape optimized SigLIP to DINOv2 they find https://huggingface.co/google/siglip-so400m-patch14-384 to be most powerful ๐ฅ > they also compare freezing different parts of models, training all stages with some frozen parts give the best yield They eventually release three models, where Apollo-3B outperforms most 7B models and Apollo 7B outperforms 30B models ๐ฅ
liked
a model
27 days ago
Qwen/Qwen2.5-Coder-0.5B-Instruct
View all activity
Organizations
jmanhype
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
a model
17 days ago
GoodiesHere/Apollo-LMMs-Apollo-3B-t32
Text Generation
โข
Updated
21 days ago
โข
305
โข
16
liked
a model
27 days ago
Qwen/Qwen2.5-Coder-0.5B-Instruct
Text Generation
โข
Updated
Nov 18, 2024
โข
19.4k
โข
23
liked
a Space
27 days ago
Running
on
Zero
588
๐
OminiControl
liked
a model
5 months ago
openbmb/MiniCPM-V-2_6
Image-Text-to-Text
โข
Updated
Nov 15, 2024
โข
36.9k
โข
884
liked
a Space
9 months ago
Running
on
Zero
187
๐
AniPortrait Official
liked
a Space
12 months ago
Runtime error
29
๐
Scepter Studio
liked
a Space
over 1 year ago
Running
on
L4
2.25k
๐ป
HuggingGPT