Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
18
93
Jia-Ying Lin
linekin
Follow
21world's profile picture
1 follower
ยท
2 following
AI & ML interests
None yet
Recent Activity
liked
a model
about 2 hours ago
AdaptLLM/Adapt-MLLM-to-Domains
reacted
to
m-ric
's
post
with ๐
12 days ago
After 6 years, BERT, the workhorse of encoder models, finally gets a replacement: ๐ช๐ฒ๐น๐ฐ๐ผ๐บ๐ฒ ๐ ๐ผ๐ฑ๐ฒ๐ฟ๐ป๐๐๐ฅ๐ง! ๐ค We talk a lot about โจGenerative AIโจ, meaning "Decoder version of the Transformers architecture", but this is only one of the ways to build LLMs: encoder models, that turn a sentence in a vector, are maybe even more widely used in industry than generative models. The workhorse for this category has been BERT since its release in 2018 (that's prehistory for LLMs). It's not a fancy 100B parameters supermodel (just a few hundred millions), but it's an excellent workhorse, kind of a Honda Civic for LLMs. Many applications use BERT-family models - the top models in this category cumulate millions of downloads on the Hub. โก๏ธ Now a collaboration between Answer.AI and LightOn just introduced BERT's replacement: ModernBERT. ๐ง๐;๐๐ฅ: ๐๏ธ Architecture changes: โ First, standard modernizations: - Rotary positional embeddings (RoPE) - Replace GeLU with GeGLU, - Use Flash Attention 2 โจ The team also introduced innovative techniques like alternating attention instead of full attention, and sequence packing to get rid of padding overhead. ๐ฅ As a result, the model tops the game of encoder models: It beats previous standard DeBERTaV3 for 1/5th the memory footprint, and runs 4x faster! Read the blog post ๐ https://huggingface.co/blog/modernbert
reacted
to
m-ric
's
post
with โค๏ธ
12 days ago
Since I published it on GitHub a few days ago, Hugging Face's new agentic library ๐๐บ๐ผ๐น๐ฎ๐ด๐ฒ๐ป๐๐ has gathered nearly 4k stars ๐คฏ โก๏ธ But we are just getting started on agents: so we are hiring an ML Engineer to join me and double down on this effort! The plan is to build GUI agents: agents that can act on your computer with mouse & keyboard, like Claude Computer Use. We will make it work better, and fully open. โจ Sounds like something you'd like to do? Apply here ๐ https://apply.workable.com/huggingface/j/AF1D4E3FEB/
View all activity
Organizations
None yet
linekin
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
a model
about 2 hours ago
AdaptLLM/Adapt-MLLM-to-Domains
Updated
Dec 14, 2024
โข
10
liked
2 models
12 days ago
NousResearch/Hermes-3-Llama-3.2-3B-GGUF
Updated
Dec 18, 2024
โข
21.2k
โข
32
NousResearch/Hermes-3-Llama-3.1-8B
Text Generation
โข
Updated
Sep 8, 2024
โข
49.4k
โข
โข
280
liked
a model
13 days ago
microsoft/phi-4
Text Generation
โข
Updated
12 days ago
โข
134k
โข
1.48k
liked
a model
23 days ago
deepseek-ai/DeepSeek-V3
Updated
22 days ago
โข
162k
โข
2.1k
liked
a dataset
about 1 month ago
taide/taide-bench
Viewer
โข
Updated
Apr 12, 2024
โข
500
โข
98
โข
13
liked
a model
about 1 month ago
google/gemma-2-9b
Text Generation
โข
Updated
Aug 7, 2024
โข
90.8k
โข
628
liked
a model
about 2 months ago
HuggingFaceTB/SmolVLM-Instruct
Image-Text-to-Text
โข
Updated
Dec 2, 2024
โข
42.9k
โข
329
liked
a model
3 months ago
vikhyatk/moondream2
Image-Text-to-Text
โข
Updated
12 days ago
โข
143k
โข
992
liked
a dataset
3 months ago
aigrant/awesome-taiwan-knowledge
Preview
โข
Updated
Nov 16, 2024
โข
88
โข
16
liked
2 models
3 months ago
thenlper/gte-large
Sentence Similarity
โข
Updated
Nov 15, 2024
โข
871k
โข
262
neulab/Pangea-7B
Updated
Oct 24, 2024
โข
8.16k
โข
122
liked
4 models
4 months ago
facebook/bart-large-mnli
Zero-Shot Classification
โข
Updated
Sep 5, 2023
โข
2.78M
โข
โข
1.27k
meetkai/functionary-small-v3.2
Updated
Sep 25, 2024
โข
788
โข
33
meetkai/functionary-medium-v3.1
Updated
Sep 25, 2024
โข
104
โข
55
Team-ACE/ToolACE-8B
Updated
Oct 22, 2024
โข
12.4k
โข
46
liked
2 models
5 months ago
Qwen/Qwen2-VL-7B-Instruct
Image-Text-to-Text
โข
Updated
9 days ago
โข
1.7M
โข
โข
1.07k
poloclub/UniTable
Updated
Apr 2, 2024
โข
23
liked
a dataset
5 months ago
gyr66/privacy_detection
Viewer
โข
Updated
Oct 17, 2023
โข
2.52k
โข
37
โข
3
liked
a model
5 months ago
Alibaba-NLP/gte-multilingual-base
Sentence Similarity
โข
Updated
12 days ago
โข
580k
โข
177
Load more