Tom Aarsen's picture

Tom Aarsen

tomaarsen

·

https://linkedin.com/in/tomaarsen

AI & ML interests

NLP: text embeddings, information retrieval, named entity recognition, few-shot text classification

Recent Activity

new activity about 12 hours ago

Salesforce/SFR-Embedding-Code-2B_R:Add Sentence Transformers integration

updated a model about 12 hours ago

Salesforce/SFR-Embedding-Code-2B_R

upvoted an article about 12 hours ago

Fine-tune ModernBERT for RAG with Synthetic Data

View all activity

Articles

Train 400x faster Static Embedding Models with Sentence Transformers

Finally, a Replacement for BERT: Introducing ModernBERT

Welcome Gemma 2 - Google's new open LLM

Training and Finetuning Embedding Models with Sentence Transformers v3

Blazing Fast SetFit Inference with 🤗 Optimum Intel on Xeon

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

🪆 Introduction to Matryoshka Embedding Models

SetFitABSA: Few-Shot Aspect Based Sentiment Analysis using SetFit

🕳️ Attention Sinks in LLMs for endless fluency

Organizations

tomaarsen's activity

upvoted an article about 12 hours ago

Article

Fine-tune ModernBERT for RAG with Synthetic Data

By

•

about 14 hours ago

• 10

upvoted a paper about 13 hours ago

CodeXEmbed: A Generalist Embedding Model Family for Multiligual and Multi-task Code Retrieval

Paper • 2411.12644 • Published Nov 19, 2024 • 3

upvoted an article 4 days ago

Article

Timm ❤️ Transformers: Use any timm model with transformers

5 days ago

• 30

upvoted an article 6 days ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

6 days ago

• 113

upvoted an article 7 days ago

Article

Python Is All You Need? Introducing Dria-Agent-α

By

•

10 days ago

• 22

upvoted a paper 9 days ago

Agentless: Demystifying LLM-based Software Engineering Agents

Paper • 2407.01489 • Published Jul 1, 2024 • 58

upvoted 2 papers 13 days ago

Trans-Tokenization and Cross-lingual Vocabulary Transfers: Language Adaptation of LLMs for Low-Resource NLP

Paper • 2408.04303 • Published Aug 8, 2024 • 15

Fietje: An open, efficient LLM for Dutch

Paper • 2412.15450 • Published Dec 19, 2024 • 4

upvoted an article 14 days ago

Article

Announcing NVIDIA Cosmos World Foundation Models

By

•

14 days ago

• 23

upvoted a paper 14 days ago

KaLM-Embedding: Superior Training Data Brings A Stronger Embedding Model

Paper • 2501.01028 • Published 19 days ago • 12

upvoted a collection 14 days ago

KaLM-embedding

6 items • Updated 6 days ago • 21

upvoted a collection 16 days ago

OLMo 2

Artifacts for the second set of OLMo models. • 22 items • Updated 14 days ago • 74

upvoted a collection 17 days ago

PubMedBERT Embeddings M2V

Models distilled with Model2Vec - 100K / 500K / 1M / 2M / 8M parameter variants. • 5 items • Updated 13 days ago • 3

upvoted a collection 18 days ago

ModernGLiNER

GLiNER models based on modern encoder architectures • 2 items • Updated 28 days ago • 6

upvoted an article 22 days ago

Article

Fine-tune ModernBERT for text classification using synthetic data

By

•

22 days ago

• 23

upvoted a collection 24 days ago

Granite 3.1 Language Models

A series of language models with 128K context length trained by IBM licensed under Apache 2.0 license. • 8 items • Updated Dec 18, 2024 • 48

upvoted 2 papers about 1 month ago

Spectrum: Targeted Training on Signal to Noise Ratio

Paper • 2406.06623 • Published Jun 7, 2024 • 13

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 340

upvoted an article about 1 month ago

Article

Use Models from the Hugging Face Hub in LM Studio

By

•

Nov 28, 2024

• 132

upvoted a paper about 1 month ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 125