Jens van Holland's picture

1 8 34

Jens van Holland

jvh

jvhgit

AI & ML interests

Deep Learning, NLP, applications and Data Science

Organizations

None yet

jvh's activity

upvoted a collection 8 months ago

GLM-4

GLM-4 Open Models • 13 items • Updated Nov 27, 2024 • 116

upvoted a paper 8 months ago

DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence

Paper • 2401.14196 • Published Jan 25, 2024 • 51

upvoted an article 9 months ago

Article

Fine-tune Llama 3 with ORPO

By

•

Apr 22, 2024

• 232

upvoted 3 papers 10 months ago

Jamba: A Hybrid Transformer-Mamba Language Model

Paper • 2403.19887 • Published Mar 28, 2024 • 107

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 607

Model Stock: All we need is just a few fine-tuned models

Paper • 2403.19522 • Published Mar 28, 2024 • 10

upvoted a collection 10 months ago

INT4/8 Quantized Whisper CT2

Int4/8 Quantized Whisper Models by using the quanto package and the CTranslate2 package. Requires (much) less GPU resources while keeping performance. • 4 items • Updated Mar 19, 2024 • 2

upvoted a paper 11 months ago

FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models

Paper • 2402.10986 • Published Feb 16, 2024 • 78