Hieu Ngo's picture

Hieu Ngo

hiieu

·

AI & ML interests

Applied, Post-Training LLM

Recent Activity

updated a model 23 days ago

hiieu/gemma-2-2b-it-lora-vi-en

liked a model about 1 month ago

AIDC-AI/Marco-o1

liked a dataset about 1 month ago

VTSNLP/vietnamese_curated_dataset

View all activity

Organizations

hiieu's activity

upvoted a paper about 1 month ago

Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions

Paper • 2411.14405 • Published Nov 21, 2024 • 58

upvoted a paper about 2 months ago

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published Nov 19, 2024 • 48

upvoted 3 papers 2 months ago

SelfCodeAlign: Self-Alignment for Code Generation

Paper • 2410.24198 • Published Oct 31, 2024 • 23

GPT-4o System Card

Paper • 2410.21276 • Published Oct 25, 2024 • 82

Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering

Paper • 2410.15999 • Published Oct 21, 2024 • 19

upvoted 2 articles 3 months ago

Article

MedEmbed: Fine-Tuned Embedding Models for Medical / Clinical IR

By

•

Oct 20, 2024

• 34

Article

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

By

•

Oct 14, 2024

• 61

upvoted a paper 4 months ago

MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct

Paper • 2409.05840 • Published Sep 9, 2024 • 46

upvoted a collection 4 months ago

Gemma 2 ChatQA RAG finetuned

1 item • Updated Sep 2, 2024 • 1

upvoted an article 5 months ago

Article

Improving Hugging Face Training Efficiency Through Packing with Flash Attention

Aug 21, 2024

• 25

upvoted 2 papers 5 months ago

Synthesizing Text-to-SQL Data from Weak and Strong LLMs

Paper • 2408.03256 • Published Aug 6, 2024 • 11

Improving Text Embeddings for Smaller Language Models Using Contrastive Fine-tuning

Paper • 2408.00690 • Published Aug 1, 2024 • 24

upvoted a collection 5 months ago

ShieldGemma Release

A series of safety classifiers, trained on top of Gemma 2, for developers to filter inputs and outputs of their applications. • 3 items • Updated 24 days ago • 11

upvoted a paper 6 months ago

Compact Language Models via Pruning and Knowledge Distillation

Paper • 2407.14679 • Published Jul 19, 2024 • 39

upvoted a collection 6 months ago

Minitron

A family of compressed models obtained via pruning and knowledge distillation • 12 items • Updated about 11 hours ago • 60

upvoted a paper 6 months ago

ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities

Paper • 2407.14482 • Published Jul 19, 2024 • 26

upvoted a collection 6 months ago

NuminaMath

Datasets and models for training SOTA math LLMs. See our GitHub for training & inference code: https://github.com/project-numina/aimo-progress-prize • 6 items • Updated Jul 21, 2024 • 69

upvoted 2 articles 6 months ago

Article

How we leveraged distilabel to create an Argilla 2.0 Chatbot

Jul 16, 2024

• 32

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 295

upvoted a collection 6 months ago

H2O Danube3

7 items • Updated Nov 30, 2024 • 56