Victor Gallego's picture

Victor Gallego

vicgalle

·

https://github.com/vicgalle

AI & ML interests

Preference fine-tuning, alignment & synthetic data. Building LLMs in general!

Recent Activity

liked a model 12 days ago

Qwen/QVQ-72B-Preview

updated a model 18 days ago

KomorebiAI/nllb-200-1.3B-ct2

updated a model 18 days ago

KomorebiAI/nllb-200-1.3B-float16-ct2

View all activity

Organizations

vicgalle's activity

upvoted an article 2 months ago

Article

VLM Art Analysis

By

•

Oct 4, 2024

• 11

upvoted a collection 3 months ago

steiner-preview

Reasoning models trained on synthetic data using reinforcement learning. • 3 items • Updated Oct 20, 2024 • 27

upvoted 2 papers 3 months ago

Do LLMs Have Political Correctness? Analyzing Ethical Biases and Jailbreak Vulnerabilities in AI Systems

Paper • 2410.13334 • Published Oct 17, 2024 • 12

Differential Transformer

Paper • 2410.05258 • Published Oct 7, 2024 • 169

upvoted a collection 3 months ago

Llama 3.2 Re-upload

10 items • Updated Sep 25, 2024 • 11

upvoted 2 papers 4 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 136

Hermes 3 Technical Report

Paper • 2408.11857 • Published Aug 15, 2024 • 43

upvoted an article 5 months ago

Article

Tensor Parallelism

By

•

Aug 20, 2024

• 11

upvoted a collection 5 months ago

Hermes 3

The Hermes 3 Series of Models • 10 items • Updated 25 days ago • 100

upvoted a paper 5 months ago

WalledEval: A Comprehensive Safety Evaluation Toolkit for Large Language Models

Paper • 2408.03837 • Published Aug 7, 2024 • 17

upvoted a collection 6 months ago

Llama 3.1

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated about 1 month ago • 637

upvoted 3 articles 6 months ago

Article

Announcing Finance Commons and the Bad Data Toolbox: Pioneering Open Data and Advanced Document Processing

By

•

Jul 19, 2024

• 18

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 295

Article

The Rise of Agentic Data Generation

By

•

Jul 15, 2024

• 80

upvoted 3 papers 6 months ago

BM25S: Orders of magnitude faster lexical search via eager sparse scoring

Paper • 2407.03618 • Published Jul 4, 2024 • 11

Scaling Synthetic Data Creation with 1,000,000,000 Personas

Paper • 2406.20094 • Published Jun 28, 2024 • 96

Symbolic Learning Enables Self-Evolving Agents

Paper • 2406.18532 • Published Jun 26, 2024 • 11

upvoted a collection 6 months ago

Probably DPO datasets

A collection of datasets that probably support DPO • 146 items • Updated Jun 26, 2024 • 12

upvoted a paper 6 months ago

TinyStyler: Efficient Few-Shot Text Style Transfer with Authorship Embeddings

Paper • 2406.15586 • Published Jun 21, 2024 • 2

upvoted a paper 7 months ago

Model Merging and Safety Alignment: One Bad Model Spoils the Bunch

Paper • 2406.14563 • Published Jun 20, 2024 • 29