Matricardi Fabio's picture

Matricardi Fabio

FM-1976

·

https://medium.com/@fabio.matricardi

AI & ML interests

control system engineering, AI, LLM with python. ThePoorGPUguy on substack

Recent Activity

liked a Space 30 minutes ago

freddyaboulton/gradio_pdf

liked a model 4 days ago

madebyollin/sdxl-vae-fp16-fix

new activity 4 days ago

OuteAI/OuteTTS-0.3-500M-GGUF:llama.cpp binary usage?

View all activity

Organizations

None yet

FM-1976's activity

upvoted a collection 5 days ago

Aira

Aira is a series of chatbots developed as an experimentation playground for value alignment. • 27 items • Updated Jun 20, 2024 • 1

upvoted a collection 7 days ago

Loxa

a Loxa family models are best models to running on CPU and GPU with high quality(=>92% accuracy) • 4 items • Updated 5 days ago • 2

upvoted a collection 8 days ago

Quadrifoglio 🍀

Small text2text models finetuned on Italian machine translation tasks. • 6 items • Updated 9 days ago • 1

upvoted 2 papers about 1 month ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 125

Hermes 3 Technical Report

Paper • 2408.11857 • Published Aug 15, 2024 • 44

upvoted 5 papers about 2 months ago

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published Nov 19, 2024 • 48

FluidML: Fast and Memory Efficient Inference Optimization

Paper • 2411.09242 • Published Nov 14, 2024 • 1

Natural Language Reinforcement Learning

Paper • 2411.14251 • Published Nov 21, 2024 • 28

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

Paper • 2411.15124 • Published Nov 22, 2024 • 58

Ultra-Sparse Memory Network

Paper • 2411.12364 • Published Nov 19, 2024 • 19

upvoted a collection 3 months ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 15 items • Updated 29 days ago • 204

upvoted a paper 3 months ago

Scalable MatMul-free Language Modeling

Paper • 2406.02528 • Published Jun 4, 2024 • 11

upvoted a collection 4 months ago

LLM

Collection of OpenVINO optimized LLMs • 135 items • Updated 29 days ago • 21

upvoted an article 4 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 216

upvoted 2 collections 4 months ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 562

DataGemma Release

A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated Dec 13, 2024 • 82

upvoted 2 collections 6 months ago

LLM

Multimodal LLM • 238 items • Updated Sep 26, 2024 • 11

Trained Models 🏋️

They may be small, but they're training like giants! • 8 items • Updated Dec 3, 2024 • 17

upvoted a collection 9 months ago

Minerva LLMs

The first family of LLMs pretrained from scratch on Italian. • 6 items • Updated Dec 7, 2024 • 33