Clem 🤗's picture

Clem 🤗 PRO

clem

·

http://huggingface.co

AI & ML interests

multi-modal, time-series, biology and chemistry

Recent Activity

reacted to cfahlgren1's post with 🚀 about 15 hours ago

The https://huggingface.co/deepseek-ai/DeepSeek-V3 is very good! I have been playing with it and found it is really good at one-shotting a pretty good landing page. You can play with it here: https://deepseek-artifacts.vercel.app All the responses get saved in the https://huggingface.co/datasets/cfahlgren1/react-code-instructions dataset. Hopefully we can build one of the biggest, highest quality frontend datasets on the hub 💪

reacted to csabakecskemeti's post with 🚀 about 15 hours ago

The https://huggingface.co/deepseek-ai/DeepSeek-V3-Base model has featured today on CNBC tech news. The whale made a splash by using FP8 and shrink the cost of training significantly! https://youtu.be/NJljq429cGk?si=kgk-ogPTMfJKsaA2

reacted to csabakecskemeti's post with 😎 about 15 hours ago

The https://huggingface.co/deepseek-ai/DeepSeek-V3-Base model has featured today on CNBC tech news. The whale made a splash by using FP8 and shrink the cost of training significantly! https://youtu.be/NJljq429cGk?si=kgk-ogPTMfJKsaA2

View all activity

Organizations

clem's activity

upvoted a paper 15 days ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published 17 days ago • 116

upvoted a collection 15 days ago

ModernBERT

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated 15 days ago • 112

upvoted a paper 16 days ago

The Open Source Advantage in Large Language Models (LLMs)

Paper • 2412.12004 • Published 18 days ago • 9

upvoted 17 papers 18 days ago

SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding

Paper • 2412.09604 • Published 22 days ago • 35

GenEx: Generating an Explorable World

Paper • 2412.09624 • Published 22 days ago • 87

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published 21 days ago • 136

Euclid: Supercharging Multimodal LLMs with Synthetic High-Fidelity Visual Descriptions

Paper • 2412.08737 • Published 23 days ago • 52

Phi-4 Technical Report

Paper • 2412.08905 • Published 23 days ago • 95

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Paper • 2412.09596 • Published 22 days ago • 92

POINTS1.5: Building a Vision-Language Model towards Real World Applications

Paper • 2412.08443 • Published 23 days ago • 38

LAION-SG: An Enhanced Large-Scale Dataset for Training Complex Image-Text Models with Structural Annotations

Paper • 2412.08580 • Published 23 days ago • 45

SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints

Paper • 2412.07760 • Published 24 days ago • 50

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

Paper • 2412.07589 • Published 24 days ago • 46

Evaluating and Aligning CodeLLMs on Human Preference

Paper • 2412.05210 • Published 28 days ago • 47

STIV: Scalable Text and Image Conditioned Video Generation

Paper • 2412.07730 • Published 24 days ago • 70

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published 25 days ago • 64

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published 25 days ago • 72

Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation

Paper • 2412.06531 • Published 26 days ago • 71

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

Paper • 2412.05237 • Published 28 days ago • 46

EXAONE 3.5: Series of Large Language Models for Real-world Use Cases

Paper • 2412.04862 • Published 29 days ago • 48