35 65 241

MoonRide

AI & ML interests

None yet

Recent Activity

liked a model about 9 hours ago

bartowski/c4ai-command-r7b-12-2024-GGUF

liked a model about 9 hours ago

CohereForAI/c4ai-command-r7b-12-2024

liked a model about 10 hours ago

bartowski/Qwen2.5-32B-AGI-GGUF

View all activity

Organizations

MoonRide's activity

upvoted a paper 7 days ago

Titans: Learning to Memorize at Test Time

Paper • 2501.00663 • Published 20 days ago • 14

upvoted 3 papers 11 days ago

Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Paper • 2501.04001 • Published 13 days ago • 40

LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token

Paper • 2501.03895 • Published 14 days ago • 48

The GAN is dead; long live the GAN! A Modern GAN Baseline

Paper • 2501.05441 • Published 11 days ago • 80

upvoted a paper 16 days ago

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published 19 days ago • 97

upvoted a paper about 1 month ago

Are Your LLMs Capable of Stable Reasoning?

Paper • 2412.13147 • Published Dec 17, 2024 • 91

upvoted a collection about 2 months ago

Unsloth 4-bit Dynamic Quants

Collection

Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit • 11 items • Updated about 6 hours ago • 23

upvoted a paper 3 months ago

AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMs

Paper • 2410.05295 • Published Oct 3, 2024 • 12

upvoted 3 papers 4 months ago

HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models

Paper • 2409.16191 • Published Sep 24, 2024 • 42

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 136

OmniGen: Unified Image Generation

Paper • 2409.11340 • Published Sep 17, 2024 • 109

upvoted a paper 5 months ago

Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment

Paper • 2408.06266 • Published Aug 12, 2024 • 10

upvoted 4 papers 6 months ago

Scaling Exponents Across Parameterizations and Optimizers

Paper • 2407.05872 • Published Jul 8, 2024 • 1

Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning

Paper • 2407.15762 • Published Jul 22, 2024 • 10

Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies

Paper • 2407.13623 • Published Jul 18, 2024 • 54

NeedleBench: Can LLMs Do Retrieval and Reasoning in 1 Million Context Window?

Paper • 2407.11963 • Published Jul 16, 2024 • 44

upvoted a collection 6 months ago

Llama 3.1

Collection

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 640

upvoted a paper 6 months ago

Beta Sampling is All You Need: Efficient Image Generation Strategy for Diffusion Models using Stepwise Spectral Analysis

Paper • 2407.12173 • Published Jul 16, 2024 • 2

upvoted 2 papers 7 months ago

No Training, No Problem: Rethinking Classifier-Free Guidance for Diffusion Models

Paper • 2407.02687 • Published Jul 2, 2024 • 22

Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion

Paper • 2407.01392 • Published Jul 1, 2024 • 40