Krishna Kaasyap's picture

Krishna Kaasyap

KrishnaKaasyap

·

AI & ML interests

None yet

Recent Activity

liked a Space 1 day ago

osanseviero/gemini-coder

liked a dataset 3 days ago

PowerInfer/QWQ-LONGCOT-500K

liked a model 3 days ago

PowerInfer/SmallThinker-3B-Preview

View all activity

Organizations

KrishnaKaasyap's activity

upvoted an article 9 days ago

Article

🐺🐦‍⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs

By

•

about 1 month ago

• 75

upvoted a collection 11 days ago

QwQ

Qwen with Questions • 2 items • Updated Nov 28, 2024 • 53

upvoted an article about 1 month ago

Article

Bridging the Gap Between Physical Numerical Simulations and Machine Learning: Introducing The Well

By

•

Dec 2, 2024

• 17

upvoted 2 collections about 1 month ago

🎬 Video models

text-to-video & image-to-video models released by the Chinese community • 22 items • Updated 11 days ago • 4

🧠 Reasoning Models

7 items • Updated 4 days ago • 36

upvoted a collection 3 months ago

Llama-3.1-Nemotron-70B

SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. • 6 items • Updated 32 minutes ago • 149

upvoted a paper 4 months ago

Show-o: One Single Transformer to Unify Multimodal Understanding and Generation

Paper • 2408.12528 • Published Aug 22, 2024 • 50

upvoted a collection 4 months ago

Jamba-1.5

The AI21 Jamba family of models are state-of-the-art, hybrid SSM-Transformer instruction following foundation models • 2 items • Updated Aug 22, 2024 • 83

upvoted 2 collections 5 months ago

Magnum v2 123b

3 items • Updated Aug 21, 2024 • 6

DeepSeek-V2

8 items • Updated about 22 hours ago • 18

upvoted an article 5 months ago

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Jul 23, 2024

• 225

upvoted a paper 5 months ago

VITA: Towards Open-Source Interactive Omni Multimodal LLM

Paper • 2408.05211 • Published Aug 9, 2024 • 47

upvoted a collection 5 months ago

Llama-3.1 Quantization

Neural Magic quantized Llama-3.1 models • 22 items • Updated Nov 22, 2024 • 42

upvoted 2 articles 5 months ago

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

By

•

Jul 29, 2024

• 260

Article

ZebraLogic: Benchmarking the Logical Reasoning Ability of Language Models

By

•

Jul 27, 2024

• 27

upvoted a collection 5 months ago

Llama 3.1

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated 29 days ago • 637

upvoted a paper 6 months ago

Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs

Paper • 2406.16860 • Published Jun 24, 2024 • 59

upvoted 3 collections 7 months ago

SSMs

A collection of Mamba-2-based research models with 8B parameters trained on 3.5T tokens for comparison with Transformers. • 5 items • Updated 32 minutes ago • 26

Nemotron 4 340B

Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated 32 minutes ago • 161

Yi-1.5 (2024/05)

10 items • Updated May 20, 2024 • 92