Junxiong Wang's picture

2 2 2

Junxiong Wang

JunxiongWang

·

https://www.cs.cornell.edu/~junxiong/

jxiw

AI & ML interests

Attention Free Model / Subquadratic Language Models

Recent Activity

updated a dataset 1 day ago

JunxiongWang/model_revision_max_4_closest_and_random

updated a collection 9 days ago

Mamba-In-Zephyr

updated a model 10 days ago

JunxiongWang/mamba_0_5_distill

View all activity

Organizations

Collections 7

Papers 3

arxiv:2408.15237

arxiv:2401.13660

arxiv:2212.10544

models 37

JunxiongWang/mamba_0_5_distill

Updated 10 days ago • 9

JunxiongWang/Llama3.2-Mamba-3B-dpo

Updated Nov 17, 2024 • 7

JunxiongWang/Llama3.2-Mamba-3B-distill

Updated Nov 17, 2024 • 54

JunxiongWang/Llama3.2-Mamba2-3B-distill

Updated Nov 17, 2024 • 169

JunxiongWang/Llama3.2-Mamba2-3B-dpo

Updated Nov 17, 2024 • 14

JunxiongWang/Llama3.1-Mamba2-8B-dpo

Updated Nov 17, 2024 • 2

JunxiongWang/Llama3.1-Mamba-8B-dpo

Updated Nov 17, 2024 • 5

JunxiongWang/Llama3.1-Mamba2-8B-distill

Updated Nov 17, 2024 • 14

JunxiongWang/Llama3.1-Mamba-8B-distill

Updated Nov 17, 2024 • 13

JunxiongWang/MambaByte_Stories

Text Generation • Updated Sep 9, 2024 • 8 • 1

datasets 5

JunxiongWang/model_revision_max_4_closest_and_random

Viewer • Updated 1 day ago • 170k • 18

JunxiongWang/sftdatasetv3

Viewer • Updated Oct 7, 2024 • 12.4M • 70

JunxiongWang/sftdataset

Viewer • Updated Aug 28, 2024 • 11M • 199 • 2

JunxiongWang/llama3-ultrafeedback-armorm

Viewer • Updated Aug 27, 2024 • 61.8k • 108 • 1

JunxiongWang/testdataset

Viewer • Updated Jun 23, 2024 • 1M • 107