Directly distill from Llama, the finetune in DPO
Junxiong Wang
JunxiongWang
AI & ML interests
Attention Free Model / Subquadratic Language Models
Recent Activity
updated
a dataset
1 day ago
JunxiongWang/model_revision_max_4_closest_and_random
updated
a collection
9 days ago
Mamba-In-Zephyr
updated
a model
10 days ago
JunxiongWang/mamba_0_5_distill
Organizations
Collections
7
models
37
JunxiongWang/mamba_0_5_distill
Updated
•
9
JunxiongWang/Llama3.2-Mamba-3B-dpo
Updated
•
7
JunxiongWang/Llama3.2-Mamba-3B-distill
Updated
•
54
JunxiongWang/Llama3.2-Mamba2-3B-distill
Updated
•
169
JunxiongWang/Llama3.2-Mamba2-3B-dpo
Updated
•
14
JunxiongWang/Llama3.1-Mamba2-8B-dpo
Updated
•
2
JunxiongWang/Llama3.1-Mamba-8B-dpo
Updated
•
5
JunxiongWang/Llama3.1-Mamba2-8B-distill
Updated
•
14
JunxiongWang/Llama3.1-Mamba-8B-distill
Updated
•
13
JunxiongWang/MambaByte_Stories
Text Generation
•
Updated
•
8
•
1
datasets
5
JunxiongWang/model_revision_max_4_closest_and_random
Viewer
•
Updated
•
170k
•
18
JunxiongWang/sftdatasetv3
Viewer
•
Updated
•
12.4M
•
70
JunxiongWang/sftdataset
Viewer
•
Updated
•
11M
•
199
•
2
JunxiongWang/llama3-ultrafeedback-armorm
Viewer
•
Updated
•
61.8k
•
108
•
1
JunxiongWang/testdataset
Viewer
•
Updated
•
1M
•
107