Yuhui Xu

yuhuixu

https://yuhuixu1993.github.io/

yuhuixu1993

AI & ML interests

None yet

Recent Activity

updated a model 9 days ago

yuhuixu/merged_model_linear_0.6_0.4

published a model 9 days ago

yuhuixu/merged_model_linear_0.6_0.4

updated a model 9 days ago

yuhuixu/merged_model_linear_0.5_0.5

View all activity

Organizations

None yet

yuhuixu's activity

updated a model 9 days ago

yuhuixu/merged_model_linear_0.6_0.4

Text Generation • Updated 9 days ago • 6

published a model 9 days ago

yuhuixu/merged_model_linear_0.6_0.4

Text Generation • Updated 9 days ago • 6

updated a model 9 days ago

yuhuixu/merged_model_linear_0.5_0.5

Text Generation • Updated 9 days ago • 4

published a model 9 days ago

yuhuixu/merged_model_linear_0.5_0.5

Text Generation • Updated 9 days ago • 4

updated a model 9 days ago

yuhuixu/merged_model_linear_0.4_0.6

Text Generation • Updated 9 days ago • 6

published a model 9 days ago

yuhuixu/merged_model_linear_0.4_0.6

Text Generation • Updated 9 days ago • 6

upvoted an article 9 days ago

Article

Mastering Long Contexts in LLMs with KVPress

•

9 days ago

• 58

updated 3 models 19 days ago

upvoted 2 papers 4 months ago

MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code

Paper • 2410.08196 • Published Oct 10, 2024 • 46

MathHay: An Automated Benchmark for Long-Context Mathematical Reasoning in LLMs

Paper • 2410.04698 • Published Oct 7, 2024 • 13

authored a paper 4 months ago

MathHay: An Automated Benchmark for Long-Context Mathematical Reasoning in LLMs

Paper • 2410.04698 • Published Oct 7, 2024 • 13

authored 7 papers 6 months ago

Latency-Aware Differentiable Neural Architecture Search

Paper • 2001.06392 • Published Jan 17, 2020

PC-DARTS: Partial Channel Connections for Memory-Efficient Architecture Search

Paper • 1907.05737 • Published Jul 12, 2019

Trained Rank Pruning for Efficient Deep Neural Networks

Paper • 1812.02402 • Published Dec 6, 2018 • 1

TRP: Trained Rank Pruning for Efficient Deep Neural Networks

Paper • 2004.14566 • Published Apr 30, 2020 • 1

Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models

Paper • 2402.14800 • Published Feb 22, 2024 • 3

TerDiT: Ternary Diffusion Models with Transformers

Paper • 2405.14854 • Published May 23, 2024 • 2

SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models

Paper • 2405.16057 • Published May 25, 2024