arxiv:2406.12832
Armin Azizi
arminazizi59
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 2 months ago
GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless
Generative Inference of LLM
upvoted
a
paper
about 2 months ago
Hymba: A Hybrid-head Architecture for Small Language Models
authored
a paper
7 months ago
LaMDA: Large Model Fine-Tuning via Spectrally Decomposed Low-Dimensional
Adaptation
Organizations
None yet
Papers
1
datasets
None public yet