arxiv:2411.13676
Ameya Sunil Mahabaleshwarkar
ameyasunilm
AI & ML interests
Deep Learning, NLP, LLM
Recent Activity
authored
a paper
about 2 months ago
Hymba: A Hybrid-head Architecture for Small Language Models
liked
a model
4 months ago
nvidia/Mistral-NeMo-Minitron-8B-Instruct
new activity
4 months ago
nvidia/Nemotron-Mini-4B-Instruct:Minor issues with the chat template during fine-tuning
Organizations
Papers
1
models
None public yet
datasets
None public yet