LoRA: Low-Rank Adaptation of Large Language Models Paper • 2106.09685 • Published Jun 17, 2021 • 31
Sparsh Collection Models and datasets for Sparsh: Self-supervised touch representations for vision-based tactile sensing • 15 items • Updated Oct 24, 2024 • 12
belyakoff/llama-3.2-3b-instruct-fine-tuned-gptq-8bit Text Generation • Updated Sep 30, 2024 • 47 • 2
Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning Paper • 2412.15797 • Published 17 days ago • 16
Progressive Multimodal Reasoning via Active Retrieval Paper • 2412.14835 • Published 18 days ago • 70
Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces Paper • 2412.14171 • Published 19 days ago • 23
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions Paper • 2411.14405 • Published Nov 21, 2024 • 58
VisDoM: Multi-Document QA with Visually Rich Elements Using Multimodal Retrieval-Augmented Generation Paper • 2412.10704 • Published 23 days ago • 15
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding Paper • 1810.04805 • Published Oct 11, 2018 • 16
sentence-transformers/all-MiniLM-L6-v2 Sentence Similarity • Updated Nov 1, 2024 • 66.2M • • 2.77k
Bamba Collection Collection of Bamba - hybrid Mamba2 model architecture based models trained on open data • 8 items • Updated 19 days ago • 17