MaziyarPanahi/Mistral-11B-Instruct-v0.2-Mistral-7B-Instruct-v0.2-slerp Text Generation • Updated Jan 10, 2024 • 38 • 2
Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences Paper • 2404.03715 • Published Apr 4, 2024 • 60
tomaarsen/span-marker-roberta-large-ontonotes5 Token Classification • Updated Sep 22, 2023 • 1.27k • 12
McGill-NLP/LLM2Vec-Meta-Llama-3-8B-Instruct-mntp-supervised Sentence Similarity • Updated Apr 30, 2024 • 18.6k • 48
neuralmagic/Meta-Llama-3.1-70B-Instruct-quantized.w8a8 Text Generation • Updated Oct 10, 2024 • 7.81k • 18
Personalized Multimodal Large Language Models: A Survey Paper • 2412.02142 • Published Dec 3, 2024 • 12
MixLLM: LLM Quantization with Global Mixed-precision between Output-features and Highly-efficient System Design Paper • 2412.14590 • Published 16 days ago • 13
Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning Paper • 2412.15797 • Published 15 days ago • 15