Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2402.09906

ibm/AttaQ

Viewer • Updated Jan 26, 2024 • 1.4k • 1.1k • 13
snorkelai/snorkel-curated-instruction-tuning

Preview • Updated Mar 11, 2024 • 36 • 8
corbyrosset/researchy_questions

Viewer • Updated Feb 29, 2024 • 96.4k • 44 • 25
argilla/ultrafeedback-binarized-preferences

Viewer • Updated Nov 30, 2023 • 63.6k • 288 • 70

LLM Augmented LLMs: Expanding Capabilities through Composition

Paper • 2401.02412 • Published Jan 4, 2024 • 36
Generative Representational Instruction Tuning

Paper • 2402.09906 • Published Feb 15, 2024 • 53
Distilling Step-by-Step! Outperforming Larger Language Models with Less Training Data and Smaller Model Sizes

Paper • 2305.02301 • Published May 3, 2023 • 2
Evolutionary Optimization of Model Merging Recipes

Paper • 2403.13187 • Published Mar 19, 2024 • 50

Instruction Tuning

A Survey on Data Selection for LLM Instruction Tuning

Paper • 2402.05123 • Published Feb 4, 2024 • 3
WaveCoder: Widespread And Versatile Enhanced Instruction Tuning with Refined Data Generation

Paper • 2312.14187 • Published Dec 20, 2023 • 49
Generative Representational Instruction Tuning

Paper • 2402.09906 • Published Feb 15, 2024 • 53
Instruction-tuned Language Models are Better Knowledge Learners

Paper • 2402.12847 • Published Feb 20, 2024 • 25

Generative Representational Instruction Tuning

Paper • 2402.09906 • Published Feb 15, 2024 • 53

Speculative Streaming: Fast LLM Inference without Auxiliary Models

Paper • 2402.11131 • Published Feb 16, 2024 • 42
Generative Representational Instruction Tuning

Paper • 2402.09906 • Published Feb 15, 2024 • 53
Chain-of-Thought Reasoning Without Prompting

Paper • 2402.10200 • Published Feb 15, 2024 • 104
BitDelta: Your Fine-Tune May Only Be Worth One Bit

Paper • 2402.10193 • Published Feb 15, 2024 • 19

Generative Representational Instruction Tuning

Paper • 2402.09906 • Published Feb 15, 2024 • 53

OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models

Paper • 2402.01739 • Published Jan 29, 2024 • 26
Rethinking Interpretability in the Era of Large Language Models

Paper • 2402.01761 • Published Jan 30, 2024 • 22
Self-Discover: Large Language Models Self-Compose Reasoning Structures

Paper • 2402.03620 • Published Feb 6, 2024 • 114
Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model

Paper • 2402.07827 • Published Feb 12, 2024 • 45

Chain-of-Thought Reasoning Without Prompting

Paper • 2402.10200 • Published Feb 15, 2024 • 104
How to Train Data-Efficient LLMs

Paper • 2402.09668 • Published Feb 15, 2024 • 40
BitDelta: Your Fine-Tune May Only Be Worth One Bit

Paper • 2402.10193 • Published Feb 15, 2024 • 19
A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts

Paper • 2402.09727 • Published Feb 15, 2024 • 36

Generative Representational Instruction Tuning

Paper • 2402.09906 • Published Feb 15, 2024 • 53
LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs

Paper • 2406.15319 • Published Jun 21, 2024 • 62
BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval

Paper • 2407.12883 • Published Jul 16, 2024 • 8
mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval

Paper • 2407.19669 • Published Jul 29, 2024 • 23

Generative Representational Instruction Tuning (GRIT)

Generative Representational Instruction Tuning

Paper • 2402.09906 • Published Feb 15, 2024 • 53
GritLM/GritLM-7B

Text Generation • Updated Feb 16, 2024 • 34.8k • 89
GritLM/GritLM-8x7B

Text Generation • Updated Feb 16, 2024 • 4.07k • 35
GritLM/GritLM-7B-KTO

Text Generation • Updated Jun 14, 2024 • 7.13k • 4

Previous
1
2
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs