Collections
Discover the best community collections!
Collections including paper arxiv:2309.12307
-
LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models
Paper • 2309.12307 • Published • 88 -
CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages
Paper • 2309.09400 • Published • 84 -
Language Modeling Is Compression
Paper • 2309.10668 • Published • 83
-
Efficient Streaming Language Models with Attention Sinks
Paper • 2309.17453 • Published • 13 -
Effective Long-Context Scaling of Foundation Models
Paper • 2309.16039 • Published • 30 -
allenai/longformer-base-4096
Updated • 2.57M • 185 -
google/bigbird-roberta-base
Updated • 30.2k • 51
-
Contrastive Decoding Improves Reasoning in Large Language Models
Paper • 2309.09117 • Published • 37 -
LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models
Paper • 2309.12307 • Published • 88 -
Llama 2: Open Foundation and Fine-Tuned Chat Models
Paper • 2307.09288 • Published • 243 -
Efficient Memory Management for Large Language Model Serving with PagedAttention
Paper • 2309.06180 • Published • 25