-
Understanding LLMs: A Comprehensive Overview from Training to Inference
Paper ā¢ 2401.02038 ā¢ Published ā¢ 62 -
Learning To Teach Large Language Models Logical Reasoning
Paper ā¢ 2310.09158 ā¢ Published ā¢ 1 -
ChipNeMo: Domain-Adapted LLMs for Chip Design
Paper ā¢ 2311.00176 ā¢ Published ā¢ 8 -
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct
Paper ā¢ 2308.09583 ā¢ Published ā¢ 7
Collections
Discover the best community collections!
Collections including paper arxiv:2401.02038
-
PaLI-3 Vision Language Models: Smaller, Faster, Stronger
Paper ā¢ 2310.09199 ā¢ Published ā¢ 26 -
A Zero-Shot Language Agent for Computer Control with Structured Reflection
Paper ā¢ 2310.08740 ā¢ Published ā¢ 14 -
Personality Traits in Large Language Models
Paper ā¢ 2307.00184 ā¢ Published ā¢ 20 -
An Emulator for Fine-Tuning Large Language Models using Small Language Models
Paper ā¢ 2310.12962 ā¢ Published ā¢ 14
-
meta-llama/Llama-2-7b-chat-hf
Text Generation ā¢ Updated ā¢ 1.1M ā¢ ā¢ 4.12k -
mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration
Paper ā¢ 2311.04257 ā¢ Published ā¢ 20 -
LLaVA-Interactive: An All-in-One Demo for Image Chat, Segmentation, Generation and Editing
Paper ā¢ 2311.00571 ā¢ Published ā¢ 41 -
Understanding LLMs: A Comprehensive Overview from Training to Inference
Paper ā¢ 2401.02038 ā¢ Published ā¢ 62
-
XGen-7B Technical Report
Paper ā¢ 2309.03450 ā¢ Published ā¢ 8 -
FLM-101B: An Open LLM and How to Train It with $100K Budget
Paper ā¢ 2309.03852 ā¢ Published ā¢ 44 -
Robotic Table Tennis: A Case Study into a High Speed Learning System
Paper ā¢ 2309.03315 ā¢ Published ā¢ 6 -
Improving Text Embeddings with Large Language Models
Paper ā¢ 2401.00368 ā¢ Published ā¢ 79