Hackiey
's Collections
Reward-Augmented Decoding: Efficient Controlled Text Generation With a
Unidirectional Reward Model
Paper
•
2310.09520
•
Published
•
10
When can transformers reason with abstract symbols?
Paper
•
2310.09753
•
Published
•
2
Improving Large Language Model Fine-tuning for Solving Math Problems
Paper
•
2310.10047
•
Published
•
5
LLaVA-Interactive: An All-in-One Demo for Image Chat, Segmentation,
Generation and Editing
Paper
•
2311.00571
•
Published
•
41
Fine-tuning Language Models for Factuality
Paper
•
2311.08401
•
Published
•
28
Exponentially Faster Language Modelling
Paper
•
2311.10770
•
Published
•
117
SmartPlay : A Benchmark for LLMs as Intelligent Agents
Paper
•
2310.01557
•
Published
•
12
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper
•
2402.17764
•
Published
•
605