Evaluating Language Models as Synthetic Data Generators Paper • 2412.03679 • Published about 1 month ago • 45
TÜLU 3: Pushing Frontiers in Open Language Model Post-Training Paper • 2411.15124 • Published Nov 22, 2024 • 57
OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs Paper • 2411.14199 • Published Nov 21, 2024 • 29
Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback Paper • 2410.19133 • Published Oct 24, 2024 • 11
Pangea: A Fully Open Multilingual Multimodal LLM for 39 Languages Paper • 2410.16153 • Published Oct 21, 2024 • 44
Scaling Retrieval-Based Language Models with a Trillion-Token Datastore Paper • 2407.12854 • Published Jul 9, 2024 • 29
Fine-grained Hallucination Detection and Editing for Language Models Paper • 2401.06855 • Published Jan 12, 2024 • 3
When Not to Trust Language Models: Investigating Effectiveness of Parametric and Non-Parametric Memories Paper • 2212.10511 • Published Dec 20, 2022 • 1
Do NLP Models Know Numbers? Probing Numeracy in Embeddings Paper • 1909.07940 • Published Sep 17, 2019
DROP: A Reading Comprehension Benchmark Requiring Discrete Reasoning Over Paragraphs Paper • 1903.00161 • Published Mar 1, 2019
One Embedder, Any Task: Instruction-Finetuned Text Embeddings Paper • 2212.09741 • Published Dec 19, 2022 • 3
HINT: Hypernetwork Instruction Tuning for Efficient Zero-Shot Generalisation Paper • 2212.10315 • Published Dec 20, 2022 • 1
TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering Paper • 2303.11897 • Published Mar 21, 2023
Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics Paper • 2009.10795 • Published Sep 22, 2020
Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging Paper • 2310.11564 • Published Oct 17, 2023 • 2
Fine-grained Hallucination Detection and Editing for Language Models Paper • 2401.06855 • Published Jan 12, 2024 • 3
BTR: Binary Token Representations for Efficient Retrieval Augmented Language Models Paper • 2310.01329 • Published Oct 2, 2023