O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson? Paper โข 2411.16489 โข Published Nov 25, 2024 โข 41
O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson? Paper โข 2411.16489 โข Published Nov 25, 2024 โข 41
Adaptive Decoding via Latent Preference Optimization Paper โข 2411.09661 โข Published Nov 14, 2024 โข 10
Thinking LLMs: General Instruction Following with Thought Generation Paper โข 2410.10630 โข Published Oct 14, 2024 โข 18
BARTScore: Evaluating Generated Text as Text Generation Paper โข 2106.11520 โข Published Jun 22, 2021 โข 1
FacTool: Factuality Detection in Generative AI -- A Tool Augmented Framework for Multi-Task and Multi-Domain Scenarios Paper โข 2307.13528 โข Published Jul 25, 2023
Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing Paper โข 2107.13586 โข Published Jul 28, 2021
Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge Paper โข 2407.19594 โข Published Jul 28, 2024 โข 20
Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge Paper โข 2407.19594 โข Published Jul 28, 2024 โข 20