1 6 16

Weizhe Yuan

weizhey

AI & ML interests

NLP

Recent Activity

upvoted a paper about 1 month ago

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

authored a paper about 1 month ago

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

upvoted a paper about 2 months ago

Adaptive Decoding via Latent Preference Optimization

View all activity

Organizations

weizhey's activity

upvoted a paper about 1 month ago

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Paper • 2411.16489 • Published Nov 25, 2024 • 41

authored a paper about 1 month ago

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Paper • 2411.16489 • Published Nov 25, 2024 • 41

upvoted a paper about 2 months ago

Adaptive Decoding via Latent Preference Optimization

Paper • 2411.09661 • Published Nov 14, 2024 • 10

authored a paper 3 months ago

Thinking LLMs: General Instruction Following with Thought Generation

Paper • 2410.10630 • Published Oct 14, 2024 • 18

authored 8 papers 5 months ago

Self-Taught Evaluators

Paper • 2408.02666 • Published Aug 5, 2024 • 27

BARTScore: Evaluating Generated Text as Text Generation

Paper • 2106.11520 • Published Jun 22, 2021 • 1

FacTool: Factuality Detection in Generative AI -- A Tool Augmented Framework for Multi-Task and Multi-Domain Scenarios

Paper • 2307.13528 • Published Jul 25, 2023

Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing

Paper • 2107.13586 • Published Jul 28, 2021

Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge

Paper • 2407.19594 • Published Jul 28, 2024 • 20

upvoted a paper 5 months ago

Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge

Paper • 2407.19594 • Published Jul 28, 2024 • 20

liked a Space 6 months ago

Running on Zero

639

🚀

Tile Upscaler

authored a paper 8 months ago

Iterative Reasoning Preference Optimization

Paper • 2404.19733 • Published Apr 30, 2024 • 47

reacted to xiaohk's post with 🤗 10 months ago

Post

Hello world!

reacted to mrm8488's post with 🤗 10 months ago

Post

Hello world! 🔥

upvoted a paper 12 months ago

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18, 2024 • 145

authored a paper 12 months ago

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18, 2024 • 145

upvoted a paper over 1 year ago

System-Level Natural Language Feedback

Paper • 2306.13588 • Published Jun 23, 2023 • 10