@akhaliq on Hugging Face: "Self-Rewarding Language Models paper page:…"

Hugging Face

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Back to feed

akhaliq

posted an update Jan 19, 2024

Post

Self-Rewarding Language Models

paper page: Self-Rewarding Language Models (2401.10020)

Fine-tuning Llama 2 70B on three iterations of our approach yields a model that outperforms many existing systems on the AlpacaEval 2.0 leaderboard, including Claude 2, Gemini Pro, and GPT-4 0613

WbjuSrceu

Jan 21, 2024

Standardized Vocabulary: Beyond Chat GTP4

In this post

akhaliq AK
WbjuSrceu vhjghvy uyfyfuyfy