Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
akhaliq 
posted an update Jan 19, 2024
Post
Self-Rewarding Language Models

paper page: Self-Rewarding Language Models (2401.10020)

Fine-tuning Llama 2 70B on three iterations of our approach yields a model that outperforms many existing systems on the AlpacaEval 2.0 leaderboard, including Claude 2, Gemini Pro, and GPT-4 0613

Standardized Vocabulary: Beyond Chat GTP4

In this post