-
Judging LLM-as-a-judge with MT-Bench and Chatbot Arena
Paper • 2306.05685 • Published • 32 -
prometheus-eval/Feedback-Collection
Viewer • Updated • 100k • 405 • 107 -
prometheus-eval/prometheus-13b-v1.0
Text2Text Generation • Updated • 4.69k • 130 -
HuggingFaceH4/ultrafeedback_binarized
Viewer • Updated • 187k • 6.33k • 263
Krzysztof Sopyla
ksopyla
·
AI & ML interests
NLP, knowledge extraction, knowledge graphs, semantic similarity, model factfulness
Recent Activity
liked
a model
12 days ago
meta-llama/Llama-3.2-1B
liked
a model
about 2 months ago
fixie-ai/ultravox-v0_4_1-llama-3_1-8b
liked
a dataset
about 2 months ago
microsoft/orca-agentinstruct-1M-v1
Organizations
Collections
2
models
None public yet
datasets
None public yet