arxiv:2407.01449
Hugues Sibille
HugSib
AI & ML interests
None yet
Recent Activity
updated
a dataset
4 months ago
illuin/vdsid_filtered_test
upvoted
a
paper
4 months ago
GroUSE: A Benchmark to Evaluate Evaluators in Grounded Question
Answering
Organizations
Papers
1
models
None public yet
datasets
None public yet