Why does the model rank differently with the exact same openness?

#2
by zhiminy - opened

The benchmarking protocol seems to be unclear:
image.png

image.png

image.png

Sign up or log in to comment