Spaces:
Running
on
CPU Upgrade
Running
on
CPU Upgrade
Commit History
Update README.md
1af6a60
verified
Update README.md
118b4a8
verified
minor fix
3cf286c
minor fix
2aa9a75
update gitignore
6632750
minor updates in publishing and logging results
2b9835a
minor update and extend to support different APIs
150bb15
Update src/display/about.py
8a6bfdc
verified
Update src/display/about.py
02cd86f
verified
Update src/display/about.py
56492c3
verified
Update src/display/about.py
6472dd8
verified
Update src/display/about.py
2a8e044
verified
Update src/display/about.py
b92e0da
verified
Updated bibtex
418a002
verified
Updated bibtex
31b8757
verified
Added bibtex
5ead597
verified
Updated bibtex citation
bac5383
verified
Update src/display/about.py
e2aca33
verified
Update src/display/about.py
3c0cb66
verified
Added tags metadata to make the leaderboard more discoverable.
1ad00dd
verified
Update README.md
2a16b2e
verified
fixed typo
fa4eaec
Minseok Bae
commited on
modified about.py
818ee3d
Minseok Bae
commited on
Modified about.py so that it displays (%) in columns.
5bcc476
Minseok Bae
commited on
Fixed the leaderboard filtering functionality. Modified filter_models() function in app.py/
1f26f6c
Minseok Bae
commited on
modified the evaluation pipelines.
2c24f05
Minseok Bae
commited on
Added citations
b46b972
Minseok Bae
commited on
Updated about.py
dbcffd4
Minseok Bae
commited on
Edited README and added reproducibility functionality in main_backend.py
f0b90cf
Minseok Bae
commited on
modified read_evals.py
c3e9147
Minseok Bae
commited on
Refine the code style
156ef43
Minseok Bae
commited on
Implemented litellm pipeline
2864204
Minseok Bae
commited on
Edited README and removed error-rate metric
404587d
Minseok Bae
commited on
modified is_model_on_hub()
3b66490
Minseok Bae
commited on
changed back to TOKEN
0c85a8e
Minseok Bae
commited on
changed to HF_TOKEN
a9a1c18
Minseok Bae
commited on
modified check_validity.py and added sample dataset to test functionality
099e4e2
Minseok Bae
commited on
Integrated backend pipelines - error occurs during model submission. (Debugging needed).
58b9de9
Minseok Bae
commited on
Modified for hallucination evaluation task
d7b7dc6
Minseok Bae
commited on
Update README.md
767187a
Update src/display/about.py
0baf5c4
update read
943f952
Clémentine
commited on
fixs
314f91a
Clémentine
commited on
fix
1257fc3
Clémentine
commited on
updated leaderboard
efeee6d
Clémentine
commited on
Simplified leaderboard v0
9833cdb
Clémentine
commited on
adding pull back
d084b26
Clémentine
commited on
simplified some parts of the code + updated requirements
9d22eee
Clémentine
commited on
Added check on tokenizer to prevent submissions which won't run
7302987
Clémentine
commited on