Are LLMs Better than Reported? Detecting Label Errors and Mitigating Their Effect on Model Performance Paper • 2410.18889 • Published Oct 24, 2024 • 15
GLEE: A Unified Framework and Benchmark for Language-based Economic Environments Paper • 2410.05254 • Published Oct 7, 2024 • 81
On Behalf of the Stakeholders: Trends in NLP Model Interpretability in the Era of LLMs Paper • 2407.19200 • Published Jul 27, 2024 • 1