Minerva LLMs Collection The first family of LLMs pretrained from scratch on Italian. • 6 items • Updated Dec 7, 2024 • 32
Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-OASIS Paper • 2411.19655 • Published Nov 29, 2024 • 20
ZEBRA: Zero-Shot Example-Based Retrieval Augmentation for Commonsense Question Answering Paper • 2410.05077 • Published Oct 7, 2024 • 2
ZEBRA Collection Zero-Shot Example-Based Retrieval Augmentation for Commonsense Question Answering • 12 items • Updated Dec 4, 2024 • 9
Guardians of the Machine Translation Meta-Evaluation: Sentinel Metrics Fall In! Paper • 2408.13831 • Published Aug 25, 2024 • 5
MT Sentinel Metrics Collection Machine Translation (MT) metrics designed explicitly to scrutinize the MT meta-evaluation process’s accuracy, robustness, and fairness. • 7 items • Updated Dec 4, 2024 • 6
Maverick Coreference Resolution Collection Efficient and Accurate Coreference Resolution models. • 3 items • Updated Dec 4, 2024 • 9
Word Sense Linking Collection Word Sense Linking is the task designed to identify and disambiguate spans of text to their most suitable senses from a reference inventory. • 6 items • Updated 25 days ago • 6
FENICE Collection FENICE is a metric for summarization factuality, with a focus on interpretability. FENICE leverages NLI and claim extraction to assess factuality • 4 items • Updated Dec 5, 2024 • 4