Instruct-SCTG: Guiding Sequential Controlled Text Generation through Instructions Paper • 2312.12299 • Published Dec 19, 2023
Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators Paper • 2403.16950 • Published Mar 25, 2024 • 4
MR-BEN: A Comprehensive Meta-Reasoning Benchmark for Large Language Models Paper • 2406.13975 • Published Jun 20, 2024
Segment Any Text: A Universal Approach for Robust, Efficient and Adaptable Sentence Segmentation Paper • 2406.16678 • Published Jun 24, 2024 • 16
Segment Any Text: A Universal Approach for Robust, Efficient and Adaptable Sentence Segmentation Paper • 2406.16678 • Published Jun 24, 2024 • 16
Where's the Point? Self-Supervised Multilingual Punctuation-Agnostic Sentence Segmentation Paper • 2305.18893 • Published May 30, 2023 • 2
CompoundPiece: Evaluating and Improving Decompounding Performance of Language Models Paper • 2305.14214 • Published May 23, 2023
HumSet: Dataset of Multilingual Information Extraction and Classification for Humanitarian Crisis Response Paper • 2210.04573 • Published Oct 10, 2022
Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models Paper • 2404.18796 • Published Apr 29, 2024 • 69
Best Practices and Lessons Learned on Synthetic Data for Language Models Paper • 2404.07503 • Published Apr 11, 2024 • 30
Chart-based Reasoning: Transferring Capabilities from LLMs to VLMs Paper • 2403.12596 • Published Mar 19, 2024 • 10
AutoPEFT: Automatic Configuration Search for Parameter-Efficient Fine-Tuning Paper • 2301.12132 • Published Jan 28, 2023 • 1
Batch Calibration: Rethinking Calibration for In-Context Learning and Prompt Engineering Paper • 2309.17249 • Published Sep 29, 2023
Survival of the Most Influential Prompts: Efficient Black-Box Prompt Search via Clustering and Pruning Paper • 2310.12774 • Published Oct 19, 2023
Multi3WOZ: A Multilingual, Multi-Domain, Multi-Parallel Dataset for Training and Evaluating Culturally Adapted Task-Oriented Dialog Systems Paper • 2307.14031 • Published Jul 26, 2023
XQA-DST: Multi-Domain and Multi-Lingual Dialogue State Tracking Paper • 2204.05895 • Published Apr 12, 2022