Language Technology Lab @University of Cambridge

university

http://ltl.mml.cam.ac.uk/

cambridgeltl

Activity Feed Request to join this org

AI & ML interests

Representation Learning, Multilingual NLP, Multimodal NLP, BioNLP, Self-Supervised Learning, Explainable AI

cambridgeltl's activity

hSterz

updated a dataset 3 months ago

cambridgeltl/DARE

Viewer • Updated Oct 21, 2024 • 8.73k • 60 • 3

yinhongliu

authored 3 papers 5 months ago

Instruct-SCTG: Guiding Sequential Controlled Text Generation through Instructions

Paper • 2312.12299 • Published Dec 19, 2023

Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators

Paper • 2403.16950 • Published Mar 25, 2024 • 4

MR-BEN: A Comprehensive Meta-Reasoning Benchmark for Large Language Models

Paper • 2406.13975 • Published Jun 20, 2024

fl399

authored a paper 7 months ago

PaliGemma: A versatile 3B VLM for transfer

Paper • 2407.07726 • Published Jul 10, 2024 • 68

benjamin

authored a paper 7 months ago

Segment Any Text: A Universal Approach for Robust, Efficient and Adaptable Sentence Segmentation

Paper • 2406.16678 • Published Jun 24, 2024 • 16

ivulic

authored a paper 7 months ago

Segment Any Text: A Universal Approach for Robust, Efficient and Adaptable Sentence Segmentation

Paper • 2406.16678 • Published Jun 24, 2024 • 16

benjamin

authored 4 papers 8 months ago

Zero-Shot Tokenizer Transfer

Paper • 2405.07883 • Published May 13, 2024 • 5

Where's the Point? Self-Supervised Multilingual Punctuation-Agnostic Sentence Segmentation

Paper • 2305.18893 • Published May 30, 2023 • 2

CompoundPiece: Evaluating and Improving Decompounding Performance of Language Models

Paper • 2305.14214 • Published May 23, 2023

HumSet: Dataset of Multilingual Information Extraction and Classification for Humanitarian Crisis Response

Paper • 2210.04573 • Published Oct 10, 2022

pangpang666

authored a paper 9 months ago

Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

Paper • 2404.18796 • Published Apr 29, 2024 • 69

fl399

authored a paper 9 months ago

Best Practices and Lessons Learned on Synthetic Data for Language Models

Paper • 2404.07503 • Published Apr 11, 2024 • 30

fl399

authored a paper 10 months ago

Chart-based Reasoning: Transferring Capabilities from LLMs to VLMs

Paper • 2403.12596 • Published Mar 19, 2024 • 10

pangpang666

authored a paper 11 months ago

StarCoder 2 and The Stack v2: The Next Generation

Paper • 2402.19173 • Published Feb 29, 2024 • 137

hzhouml

authored 5 papers 11 months ago

AutoPEFT: Automatic Configuration Search for Parameter-Efficient Fine-Tuning

Paper • 2301.12132 • Published Jan 28, 2023 • 1

Batch Calibration: Rethinking Calibration for In-Context Learning and Prompt Engineering

Paper • 2309.17249 • Published Sep 29, 2023

Survival of the Most Influential Prompts: Efficient Black-Box Prompt Search via Clustering and Pruning

Paper • 2310.12774 • Published Oct 19, 2023

Multi3WOZ: A Multilingual, Multi-Domain, Multi-Parallel Dataset for Training and Evaluating Culturally Adapted Task-Oriented Dialog Systems

Paper • 2307.14031 • Published Jul 26, 2023

XQA-DST: Multi-Domain and Multi-Lingual Dialogue State Tracking

Paper • 2204.05895 • Published Apr 12, 2022

AI & ML interests

Team members 17

cambridgeltl's activity