-
Challenges and Opportunities of Using Transformer-Based Multi-Task Learning in NLP Through ML Lifecycle: A Survey
Paper • 2308.08234 • Published • 1 -
Understanding and Improving Information Transfer in Multi-Task Learning
Paper • 2005.00944 • Published • 1 -
Improving Multi-task Learning via Seeking Task-based Flat Regions
Paper • 2211.13723 • Published • 1 -
Improvable Gap Balancing for Multi-Task Learning
Paper • 2307.15429 • Published • 1
Collections
Discover the best community collections!
Collections including paper arxiv:2305.07230
-
Augmenting Pre-trained Language Models with QA-Memory for Open-Domain Question Answering
Paper • 2204.04581 • Published • 1 -
Large Language Models (GPT) Struggle to Answer Multiple-Choice Questions about Code
Paper • 2303.08033 • Published • 1 -
CAR: Conceptualization-Augmented Reasoner for Zero-Shot Commonsense Question Answering
Paper • 2305.14869 • Published • 1 -
Multi-hop Commonsense Knowledge Injection Framework for Zero-Shot Commonsense Question Answering
Paper • 2305.05936 • Published • 1
-
CLIN: A Continually Learning Language Agent for Rapid Task Adaptation and Generalization
Paper • 2310.10134 • Published • 1 -
TiC-CLIP: Continual Training of CLIP Models
Paper • 2310.16226 • Published • 8 -
In-Context Pretraining: Language Modeling Beyond Document Boundaries
Paper • 2310.10638 • Published • 29 -
Controlled Decoding from Language Models
Paper • 2310.17022 • Published • 14
-
Ada-Instruct: Adapting Instruction Generators for Complex Reasoning
Paper • 2310.04484 • Published • 5 -
Diversity of Thought Improves Reasoning Abilities of Large Language Models
Paper • 2310.07088 • Published • 5 -
Adapting Large Language Models via Reading Comprehension
Paper • 2309.09530 • Published • 77 -
Democratizing Reasoning Ability: Tailored Learning from Large Language Model
Paper • 2310.13332 • Published • 14
-
Ensemble-Instruct: Generating Instruction-Tuning Data with a Heterogeneous Mixture of LMs
Paper • 2310.13961 • Published • 4 -
Fabricator: An Open Source Toolkit for Generating Labeled Training Data with Teacher LLMs
Paper • 2309.09582 • Published • 4 -
Auto-Instruct: Automatic Instruction Generation and Ranking for Black-Box Language Models
Paper • 2310.13127 • Published • 11 -
Evaluating the Robustness to Instructions of Large Language Models
Paper • 2308.14306 • Published • 1