EAustino
's Collections
Understanding LLMs: A Comprehensive Overview from Training to Inference
Paper
•
2401.02038
•
Published
•
62
DocLLM: A layout-aware generative language model for multimodal document
understanding
Paper
•
2401.00908
•
Published
•
181
LLaMA Beyond English: An Empirical Study on Language Capability Transfer
Paper
•
2401.01055
•
Published
•
54
LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning
Paper
•
2401.01325
•
Published
•
27
A Comprehensive Study of Knowledge Editing for Large Language Models
Paper
•
2401.01286
•
Published
•
16
Improving Text Embeddings with Large Language Models
Paper
•
2401.00368
•
Published
•
79
Astraios: Parameter-Efficient Instruction Tuning Code Large Language
Models
Paper
•
2401.00788
•
Published
•
21
PanGu-π: Enhancing Language Model Architectures via Nonlinearity
Compensation
Paper
•
2312.17276
•
Published
•
15
Unicron: Economizing Self-Healing LLM Training at Scale
Paper
•
2401.00134
•
Published
•
9
Multilingual Instruction Tuning With Just a Pinch of Multilinguality
Paper
•
2401.01854
•
Published
•
10
TinyLlama: An Open-Source Small Language Model
Paper
•
2401.02385
•
Published
•
90
LLaMA Pro: Progressive LLaMA with Block Expansion
Paper
•
2401.02415
•
Published
•
53
LLM Augmented LLMs: Expanding Capabilities through Composition
Paper
•
2401.02412
•
Published
•
36
ICE-GRT: Instruction Context Enhancement by Generative Reinforcement
based Transformers
Paper
•
2401.02072
•
Published
•
9
DocGraphLM: Documental Graph Language Model for Information Extraction
Paper
•
2401.02823
•
Published
•
35
TrustLLM: Trustworthiness in Large Language Models
Paper
•
2401.05561
•
Published
•
66
Transformers are Multi-State RNNs
Paper
•
2401.06104
•
Published
•
36
TOFU: A Task of Fictitious Unlearning for LLMs
Paper
•
2401.06121
•
Published
•
15
Patchscope: A Unifying Framework for Inspecting Hidden Representations
of Language Models
Paper
•
2401.06102
•
Published
•
20
Secrets of RLHF in Large Language Models Part II: Reward Modeling
Paper
•
2401.06080
•
Published
•
26
Efficient LLM inference solution on Intel GPU
Paper
•
2401.05391
•
Published
•
9
Tuning LLMs with Contrastive Alignment Instructions for Machine
Translation in Unseen, Low-resource Languages
Paper
•
2401.05811
•
Published
•
6
A Shocking Amount of the Web is Machine Translated: Insights from
Multi-Way Parallelism
Paper
•
2401.05749
•
Published
•
7
The Impact of Reasoning Step Length on Large Language Models
Paper
•
2401.04925
•
Published
•
16
Bootstrapping LLM-based Task-Oriented Dialogue Agents via Self-Talk
Paper
•
2401.05033
•
Published
•
16