leonardlin
's Collections
TOREAD
updated
A Survey on Data Selection for Language Models
Paper
•
2402.16827
•
Published
•
4
Instruction Tuning with Human Curriculum
Paper
•
2310.09518
•
Published
•
3
Fine-Tuning or Retrieval? Comparing Knowledge Injection in LLMs
Paper
•
2312.05934
•
Published
•
1
Language Models as Agent Models
Paper
•
2212.01681
•
Published
Beyond Language Models: Byte Models are Digital World Simulators
Paper
•
2402.19155
•
Published
•
49
StarCoder 2 and The Stack v2: The Next Generation
Paper
•
2402.19173
•
Published
•
136
Polaris: A Safety-focused LLM Constellation Architecture for Healthcare
Paper
•
2403.13313
•
Published
•
2
Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts
for Instruction Tuning on General Tasks
Paper
•
2401.02731
•
Published
•
2
On the Measure of Intelligence
Paper
•
1911.01547
•
Published
Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations?
Paper
•
2405.05904
•
Published
•
6
A Survey on Large Language Models with Multilingualism: Recent Advances
and New Frontiers
Paper
•
2405.10936
•
Published
•
1
Human-like Episodic Memory for Infinite Context LLMs
Paper
•
2407.09450
•
Published
•
60
MUSCLE: A Model Update Strategy for Compatible LLM Evolution
Paper
•
2407.09435
•
Published
•
21
The Impact of Hyperparameters on Large Language Model Inference
Performance: An Evaluation of vLLM and HuggingFace Pipelines
Paper
•
2408.01050
•
Published
•
8
OpenResearcher: Unleashing AI for Accelerated Scientific Research
Paper
•
2408.06941
•
Published
•
30
The AI Scientist: Towards Fully Automated Open-Ended Scientific
Discovery
Paper
•
2408.06292
•
Published
•
117
Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data
Assessment and Selection for Instruction Tuning of Language Models
Paper
•
2408.02085
•
Published
•
17
Scaling LLM Test-Time Compute Optimally can be More Effective than
Scaling Model Parameters
Paper
•
2408.03314
•
Published
•
53