blizzard-neel
's Collections
Meta-Learning a Dynamical Language Model
Paper
•
1803.10631
•
Published
TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance
Generation
Paper
•
2003.11963
•
Published
BigScience: A Case Study in the Social Construction of a Multilingual
Large Language Model
Paper
•
2212.04960
•
Published
•
1
Continuous Learning in a Hierarchical Multiscale Neural Network
Paper
•
1805.05758
•
Published
•
1
HuggingFace's Transformers: State-of-the-art Natural Language Processing
Paper
•
1910.03771
•
Published
•
16
Evaluate & Evaluation on the Hub: Better Best Practices for Data and
Model Measurements
Paper
•
2210.01970
•
Published
•
11
TransferTransfo: A Transfer Learning Approach for Neural Network Based
Conversational Agents
Paper
•
1901.08149
•
Published
•
2
Datasets: A Community Library for Natural Language Processing
Paper
•
2109.02846
•
Published
•
11
Large Language Models Can Self-Improve in Long-context Reasoning
Paper
•
2411.08147
•
Published
•
63
Model soups: averaging weights of multiple fine-tuned models improves
accuracy without increasing inference time
Paper
•
2203.05482
•
Published
•
6
MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark
Paper
•
2410.19168
•
Published
•
19
MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding
Benchmark
Paper
•
2409.02813
•
Published
•
28
JuStRank: Benchmarking LLM Judges for System Ranking
Paper
•
2412.09569
•
Published
•
19
RetroLLM: Empowering Large Language Models to Retrieve Fine-grained
Evidence within Generation
Paper
•
2412.11919
•
Published
•
33
Are Your LLMs Capable of Stable Reasoning?
Paper
•
2412.13147
•
Published
•
91
OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse
Task Synthesis
Paper
•
2412.19723
•
Published
•
69
Large Language Model-Brained GUI Agents: A Survey
Paper
•
2411.18279
•
Published
•
29
Molar: Multimodal LLMs with Collaborative Filtering Alignment for
Enhanced Sequential Recommendation
Paper
•
2412.18176
•
Published
•
15
Token-Budget-Aware LLM Reasoning
Paper
•
2412.18547
•
Published
•
42
Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via
Collective Monte Carlo Tree Search
Paper
•
2412.18319
•
Published
•
34
RobustFT: Robust Supervised Fine-tuning for Large Language Models under
Noisy Response
Paper
•
2412.14922
•
Published
•
83
Learning to Reason via Self-Iterative Process Feedback for Small
Language Models
Paper
•
2412.08393
•
Published
SeFAR: Semi-supervised Fine-grained Action Recognition with Temporal
Perturbation and Learning Stabilization
Paper
•
2501.01245
•
Published
•
5
Xmodel-2 Technical Report
Paper
•
2412.19638
•
Published
•
23
Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs
Paper
•
2412.21187
•
Published
•
25