papers - a blizzard-neel Collection

blizzard-neel 's Collections

papers

papers

updated 1 day ago

Meta-Learning a Dynamical Language Model

Paper • 1803.10631 • Published Mar 28, 2018
TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation

Paper • 2003.11963 • Published Mar 26, 2020
BigScience: A Case Study in the Social Construction of a Multilingual Large Language Model

Paper • 2212.04960 • Published Dec 9, 2022 • 1
Continuous Learning in a Hierarchical Multiscale Neural Network

Paper • 1805.05758 • Published May 15, 2018 • 1
HuggingFace's Transformers: State-of-the-art Natural Language Processing

Paper • 1910.03771 • Published Oct 9, 2019 • 16
Evaluate & Evaluation on the Hub: Better Best Practices for Data and Model Measurements

Paper • 2210.01970 • Published Sep 30, 2022 • 11
TransferTransfo: A Transfer Learning Approach for Neural Network Based Conversational Agents

Paper • 1901.08149 • Published Jan 23, 2019 • 2
Datasets: A Community Library for Natural Language Processing

Paper • 2109.02846 • Published Sep 7, 2021 • 11
Large Language Models Can Self-Improve in Long-context Reasoning

Paper • 2411.08147 • Published Nov 12, 2024 • 63
Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time

Paper • 2203.05482 • Published Mar 10, 2022 • 6
MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark

Paper • 2410.19168 • Published Oct 24, 2024 • 19
MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark

Paper • 2409.02813 • Published Sep 4, 2024 • 28
JuStRank: Benchmarking LLM Judges for System Ranking

Paper • 2412.09569 • Published 24 days ago • 19
RetroLLM: Empowering Large Language Models to Retrieve Fine-grained Evidence within Generation

Paper • 2412.11919 • Published 21 days ago • 33
Are Your LLMs Capable of Stable Reasoning?

Paper • 2412.13147 • Published 19 days ago • 91
OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

Paper • 2412.19723 • Published 10 days ago • 69
Large Language Model-Brained GUI Agents: A Survey

Paper • 2411.18279 • Published Nov 27, 2024 • 29
Molar: Multimodal LLMs with Collaborative Filtering Alignment for Enhanced Sequential Recommendation

Paper • 2412.18176 • Published 13 days ago • 15
Token-Budget-Aware LLM Reasoning

Paper • 2412.18547 • Published 12 days ago • 42
Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

Paper • 2412.18319 • Published 13 days ago • 34
RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response

Paper • 2412.14922 • Published 18 days ago • 83
Learning to Reason via Self-Iterative Process Feedback for Small Language Models

Paper • 2412.08393 • Published 26 days ago
SeFAR: Semi-supervised Fine-grained Action Recognition with Temporal Perturbation and Learning Stabilization

Paper • 2501.01245 • Published 4 days ago • 5
Xmodel-2 Technical Report

Paper • 2412.19638 • Published 10 days ago • 23
Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs

Paper • 2412.21187 • Published 6 days ago • 25