OnePiece123
's Collections
Unlocking Continual Learning Abilities in Language Models
Paper
•
2406.17245
•
Published
•
28
A Closer Look into Mixture-of-Experts in Large Language Models
Paper
•
2406.18219
•
Published
•
15
Symbolic Learning Enables Self-Evolving Agents
Paper
•
2406.18532
•
Published
•
11
Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of
LLMs
Paper
•
2406.18629
•
Published
•
41
AutoRAG-HP: Automatic Online Hyper-Parameter Tuning for
Retrieval-Augmented Generation
Paper
•
2406.19251
•
Published
•
8
LiteSearch: Efficacious Tree Search for LLM
Paper
•
2407.00320
•
Published
•
37
Chain-of-Knowledge: Integrating Knowledge Reasoning into Large Language
Models by Learning from Knowledge Graphs
Paper
•
2407.00653
•
Published
•
11
We-Math: Does Your Large Multimodal Model Achieve Human-like
Mathematical Reasoning?
Paper
•
2407.01284
•
Published
•
75
Agentless: Demystifying LLM-based Software Engineering Agents
Paper
•
2407.01489
•
Published
•
42
Planetarium: A Rigorous Benchmark for Translating Text to Structured
Planning Languages
Paper
•
2407.03321
•
Published
•
15
AriGraph: Learning Knowledge Graph World Models with Episodic Memory for
LLM Agents
Paper
•
2407.04363
•
Published
•
27
DotaMath: Decomposition of Thought with Code Assistance and
Self-correction for Mathematical Reasoning
Paper
•
2407.04078
•
Published
•
17
AgentInstruct: Toward Generative Teaching with Agentic Flows
Paper
•
2407.03502
•
Published
•
51
Skywork-Math: Data Scaling Laws for Mathematical Reasoning in Large
Language Models -- The Story Goes On
Paper
•
2407.08348
•
Published
•
51
Towards Building Specialized Generalist AI with System 1 and System 2
Fusion
Paper
•
2407.08642
•
Published
•
9
MUSCLE: A Model Update Strategy for Compatible LLM Evolution
Paper
•
2407.09435
•
Published
•
21
Paper
•
2407.10671
•
Published
•
160
Sibyl: Simple yet Effective Agent Framework for Complex Real-world
Reasoning
Paper
•
2407.10718
•
Published
•
17