Stream of Search (SoS): Learning to Search in Language Paper • 2404.03683 • Published Apr 1, 2024 • 29 • 1
Streaming Diffusion Policy: Fast Policy Synthesis with Variable Noise Diffusion Models Paper • 2406.04806 • Published Jun 7, 2024 • 1 • 1
Self-Reflection in LLM Agents: Effects on Problem-Solving Performance Paper • 2405.06682 • Published May 5, 2024 • 3 • 1
Probabilistic Programming with Programmable Variational Inference Paper • 2406.15742 • Published Jun 22, 2024 • 2 • 1
Trace is the New AutoDiff -- Unlocking Efficient Optimization of Computational Workflows Paper • 2406.16218 • Published Jun 23, 2024 • 2 • 1
TaskGen: A Task-Based, Memory-Infused Agentic Framework using StrictJSON Paper • 2407.15734 • Published Jul 22, 2024 • 1 • 1
Grokfast: Accelerated Grokking by Amplifying Slow Gradients Paper • 2405.20233 • Published May 30, 2024 • 6 • 1
HyperZ$\cdot$Z$\cdot$W Operator Connects Slow-Fast Networks for Full Context Interaction Paper • 2401.17948 • Published Jan 31, 2024 • 2 • 1
Extreme Compression of Large Language Models via Additive Quantization Paper • 2401.06118 • Published Jan 11, 2024 • 12 • 1
Spectrum: Targeted Training on Signal to Noise Ratio Paper • 2406.06623 • Published Jun 7, 2024 • 12 • 2
OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework Paper • 2405.11143 • Published May 20, 2024 • 34 • 3