Cross-Entropy Loss Functions: Theoretical Analysis and Applications Paper β’ 2304.07288 β’ Published Apr 14, 2023 β’ 1
Training Language Models on Synthetic Edit Sequences Improves Code Synthesis Paper β’ 2410.02749 β’ Published Oct 3, 2024 β’ 12
Batch Prompting: Efficient Inference with Large Language Model APIs Paper β’ 2301.08721 β’ Published Jan 19, 2023 β’ 1
An Explanation of In-context Learning as Implicit Bayesian Inference Paper β’ 2111.02080 β’ Published Nov 3, 2021 β’ 1
Explaining NonLinear Classification Decisions with Deep Taylor Decomposition Paper β’ 1512.02479 β’ Published Dec 8, 2015 β’ 1
From Instructions to Constraints: Language Model Alignment with Automatic Constraint Verification Paper β’ 2403.06326 β’ Published Mar 10, 2024 β’ 1
Is Prompt All You Need? No. A Comprehensive and Broader View of Instruction Learning Paper β’ 2303.10475 β’ Published Mar 18, 2023 β’ 2
Eliciting Instruction-tuned Code Language Models' Capabilities to Utilize Auxiliary Function for Code Generation Paper β’ 2409.13928 β’ Published Sep 20, 2024 β’ 1
WebApp1K: A Practical Code-Generation Benchmark for Web App Development Paper β’ 2408.00019 β’ Published Jul 30, 2024 β’ 1
The Unlocking Spell on Base LLMs: Rethinking Alignment via In-Context Learning Paper β’ 2312.01552 β’ Published Dec 4, 2023 β’ 30
Instruction Following without Instruction Tuning Paper β’ 2409.14254 β’ Published Sep 21, 2024 β’ 27
A Case Study of Web App Coding with OpenAI Reasoning Models Paper β’ 2409.13773 β’ Published Sep 19, 2024 β’ 5
The No Free Lunch Theorem, Kolmogorov Complexity, and the Role of Inductive Biases in Machine Learning Paper β’ 2304.05366 β’ Published Apr 11, 2023 β’ 1