BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks Jun 18, 2024 • 43
StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation Apr 29, 2024 • 76
Evaluating Language Models for Efficient Code Generation Paper • 2408.06450 • Published Aug 12, 2024 • 1