CORE: Benchmarking LLMs Code Reasoning Capabilities through Static Analysis Tasks Paper • 2507.05269 • Published Jul 3, 2025 • 1 • 1
TENET: Leveraging Tests Beyond Validation for Code Generation Paper • 2509.24148 • Published Sep 29, 2025 • 3 • 2