Recent Papers / arXiv:2606.07412
Socratic-SWE: Self-Evolving Coding Agents via Trace-Derived Agent Skills
Authors pending
Abstract
Closed-loop self-evolution distills solving traces into structured skills to generate targeted repair tasks; achieves 50.40% on SWE-bench Verified after three iterations.
Tasks
editResults
No benchmark results recorded yet.
Benchmark results referencing this paper haven't been added to the registry yet. If you have a reproduction, submit it →
CodeSOTA extraction
Benchmark evidence
- Socratic-SWE: exact SWE-bench Verified score per iteration (abstract reports 50.40% after 3 rounds)