Language model (LM) agents are increasingly used for automating complex digital tasks. [5] Just as human developers rely on Integrated Development Environments (IDEs), this paper posits that LM agents also need specialized interfaces to effectively tackle complex challenges like software engineering. [5, 8]
To address this, the paper introduces SWE-agent, a system that provides a tailored agent-computer interface (ACI). [2, 3] This interface is specifically designed to improve an agent's ability to perform key software development actions, including:
- Creating and editing code files [2]
- Navigating entire code repositories [2]
- Executing tests and other programs [2]
By using this specialized ACI, SWE-agent achieved state-of-the-art performance, solving 12.5% of issues on the SWE-bench benchmark and 87.7% on HumanEvalFix, significantly outperforming previous non-interactive models. [2, 5, 9]
