LogoAIAny
Icon for item

DeepCode: Open Agentic Coding

DeepCode is an open-source multi-agent coding framework from HKU Data Intelligence Lab that automatically converts research papers and natural language into production-ready code. Key features include Paper2Code, Text2Web, Text2Backend, multi-agent orchestration, CodeRAG retrieval-augmented generation, MCP integration, and automated testing and validation.

Introduction

DeepCode: Open Agentic Coding

DeepCode is an open-source project (HKU Data Intelligence Lab) that provides an end-to-end multi-agent system for translating research papers and plain-language requirements into production-ready code. The framework focuses on three flagship capabilities: Paper2Code (automated implementation of algorithms described in papers), Text2Web (generate frontend web interfaces from text), and Text2Backend (generate backend services and APIs from text). DeepCode couples a central orchestration agent with specialized sub-agents (intent understanding, document parsing, code planning, code generation, reference mining, indexing, and QA) to perform planning, decomposition, code synthesis, testing, and delivery.

Key components and techniques
  • Multi-Agent Orchestration: a central planner dynamically coordinates specialized agents to decompose complex tasks and parallelize work.
  • Document Parsing & Paper2Code: multi-modal paper analysis extracts algorithms, mathematics and implementation details and converts them into implementable specifications.
  • CodeRAG (Retrieval-Augmented Generation): semantic embeddings + dependency analysis to retrieve relevant code patterns and libraries from large corpora, improving correctness and reuse.
  • Memory & Context Management: hierarchical memory and context-compression mechanisms allow the system to maintain long code contexts and cross-module consistency.
  • Quality Assurance Automation: static analysis, automatic unit-test generation, property-based tests and iterative debugging loops to raise implementation reliability.
  • MCP Integration: uses the Model Context Protocol (MCP) to integrate search servers, filesystem, downloader and other tooling for full-stack automation and reproducibility.
Usage & Deployment

DeepCode provides both a CLI and a web UI for interactive use. Typical workflow:

  1. Provide input: upload a PDF (paper), paste a URL, or give a textual requirement.
  2. The orchestrator runs document parsing, plans a reproduction strategy and assigns tasks to code-generation agents.
  3. Code is synthesized, tested, and packaged with documentation and test suites.

Installation options include pip installation (deepcode-hku) or cloning the repository and installing dependencies. Configuration requires API keys for LLM providers and optional search servers (Brave, Bocha-MCP).

Empirical results & claims

The project README reports strong benchmarks on the PaperBench reproduction suite (claims of state-of-the-art performance and improvements over both human experts and other code agents). Whether you accept those results depends on reproducing the evaluation; the repository provides references and an arXiv paper that documents the experiments.

Who should use it
  • Researchers who want faster paper reproduction and algorithm implementation.
  • Engineers needing rapid prototyping from natural language specifications.
  • Teams exploring multi-agent orchestration for large code synthesis tasks.
License & Attribution

DeepCode is released under the MIT License (copyright Data Intelligence Lab, The University of Hong Kong). The README cites an associated arXiv paper: "DeepCode: Open Agentic Coding" (Li et al., 2025).

(Repository metadata: created 2025-05-14; stars: 12,420 as listed in README metadata.)

Information

  • Websitegithub.com
  • AuthorsZongwei Li, Zhonghang Li, Zirui Guo, Xubin Ren, Chao Huang, Data Intelligence Lab, The University of Hong Kong
  • Published date2025/05/14

More Items