AI Agent2025

PentAGI

Runs penetration tests autonomously: a multi-agent system (researcher, developer, executor) plans attacks, writes and runs exploit code, and chains 20+ tools like nmap, metasploit and sqlmap in isolated Docker containers — for authorized testing only.

Visit Website

Introduction

Most "AI security" tools stop at scanning and flagging — they hand you a list of findings and leave the actual exploitation to a human. PentAGI closes that loop: it treats a pentest as an autonomous agent task, where separate agents reason over findings, write exploit code, run it against the target, and pick the next move from what came back.

What Sets It Apart

Multi-agent division of labor: a Researcher gathers and reasons over intel, a Developer writes exploit and automation code, and an Executor runs the tooling — so each phase is specialized instead of one prompt doing everything.
Memory built for long engagements: vector embeddings in PostgreSQL/pgvector plus a Graphiti and Neo4j knowledge graph let it recall earlier findings and track relationships across a multi-step attack chain rather than losing context mid-run.
Provider-agnostic by design: runs on OpenAI, Anthropic, Gemini, AWS Bedrock, Ollama, and Chinese models (DeepSeek, GLM, Kimi, Qwen), so you are not tied to one LLM's pricing or refusal behavior.

Who It's For

Great fit if you are a security professional or researcher who wants to automate the repetitive recon-to-exploit chain in an authorized lab or engagement and can run Docker. Look elsewhere if you want a point-and-click scanner, cannot supply your own LLM keys, or — most importantly — lack written authorization for the target. This is offensive tooling: pointing it at systems you do not own or are not permitted to test is illegal, and all activity is meant to stay inside its sandboxed Docker environment.

Back

Information

Websitegithub.com
OrganizationsVXControl
AuthorsVXControl (VXControl LLC-FZ), PentAGI Development Team
Published date2025/01/06

More Items

AI Agent2024

STORM: Synthesis of Topic Outlines through Retrieval and Multi-perspective Question Asking

Yucheng Jiang, Yijia Shao +6Stanford OVAL, Stanford University

Generates Wikipedia-style articles from web search using LLMs: it researches topics, produces multi-perspective outlines via perspective-guided question asking and simulated conversations, then drafts full articles with citations. Supports human-AI collaboration (Co-STORM) and grounding with multiple retrievers.

LLM NLP retrieval RAG ai-agent+2

AI Agent2025

深入理解 AI Agent：设计原理与工程实践

李博杰 (Bojie Li)

Open-source companion to a technical book that teaches how to design, evaluate and ship LLM-based AI agents — includes the full Chinese manuscript, community translations, chapter-aligned runnable example projects, and reproducible evaluation harnesses.

ai-agent book agent-skills prompt-engineering retrieval+10

MCP Server2026

wigolo

KnockOutEZ

Provides a local-first web-intelligence layer for AI agents: search, fetch, crawl, extract, cache, find-similar and agent-style research without API keys or per-query billing, running as an MCP server, REST daemon, or SDK.

mcp-server mcp-client mcp web-search embeddings+6