文章

AI渗透测试已实现全链路自动化漏洞发现，成本下降并超越人类——从工具授权向持续AI安全订阅转型

In early 2025 an anonymous document branded "Unpatched AI" published >100 previously unknown Microsoft Access and Office 365 vulnerabilities with full technical proofs, stack traces and exploit chains, indicating an autonomous LLM‑steered vulnerability‑research pipeline that combines fuzzing, symbolic execution and generative narration. The security community recognized the findings as genuine and highly automated, prompting the realization that autonomous systems are beginning to compete—and in some cases outperform—human researchers in offensive security tasks, moving up public bug‑bounty leaderboards and scaling attack‑surface coverage without human guidance. Traditional penetration testing relies on periodic, human‑driven engagements that cannot keep pace with rapid software change; the 2025 Verizon DBIR notes >67 % of breaches involve unpatched flaws >90 days old despite recent assessments. A new class of AI‑native pentesting platforms unbundles expert‑labor constraints by coupling LLMs with exploit tooling, real‑time telemetry and proprietary data, offering fully autonomous agents or copilot‑style assistance that execute exploits in safe sandboxes, verify findings, and generate actionable reports. Early platforms promised automation but suffered shallow coverage, static detection logic, poor cloud‑native support and alert fatigue ("50 000 criticals, zero real"). Current systems excel at low‑hanging issues (XSS, SSRF, misconfigs) and business‑logic flaws that can be inferred from intent, but still lag in complex chained authorizations, race conditions and environment‑specific contexts requiring deep contextual reasoning. Regulatory frameworks (SOC 2, PCI, ISO 27001) require human‑led assessments, creating auditability and liability gaps for fully autonomous tools. The emerging shift is toward continuous, AI‑augmented testing integrated into CI/CD pipelines, blurring the line between testing, pentesting and red‑teaming.

来源信息

发布时间：2025年6月13日

英文原标题：Next-Gen Pentesting: AI Empowers the Good Guys

来源：查看 a16z 原文

核心要点

A pseudonymously released document in early 2025 (Unpatched AI) disclosed >100 previously unknown Microsoft Access and Office 365 vulnerabilities with complete technical proofs and exploit chains.
Analysis of the document indicates an autonomous pipeline that blends modern fuzzing, symbolic execution and generative AI narration, suggesting LLM‑driven vulnerability discovery.
Autonomous security systems are already outperforming human researchers on public bug‑bounty leaderboards and scaling attack‑surface coverage without human direction.
Traditional penetration testing is constrained by periodic, human‑driven engagements that lag behind rapid software delivery; the 2025 Verizon DBIR reports >67 % of breaches involve unpatched flaws >90 days old.
Next‑gen pentesting platforms combine LLMs with exploit tooling, real‑time telemetry and proprietary datasets, offering fully autonomous agents or copilot‑style assistance.

关键判断

2025年初，匿名文档「Unpatched AI」公开披露超过100个此前未知的Microsoft Access和Office 365漏洞，并附完整技术证明、栈追踪、利用链，表明存在自主的LLM驱动的漏洞研究管线。
该管线融合现代fuzzing、符号执行与生成式AI叙述，说明人工智能已能够自动发现、文档化并规模化输出漏洞研究成果。
自主安全系统已在公开漏洞赏金排行榜上超越人类研究员，在无需人工指导的情况下实现攻击面大规模覆盖。
传统渗透测试依赖周期性、人工驱动的评估，难以跟上快速迭代的软件交付；2025 Verizon DBIR显示，超过67%的 breach 涉及的漏洞在超过90天未修补，尽管近期已有安全评估。
新一代AI原生的渗透测试平台将LLM与利用工具、实时遥测和专有数据结合，提供全自主Agent或copilot式辅助，能在安全沙箱中执行利用、验证发现并生成可操作报告。

未来推演

判断：未来 3-9 个月，Agent 产品会更快从能力展示转向审批明确、可回滚、可观测的执行流程。

时间跨度：未来 3-9 个月

为什么是现在：文章对价值判断已经不再停留在对话体验，而是落在流程接入、执行闭环和控制能力上。

重点信号：产品是否增加审批节点、案例是否从演示转向生产流程、用户是否更重视可观测性

置信度：高