20% of vulnerability AI-patches that pass CI breaks production. Here’s why.

The “Successful” Patch That Broke Production This is the first entry in a series examining the structural failures of automated vulnerability remediation. A green CI pipeline is often a false comfort; in our recent benchmarking of leading AI coding assistants, including Claude Code, Gemini 3 Pro, and Codex, we found that patches frequently compile and […]