Back to stories
Tools

OpenAI Launches Codex Security, an AI Agent That Finds and Fixes Code Vulnerabilities

Michael Ouroumis2 min read
OpenAI Launches Codex Security, an AI Agent That Finds and Fixes Code Vulnerabilities

OpenAI has entered the application security market with Codex Security, an autonomous AI agent designed to discover, validate, and patch code vulnerabilities that traditional scanning tools miss. The tool launched on March 6 in research preview for enterprise customers.

How Codex Security Works

Unlike conventional static analysis tools that rely on pattern matching, Codex Security takes an agentic approach to security auditing. The system first builds a deep, project-specific threat model by analyzing the codebase's architecture, dependencies, and data flows. It then uses OpenAI's frontier reasoning models to search for complex vulnerabilities — including logic flaws and multi-step attack chains — that rule-based scanners typically overlook.

Critically, Codex Security validates its findings by attempting to exploit discovered vulnerabilities in sandboxed environments before reporting them. This dramatically reduces the false positive noise that has long plagued automated security tools.

Impressive Early Results

The numbers from beta testing tell a compelling story. False positive rates on detections fell by more than 50% across all tested repositories, while over-reported severity findings — a persistent pain point for security teams drowning in alerts — dropped by more than 90%.

In the 30 days leading up to launch, the agent scanned over 1.2 million commits, identifying 792 critical findings and 10,561 high-severity issues. Perhaps most notably, Codex Security discovered and helped report 14 CVEs across major open-source projects including OpenSSH, GnuTLS, PHP, and Chromium.

Availability and Pricing

Codex Security is available now to ChatGPT Enterprise, Business, and Edu customers. OpenAI is offering free usage for the first month, after which it will be bundled into existing enterprise pricing tiers. The research preview designation means the product is still being refined based on real-world feedback.

Market Implications

The launch positions OpenAI as a direct competitor to established application security vendors like Snyk, Veracode, and Checkmarx. Cybersecurity stocks showed mixed reactions following the announcement, with some incumbents dipping while the broader security sector remained stable.

For development teams, the promise is significant: a security tool that understands code intent rather than just code patterns, integrated directly into the workflows they already use. If Codex Security delivers on its beta results at scale, it could reshape how organizations approach application security — shifting from periodic audits and noisy scanners to continuous, context-aware vulnerability management powered by AI.

Learn AI for Free — FreeAcademy.ai

Take "Prompt Engineering Practice" — a free course with certificate to master the skills behind this story.

More in Tools

Anthropic Launches Claude Design, Turning Text Prompts Into Slides, Prototypes and One-Pagers
Tools

Anthropic Launches Claude Design, Turning Text Prompts Into Slides, Prototypes and One-Pagers

Anthropic introduced Claude Design on April 17, 2026, a research preview that converts text descriptions into shareable visuals like prototypes, slides and one-pagers using Claude Opus 4.7.

12 hours ago2 min read
Google Brings AI Mode Side-by-Side With Web Pages in Chrome
Tools

Google Brings AI Mode Side-by-Side With Web Pages in Chrome

Google's Chrome desktop now keeps AI Mode open alongside web pages, lets users query across multiple tabs and PDFs at once, and surfaces image and Canvas tools through a new plus menu.

1 day ago2 min read
Canva AI 2.0 Launches as Agentic Design Platform With Proprietary Proteus, Lucid Origin, and I2V Models
Tools

Canva AI 2.0 Launches as Agentic Design Platform With Proprietary Proteus, Lucid Origin, and I2V Models

Canva unveiled Canva AI 2.0 on April 16 as a research preview, recasting its design tool as an agentic workspace powered by three proprietary models the company claims are up to 7x faster and 30x cheaper than frontier alternatives.

1 day ago2 min read