Vibe Coding: The Security Risks of AI-Generated Code

The Night Autocomplete Shipped Risk

The time was 2:17 a.m. in a payments company’s war room. The deploy was supposed to be boring, a simple quality-of-life feature, mostly drafted by the team’s new AI assistant. The canary deployment had been green for seven minutes when the alerts began to braid together. CPU usage hit 92%. Outbound traffic spiked in patterns the fraud engine didn’t recognize.

The on-call lead, Priya, pinched the bridge of her nose and watched a number climb on the wallboard: estimated exposure, $6,500 per minute. The CFO had set that calculation months ago to capture chargeback risk when the fraud system throttles. No one liked seeing it animated in real time.

“Roll back?” someone asked.

“Wait,” said the feature’s author, still certain the change was harmless. “It’s just a serializer and a tiny utility class. Literally what the model suggested.”

“Which serializer?” Priya asked.

“Pickle,” he said, as if naming a sandwich.

The room went quiet. A security engineer’s typing grew frantic. Untrusted deserialization, exactly the kind of vulnerability that can turn routine requests into remote-code-execution exploits if reachable from the network edge. As transactions slowed, the fraud engine softened its rules to avoid false positives, just as it was designed to do. In the corner of the screen, a second number glowed with the dull neutrality of statistics: Average breach cost: $4.88M; financial services ≈ $6.08M. That banner had been added after the last tabletop exercise, a stark reminder from IBM’s breach cost report to keep hands off the hot stove.

Priya didn’t love drama. She loved control – clear interfaces, explicit contracts, and loud failures. This change had shipped without unit constraints or a threat model. It had vibes. It felt right in the IDE. The AI’s snippet looked idiomatic, the tests were “happy path” green, and it even generated docstrings.

“Roll back,” she said, already paging the fraud analyst. “And pull the diffs. Every single line the assistant touched goes through a threat-model review before dawn. We treat it like a hostile dependency.”

As the rollback propagated, the risk counter decayed in $6,500 steps. The room exhaled. But a question lingered in Priya’s mind: had they accidentally made vibe their operating model?

Why “Feels Right” Keeps Failing

The antagonist here isn’t a person. It’s a pattern: vibe coding. When a team outsources its judgment to an autocomplete that “looks right,” unfamiliar code and its transitive risks slides into production wrapped in confident syntax and friendly comments. As Lawfare notes, the sheer ease of AI-generated code encourages developers to accept its output without deep review or manual testing, especially under the pressure of a deadline.

This leads to three recurring vignettes now playing out in engineering departments everywhere:

Unsafe deserialization as a shortcut. An AI assistant suggests using a permissive object serializer like Python’s pickle for convenience, and the team copies it into a network-reachable path. Databricks’ red team highlights this exact risk class, where unchecked input paired with an insecure deserialization pattern creates a foothold for Remote Code Execution (RCE).
Hallucinated dependencies → slopsquatting. The AI model “helpfully” invents a plausible-sounding package name. The package doesn’t exist, until an attacker publishes a malicious one under that name, capturing builds or local developer runs. Trend Micro documents how these AI hallucinations can be weaponized and how simple provenance checks can prevent them.
The “looks secure” fallacy. Teams often equate idiomatic, clean-looking code with safety. Veracode’s 2025 study is a cold shower: across various models and tasks, a staggering 45% of AI-generated solutions included a known security flaw. Worse, newer, larger models didn’t reliably reduce this risk.

When success is measured by “PRs merged per day,” these patterns don’t feel reckless. They feel productive until 2 a.m.

Downtime, Breach, and Decision Debt

Incidents are expensive in the moment and corrosive over time. Industry estimates from TechTarget peg the average cost of IT downtime at around $9,000 per minute, and that doesn’t even account for reputational damage or regulatory scrutiny. Stack a weekend of hot-patches on top, and you’re into six figures before breakfast. Combine that with the economics of a data breach, a $4.88M global average, climbing to $6.08M in finance according to IBM, and the math turns ugly fast.

But the more dangerous cost is the quiet one: decision debt. Every vibey shortcut, the untyped interface, the implicit units, the permissive parser, the mystery dependency pushes risk left into design and right into operations. Reliability becomes a hope, not a plan. The change failure rate creeps up. The time-to-root-cause stretches from minutes to hours. Firefighting crowds out learning and innovation.

Executives see it as margin attrition and schedule slips. Engineers feel it as whiplash.

Leash the Model, Lead the System

After the rollback, Priya didn’t ban AI. She leashed it. The team implemented a new operating rhythm focused on control and evidence.

First, they sketched the model of their service before touching any code. They defined external inputs, trust boundaries, serialization formats, and kill-switches. They mapped exactly where data entered, transformed, and left, and which parts of that surface an AI assistant was allowed to touch. If a diff affected I/O, auth, crypto, or deserialization, it automatically triggered a threat-model checkpoint. No exceptions.

Second, they codified guardrails. Unsafe patterns, like pickling untrusted input, were set to fail the build automatically. Any new dependency had to pass provenance checks against a strict allow-list. OWASP’s guidance on GenAI framed their taxonomy, helping them block common vulnerabilities like prompt injection and insecure output handling with policy-as-code.

Third, they built telemetry that pays: a handful of critical signals keyed to the failure modes that actually cause 2 a.m. phone calls. These included mechanisms that screamed when the system attempted an unsafe deserialization or when an unrecognized package appeared in the build graph.

Finally, they aligned with external rigor. The team adopted practices consistent with NIST’s profile for secure software development with GenAI, ensuring that gating, traceability, and human oversight remained central to their process.

The results were immediate.

Before → After (One Sprint)

Critical change-failures in canary: 3 → 1 (two were blocked pre-merge by the new guardrails).
Time-to-root-cause for hot issues: ~90 min → ~35 min (signals now pointed directly to the misuse).
Unreviewed AI-touched I/O diffs: 7 → 0 (blocked by policy gates and threat-model reviews).

How to Win Without the War Room

You can implement the same changes. Here are four moves that directly improve key engineering metrics.

Model the Interface. Draw your system’s boundary. List trusted vs. untrusted inputs and declare allowed formats (e.g., JSON with schema validation). Forbid dangerous formats like pickle across trust boundaries.
- KPI moved: Change failure rate ↓; Time-to-root-cause ↓.
- Micro-example: A single deserialize_attempts_total{format="pickle"} counter caught a dangerous code path within hours of its merge.
Constrain Generation. Treat AI suggestions as drafts under policy. Require allow-listed libraries, forbid unsafe APIs with linters, and mandate unit tests for any AI-generated diff touching I/O or auth.
- KPI moved: Iteration velocity ↑ (less rework); Defects escaping review ↓.
- Micro-example: A pre-commit hook rejected a reflection-based JSON mapper; the AI model immediately suggested a safer, schema-checked alternative.
Demand Provenance. Enforce Software Bill of Materials (SBOM) checks and domain-specific allow-lists to defeat hallucinated dependencies and slopsquatting. If a package isn’t on the list, the build fails.
- KPI moved: Supply-chain incidents ↓; Hot-patch frequency ↓.
- Micro-example: A fake package name was flagged because it wasn’t in the allow-list. The build failed early, preventing a weekend firefight.
Threat-Model the Diff, Not the App. Every change affecting a risky area gets a 15-minute mini-review focused on entry points, attacker goals, misuse cases, and kill-switches.
- KPI moved: Time-to-decision ↓; Severity of post-merge incidents ↓.
- Micro-example: The team added a runtime feature flag to disable a new parser. The rollback became a simple toggle, not a war room crisis.

Five Calls That Pay for Themselves

Ready to move from firefighting to flow? Take these five steps this week.

Ban permissive deserialization at trust boundaries. Make it a hard CI fail for pickling or Java object deserialization unless it’s sandboxed and explicitly justified.
Require provenance for all code and packages. Use an SBOM, an allow-list, and a domain-owned namespace. If a new package name appears, fail the build and investigate. Assume it’s slopsquatting until proven otherwise.
Threat-model AI-touched diffs that touch I/O, auth, or crypto. A ten-minute review asking four questions: What enters? What could be injected? How do we kill it at runtime? What telemetry proves it’s behaving?
Set executive-visible stakes. Put one number on the wall. your per-minute downtime estimate, and define a threshold where you pull the plug. Don’t argue vibes against dollars at 2 a.m.
Adopt an external standard. Map your controls to a framework like NIST SP 800-218A or the OWASP LLM Top 10. Use them to audit your guardrails and keep humans in charge.

From Our Last Chapter

In Part I, we saw how worshipping speed taxes reliability. In Part II, we explored how reusing code outside its design envelope makes good components dangerous. In this chapter, we learned that AI can amplify both mistakes, unless human judgment stays in the loop.

Next time in Part IV: We follow the money and the culture. How do leadership rhythms reward vibe and compliance theater, versus demanding evidence, learning, and accountable change? The tools won’t save you; your habits will.

References

[1] Databricks, “Passing the Security Vibe Check: The Dangers of Vibe Coding,” Aug 12, 2025 — Red-team findings on insecure deserialization and prompting strategies; concrete mitigations. Databricks
[2] Lawfare, “When the Vibes Are Off: The Security Risks of AI-Generated Code,” Sep 10, 2025 — Policy and security framing of vibe coding risks and governance gaps. Default
[3] Veracode, “2025 GenAI Code Security Report,” Jul 2025 — Large-scale study: 45% of AI-generated code samples contained known vulnerabilities; security flat across model sizes. Veracod
[4] Trend Micro, “Slopsquatting: When AI Agents Hallucinate Malicious Packages,” Jun 5, 2025 — How hallucinated dependencies become compromise paths; validation strategies. www.trendmicro.com
[5] OWASP, “Top 10 for LLM Applications / GenAI Project,” 2025 — Taxonomy and mitigations: prompt injection, insecure output handling, supply-chain vulnerabilities. OWASP
[6] NIST SP 800-218A, “Secure Software Development Practices for Generative AI & Dual-Use Models,” Jul 2024 — SSDF-aligned controls that keep humans in the loop. NIST Publications
[7] IBM, “Cost of a Data Breach 2024,” Jul 2024 — Global average $4.88M; financial services ≈ $6.08M. IBM
[8] TechTarget (citing Oxford Economics), “The Cost of Downtime,” Aug 8, 2025 — $9,000 per minute average downtime estimate. TechTarget