AI Security

Proof-Of-Concept Payloads From Discovered Paths

A taint path is not an exploit. Here is how a zero-day pipeline turns a reachable flow into a defensible proof-of-concept payload without inventing a vulnerability.

A finding without a proof-of-concept is a finding nobody trusts. Engineers receive vulnerability reports daily, and the ones that get acted on quickly are the ones that come with a payload they can paste into the request handler and watch fail. The ones that get deferred are the ones that describe a vulnerability in prose and ask the engineer to imagine what an attacker might do. The asymmetry is not anybody's fault. It is the natural response of a busy team to artefacts that vary in their concreteness.

The hard part of producing proof-of-concept payloads from automated discovery is that the obvious approach, asking a language model to write an exploit for a described vulnerability, produces output that is plausible but rarely correct. The exploit looks like an exploit. It does not, when you actually fire it at the running system, do what the report claimed. This is the same hallucination problem that plagues pure-LLM bug hunters, transposed to the exploit-generation step. The fix is the same. You ground the generation in the engine's evidence and you run a verification pass.

What a grounded payload requires

A proof-of-concept payload is grounded if every claim it makes corresponds to something the engine has already proven. The taint path established that attacker-controlled data from the entry point reaches the sink. The hypothesis stage established the CWE class and the exploit conditions. The payload generation stage takes the conditions and writes concrete inputs that satisfy them.

The claims a grounded payload makes are narrow. It does not claim a particular outcome on a particular environment. It claims that under the exploit conditions the hypothesis described, this input will traverse the path and reach the sink. The verification step is whether the path traversal actually happens. Whether the sink does something dangerous is a separate, downstream question that depends on the runtime environment, the database schema, the OS the service runs on, and so on.

This narrowness is what makes the payload defensible. An exploit that claims "this will produce remote code execution on Linux" is making a claim about runtime behaviour that the pipeline cannot verify in a vacuum. An exploit that claims "this input traverses these five functions and arrives at this sink as the string ';-- DROP TABLE" is making a claim that a test harness can verify mechanically. The first kind of claim is what attackers use to embarrass vendors at conferences. The second kind is what defenders use to write a regression test.

How an engine-plus-Griffin AI pipeline produces them

The pipeline I have come to trust handles payload generation as a fourth stage after the disproof pass. The first three stages have produced a finding that survived disproof. The taint path is real, the exploit conditions have been articulated, and the failed disproof attempt is on record. The fourth stage takes the exploit conditions and asks: what minimum input, supplied at the entry point, would satisfy these conditions?

The model used for this stage has access to the engine's representation of each function on the path. It can read the parameter types, the constraint checks, the partial parsing the function does on its input, and the transformations the data undergoes before it is passed to the next function. From this evidence it constructs an input that, on paper, traverses every constraint along the path.

The constructed input then runs through a verification harness. The harness instruments the path with breakpoints at each function on the route, fires the input at the entry point, and records which functions actually execute and what values they see. If the trace matches the expected path, the payload is verified and attached to the finding. If the trace diverges, the payload is rejected and the pipeline either iterates or downgrades the finding.

This verification step is the part that distinguishes a real proof-of-concept from a plausible one. The pipeline never reports a payload it has not actually traced through the path.

Sandboxing and safety

Generating and running payloads is not risk-free. A poorly designed verification harness could fire the payload at the production environment and trigger the very damage the report is trying to prevent. The discipline that prevents this is to run the harness in a hermetically sealed sandbox. The sandbox has a copy of the application, a copy of the relevant dependencies, and a synthetic database or stub for any side-effecting external service. No real customer data, no real network egress, no production credentials. The path traces happen in the sandbox. The payload is captured, validated against the trace, and attached to the finding.

This is operationally fiddly but conceptually simple. The cost of getting it wrong is high enough that the operational care pays for itself.

What the report looks like in the engineer's queue

When the engineer opens a finding from this pipeline, the proof-of-concept is right there. They see the path, the CWE class, the exploit conditions, the disproof attempt, and a payload with the verification trace attached. They can copy the payload into their own local sandbox, fire it at the application, and confirm the trace independently. The cycle from "open ticket" to "I believe the bug is real" is sometimes minutes rather than days.

The downstream effects on remediation are large. An engineer who has run the payload against their own copy of the service and watched it traverse the path is in a different position from one who is reading prose and imagining the attack. They write better fixes, they write regression tests that actually cover the bug, and they close the loop with the security team faster.

What grounded payloads do not do

It is worth being honest about what this kind of payload does not claim. It does not claim full weaponisation. It does not claim a particular impact on a particular runtime. It does not claim that an attacker with a different architecture or a different version of the runtime would see the same behaviour. The pipeline's job ends at the verification trace. Anything beyond that is judgement, and the judgement belongs to the human triager.

This is not a limitation. It is the source of the report's trustworthiness. A payload that claims only what it has verified is a payload that does not have to be hedged in conversation with engineering. The report says exactly what it can defend, and nothing more.

The interaction with disclosure

Attaching a verified payload to a responsible-disclosure report changes the conversation with upstream maintainers. A maintainer who receives a report with a payload they can run in 30 seconds and watch traverse their library is in a fundamentally different position from one who has to construct the payload themselves. The acknowledgement-and-fix timeline shortens. The maintainer's confidence in the reporter goes up. The relationship becomes the kind of cooperative interaction that makes coordinated disclosure work.

The payload, in other words, is not just a triage artefact. It is a disclosure artefact. The same evidence that helps the consuming team verify the finding internally also helps the producing team verify the finding externally.

How Safeguard Helps

Safeguard's engine-plus-Griffin AI pipeline includes a payload-generation and verification stage that operates only on findings that survived the disproof pass. The pipeline produces minimum-traversal inputs for each finding, runs them through a sandboxed verification harness, and attaches the trace to the report. Engineers see the payload, the trace, and the path together, and can reproduce the result locally with no external dependencies. When the time comes to disclose to an upstream maintainer, the same artefact bundle accompanies the disclosure email, so the maintainer can verify the bug as quickly as the consuming team did.

zero-day griffin-ai ai-security supply-chain

Back to all articles

More on #zero-day

View all →

AI Security

Never miss an update

Weekly insights on software supply chain security, delivered to your inbox.

Proof-Of-Concept Payloads From Discovered Paths

What a grounded payload requires

How an engine-plus-Griffin AI pipeline produces them

Sandboxing and safety

What the report looks like in the engineer's queue

What grounded payloads do not do

The interaction with disclosure

How Safeguard Helps

More on #zero-day

Zero-Day Discovery With LLM-Augmented Reachability: A Safeguard Engine Walkthrough

Pattern Scanners Can't Find Zero-Days. This Can.

Anthropic's Mythos Vulnerability Scanner: An Honest Assessment of Strengths, Weaknesses, and Reasons to Be Cautious

The Limits of Single-Model Vulnerability Scanning: A Technical Analysis of the Mythos Approach

Related articles in AI Security

Building an Eval Suite for Your Security LLM Workflows

Zero-Day Discovery With LLM-Augmented Reachability: A Safeguard Engine Walkthrough

Frontier LLM Vendors Are Not Your Supply Chain Security Vendor

Never miss an update

Product

Solutions

Compare

Resources

Company

Legal

Developers