Proof-Attempt Based Auditing

From "Look for Bugs" to "Try to Prove"

Conventional LLM-based tools operate on vague instructions:

"Please check whether this code has any bugs"

The result is speculative and weakly grounded. FP rate of 88%.

SPECA demands a structured claim:

"Please prove whether this property holds:
 'authenticate() is always called before sensitive_data()'"

Proof Gap = Finding Candidate

A proof-attempt arrives at one of three conclusions:

Proof Success: the property holds → NO_FINDING
Proof Gap: there is a part that cannot be proven → FINDING candidate
Proof Failure: the property does not hold → CONFIRMED_VULNERABILITY

A proof gap is the core of detection. The concrete gap (code location, condition) is identified:

Claim: "authenticate() is called before sensitive_data()"

Gap at line 85 in error_handler():
  if (!cache_hit) {
    sensitive_data();  // <-- authenticate() not called
  }

Suppressing Hallucination

A structured claim suppresses speculation through:

Commitment: the model clearly judges "this code satisfies / does not satisfy property X"
Gap articulation: if it is an FP, it must explain a concrete gap
3-gate filter: among proof gaps, those outside the trust boundary / scope are judged as FPs

Implementation in Phase 03

The audit follows a fixed Map → Prove → Stress-Test flow. The branch points correspond directly to the verdicts written in 03_PARTIAL_*.json.

Proof-attempt flow

Map — locate the code that is responsible for enforcing the property (uses the pre-resolution from Phase 02c when available).
Prove — try to construct a proof that the property holds across all execution paths. Sub-claims are explicit.
Proof holds → Pass (no finding).
Gaps remain → run Stress-Test to look for a concrete counterexample.
- Attack plausible → Vulnerability (proof actively fails).
- No counterexample yet → Potential (proof gap survives but no exploit constructed).

The Phase 03 verdicts feed directly into the 3-gate review in Phase 04. See Pipeline — Phase 03 for the JSON shape and prompt-level details.

From "Look for Bugs" to "Try to Prove"​

Proof Gap = Finding Candidate​

Suppressing Hallucination​

Implementation in Phase 03​

From "Look for Bugs" to "Try to Prove"

Proof Gap = Finding Candidate

Suppressing Hallucination

Implementation in Phase 03