2026-04-21 · Pentestas Features

Payment apps ship 50 times a quarter. Your annual pentest covers 1 of those snapshots. Here's how continuous AI pentest as a service closes the 49-scan gap.

PCI DSS 4.0's seven pentest requirements, each mapped to a continuous Pentestas workflow.

Continuous ai penetration testing — specifically, a continuous pentest as a service programme run with Pentestas — closes the gap without slowing down your ship cadence. Here's how the pieces map.

💰

In Detail

Mapping PCI DSS 4.0 to Pentestas

11.4.1 — annual pentest methodology

PCI expects you to have a documented pentest methodology. You probably already do; Pentestas doesn't replace the document, it extends it. The methodology section updates from "annual engagement by Consultancy X" to "annual engagement by Consultancy X + continuous AI pentest via Pentestas for the remaining 364 days". Both reviewers we've talked to (a QSA and an internal-audit director at a top-20 EU acquirer) accept this framing.

11.4.2 / 11.4.3 — internal + external pentest

Pentestas supports both through the same pipeline. External pentest = ai pentest run from Pentestas cloud. Internal pentest = the same ai pentest dispatched through a Linux or Windows agent deployed inside your network. One scan history, two execution modes, satisfies both requirements.

11.4.4 — pentest after significant change

The trigger word is "significant change" — which your organisation defines. Most mature fintech programmes define it as "any PR that touches authentication, authorization, payment processing, or the CDE boundary". A simple CI hook satisfies the requirement:

- name: Pentestas — trigger when CDE files change
  if: ${{ contains(github.event.head_commit.message, '[cde]') ||
         contains(steps.changed-files.outputs.all_changed_files, 'src/payments/') }}
  run: pentestas start -u ${{ env.STAGING_URL }} -c .pentestas/scan.yaml -w 1h

Scan history auto-generates your change-triggered pentest evidence. Map the scan ID to the change ticket in your CAB; the auditor trail is complete.

11.4.5 — segmentation check

Run Pentestas agents on both sides of every segmentation boundary. Dispatch a network-scan job from Agent A (inside CDE) to targets on the other side; expected result is "no reachable ports". The scan output is your segmentation evidence.

11.4.6 — pentest after segmentation change

Same wiring as 11.4.4 but triggered by infrastructure-as-code file changes (Terraform plans that touch firewall or subnet config).

11.4.7 — pentest after new system introduction

When you stand up a new system, add it as a verified domain in Pentestas, run an authenticated ai pentest against it, file the report, mark the system as added to the continuous cadence going forward. Five-minute onboarding.

💵

Fintech

The pentest that catches real fintech bugs

The OWASP Top 10 catches some things. Fintech-specific bug classes often live outside the Top 10 but inside what Pentestas's five specialist agents catch:

IDOR on account endpoints

GET /api/accounts/{id} returning another user's balance because the server trusts id from the path rather than deriving it from the session. Pentestas's Authz specialist probes every endpoint with an object ID by submitting adjacent, random, and off-by-one values, comparing response bodies for other-user content. Confirmed IDORs on balance or transaction endpoints are automatically tagged CRITICAL.

Mass assignment in user update

PATCH /api/users/me accepting { "email": "...", "role": "admin" } because the server uses the whole body rather than an allow-list. Pentestas probes every mutation endpoint with role-shaped fields (is_admin, role, tenant_id, verified) and watches for silent acceptance.

JWT weakness in auth flow

JWTs signed with alg: none, with a weak symmetric secret, or with an RS-HS algorithm-confusion bug. The Auth specialist decodes header + payload in its reasoning and proposes specific forging attacks against the observed shape.

Missing rate limit on transfer endpoints

The dream finding for an attacker: a /api/transfers endpoint with no server-side rate limit. The API specialist measures burst behaviour (5×, 10×, 50× probe trains) and flags any endpoint that doesn't throttle payments, login, or reset flows.

Race condition in balance check + transfer

Classic double-spend: balance check and debit happen in separate transactions. Pro+ plans include a race-condition tester module that fires concurrent requests and measures balance-state delta. Rare-but-expensive bug; finding it once pays for the subscription for a decade.

IDOR combined with webhook-secret exposure

Chain: IDOR on /api/webhooks/{id} exposes another tenant's webhook secret → attacker forges signed webhooks to that tenant → malicious state updates. Individually two HIGHs; chained, a CRITICAL tenant-compromise. Pentestas's rule-based chain synthesiser catches this exact pattern.

📈

In Detail

TOTP + 2FA handling

Every fintech tester's worst bottleneck: "the admin account has 2FA so I couldn't log in". Pentestas solves this with TOTP support in the YAML scan config:

authentication:
  login_type: form
  login_url: "https://admin.fintech.com/login"
  credentials:
    username: "audit@fintech.com"
    password: "***"
    totp_secret: "LB2E2RX7XFHSTGCK"
  login_flow:
    - "Type $username in email"
    - "Type $password in password"
    - "Click Sign In"
    - "Type $totp in 2FA code field"
    - "Click Verify"

$totp expands to a live RFC 6238 code each login. Long scans with mid-scan re-auths handle it automatically. Your admin surface gets pentested without a human in the loop.

🔍

Scanning

Agent-based CDE scanning

The CDE (cardholder-data environment) typically has the strictest firewall policy in the org. Pentestas cloud cannot reach into it by design. The Linux or Windows .NET agent solves this: deploy inside the CDE segment, authenticate with a tenant-scoped key, scan internal targets from the inside.

CDE-internal admin panels (no external interface) → now pentested.
CDE-internal APIs used by payment processors → now pentested.
Segmentation boundaries (CDE ↔ non-CDE) → scan-verified on demand.
CDE host posture (open ports, exposed banners, CVE matches on internal services) → continuous network scan.

The agent only opens outbound WebSocket. No inbound ports. The CDE firewall stays untouched.

⚠️

In Detail

The CIS complement

Payment platforms almost always run M365 for internal email / collaboration. The CIS Microsoft 365 Foundations Benchmark — which Pentestas ships as a one-click scan (see CIS M365 benchmark) — validates M365 controls that PCI inherits from your internal comms (external forwarding blocked, DKIM, audit logging). Run it weekly alongside your ai pentest.

📊

In Detail

Evidence for the QSA

Every scan produces:

PDF report — cover page with tenant branding, executive summary, full finding list with proof-of-exploit.
JSON export — machine-readable; ingest into your GRC.
Scan history — retained 3 years (Pro) or unlimited (Enterprise). PCI-aligned.
Attack chain mindmap — renders cleanly in the PDF.
Remediation tracking — per-finding status (active / resolved / false-positive-after-review) with timestamps.

QSAs we've walked through Pentestas output ask two questions: "can I see the scan-ID-to-commit-SHA mapping?" (yes, via the scan's config.commit_sha), and "can I export every scan from Q3?" (yes, via GET /api/scans?since=2026-07-01). Both answer yes. The QSA workflow becomes a one-evening export + upload rather than a week of forensic scan-history archaeology.

🔍

The Breakdown

What it costs

Typical mid-size fintech programme:

**Pro plan*— $1K–$2K/month. Covers 200 scans/month, 5 concurrent, 3 agents, AI analysis on, custom branding.
**Anthropic costs*(BYOK) — $100–$500/month for a medium-complexity programme running 50–100 scans/month with white-box mode on.
**Engineer time*— ~2 hours/week triaging findings. Declining as Accuracy Gate learns the environment.

Total: $1.5K–$3K/month for a continuous pentest as a service programme that covers PCI 11.4's seven sub-requirements + internal agents + M365 baseline. Compare to a single consultancy engagement ($15K–$40K for a week of testing) and the economics favour continuous by ~50×.

🛡️

In Detail

The migration path

If you're currently running a traditional pentest programme:

Month 1. Wire Pentestas against staging. Get baseline findings. Triage. Fix the CRITICAL / HIGH deltas in week 2.
Month 2. Add the continuous CI hook (pentestas start on every merge). Enable Accuracy Gate + AI false-positive filter.
Month 3. Deploy the first internal agent. Scan the admin panel. Set up weekly scheduled scan of the full CDE scope.
Month 4. Brief your QSA on the new methodology. Walk them through a sample scan report. Get sign-off on the evidence model.
Quarter 2 onward. Continue continuous coverage. Schedule the annual human-led engagement to focus on business-logic and chained exploits the AI pentest misses.

No drop-in replacement is required for the annual engagement; the model is additive.

💬

Tough Questions

Objections you'll hear

"Our QSA won't accept this." Walk the QSA through a sample report. They accept evidence; the format matters less than the content. Pentestas's evidence format lines up with every PCI-adjacent review we've been through.

"It'll generate noise." It won't — the Accuracy Gate + "no exploit, no report" policy keeps FP rate under 10%. Your triage burden drops compared to any open-source scanner.

"Our scanners already do this." They don't do attack-chain synthesis, IDOR probing with response-body comparison, or white-box code-aware mode. Check the findings delta in month one — the findings Pentestas reports that your existing scanner doesn't are the entire justification.

"We can't give you our source code." White-box is optional. Black-box ai pentest is the baseline product. Most PCI-focused deployments we see start black-box and add source access after a quarter of trust-building.

Start a fintech-ready AI pentest programme

Start your AI pentest

📚

How Pentestas runs this in production

Everything above is shipped as part of Pentestas — a pentesting-as-a-service platform built around an AI penetration testing system that orchestrates dozens of deterministic detectors alongside an LLM-driven planner and reflector. Our penetration testing with Claude pipeline handles the audit-trail-grade reasoning (causal chains, evidence weighting, narrative attack paths) while our penetration testing with DeepSeek pipeline handles high-volume parallel coverage at the kind of unit cost that lets us re-run a full B2B SaaS pentest weekly without burning the customer's annual budget on a single engagement.

If you're evaluating a vendor for penetration testing with AI, the questions worth pressing on are exactly the ones this post walks through — accuracy gating, replay verification, payload safety, evidence chains, retest cadence. Those are what separate a real pipeline from a wrapper around a public LLM.

Related reading

Run it on your stack: Penetration Testing →

Pentestas for Fintech: AI Penetration Testing That Satisfies PCI DSS 4.0 Without Slowing Your Ship Cadence

Mapping PCI DSS 4.0 to Pentestas

11.4.1 — annual pentest methodology

11.4.2 / 11.4.3 — internal + external pentest

11.4.4 — pentest after significant change

11.4.5 — segmentation check

11.4.6 — pentest after segmentation change

11.4.7 — pentest after new system introduction

The pentest that catches real fintech bugs

IDOR on account endpoints

Mass assignment in user update

JWT weakness in auth flow

Missing rate limit on transfer endpoints

Race condition in balance check + transfer

IDOR combined with webhook-secret exposure

TOTP + 2FA handling

Agent-based CDE scanning

The CIS complement

Evidence for the QSA

What it costs

The migration path

Objections you'll hear

Further reading

How Pentestas runs this in production

Alexander Sverdlov