2026-04-21 · Pentestas Features

Legal SaaS holds the most sensitive data your customers will ever put in your DB. Here's why legaltech needs continuous AI penetration testing more than most.

Legal platforms chain five authz checks per request. Each one is a potential CRITICAL.

That's the case for continuous ai penetration testing in legaltech. Annual manual pentesting is compliance; continuous coverage is survival.

⚖️

Legaltech

The legaltech threat model

Four threat categories dominate the legaltech security conversation:

1. Cross-client data exposure

Every multi-tenant legal SaaS has the same latent risk: an IDOR or BFLA or cross-tenant-ID leak where client A's data flows to client B's user. This is the single worst-case incident for the platform and a near-guaranteed contract-termination event for any affected client. Pentestas's Authz specialist caps at 25 hypotheses per scan and runs cross-tenant probes explicitly.

2. Privilege escalation inside a matter

Within a single client, different users have different access: lead partner (everything), reviewer (documents only), paralegal (limited-sensitivity subset), external expert (specific document subset). A missing server-side permission check lets a paralegal see deposition transcripts they shouldn't have; an expert sees strategy documents they shouldn't; a reviewer exports data they can't legally hold. Pentestas probes every endpoint with role-shaped bodies and permission-boundary comparisons.

3. Stored XSS in document annotations

Classic legaltech-specific: stored XSS in a comment or annotation field that fires on the next reviewer's session. When the next reviewer is the lead partner whose role grants access to every document in every matter, the XSS exfils that access. The XSS specialist fires probes through real headless Chromium — a finding is only reported if a real browser executes the payload on a second user's render.

4. Supply-chain + third-party integration bugs

Legal platforms integrate with third-party services: DocuSign for signatures, Box / Dropbox for document storage, Microsoft Graph for M365 integration, Twilio for text alerts. Each integration is a potential exfiltration path. Pentestas scans the whole flow (including OAuth consent dialogs, callback endpoints, and webhook signature verification) and flags weaknesses in the integration boundary.

💡

The Problem

Why legacy scanners struggle here

Three structural reasons:

Complex authorization. Legal platforms often have 5–10 authz check layers per request (org → matter → document → annotation → user role → time-of-day → etc.). Legacy scanners probe only the first check; miss the last four. Pentestas's white-box mode reads the full middleware chain and probes each check independently.

Long document workflows. Document review is a multi-step, conditional, sometimes multi-user workflow. Crawlers follow links; they don't complete workflows. Scan-as-you-browse (via the .NET Windows agent) lets a reviewer walk the real workflow and Pentestas actively tests every endpoint along the way.

Specialised field types. Annotations aren't just text — they're rich documents with embedded references. PDF markups. HTML comments with markdown. Redactions with metadata. The XSS and injection specialists need to reason about each field's specific render context, which is something a regex-based scanner cannot do but a Claude agent with per-category prompts can.

🔍

The Breakdown

What Pentestas does for legaltech specifically

Cross-tenant probes

For every endpoint that accepts a tenant-ID-shaped field in the request body (client_id, org_id, matter_id, firm_id), the Authz specialist adds a cross-tenant probe: submit your own session token + a DIFFERENT tenant's ID and watch for a 200. Most legaltech platforms have at least one such endpoint missing the check.

Role-boundary comparisons

Ship the YAML scan config twice — once with paralegal credentials, once with partner credentials. Pentestas runs the scan for each role and cross-references: endpoints accessible to paralegal that return MORE data than should be visible at paralegal level are flagged CRITICAL (privilege-escalation via information leak). This runs on top of the standard authz probes.

Stored-XSS on document fields

The XSS specialist probes every user-writable field — comment body, annotation text, redaction reason, file description — with context-specific payloads. Each candidate is rendered in headless Chromium; payloads that execute trigger a CRITICAL. Payloads that reflect but don't execute (because the renderer escapes them) are correctly classified as non-exploitable.

File upload + processing

Legal platforms accept PDFs, Word documents, and image files from clients. Each is a processing sink. Pentestas tests:

Malicious SVG → stored XSS in the next reviewer's session.
Oversized file → DoS via memory exhaustion.
PDF with embedded JavaScript → XSS when viewed in a reader.
Filename traversal → path traversal on the storage backend.
Malformed Word doc → XXE in the XML parser.

Each finding includes reproduction steps + the payload file.

Audit-log completeness (white-box)

Source-code-aware mode reads your server code, finds every controller that touches a privileged field, and cross-references with your audit-log writes. Missing audit-log writes on privileged endpoints are flagged as compliance-relevant findings (many legal-engagement contracts require demonstrable audit coverage).

📝

Compliance

Compliance framing

Legal platforms typically operate under:

**ABA Model Rule 1.6*— confidentiality.
**NIST 800-171*— for platforms handling US government legal work.
**ISO 27001*— increasingly mandatory in RFPs.
**SOC 2 Type II*— the cross-industry baseline; most legaltech platforms need it for mid-market sales.
**Client-specific DPAs*— often stricter than the baseline regulations.

Pentestas's scan history + finding-remediation timestamps + attack-chain reports all feed these compliance programmes. The Enterprise tier's unlimited retention + BYOK encryption align with the strictest client DPAs we've seen.

⚖️

Legaltech

Engagement shape for legaltech

A typical mid-size legaltech SaaS (~30 engineers, ~250 API endpoints, multi-tenant, SaaS):

Onboarding (week 1). Verify domains. Configure two scan YAMLs — one for partner role, one for paralegal role. First scan runs; baseline findings triaged.
Baseline remediation (weeks 2–6). Fix the baseline CRITICAL / HIGH. Rescan-on-fix gives the audit trail.
Continuous CI (week 7 onward). Every PR to main runs an ai pentest against an ephemeral environment. Build fails on new HIGH/CRITICAL.
Nightly scheduled scan against staging. Slack alert on HIGH+.
Quarterly client-facing attestation. Export a Pro+ custom-branded PDF showing the last 90 days of security testing — evidence for enterprise RFPs + annual DPA reviews.
Annual human-led deep engagement. Focus on business-logic exploits, unusual legal-specific attack chains (e.g. "can a reviewer abuse the redaction-reveal feature to exfil redacted text?").

The total budget scales with firm size but rarely exceeds $3K/month for platforms with ~100–500 customers. Compare to the $25K–$75K/year for a single annual on-site pentest engagement.

⚖️

Legaltech

Sample legaltech finding

CRITICAL — Cross-Client Document Exposure (IDOR)

Endpoint: GET /api/matters/{matter_id}/documents
Evidence:
  Request 1 (session = user 101, assigned to matter 5001):
    /api/matters/5001/documents → 200, 47 documents (correct)
  Request 2 (same session, modified URL):
    /api/matters/5002/documents → 200, 32 documents
    → Documents belong to a different client entirely.

Source-code citation (white-box):
  src/controllers/documents_controller.rb:31
    @documents = Document.where(matter_id: params[:matter_id])
  Missing: authorization check for current_user's access to matter.

Reproduction:
  curl -H "Cookie: sid=..." \
       https://platform.legaltech.com/api/matters/5002/documents

Chain context:
  Combined with STORED-XSS-7 (annotation XSS exfils session cookies from
  the next reviewer), any reviewer in any matter could be phished to
  serve up cross-client document listings.

Impact:
  Cross-client document exposure. ABA Model Rule 1.6 violation.
  Breach-disclosure-adjacent under many state consumer-protection laws.
  Enterprise client termination-event.

CVSS: 8.6

Source-code pointer + proof-of-exploit HTTP trace + chain context. Your engineer has a 5-minute fix; your compliance officer has an audit-ready finding.

💡

The Problem

Why continuous + AI specifically

The continuous-coverage argument is stronger in legaltech than most industries. Three reasons:

Client-specific attack surfaces. Every enterprise client gets custom configuration: SSO, DPA-specific retention rules, custom workflow templates. Each customisation is an attack-surface change; each change is a potential bug. Continuous scan catches drift the annual pentest won't.

Reputational catastrophe. A law firm that loses a trial strategy to an IDOR doesn't just sue the platform — they switch vendors, tell every peer firm in their network, and feature the incident in trade-press investigations. The reputational tail is permanent. Continuous ai pentest is the lowest-cost way to prevent headline-worthy incidents.

RFP gate-keeping. Enterprise legal buyers increasingly require continuous-testing attestation in RFPs. "We do an annual pentest" no longer wins deals; "We run continuous AI penetration testing through Pentestas with scan history available under NDA" does.

Protect your legal platform's privilege

Start your AI pentest

📚

How Pentestas runs this in production

Everything above is shipped as part of Pentestas — a pentesting-as-a-service platform built around an AI penetration testing system that orchestrates dozens of deterministic detectors alongside an LLM-driven planner and reflector. Our penetration testing with Claude pipeline handles the audit-trail-grade reasoning (causal chains, evidence weighting, narrative attack paths) while our penetration testing with DeepSeek pipeline handles high-volume parallel coverage at the kind of unit cost that lets us re-run a full B2B SaaS pentest weekly without burning the customer's annual budget on a single engagement.

If you're evaluating a vendor for penetration testing with AI, the questions worth pressing on are exactly the ones this post walks through — accuracy gating, replay verification, payload safety, evidence chains, retest cadence. Those are what separate a real pipeline from a wrapper around a public LLM.

Related reading

Run it on your stack: Penetration Testing →

Pentestas for Legaltech: Continuous AI Pentest for Privilege-Heavy Document Platforms

The legaltech threat model

1. Cross-client data exposure

2. Privilege escalation inside a matter

3. Stored XSS in document annotations

4. Supply-chain + third-party integration bugs

Why legacy scanners struggle here

What Pentestas does for legaltech specifically

Cross-tenant probes

Role-boundary comparisons

Stored-XSS on document fields

File upload + processing

Audit-log completeness (white-box)

Compliance framing

Engagement shape for legaltech

Sample legaltech finding

Why continuous + AI specifically

Further reading

How Pentestas runs this in production

Alexander Sverdlov