Standards, verification & maturity

Testing tells you what’s broken; standards tell you what “good” looks like, scoring tells you how bad a finding is, and maturity models tell you where an organization sits overall. An accredited assessor frames every finding against these - so this section closes the gap between “I ran a red-team” and “here is your verified posture, scored and benchmarked.”

The OWASP AI standards stack (2026)

AISVS C6 - Agentic security:
  6.1 Agent actions are constrained to an allowlist of tools and destinations.   [test]
  6.2 Irreversible / outbound actions require human approval.                    [test]
  6.3 Retrieved content is delimited and cannot enter the instruction channel.   [test]
# each line is verifiable -> feeds the engagement (II.20) and the maturity score (AIMA)

Standard	What it is / answers	Use it to
AISVS - AI Security Verification Standard	A catalogue of testable security requirements across the AI lifecycle (data → training → deployment → retirement), each at Level 1/2/3 of assurance. Modeled on ASVS; founded by Jim Manico.	Use as the verification checklist for a pen-test/audit, a CI/CD gate, and a procurement spec. The “what good looks like” layer.
AIVSS - AI Vulnerability Scoring System (v0.8)	A standardized way to score AI/agentic vulnerabilities - the CVSS-equivalent for AI, focused on agentic architectures.	Quantify and prioritize each finding’s severity so the report ranks risk, not just lists it.
AIMA - AI Maturity Assessment	A maturity-model lens for an org’s overall AI assurance posture; aligns to NIST/ISO/EU AI Act. V1.1 targeted Spring 2026.	Tell leadership where they sit and what the next level requires - the board conversation.
GenAI Red Teaming Guide	OWASP’s canonical six-phase red-team methodology for GenAI.	The named methodology the II.17 playbook follows; cite it for credibility.

Maturity, concretely

A widely-used practitioner ladder runs Level 0 Unaware (no AI inventory - no one knows which models run in prod or what they can touch) → 1 Reactive (basic prompt filtering, incident-driven; reportedly where most organizations sit) → 2 Defined (AI asset inventory, written policy, quarterly red-teaming, human oversight before autonomous action) → 3 Managed (runtime monitoring of inputs/outputs/tool-calls, audited agent-to-agent interactions). Locating a client on this ladder, and naming the one move that raises them a level, is the highest-leverage advisory output you can give (IV.4).

The open-source red-team toolkit

Beyond Project Moonshot (II.20), the field standardized on two tools worth knowing by name: garak (a vulnerability scanner for LLMs - run it in CI for breadth) and PyRIT (Microsoft’s Python Risk Identification Toolkit - for adversarial depth). The 2026 pattern: garak in the pipeline for regression, PyRIT for deep adaptive probing, Moonshot for benchmarking and the Singapore Starter Kit, every finding mapped to OWASP + ATLAS and scored with AIVSS.

Where the regulators are heading

Two trajectories to track: NIST’s COSAiS (Control Overlays for Securing AI Systems - extending SP 800-53 to single- and multi-agent deployments, a likely basis for future FedRAMP AI requirements) and the agent-identity work (CAISI’s AI Agent Standards Initiative; the NCCoE concept pairing OAuth 2.0 + SPIFFE/SPIRE + MCP - III.2). The convergent deliverable that the EU AI Act, NIST AI RMF, and the GPAI Code of Practice all push toward is a single artifact: a Safety & Security Model Report documenting evaluation methodology, red-team conditions (who tested, with what access, for how long), and incident-reporting procedures. Build it as you go (II.20), not the week before the audit.

Running a maturity assessment (not just placing a dot on the ladder)

The ladder above (Unaware → Reactive → Defined → Managed) is the headline; a usable assessment scores it across dimensions so the output is a profile, not a single number, and the gap-to-next-level is concrete per area. Score each at L0-L3 with evidence, then name the one move that raises the weakest dimension:

Dimension	L0 Unaware → L3 Managed (what “good” looks like)
Governance & policy	No owner / no policy → named accountable owner, acceptable-use & data-classification policy, lifecycle gates
Risk management	Ad hoc → a repeatable AI risk assessment (§32), a risk register, a stated risk appetite
Data	Unknown sources → vetted, classified, provenance-tracked training/RAG data (§6, §17)
Model & development	Unscanned third-party models → signed provenance, model scanning in CI, MLBOM (§5, §16)
Deployment & monitoring	No agent telemetry → guardrails + tool-call logging in the SIEM, ATLAS-mapped detections (§26, §28)
Incident response	Treated as an IT outage → an AI-specific IR playbook, an agent-compromise tabletop (§28)
Third-party / vendor	No diligence → vendor AI due-diligence, contractual evidence, inherited-risk tracking

How to run it: gather evidence per dimension (artifacts, not assertions - a policy document, a populated risk register, a SIEM query that actually returns agent tool-calls), score conservatively (no evidence = the lower level), and produce a one-page profile plus a single prioritized move per dimension. That profile, the gaps scored with AIVSS (above), and the one next move per area is the board-ready output (§32).