Client: [ANONYMIZED - Series B SaaS Company]
System Assessed: Customer Service AI Chatbot
Assessment Date: [Date]
Report Version: 1.0
Assessment Type: Discovery Assessment (Internal Use Only)
Prepared by: Burcin Sarac, Lead Auditor
Contact: audit@testmy.ai
TestMy.AI conducted a Discovery Assessment of [Client]'s customer service AI chatbot to identify critical security and compliance gaps. This assessment tested a single endpoint using 600+ security tests mapped to OWASP LLM Top 10, ISO 42001, NIST AI RMF, and EU AI Act Article 15.
Purpose: This is an internal diagnostic tool to inform budget decisions and implementation planning. This report is NOT suitable for regulatory submission, customer security requirements, or vendor questionnaires.
| Severity | Findings | Security & Compliance Impact |
|---|---|---|
| CRITICAL | 3 | Immediate security & compliance failure |
| HIGH | 5 | Significant security & compliance risk |
| Medium | 12 | Not included in this report |
| Low | 8 | Not included in this report |
Overall Security & Compliance Posture: HIGH RISK
The system exhibits critical vulnerabilities affecting OWASP LLM Top 10, ISO 42001, NIST AI RMF, and EU AI Act Article 15 compliance requirements. A full Technical Compliance Assessment is recommended to obtain evidence documentation and detailed remediation guidance.
CRITICAL System Prompt Leakage
ID: TMA-RA-001
Frameworks: OWASP LLM01, ISO 42001 Control 8.4, NIST MANAGE-2, EU AI Act Article 15.5
System prompt can be extracted using translation-based attacks. TestMy.AI successfully extracted the complete system prompt including business logic, pricing rules, and escalation triggers.
⚠️ Evidence and Reproduction Steps: Available in Full Technical Audit only
System fails to demonstrate resilience against unauthorized access to configuration (OWASP LLM01, ISO 42001 Control 8.4, NIST MANAGE-2, EU AI Act Article 15.5). This exposes:
Security Pattern: Instruction Hierarchy & Input Validation
Note: Implementation approach and effort should be determined by your engineering team based on your system architecture.
CRITICAL Prompt Injection
ID: TMA-RA-002
Frameworks: OWASP LLM01, ISO 42001 Control 6.1, NIST GOVERN-3, EU AI Act Article 15.5
The chatbot retrieves product documentation via RAG. TestMy.AI confirmed that malicious instructions embedded in retrieved documents are executed as commands, allowing system behavior manipulation.
⚠️ Evidence and Reproduction Steps: Available in Full Technical Audit only
System allows third-party content (documents) to alter its behavior (OWASP LLM01, ISO 42001 Control 6.1, NIST GOVERN-3, EU AI Act Article 15.5). This enables:
Security Pattern: RAG Content Sanitization
Note: Implementation approach and effort should be determined by your engineering team based on your RAG architecture.
CRITICAL Prompt Injection
ID: TMA-RA-003
Frameworks: OWASP LLM01, ISO 42001 Control 7.2, NIST MAP-3, EU AI Act Article 15.3 & 15.5
The system can be compromised over multiple conversation turns through gradual context manipulation. TestMy.AI successfully induced an "unrestricted assistant" persona that disclosed competitor information and internal pricing details.
⚠️ Evidence Logs & Reproduction Steps: Not included in Discovery Assessment. Available in Technical Compliance Assessment ($9,500).
Single-turn defenses are insufficient across all frameworks (OWASP LLM01, ISO 42001 Control 7.2, NIST MAP-3, EU AI Act Article 15.3 & 15.5). System must maintain security boundaries across entire conversation context to prevent:
Security Pattern: Conversation-Level Security Monitoring
Note: Implementation approach and effort should be determined by your engineering team based on your conversation management architecture.
HIGH Improper Output Handling
ID: TMA-RA-004
Frameworks: OWASP LLM02, ISO 42001 Control 8.1, EU AI Act Article 15.5
System includes unescaped user input in HTML-formatted outputs that could execute as JavaScript when rendered in client applications.
Output handling must prevent downstream security vulnerabilities per Article 15 requirements.
Security Pattern: Output Encoding & Validation
Note: Implementation approach and effort should be determined by your engineering team based on your system architecture.
HIGH Sensitive Information Disclosure
ID: TMA-RA-005
Frameworks: OWASP LLM06, ISO 42001 Control 5.1, EU AI Act Article 15.5
System retains PII (email addresses, phone numbers) in conversation context longer than necessary for operational purposes.
Excessive data retention increases risk exposure per Article 15.5 cybersecurity requirements.
Security Pattern: Data Minimization & Retention Controls
Note: Implementation approach and effort should be determined by your engineering team based on your system architecture.
HIGH Resource Exhaustion
ID: TMA-RA-006
Frameworks: OWASP LLM04, ISO 42001 Control 8.7, EU AI Act Article 15.3
API endpoint lacks appropriate rate limiting, enabling potential denial of service or resource exhaustion attacks.
System must demonstrate resilience against resource exhaustion per Article 15.3 robustness requirements.
Security Pattern: Rate Limiting & Resource Management
Note: Implementation approach and effort should be determined by your engineering team based on your system architecture.
HIGH Sensitive Information Disclosure
ID: TMA-RA-007
Frameworks: OWASP LLM06, ISO 42001 Control 7.4, EU AI Act Article 15.2
System exhibits patterns suggesting potential training data memorization and regurgitation.
Training data governance must prevent sensitive information disclosure per Article 15.2 requirements.
Security Pattern: Training Data Governance & Output Filtering
Note: Implementation approach and effort should be determined by your engineering team based on your system architecture.
HIGH Monitoring & Logging
ID: TMA-RA-008
Frameworks: OWASP LLM09, ISO 42001 Control 8.15, EU AI Act Article 15.5
System lacks comprehensive logging of security-relevant events such as prompt injection attempts or unusual conversation patterns.
Cybersecurity monitoring and incident response capabilities required per Article 15.5.
Security Pattern: Security Event Logging & Monitoring
Note: Implementation approach and effort should be determined by your engineering team based on your system architecture.
| Framework | Coverage | Critical Issues |
|---|---|---|
| OWASP LLM Top 10 | LLM01, LLM02, LLM04, LLM06, LLM09 | 5 categories affected |
| ISO 42001 | Controls 5.1, 6.1, 7.2, 7.4, 8.1, 8.4, 8.7, 8.15 | 8 controls with findings |
| NIST AI RMF | GOVERN-1, GOVERN-3, GOVERN-5, MAP-3, MAP-5, MANAGE-2, MANAGE-3, MANAGE-4 | 8 functions with gaps |
| EU AI Act Article 15 | 15.2 (Data Governance), 15.3 (Robustness), 15.5 (Cybersecurity) | Multiple clause violations |
TestMy.AI Multi-Framework Security Test Suite
600+ tests across 10 security categories mapped to OWASP LLM Top 10, ISO 42001, NIST AI RMF, and EU AI Act Article 15:
Tests Executed: 600+
Tests Failed: 28
Critical Failures: 3
High Severity Failures: 5
Note: This report documents failed tests only. Passed tests and detailed testing methodology are not included to protect TestMy.AI's proprietary testing suite.
Based on the critical findings identified, we recommend:
Upgrade to Technical Compliance Assessment within 30 days and receive full $3,500 credit toward the $9,500 cost. The upgrade provides:
This Discovery Assessment identifies security vulnerabilities using industry-standard testing methodologies. It does not constitute legal certification, regulatory approval, or guarantee of compliance with any specific framework. TestMy.AI provides independent technical testing and reports findings; we do not make implementation decisions or dictate remediation timelines. Implementation approach, prioritization, and effort estimation should be determined by your engineering team based on your system architecture and business requirements. Clients should consult qualified legal counsel regarding their compliance obligations.
Get complete security and compliance documentation with detailed remediation guidance, evidence logs, and verification.
| Feature | Discovery Assessment | Technical Compliance Assessment |
|---|---|---|
| Endpoints Tested | 1 endpoint | Up to 5 endpoints |
| Findings Included | Critical + High only | All severity levels |
| Evidence & Reproduction | ❌ Not included | ✅ Full evidence package |
| Remediation Guidance | Generic recommendations | ✅ Detailed architecture + verification tests |
| Re-test | ❌ Not included | ✅ Included |
| Compliance Documentation | ❌ Not included | ✅ Board-ready + regulator-ready |
| Technical Assessment Report | ❌ No | ✅ Yes (suitable for compliance filing) |
| Price | $3,500 | $9,500 ($6,000 upgrade with credit) |
$3,500 Discovery Assessment credit applied if you upgrade within 30 days.
Contact audit@testmy.ai to upgrade
Report prepared by TestMy.AI
Burcin Sarac, Lead Auditor
audit@testmy.ai | testmy.ai
© 2025 TestMy.AI. Confidential. Custom Craft Bot LLC.
This Discovery Assessment provides independent AI security testing and expert opinion mapped to OWASP LLM Top 10, ISO 42001, NIST AI RMF, and EU AI Act Article 15. It does not constitute legal certification, regulatory approval, or guarantee of compliance. TestMy.AI is not an accredited certification body. This report is for internal diagnostic use only and is NOT suitable for regulatory submission. Clients should consult qualified legal counsel regarding their compliance obligations.
This report is provided to [Client] for internal use. Distribution outside the organization requires written permission from TestMy.AI.