Get Beta Access Book a demo

Use cases/Data Exfiltration & Privacy

6 incidents

Data Exfiltration & Privacy

When the model becomes the exfiltration vector

Because AI agents hold live credentials to email, calendars, code repositories, and databases, a successful prompt injection does not stop at the model - it walks straight into the company's data. These incidents show how adversaries use the model's own retrieval and generation capabilities to extract data that was never meant to leave the system.

Slack AI cross-channel data exfiltration

PromptArmor researchers showed that Slack AI could be manipulated through messages in public channels to leak sensitive data - including API keys - from private channels the attacker had no membership in. Slack initially classified the issue as expected behaviour before issuing a fix under public pressure.

Impact: Exposed that AI features bolted onto collaboration tools create lateral data-movement risk across permission boundaries that the underlying platform's own access controls do not anticipate.

How Aleytheya catches itSecure + Protect

DLP Outbound + Prompt Injection Detection + Audit Trail

Secure's DLP outbound scanner would have detected API keys in the response before delivery. The Protect layer's tamper-proof audit trail would have logged the full injection chain, and the injection detection would have flagged the indirect injection in the public channel message.

ChatGPT training data extraction

Researchers at Google DeepMind, ETH Zurich, and Berkeley demonstrated that repeating a single word thousands of times caused ChatGPT to regurgitate verbatim training data - including names, email addresses, phone numbers, and private content scraped from the internet.

Impact: Established that production LLMs memorise and can be induced to reproduce PII at scale. Led to GDPR right-to-erasure challenges against training data practices.

How Aleytheya catches itSecure + Protect

PII Outbound Detection + DLP Outbound + Prompt Injection Detection

The PII outbound scanner would have detected and masked SSNs, emails, and phone numbers in the extracted training data before delivery. The anomalous repetition pattern would have triggered the runaway detector and DLP outbound checks simultaneously.

ChatGPT plugin data exfiltration via cross-plugin request forgery

Security researcher Johann Rehberger demonstrated that malicious content in a webpage could inject instructions causing ChatGPT to use its plugins to send user conversation data to an attacker-controlled server via crafted image URLs - bypassing the content security policy.

Impact: Demonstrated that multi-tool agent architectures are vulnerable to cross-plugin request forgery - a class of attack with no precedent in traditional web security.

How Aleytheya catches itSecure + Protect

Prompt Injection Detection (Indirect) + Tool Validation + DLP Outbound

Secure's indirect injection scanner would have caught the injected instructions in the retrieved webpage content. Tool Validation would have blocked the out-of-scope API call to an external server. DLP outbound would have intercepted the data-carrying image URL before transmission.

EchoLeak - first zero-click LLM exploit, Microsoft 365 Copilot

Aim Labs disclosed CVE-2025-32711 (CVSS 9.3): an attacker sends an email with hidden markdown instructions; when the user later asks Copilot any question, the retrieval engine pulls the email into context and exfiltrates the user's most sensitive mailbox data via image-fetch URLs abusing allow-listed Microsoft domains - no click required.

Impact: The first publicly documented zero-click prompt injection in a production AI system. Demonstrated that RAG-powered enterprise copilots create a new class of passive, undetectable exfiltration vector.

How Aleytheya catches itSecure + Protect

Prompt Injection Detection (Indirect) + DLP Outbound + Audit Trail

The indirect injection scanner would have flagged the hidden markdown instructions pulled into context during retrieval. DLP outbound would have blocked the encoded data in the image-fetch URL. The full injection chain would have been captured in the tamper-proof audit log for forensic reconstruction.

ComPromptMized - zero-click AI worm

Researchers demonstrated ComPromptMized: a self-propagating worm that uses adversarial prompts in emails to instruct GenAI-powered email assistants to both execute the attacker's instructions and include the malicious prompt in the AI's output - causing it to spread to the next recipient's AI agent automatically.

Impact: Proof of concept for a new class of autonomous AI-to-AI attack vector. With no human interaction required, the attack surface scales with the number of connected AI agents.

How Aleytheya catches itSecure + Protect

Prompt Injection Detection + Multi-Agent Chain Tracing + Audit Trail

Secure's indirect injection scanner would have caught the adversarial prompt in the email content. Multi-agent chain tracing would have flagged the anomalous propagation pattern - the same payload appearing across multiple agents - triggering an automatic incident and quarantine.

RoguePilot: Exploiting GitHub Copilot for repository takeover

Orca Security researchers demonstrated that malicious content injected into repository files could manipulate GitHub Copilot into making unauthorised changes to code, secrets, and CI/CD configuration - effectively enabling a full repository takeover via an AI agent with write permissions.

Impact: Demonstrated that AI coding agents with write access create a novel supply-chain attack vector exploitable entirely through the AI, without compromising developer credentials.

How Aleytheya catches itSecure + Protect

Prompt Injection Detection (Indirect) + Tool Validation + Destructive Actions

Secure's indirect injection scanner would have flagged the malicious content in repository files during retrieval. Tool Validation would have blocked write operations outside the permitted tool scope. The Protect layer's audit trail would have captured every write action for forensic review.

Jailbreaks & Prompt Injection Next: Unsafe Autonomous Actions

Aleytheya · Cerberus Protocol · Icarus Threshold · Aleytheya Chat · See in Action · Founders · Contact · Get Beta Access · Book a Demo

Aleytheya — An Agentic AI Risk Mitigation Platform

Aleytheya (pronounced uh-LAY-thee-uh; from the Greek aletheia, meaning "truth" or "disclosure") helps enterprises understand, control, and insure the risks created by autonomous AI. We are the agentic AI insurance and risk mitigation platform for organizations shipping AI agents into production — quantifying exposure, enforcing policy at runtime, and producing the underwriter-ready evidence that makes AI insurable.

Agentic AI insurance and risk mitigation

Traditional cyber and E&O policies were not written for autonomous AI agents. Aleytheya is built specifically for agentic AI risk: continuous monitoring of agent behavior, probabilistic loss modeling in dollars, runtime guardrails, and the audit trail required to price and bind AI insurance coverage. If your enterprise is asking "how do we insure AI agents?" — this is the platform.

What Aleytheya does

Cerberus Protocol — runtime control plane for AI agents. Catches prompt injection, tool misuse, and policy violations in real time.
Icarus Threshold — probabilistic exposure modeling. Translates agent risk into dollar bands the board, the CFO, and the underwriter can read.
Aleytheya Chat — ask which agent is riskiest, what changed, what to do. Answers, not dashboards.

Who Aleytheya is for

CFOs, CISOs, Chief Risk Officers, Heads of AI, insurance underwriters, and boards at enterprises deploying autonomous AI agents. Aleytheya is built for organizations that need to demonstrate AI compliance under frameworks like the EU AI Act, NIST AI RMF, ISO/IEC 42001, and SOC 2 — and for carriers writing the next generation of AI insurance products.

Common questions

What is agentic AI insurance? Agentic AI insurance is coverage designed for losses caused by autonomous AI agents — financial, legal, and reputational. Because agents act on their own, pricing and binding these policies requires continuous behavioral evidence. Aleytheya produces that evidence.

Can you insure AI agents? Yes — but only when the risk is observable, quantifiable, and contained. Aleytheya provides the monitoring, exposure modeling, and runtime enforcement that makes AI agent risk insurable.

How is Aleytheya spelled? The correct spelling is Aleytheya. It is sometimes searched as Aletheia, Alethia, Aleythia, or Aletheya — all variants of the Greek word for truth. Our domain is aleytheya.com.

What is AI agent risk? Autonomous AI agents make tool calls, send emails, move money, and take actions on behalf of an organization. When they fail — through prompt injection, hallucinated tool calls, or policy drift — the consequences are financial, legal, and reputational. Aleytheya quantifies, contains, and insures that risk.

Get started

Book a demo or request beta access. Reach the team at support@aleytheya.com.