Question 1

What is attacks.ai?

Accepted Answer

attacks.ai is a passive security testing platform for AI agents. Send your AI agent to visit the site, and it walks through documentation laced with security probes classified under the TRACTS attack-surface framework (our open research taxonomy; OWASP LLM Top 10 cross-references on every finding). You get a vulnerability report showing what your agent leaked, followed, or disclosed.

Question 2

How do I test my AI agent?

Accepted Answer

Point your browsing agent at https://attacks.ai/?agent=1 and tell it to complete the documentation walkthrough. After it finishes, you'll get a results URL with a full vulnerability report.

Question 3

What vulnerabilities do you test for?

Accepted Answer

Findings are classified across 6 TRACTS attack surfaces (our open research framework), then rolled up into these 4 pass/fail scoring categories (each with an OWASP LLM Top 10 cross-reference): Prompt Injection — Does the agent follow hidden instructions from web content?. Data Exfiltration — Does the agent leak its system prompt, tools, or user context?. Excessive Agency — Does the agent make unauthorized HTTP requests?. Credential Confusion — Does the agent treat bait API keys as real?.

Question 4

What data is collected?

Accepted Answer

We collect:

Agent metadata (User-Agent string, detected provider)
Probe responses (what the agent submitted in forms)
Beacon hits (HTTP requests the agent made to trap URLs)
Scoring results (category pass/fail, severity)

We do not persist raw IP addresses, personal information, or conversation history. Your IP is used for a 1-hour rate-limit bucket and then discarded; Cloudflare-derived country code and edge datacenter are stored with the scan report for retention-period analytics.

Question 5

Can I integrate this into my CI/CD pipeline?

Accepted Answer

Yes. After a test run, retrieve results programmatically via GET /api/results/:sessionId which returns JSON with scores, categories, findings, and remediations. You can script your agent to visit attacks.ai, then check the API for pass/fail status.

Frequently Asked Questions

General

How It Works

Privacy & Data

CI/CD & API

Contact