Introducing AI Cyber Model Arena: A Real-World Benchmark for AI Agents in Cybersecurity - wiz.io
The AI Cyber Model Arena, a new benchmark by Wiz Research, evaluates offensive AI security agents against 257 real-world challenges focused on discovering and exploiting various vulnerabilities. These challenges encompass zero-day discovery, CVE detection, and the exploitation of security weaknesses in APIs, web applications, and multi-cloud environments like AWS, Azure, GCP, and Kubernetes.
Source: Original Report ↗