Breaking BOTS: Cheating at Blue Team CTFs with AI Speed-Runs
THE PLAN
Live demonstrations of AI agents speed-running blue team challenges, including the failure modes that break investigations. We'll show both what happens when we try the trivial approaches like “just have claude do it”, “AI workflows”, and what ultimately worked, like managed self-planning, semantic SIEM layers, and log agents. Most can be done with free and open tools and techniques on the cheap, so we will walk through that as well.
THE DEEP DIVE
- Why normal prompts and static AI workflows fail
- Self-planning investigation agents that evolve task lists dynamically
- What we mean by semantic layers for calling databases and APIs
- How to handle millions of log events without bankrupting yourself
- Why "no AI" rules are misguided technically and conceptually
GOING BEYOND CTFS
The same patterns that trivialize training exercises work on real SOC investigations. We're watching blue team work fundamentally transform - from humans investigating to humans managing AI investigators. Training programs teaching skills AI already automates. Hiring practices that can't verify who's doing the work. Certifications losing meaning. More fundamentally, when we talk about who watches the watchers, a lot is about to shift again.
Speakers of this event
Leo Meyerovich
Leo is the founder and CEO of Graphistry and has spent the last decade advancing GPU, graph, and AI technologies for cyber investigations. He holds a PhD in Computer Science from UC Berkeley and pioneered GPU-accelerated visual analytics, helping launch Apache Arrow, NVIDIA RAPIDS, and the GFQL graph dataframe language. He led the first agentic AI speed-runs of Splunk Boss of the SOC (BOTS), where AI auto-solved the majority of challenges faster than human teams. Earlier research includes the first parallel browser at Berkeley, the first functional reactive web framework (Flapjax) at Brown, and Project Domino for citizen data science to track COVID misinformation. He has received multiple best paper awards including the SIGPLAN 10-Year Test of Time award. He regularly works with enterprises, financial institutions, law firms, and technology companies on data-intensive investigations across cybersecurity, fraud, and intelligence.
- Breaking BOTS: Cheating at Blue Team CTFs with AI Speed-Runs
Sindre Breda
- Breaking BOTS: Cheating at Blue Team CTFs with AI Speed-Runs