AI Security
CyberSecEval Reviewed: What It Measures
A working engineer's review of CyberSecEval, the Meta-originated benchmark that has quietly become the default sniff test for AI-for-security claims. What it actually measures, what it misses, and how to read its scores without fooling yourself.
Jan 9, 20266 min read