AI Security
AI Safety Eval Datasets as Supply Chain
The datasets you use to evaluate model safety are themselves a supply chain, and almost nobody is treating them that way. A senior engineer's audit of how eval corpora get poisoned, contaminated, and silently drifted.
Jan 18, 20267 min read