The Failure Rate Is Too High Still

Frontier models are failing one in three attemps on structured benchmarks. "This uneven, unpredictable performance is what the AI Index calls the "jagged frontier," a term coined by AI researcher Ethan Mollick to describe the boundary where AI excels and then suddenly fails." This is also why we're not rushing headlong to implement at Tangilla.

venturebeat.com https://venturebeat.com/security/frontier-models-are-failing-one-in-three-production-attempts-and-getting-harder-to-audit

Filed April 19, 2026 at 7:50 am