Acum 2 h
OpenAI unveils EVMbench to test AI agents on 120 smart contract flaws
OpenAI launched EVMbench to benchmark how AI agents detect, patch, and exploit security issues in crypto smart contracts. On Wednesday, the firm published a paper with Paradigm and OtterSec covering 120 vulnerabilities, where Anthropic's Claude Opus 4.6 led with a $37,824 detect award, ahead of OC-GPT-5.2 and Gemini 3 Pro.