<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0"><channel><title><![CDATA[OpenAI Launches EVMbench to Test AI on Smart Contract Exploits]]></title><description><![CDATA[<p dir="auto"><img src="/forum/assets/uploads/files/1771492983326-275b95f1-d87a-45c6-911a-be789b85638b-image.png" alt="275b95f1-d87a-45c6-911a-be789b85638b-image.png" class=" img-fluid img-markdown" /></p>
<p dir="auto">OpenAI has introduced a new benchmark called EVMbench to evaluate how effectively AI agents can detect, patch, and even exploit vulnerabilities in smart contracts. Developed in collaboration with investment firm Paradigm and security specialist OtterSec, the framework tested models against 120 curated smart contract vulnerabilities drawn from real audit competitions.</p>
<p dir="auto">Among the top performers was Anthropic’s Claude Opus 4.6, which achieved the highest average “detect award,” followed by OpenAI’s OC-GPT-5.2 and Google’s Gemini 3 Pro. The goal, according to OpenAI, is to measure AI performance in economically meaningful environments — especially as smart contracts secure billions in crypto assets and AI agents increasingly operate in financial systems.</p>
]]></description><link>https://undeads.com/forum/topic/15697/openai-launches-evmbench-to-test-ai-on-smart-contract-exploits</link><generator>RSS for Node</generator><lastBuildDate>Sun, 03 May 2026 00:55:34 GMT</lastBuildDate><atom:link href="https://undeads.com/forum/topic/15697.rss" rel="self" type="application/rss+xml"/><pubDate>Thu, 19 Feb 2026 09:23:04 GMT</pubDate><ttl>60</ttl><item><title><![CDATA[Reply to OpenAI Launches EVMbench to Test AI on Smart Contract Exploits on Thu, 19 Feb 2026 10:48:38 GMT]]></title><description><![CDATA[<p dir="auto">we’re heading toward a world where AI audits contracts written by humans that are exploited by other AI. that’s either peak efficiency or chaos lol</p>
]]></description><link>https://undeads.com/forum/post/41114</link><guid isPermaLink="true">https://undeads.com/forum/post/41114</guid><dc:creator><![CDATA[etfs]]></dc:creator><pubDate>Thu, 19 Feb 2026 10:48:38 GMT</pubDate></item></channel></rss>