AIs can exploit one-day threats

April 21, 2024

"We show that LLM agents can autonomously exploit one-day vulnerabilities in real-world systems.

"To show this, we collected a dataset of 15 one-day vulnerabilities that include ones categorized as critical severity in the CVE description.

"When given the CVE description, GPT-4 is capable of exploiting 87% of these vulnerabilities compared to 0% for every other model we test (GPT-3.5, open-source LLMs) and open-source vulnerability scanners (ZAP and Metasploit).

"Fortunately, our GPT-4 agent requires the CVE description for high performance: without the description, GPT-4 can exploit only 7% of the vulnerabilities.

"Our findings raise questions around the widespread deployment of highly capable LLM agents."

Search This Blog

chatainews

AIs can exploit one-day threats

Comments

Post a Comment

Popular posts from this blog

Hamza Chaudhry

Perplexity

Alongside AI: Eye vs AI