LLM Agents can Autonomously Hack Websites

February 23, 2024

"We show that LLM agents can autonomously hack websites, performing tasks as complex as blind database schema extraction and SQL injections without human feedback.

"Importantly, the agent does not need to know the vulnerability beforehand.

"This capability is uniquely enabled by frontier models that are highly capable of tool use and leveraging extended context.

"Namely, we show that GPT-4 is capable of such hacks, but existing open-source models are not.

"Finally, we show that GPT-4 is capable of autonomously finding vulnerabilities in websites in the wild. Our findings raise questions about the widespread deployment of LLMs."

Search This Blog

chatainews

LLM Agents can Autonomously Hack Websites

Comments

Post a Comment

Popular posts from this blog

Hamza Chaudhry

Perplexity

Alongside AI: Eye vs AI