AI Hackers can Find, Exploit Zero-Day Vulnerabilities - Soteria Technology Solutions

Services

9

IT Managed Services

9

Compliance

9

Cybersecurity

9

Cloud & Network

9

Backup & Disaster Recovery

9

Business Communications

9

Web Design

9

Web Services

Expertise

9

Manufacturing

IT & Compliance support tailored for Manufacturers

9

Healthcare

IT & HIPAA support for Healthcare Practices

9

Public Sector

Specialized technology solutions & support for the particular needs of Government and other Public Sector entities

About

9

About Us

9

Legacy Brands

9

CloudTotally

9

River City Digital

9

MyRealTown

9

SilverGear

9

Testimonials

Resources

9

Hey Soteria blog

9

Events

9

Videos

9

Books

9

Referral Program

9

Partner Resources

9

Bill Pay

Contact

9

Contact Service

Call 316.448.7944

help@heysoteria.com

9

Contact Sales

Call 316.816.2600

sales@heysoteria.com

9

1815 E Central • Wichita, KS • 67214

AI Hackers can Find, Exploit Zero-Day Vulnerabilities



Cybersecurity , News



June 17, 2024

a room full of white robots

Autonomous AI “hackers” are quickly becoming very sophisticated

In April this year, a team of researchers from the University of Illinois Urbana-Champaign released a paper showing how they had been able to use an LLM (Large Language Model), GPT-4 in particular, to “autonomously exploit one-day vulnerabilities in real-world systems.”

One-day vulnerabilities are security issues that are known about, but not yet patched. When a vulnerability is discovered, it is given a number and put on the CVE (Common Vulnerabilities and Exposure) list, which also includes a description and severity level.

The researchers showed that, when fed a CVE description, “GPT-4 is capable of exploiting 87% of these vulnerabilities compared to 0% for every other model we test (GPT-3.5, open-source LLMs) and open-source vulnerability scanners (ZAP and Metasploit).” “Fortunately,” they added. “our GPT-4 agent requires the CVE description for high performance: without the description, GPT-4 can exploit only 7% of the vulnerabilities. Our findings raise questions around the widespread deployment of highly capable LLM agents.”

Teamwork makes the Dream work

Only two months later, the team released another paper. Building on their previous research, they were able to harness teams of LLMs to successfully exploit real-world zero-day vulnerabilities.

Zero-Day vulnerabilities are security flaws that are not yet known about by the creators of the affected software or hardware (or are very freshly discovered) and not yet patched. Obviously, it’s hard to defend a weakness you know nothing about, so threat actors are constantly on the lookout for them.

This time, the researchers used a new technique they call HPTSA (Hierarchical Planning and Task-Specific Agents) to organize a team of LLMs the same way you might organize a project team – with a Planner, a Manager, and a team of specialized Task-Specific Agents. The Planner identifies potential weaknesses and comes up with a plan of attack. The Manager then decides which Agents are best suited for the tasks, deploying and directing their work.

This model was tested on a set of vulnerabilities that the researchers knew about – but the LLMs were not given that information, mimicking a zero-day scenario. The LLM team was able to successfully exploit over 50% of the zero-day vulnerabilities tested.

A whole new ballgame for Cybersecurity

Now that is is proven that threat actors can potentially use AI to autonomously hack websites, the defenders will need to keep pace. Luckily, the same method can be used to perform penetration testing, to test systems and spot zero-day vulnerabilities – and patch them before they are found by others. It’s easy to imagine that HPTSA will have a huge impact on not only cybersecurity, but in expanding the use of LLMs in unforeseen directions, for good or bad.

As the researchers themselves concluded:

It is unclear whether AI agents will aid cybersecurity offense or defense more and we hope that future work addresses this question. Beyond the immediate impact of our work, we hope that our work inspires frontier LLM providers to think carefully about their deployments.

Sources:

Richard Fang, Rohan Bindu, Akul Gupta, and Daniel Kang. LLM Agents can Autonomously
Exploit One-Day Vulnerabilities. arXiv preprint arXiv:2404.08144, 2024. https://arxiv.org/abs/2404.08144

Richard Fang, Rohan Bindu, Akul Gupta, and Daniel Kang. Teams of LLM agents can Exploit Zero-Day Vulnerabilities.

arXiv preprint arXiv:2406.01637, 2024. https://arxiv.org/abs/2406.01637

This post, like all our posts, is 100% written by a human.

Like What You See? Sign Up For Our Newsletter!

News, Events, Tips from the Techs and more, delivered to your email once a month. Absolutely No Spam!

Related Posts

Phishing for Robots

Phishing for Robots

Aug 27, 2025

Privacy Alert: What you say to AI can be used against you in court

Privacy Alert: What you say to AI can be used against you in court

Jul 28, 2025

Scam of the Month: Fradulent Fraud Investigators

Scam of the Month: Fradulent Fraud Investigators

Jun 30, 2025

What else is happening in

The Blog

Phishing for Robots

Phishing for Robots

Aug 27, 2025 | Cybersecurity, Scam of the Month

Phishing for Robots? Scams aren't just for people anymore. Scammers are targeting AI tools and agents as well. As your friendly neighborhood webgoblin, I spend a fair bit of time looking at cybersecurity news, to find important and interesting things to share with you...

Privacy Alert: What you say to AI can be used against you in court

Privacy Alert: What you say to AI can be used against you in court

Jul 28, 2025 | Compliance, News

It's no secret that people are becoming more comfortable - and increasingly reliant - on AI assistants like ChatGPT. Including using it to seek therapy, legal, and medical advice. But, unlike the trained humans in those fields, what you say to ChatGPT can be...

Scam of the Month: Fradulent Fraud Investigators

Scam of the Month: Fradulent Fraud Investigators

Jun 30, 2025 | Cybersecurity, Scam of the Month

Watch out for Fraudulent Fraud Inspectors The FBI warns about cybercriminals posing as Health Fraud Inspectors in order to steal personal information. In a Public Service Announcement issued Friday, the FBI warned of criminals "impersonating legitimate health...

Is Your Router a Zombie?

Is Your Router a Zombie?

May 14, 2025 | Cybersecurity, Tips

Is your router undead? End-of-Life Routers that are still in service are in danger of being taken over and added to a botnet, forced to do the evil bidding of their new masters. The FBI recently released a FLASH bulletin warning about old routers, no longer...

Scam of the Month: Toll Trolls

Scam of the Month: Toll Trolls

Feb 21, 2025 | Scam of the Month, Tips

A text message arrives saying you owe an unpaid toll. Maybe you’ve never even driven on that road. Is it real, or is it a scam?

The Worst Passwords of 2024

The Worst Passwords of 2024

Jan 21, 2025 | Cybersecurity, Tips

It’s that time of year again! NordPass has issued their 6th annual list of the most common leaked passwords for the year. Did your favorite make the list?

« Older Entries