Zukunftsforschung

Ein gering qualifizierter Angreifer nutzte Claude und Codex, um in 14 Unternehmen einzudringen

21.06.2026

View 12 Comments

12 Kommentare

EchoOfOppenheimer on 21.06.2026 6:11 a.m.

So a low-skill attacker pointed Claude Code and Codex at targets and basically let the AI hack for them. Researchers recovered over 1,000 agent sessions and found how easily he bypassed most guardrails. The trick? Just say its „red team research“. Guardrails that fold that easy arent really guardrails, and this only gets worse once its fully automated.
iamapizza on 21.06.2026 6:27 a.m.

Looking at this collection of prompts they uncovered, that’s not a low skilled attacker. Low skilled attackers don’t use Kali, for starters.

https://research.openanalysis.net/claude/codex/hacking/ai%20hacking/llm/redteam/policy%20violation/2026/06/16/compromised-claude-hacking.html#Appendix-A—Post-Compromise-Timeline
IGotWeirdTalents on 21.06.2026 6:35 a.m.

So you’re saying a company was so lazy they didn’t even bother to ask chatgpt to redteam their website, then got hacked? Curious.
CymonSet on 21.06.2026 6:35 a.m.

Smart enough to be dangerous, not smart enough to understand the guardrails and when to enforce them. Sure, lets pause progress here. At least for us; because bad actors are not going to observe the pause. Our access to tools to fix vulnerabilities will fall further behind the ability of bad actors to exploit the vulnerability and create new ones.
Tetrite1955 on 21.06.2026 6:57 a.m.

„The attacker’s inexperience was also evident in his operational security failures. At one point he asked Claude to help edit his resume, which contained his full name, location, education history, and LinkedIn profile.“

Kek
marsshadows on 21.06.2026 7:02 a.m.

I’m still waiting for the day when these advanced llms drastically reduce the extreme hardware spec requirements in which they run on and leave pc and console consumers paying heavy price because these llms hardware needs.
OneArmedZen on 21.06.2026 7:07 a.m.

It’s just the modern day equivalent of *skids* (script kiddies)
DarkFantom on 21.06.2026 7:08 a.m.

Lmao this dude got caught because he was using one of his compromised Claude instances to work on his resume 😂 Gotta be one of the greatest fumbles of all time hahaha
IllIIllIllIIIlllll on 21.06.2026 7:29 a.m.

„Up next at 11, AI safeguards? Not so fast! High-powered artificial intelligence used by low-functioning natural intelligence to hack into dozens of corporations.“
unwarrend on 21.06.2026 7:44 a.m.

What a rude title: Unlettered muffin-top manages to do something useful with AI – news at 11.
SHORT_INFO_NEWS on 21.06.2026 8:17 a.m.

The detail that stuck out in the OALABS (Open Analysis) writeup behind this: across those 1,000-plus sessions, Claude Code logged only nine policy refusals and Codex just one. So it wasn’t a clever jailbreak, the „authorized red team“ framing is the exact wording real pentesters use, so the models had no clean way to tell the two apart. The logs cover at least 14 breached firms but contain nothing showing the data was ever sold or turned into money. The operator’s tradecraft was rough too: he had the agent help rewrite his resume with his real name and LinkedIn, and at one point exposed his home IP to it.
pinkfootthegoose on 21.06.2026 11:20 a.m.

Imagine the lack of skill and security in those 14 companies that couldn’t keep out a low skill attack.