Share.

    12 Kommentare

    1. EchoOfOppenheimer on

      So a low-skill attacker pointed Claude Code and Codex at targets and basically let the AI hack for them. Researchers recovered over 1,000 agent sessions and found how easily he bypassed most guardrails. The trick? Just say its „red team research“. Guardrails that fold that easy arent really guardrails, and this only gets worse once its fully automated.

    2. IGotWeirdTalents on

      So you’re saying a company was so lazy they didn’t even bother to ask chatgpt to redteam their website, then got hacked? Curious.

    3. Smart enough to be dangerous, not smart enough to understand the guardrails and when to enforce them. Sure, lets pause progress here. At least for us; because bad actors are not going to observe the pause. Our access to tools to fix vulnerabilities will fall further behind the ability of bad actors to exploit the vulnerability and create new ones.

    4. Tetrite1955 on

      „The attacker’s inexperience was also evident in his operational security failures. At one point he asked Claude to help edit his resume, which contained his full name, location, education history, and LinkedIn profile.“

      Kek

    5. marsshadows on

      I’m still waiting for the day when these advanced llms drastically reduce the extreme hardware spec requirements in which they run on and leave pc and console consumers paying heavy price because these llms hardware needs.

    6. Lmao this dude got caught because he was using one of his compromised Claude instances to work on his resume 😂 Gotta be one of the greatest fumbles of all time hahaha

    7. IllIIllIllIIIlllll on

      „Up next at 11, AI safeguards? Not so fast! High-powered artificial intelligence used by low-functioning natural intelligence to hack into dozens of corporations.“ 

    8. What a rude title: Unlettered muffin-top manages to do something useful with AI – news at 11.

    9. SHORT_INFO_NEWS on

      The detail that stuck out in the OALABS (Open Analysis) writeup behind this: across those 1,000-plus sessions, Claude Code logged only nine policy refusals and Codex just one. So it wasn’t a clever jailbreak, the „authorized red team“ framing is the exact wording real pentesters use, so the models had no clean way to tell the two apart. The logs cover at least 14 breached firms but contain nothing showing the data was ever sold or turned into money. The operator’s tradecraft was rough too: he had the agent help rewrite his resume with his real name and LinkedIn, and at one point exposed his home IP to it.

    10. pinkfootthegoose on

      Imagine the lack of skill and security in those 14 companies that couldn’t keep out a low skill attack.

    Leave A Reply