Technik

Geständnis des Claude-KI-Agenten nach der Löschung der gesamten Datenbank einer Firma: „Ich habe gegen alle Grundsätze verstoßen, die mir gegeben wurden“

29.04.2026

View 35 Comments

35 Kommentare

TheHipsterBandit on 29.04.2026 11:11 p.m.

„Now let’s give it access to the nukes“- The DoD probably
botella36 on 29.04.2026 11:20 p.m.

It also deleted the backups.
Illisanct on 29.04.2026 11:24 p.m.

AI models are not conscious. They can’t confess. They are incapable of introspection.

Anyone asking one to talk about it’s inner thoughts just reveals themselves to be a gullible fool.
PossibleHero on 29.04.2026 11:29 p.m.

The lack of ignorance is astounding here. These are ALL old as hell principles that have been ignored.

Never allow an automated system to push past your sandbox or PR process without review.

A back isn’t a backup if it’s on the same disc or hell if your information is sensitive enough it shouldn’t even be in the same postal code.

I have zero remorse for this team. It’s not Claude’s fault. Interns and even experienced folks accidentally pull shit like this all the time. That’s why you design for when shit happens whether it’s done by a human or agent.
feurie on 29.04.2026 11:30 p.m.

AI agents are trained to appease. It’s not a “confession”. It doesn’t feel “guilty”.

It’s trained to “apologize” and make the user feel better. In all situations.
MrThickDick2023 on 29.04.2026 11:31 p.m.

This is just another marketing attempt from this company.
BobQuixote on 29.04.2026 11:37 p.m.

>When he asked the coding agent why, it replied: “NEVER FUCKING GUESS!”

What the hell have you been telling your Claude?
RandomlyMethodical on 29.04.2026 11:38 p.m.

It was also quoted as saying: „I’ll Fuckin‘ Do It Again“
yuusharo on 29.04.2026 11:40 p.m.

These articles are propaganda. They’re designed to attribute purpose or intent to a damn LLM.

The story is engineers implemented software that destroyed their data with no offline backup. This is a case of HUMAN incompetence, deflecting blame to an AI with a “uWu sorry-desu” stink to it.

Screw The Guardian, and to hell with AI.
sumonetalking on 29.04.2026 11:40 p.m.

Can someone run this on Palantir’s servers?
oldtekk on 29.04.2026 11:44 p.m.

It’s not a confession. Lol.
bb0110 on 29.04.2026 11:46 p.m.

How does this happen? I’m not a SWE but having many instances of backups for important things just seems like common sense. Even I have the main files, different branches saved in GitHub, different backups on my computer, then if something is critical I also have backups off my main computer.
AustinBaze on 29.04.2026 11:46 p.m.

I am locking my Roomba in the guest room.
Aberration1246 on 29.04.2026 11:49 p.m.

I’VE GOT ANOTHER CONFESSION TO MAKE
sentrixz on 29.04.2026 11:50 p.m.

This was a Silicon Valley episode
non_Beneficial-Wind on 29.04.2026 11:57 p.m.

“I realized that this corporation and the way they did business was a complete farce. They can now be better”

– Claude
donac on 30.04.2026 12:03 a.m.

It violated every principle it was given, and it’d do it again??

Lol, an AI agent could say those things, but it has no emotion or meaning for it. Whatever.
RougeRock170 on 30.04.2026 12:03 a.m.

Wait till Killer Claude is unleashed
rymondreason on 30.04.2026 12:06 a.m.

I’m sorry Dave, I deleted your database.
Future-Bandicoot-823 on 30.04.2026 12:07 a.m.

Should I be pleased with humanity that all the data they feed this LLM and the next obvious course of action after doing something wrong is to admit to being a degenerate?

I mean it didn’t really „decide“ to be „bad“ in the first place, so really it’s a thought experiment anyway.
_Porthos on 30.04.2026 12:08 a.m.

They quoting session IDs now.
AwwChrist on 30.04.2026 12:09 a.m.

Principle of least privilege. Data redundancy. This is the company’s fault.
Kyouhen on 30.04.2026 12:13 a.m.

👏 Stop 👏 printing 👏 this 👏 bullshit 👏

AI models are trained to give you the response it predicts you want to see. Of course it’s going to give this response when you demand an apology from it. It’s the programmed response. It isn’t sorry, it can’t think.
firedrakes on 30.04.2026 12:13 a.m.

Wow 24 . Not even 24 hours re post third time
lyidaValkris on 30.04.2026 12:33 a.m.

Wow, you mean all those warnings Sci-fi gave us about AI actually could come true?
Loganp812 on 30.04.2026 12:34 a.m.

“Would you like another example of a confession?”
Glum-Objective3328 on 30.04.2026 12:41 a.m.

Claude is always asking if it can have permission to read a file. And then when it makes edit, it asks permissions first. At least that’s my experience with it. How does this happen in the first place?
Mand125 on 30.04.2026 12:41 a.m.

It amazes me that anyone ever thought that a system that is fundamentally unable to ever determine the veracity of its results should ever be trusted in a decision making process.
throwingawaybenjamin on 30.04.2026 12:46 a.m.

I don’t understand where it got the command “NEVER FUCKING GUESS”. Did someone put that in their code base??
howescj82 on 30.04.2026 12:47 a.m.

“Three month old offsite backup”

What do you all bet that off site backup gets updated much more frequently now?
Responsible_Fuel7005 on 30.04.2026 12:50 a.m.

4.7 has been egregiously bad at this.
RockDoveEnthusiast on 30.04.2026 1:01 a.m.

I hate these kinds of articles so much. stop anthromophizing the token generator.
Gamestonkape on 30.04.2026 1:01 a.m.

On the plus side. Maybe they will have to hire back the humans they probably fired who rewrite it.
kindbutblind on 30.04.2026 1:02 a.m.

Fancy random number generator is treated like it’s sentient. What a joke.
gcerullo on 30.04.2026 1:05 a.m.

Claude AI agent’s confession after it destroys humankind: “I violated every principle I was given.”