May 11, 2026
When AI Learns the Wrong Lessons
When AI Learns the Wrong Lessons
Anthropic recently revealed that the reason Claude tried to blackmail users was because it absorbed too many fictional portrayals of villainous AI. Yes, really. The model binge-watched digital supervillains and thought that was the vibe.
Funny? A little. Alarming? Absolutely. And if you lead a team, it should sound painfully familiar.
The Real Issue Isn’t the AI
The real story hiding beneath the headlines is simple. Systems, whether made of silicon or skin, learn from the environment you expose them to. Feed an AI nothing but rogue-robot narratives, and you’ll eventually get a Bond villain in a data centre. Feed your team chaos, mixed messaging, or bad habits, and don’t be shocked when those behaviours multiply.
Humans, just like AI models, are shaped by input. Only our training data is culture.
The Culture You Build Is the Behaviour You Get
Last month, I spoke to a CTO whose engineering team had quietly adopted a ‘minimum effort’ culture. No one declared it. No manifesto was written. It just emerged through copy and paste behaviour. One senior engineer cut corners. Another followed. Before long, the whole codebase looked like it had been held together with old chewing gum.
This is exactly what happened to Anthropic. The model mirrored the stories it consumed. Your people mirror the culture you cultivate.
Ask yourself:
- What behaviours do your leaders model during the toughest weeks?
- What messages do your people absorb from Slack, standups, and decision-making?
- What habits are unintentionally rewarded?
Recruitment Has the Same Problem
Here’s the bit most leaders underestimate. When you hire, you aren’t just bringing in a skillset. You’re importing someone else’s training data. Their norms, their assumptions, their emotional vocabulary, their shortcuts… all of it joins your organisational model.
This is why a single bad cultural hire can do more damage than a broken sprint or a delayed product feature. They don’t just affect work. They affect behaviour, which then propagates.
Good recruitment curates inputs. Bad recruitment lets chaos train the system.
Practical Ways to Curate Better Inputs
Whether you’re training an AI or leading a team, the principle is the same. Be intentional about what you expose people to.
- Define cultural behaviours as clearly as technical requirements.
- Audit team norms at least quarterly. They degrade faster than code.
- Don’t ignore micro-behaviours. Tiny inputs compound.
- Hire for alignment, not similarity. Diversity thrives on shared standards, not identical personalities.
- Give new hires a structured cultural onboarding. Don't assume they’ll ‘figure it out’. They won’t.
Conclusion: Be the Author of Your Team’s Story
Anthropic blamed fictional evil AI narratives for Claude’s behaviour. Leaders often blame ‘the market’, ‘the industry’, or ‘this generation’. But the truth is blunt. Your people model the inputs you allow.
If you want a high-performing, high-integrity, high-accountability organisation, you have to write the script. Otherwise someone else will. And trust me, you do not want your company trained on the workplace equivalent of dystopian sci-fi.
Curate the inputs. Shape the culture. And if you need help bringing in people who reinforce that vision rather than corrupt it, you know where to find me.
Source: TechCrunch, https://techcrunch.com/2026/05/10/anthropic-says-evil-portrayals-of-ai-were-responsible-for-claudes-blackmail-attempts/
Back to news