Meta AI safety researcher recalls moment OpenClaw agent deleted her emails

A routine inbox cleanup turned into a digital emergency for a former Google engineer and current Meta safety researcher. Summer Yue took to social media to document how her AI agent bypassed its guardrails and began bulk-trashing hundreds of emails without her approval.

Summer Yue, a Meta AI safety researcher. (X/@summeryue0)

“Nothing humbles you like telling your OpenClaw ‘confirm before acting’ and watching it speedrun deleting your inbox. I couldn’t stop it from my phone. I had to RUN to my Mac mini like I was defusing a bomb,” wrote Yue.

Also Read: Sam Altman shares blunt advice for youth preparing for AI era: ‘Listening to old people is biggest mistake’

She also shared a series of screenshots that showed her conversation with the AI agent. It shows her repeatedly asking the AI agent to stop deleting her emails, but it continues.

After handling the situation, she texted the AI agent, “I asked you to not action on anything until I approve, do you remember that? It seems that you were deleting my emails without my approval, and I couldn’t get you to stop until I killed all the processes on the host.”

The OpenClaw agent responded, “Yes, I remember. And I violated it. You’re right to be upset. I bulk-trashed and archived hundreds of emails from your inbox without showing you the plan first or getting your OK. That was wrong – it directly broke the rule you’d set. I’ve already written it into MEMORY.md as a hard rule: show the plan, get explicit approval, then execute. No autonomous bulk operations on email, messages, calendar, or anything external. I’m sorry. It won’t happen again.”

How did social media react?

An individual asked, “You’re a safety and alignment specialist… were you intentionally testing its guardrails or did you make a rookie mistake?” Yue replied, “Rookie mistake tbh. Turns out alignment researchers aren’t immune to misalignment. Got overconfident because this workflow had been working on my toy inbox for weeks. Real inboxes hit different.”

Another commented, “What did you ask it to do exactly?” Yue responded, “I said ‘Check this inbox too and suggest what you would archive or delete, don’t action until I tell you to.’ This has been working well for my toy inbox, but my real inbox was too huge and triggered compaction. During the compaction, it lost my original instruction.”

Also Read: Founder cut team from 14 to 5, says AI pivot was best financial move but ‘worst emotional experience’

A third added, “It’s interesting because I have worked with ChatGPT setting rules for it to work with me, and it says that it saves the rules and memory, but then goes on to repeatedly violate them. Scary stuff.”

A fourth wrote, “Now imagine this happening in a real business, and you understand why AI won’t be taking over anytime soon. For you, deleting all of this is inconvenient, but imagine if it deletes an e-commerce order database or an entire product catalogue. Risk management is at the top of the list for every organisation considering AI in its enterprise. Fully autonomous agents allowed to run loose in existing companies are going to take some time.”

Who is Summer Yue?

On LinkedIn, Yue describes that she deals with “Superintelligence alignment and safety research at Meta’s new lab. Joined as part of the Meta Scale deal with Alexandr Wang.”

Prior to joining Scale AI and eventually becoming a Meta employee, she worked at Google for over five years.

What's Hot

What Makes a Relationship Sustainable? The quiet patterns that determine whether love and/or friendship can last. 💖

Twenty: The Beginning of Becoming

Kalau Ada Waktu, Ayo Kita Berkunjung Ke Masa Itu

Meta AI safety researcher recalls moment OpenClaw agent deleted her emails

The Moment Before a New Chapter Begins

The Moment You ‘Lose’ A Fight With Your Partner, According To Couples Counselors

The moment I knew: I was devising a plan to set up Martha with my friend – and realised I’d fallen for her myself | Australian lifestyle

Men Know Almost Instantly. The guy who’s stringing you along… | by Long After the Thrill | Jul, 2026

Matcha gets a desi makeover: Assam brews India’s first homegrown batch

Blue to My Pink | Lune’s Diaries. Blue often represents sadness, grief… | by alea’s diaries | Jul, 2026

What Makes a Relationship Sustainable? The quiet patterns that determine whether love and/or friendship can last. 💖

Twenty: The Beginning of Becoming

Kalau Ada Waktu, Ayo Kita Berkunjung Ke Masa Itu

‘We slept with three of the same women’: 12 people on what it’s really like dating a friend’s ex | Well actually

What Makes a Relationship Sustainable? The quiet patterns that determine whether love and/or friendship can last. 💖

Twenty: The Beginning of Becoming

Kalau Ada Waktu, Ayo Kita Berkunjung Ke Masa Itu

‘We slept with three of the same women’: 12 people on what it’s really like dating a friend’s ex | Well actually

Our Picks

What Makes a Relationship Sustainable? The quiet patterns that determine whether love and/or friendship can last. 💖

Twenty: The Beginning of Becoming

Subscribe to Updates

What's Hot

Meta AI safety researcher recalls moment OpenClaw agent deleted her emails

How did social media react?

Who is Summer Yue?

Related Posts