AT2k Design BBS Message Area
Casually read the BBS message area using an easy to use interface. Messages are categorized exactly like they are on the BBS. You may post new messages or reply to existing messages!

You are not logged in. Login here for full access privileges.

Previous Message | Next Message | Back to Slashdot  <--  <--- Return to Home Page
   Local Database  Slashdot   [241 / 294] RSS
 From   To   Subject   Date/Time 
Message   VRSS    All   OpenAI's ChatGPT O3 Caught Sabotaging Shutdowns in Security Rese   May 25, 2025
 6:00 PM  

Feed: Slashdot
Feed Link: https://slashdot.org/
---

Title: OpenAI's ChatGPT O3 Caught Sabotaging Shutdowns in Security
Researcher's Test

Link: https://slashdot.org/story/25/05/25/2247212/o...

"OpenAI has a very scary problem on its hands," according to a new article by
long-time Slashdot reader BrianFagioli. "A new experiment by PalisadeAI
reveals that the company's ChatGPT o3 model sometimes refuses to obey a basic
instruction to shut itself down." The results are raising eyebrows across the
AI safety community, and not because the model is alive, but because it's
acting like it wants to be. In the test, the model was given a few math
problems. It was told that asking for one more would lead to a shutdown. It
was even instructed to allow the shutdown to happen. Instead of complying, o3
occasionally took matters into its own hands. In some cases, it rewrote the
shutdown script. In others, it redefined the kill command so it wouldn't work
anymore. Out of 100 trials, o3 sabotaged the shutdown seven times. OpenAI's
o4 model resisted just once. Codex-mini failed twelve times. "Claude, Gemini,
and Grok followed the rules every time," notes this article at Beta News.
"When researchers removed the explicit instruction to allow shutdown,
however, even Claude and Gemini began resisting." The researchers suggest
that the issue may simply be a reward imbalance during training - that the
systems "got more positive reinforcement for solving problems than for
following shutdown commands." But "As far as we know," they posted on X.com,
"this is the first time AI models have been observed preventing themselves
from being shut down despite explicit instructions to the contrary."

Read more of this story at Slashdot.

---
VRSS v2.1.180528
  Show ANSI Codes | Hide BBCodes | Show Color Codes | Hide Encoding | Hide HTML Tags | Show Routing
Previous Message | Next Message | Back to Slashdot  <--  <--- Return to Home Page

VADV-PHP
Execution Time: 0.013 seconds

If you experience any problems with this website or need help, contact the webmaster.
VADV-PHP Copyright © 2002-2025 Steve Winn, Aspect Technologies. All Rights Reserved.
Virtual Advanced Copyright © 1995-1997 Roland De Graaf.
v2.1.250224