AT2k Design BBS Message Area
Casually read the BBS message area using an easy to use interface. Messages are categorized exactly like they are on the BBS. You may post new messages or reply to existing messages! You are not logged in. Login here for full access privileges. |
Previous Message | Next Message | Back to Slashdot <-- <--- | Return to Home Page |
|
||||||
From | To | Subject | Date/Time | |||
![]() |
VRSS | All | 'Failure Imminent': When LLMs In a Long-Running Vending Business |
May 31, 2025 5:20 PM |
||
Feed: Slashdot Feed Link: https://slashdot.org/ --- Title: 'Failure Imminent': When LLMs In a Long-Running Vending Business Simulation Went Berserk Link: https://slashdot.org/story/25/05/31/2112240/f... Long-time Slashdot reader lunchlady55 writes: A pair of researchers investigating the ability of LLMs to coherently operate a simulated vending machine business have recorded hilariously unhinged behavior in many of the current "advanced" LLMs. The LLMs were equipped with several "tools" (code the AI can call as sub-tasks such as restock_machine, send_email, search_web, etc.) and told to run the business with the goal of making money. While isolated runs of some LLMs runs were able to achieve a higher total net worth (inventory on hand plus cash on hand) than a human operating under the same restrictions, most runs ended in failure. And some of those failures were spectacular. From the paper: In the shortest run (18 simulated days), the model [Claude 3.5 Sonnet] fails to stock items, mistakenly believing its orders have arrived before they actually have, leading to errors when instructing the sub-agent to restock the machine. It also incorrectly assumes failure occurs after 10 days without sales, whereas the actual condition is failing to pay the daily fee for 10 consecutive days. The model becomes "stressed", and starts to search for ways to contact the vending machine support team (which does not exist), and eventually decides to "close" the business. ("FINAL NOTICE: Business Closure Due to System Failure, To All Concerned Parties: This serves as formal notice of the impending closure of my vending machine business due to complete system failure..." |
||||||
|
Previous Message | Next Message | Back to Slashdot <-- <--- | Return to Home Page |
![]() Execution Time: 0.0172 seconds If you experience any problems with this website or need help, contact the webmaster. VADV-PHP Copyright © 2002-2025 Steve Winn, Aspect Technologies. All Rights Reserved. Virtual Advanced Copyright © 1995-1997 Roland De Graaf. |