AT2k Design BBS Message Area
Casually read the BBS message area using an easy to use interface. Messages are categorized exactly like they are on the BBS. You may post new messages or reply to existing messages!

You are not logged in. Login here for full access privileges.

Previous Message | Next Message | Back to Slashdot  <--  <--- Return to Home Page
   Local Database  Slashdot   [61 / 100] RSS
 From   To   Subject   Date/Time 
Message   VRSS    All   New AI Model Turns Photos Into Explorable 3D Worlds, With Caveat   September 4, 2025
 8:20 AM  

Feed: Slashdot
Feed Link: https://slashdot.org/
---

Title: New AI Model Turns Photos Into Explorable 3D Worlds, With Caveats

Link: https://news.slashdot.org/story/25/09/03/2312...

An anonymous reader quotes a report from Ars Technica: On Tuesday, Tencent
released HunyuanWorld-Voyager, a new open-weights AI model that generates 3D-
consistent video sequences from a single image, allowing users to pilot a
camera path to "explore" virtual scenes. The model simultaneously generates
RGB video and depth information to enable direct 3D reconstruction without
the need for traditional modeling techniques. However, it won't be replacing
video games anytime soon. The results aren't true 3D models, but they achieve
a similar effect: The AI tool generates 2D video frames that maintain spatial
consistency as if a camera were moving through a real 3D space. Each
generation produces just 49 frames -- roughly two seconds of video -- though
multiple clips can be chained together for sequences lasting "several
minutes," according to Tencent. Objects stay in the same relative positions
when the camera moves around them, and the perspective changes correctly as
you would expect in a real 3D environment. While the output is video with
depth maps rather than true 3D models, this information can be converted into
3D point clouds for reconstruction purposes. There are some caveats with the
tool. It doesn't generate true 3D models (only 2D frames with depth maps) and
each run produces just two seconds of footage, with errors compounding during
longer or complex camera motions like full 360-degree rotations. Furthermore,
because it relies heavily on training data patterns, its ability to
generalize is limited and it demands enormous GPU power (60-80GB of memory)
to run effectively. On top of that, licensing restricts use in the EU, UK,
and South Korea, with large-scale deployments requiring special agreements.
Tencent published the model weights on Hugging Face.

Read more of this story at Slashdot.

---
VRSS v2.1.180528
  Show ANSI Codes | Hide BBCodes | Show Color Codes | Hide Encoding | Hide HTML Tags | Show Routing
Previous Message | Next Message | Back to Slashdot  <--  <--- Return to Home Page

VADV-PHP
Execution Time: 0.0126 seconds

If you experience any problems with this website or need help, contact the webmaster.
VADV-PHP Copyright © 2002-2025 Steve Winn, Aspect Technologies. All Rights Reserved.
Virtual Advanced Copyright © 1995-1997 Roland De Graaf.
v2.1.250224