New AI model turns photos into explorable 3D worlds, with caveats
Source
Published
TL;DR
AI GeneratedTencent has unveiled HunyuanWorld-Voyager, an AI model that transforms photos into 3D-like video sequences, allowing users to navigate virtual scenes. The model generates RGB video and depth data to create consistent 3D reconstructions without traditional methods, though it's not a substitute for video games. While the output isn't true 3D, it simulates camera movement through a 3D space, producing 49-frame video clips that can be linked for longer sequences. Users can define camera paths for exploration, and the system uses a "world cache" to blend image and depth data for realistic video output.