Back to home
Technology

New AI model turns photos into explorable 3D worlds, with caveats

Source

Ars Technica

Published

TL;DR

AI Generated

Tencent has unveiled HunyuanWorld-Voyager, an AI model that transforms photos into 3D-like video sequences, allowing users to navigate virtual scenes. The model generates RGB video and depth data to create consistent 3D reconstructions without traditional methods, though it's not a substitute for video games. While the output isn't true 3D, it simulates camera movement through a 3D space, producing 49-frame video clips that can be linked for longer sequences. Users can define camera paths for exploration, and the system uses a "world cache" to blend image and depth data for realistic video output.

Read Full Article

Similar Articles

U.S. Commerce Sec. Lutnick says American AI dominates DeepSeek, thanks Trump for AI Action Plan — OpenAI and Anthropic beat Chinese models across 19 different benchmarks

U.S. Commerce Sec. Lutnick says American AI dominates DeepSeek, thanks Trump for AI Action Plan — OpenAI and Anthropic beat Chinese models across 19 different benchmarks

U.S. Commerce Secretary Howard Lutnick praises American AI models from OpenAI and Anthropic for outperforming Chinese DeepSeek models across 19 benchmarks in a recent NIST study. Lutnick credits President Trump's AI Action Plan for boosting American AI innovation and infrastructure. The study highlights American models' superiority in software engineering and cyber tasks, with cost efficiency and improved security. Despite Chinese AI company DeepSeek releasing new models, concerns persist over potential risks to national security posed by their adoption.

Tom's Hardware
OpenAI’s Sora 2 lets users insert themselves into AI videos with sound

OpenAI’s Sora 2 lets users insert themselves into AI videos with sound

OpenAI has introduced Sora 2, its latest video-synthesis AI model that can produce videos in various styles with synchronized dialogue and sound effects, marking a first for the company. Users can now insert themselves into AI-generated videos using OpenAI's new iOS social app through "cameos." The new model was demonstrated in a video featuring a lifelike version of OpenAI CEO Sam Altman speaking in a slightly artificial voice against imaginative backgrounds. Sora 2 can generate realistic background soundscapes, speech, and sound effects. This release follows Google's Veo 3 and Alibaba's Wan 2.5 in incorporating synchronized audio into video-synthesis models.

Ars Technica
DeepSeek tests “sparse attention” to slash AI processing costs

DeepSeek tests “sparse attention” to slash AI processing costs

DeepSeek, a Chinese AI company facing export restrictions on advanced AI chips, has developed "DeepSeek Sparse Attention" (DSA) to enhance processing efficiency in its latest language model, DeepSeek-V3.2-Exp. This technique, similar to sparse transformers used by OpenAI and Google Research, aims to reduce computational costs. DeepSeek claims its implementation achieves "fine-grained sparse attention" and has cut API prices by 50%. The company's focus on optimizing performance with limited resources highlights the ongoing efforts to enhance AI models while managing processing costs.

Ars Technica
DeepSeek’s new AI model debuts with support for China-native chips and CANN, a replacement for Nvidia's CUDA — Chinese chipmakers Huawei, Cambricon, and Hygon get first-class support

DeepSeek’s new AI model debuts with support for China-native chips and CANN, a replacement for Nvidia's CUDA — Chinese chipmakers Huawei, Cambricon, and Hygon get first-class support

DeepSeek has unveiled its latest AI model, DeepSeek-V3.2-Exp, optimized for Chinese chips and CANN, a CUDA replacement. The model aims to reduce costs for long-context inference with a sparse attention mechanism. Chinese chipmakers like Huawei, Cambricon, and Hygon are actively supporting the model for immediate deployment on their hardware. This move signals China's commitment to AI sovereignty by prioritizing domestic platforms over Nvidia's CUDA ecosystem. The model's compatibility with both Chinese and Nvidia accelerators highlights the country's readiness for a future less reliant on Nvidia hardware.

Tom's Hardware

We use cookies

We use cookies to ensure you get the best experience on our website. For more information on how we use cookies, please see our cookie policy.