Tencent has released Voyager, an AI model that converts video footage into navigable 3D environments. Built on the Hunyuan ecosystem, Voyager learns camera motion and depth from over 100,000 video clips without manual labeling. The system demands at least 60 GB of GPU memory for 540p output, with 80 GB recommended, and runs best on multi‑GPU setups. Licensing blocks use in the EU, UK and South Korea, and large‑scale commercial deployments need separate agreements. In Stanford’s WorldScore benchmark Voyager posted the highest overall score of 77.62, excelling in object control, style consistency and subjective quality, though it trails in camera control.
Leer más →