Voltar

OpenAI Unveils Sora 2, a Video‑Synthesis Model With Synchronized Audio and New iOS Cameo App

OpenAI Unveils Sora 2, a Video‑Synthesis Model With Synchronized Audio and New iOS Cameo App
Ars Technica2

OpenAI Announces Sora 2

OpenAI unveiled Sora 2, a second‑generation video‑synthesis model capable of generating videos that include synchronized dialogue and sound effects. This marks the first time OpenAI’s video models have incorporated realistic audio, joining other major AI labs that have recently added sound capabilities.

New iOS Cameo App

Alongside the model, OpenAI launched a new iOS social app that allows users to place themselves into AI‑generated videos using a feature the company calls “cameos.” The app lets users create personalized videos where they appear alongside AI‑crafted scenes.

Demonstrated Capabilities

OpenAI showcased Sora 2 with a demo video featuring a photorealistic version of its CEO speaking in a slightly unnatural voice while surrounded by fantastical backdrops such as a competitive duck‑race and a glowing mushroom garden. The model can produce “sophisticated background soundscapes, speech, and sound effects with a high degree of realism.”

Technical Improvements

Compared with the original Sora model released earlier, Sora 2 offers notable visual consistency improvements, better handling of complex multi‑shot instructions, and more realistic physics. The model can simulate intricate physical movements like Olympic gymnastics routines and triple axels while maintaining realistic motion. OpenAI notes that prior video models were “overoptimistic” and sometimes produced physically impossible results, such as objects teleporting to achieve a prompt. In Sora 2, a missed basketball shot will rebound off the backboard, reflecting more accurate physics.

Industry Context

OpenAI frames the release as a “GPT‑3.5 moment for video,” likening it to the breakthrough that ChatGPT represented for text generation. The addition of audio aligns OpenAI with recent developments from other AI labs that have introduced synchronized audio in video generation.

Future Outlook

The launch of Sora 2 and the cameo app signals OpenAI’s intent to expand the creative possibilities of AI‑generated media, offering users both higher‑quality video output and new ways to personalize content.

Usado: News Factory APP - descoberta e automação de notícias - ChatGPT para Empresas

Source: Ars Technica2

Também disponível em: