What is new on Article Factory and latest in generative AI world

ByteDance Unveils Seedance 2.0, Multimodal AI Video Generator

ByteDance Unveils Seedance 2.0, Multimodal AI Video Generator
ByteDance announced Seedance 2.0, a next‑generation AI model that can create short video clips from combined text, image, audio, and video prompts. The system supports up to nine images, three video clips, and three audio clips per request and can produce 15‑second videos that respect camera movement, visual effects, and physical laws. Demonstrations include synchronized figure‑skating routines, anime‑style scenes, and celebrity‑lookalike cinematic fights. Seedance 2.0 is currently available through ByteDance’s Dreamina AI platform and the Doubao assistant, with no clear plan for TikTok integration. Read more →

Google Enhances Veo AI Video Model with Better Image Reference, Vertical Output, and 4K Upscaling

Google Enhances Veo AI Video Model with Better Image Reference, Vertical Output, and 4K Upscaling
Google has upgraded its Veo 3.1 AI video model to improve how it uses reference images, allowing users to generate more consistent and expressive clips. The update adds native vertical video support for a 9:16 aspect ratio, making content ready for platforms like TikTok and YouTube Shorts without extra editing. Users can now upscale videos to 4K resolution, while 1080p generation receives sharper quality. These features are being rolled out through the Gemini app and integrated into YouTube Shorts and the YouTube Create app, expanding creative options for developers and creators. Read more →

Google Gemini’s New Ad Shows AI Crafting Adventures for a Lost Stuffed Toy

Google Gemini’s New Ad Shows AI Crafting Adventures for a Lost Stuffed Toy
Google’s latest advertisement for its Gemini AI model imagines parents using the technology to locate a missing child’s favorite stuffed animal and to create whimsical images and videos of the toy traveling the world. A hands‑on test of Gemini’s image‑search and generation features shows the system can produce plausible results, though it requires careful prompting and has built‑in safeguards that prevent certain uses. The piece also explores the ethical questions around using AI to fabricate comforting narratives for children. Read more →

Runway Unveils GWM-1 World Model Claiming Minute-Long Coherence

Runway Unveils GWM-1 World Model Claiming Minute-Long Coherence
Runway announced its new GWM-1 "world model" technology, asserting it can maintain coherent output for minutes at a time. The company framed the model as a step toward unifying diverse domains and action spaces under a single base system. While highlighting potential uses in film, television, advertising, robotics, physics and life‑science research, Runway also disclosed a partnership with CoreWeave that will employ Nvidia GB300 NVL72 racks for training and inference. The move positions Runway in a competitive AI arena where larger tech firms also pursue similar capabilities. Read more →

AI Image and Video Models Develop Distinct “Personalities,” Shaping Creator Workflows

AI Image and Video Models Develop Distinct “Personalities,” Shaping Creator Workflows
Creators are increasingly describing generative AI image and video models as having distinct “personalities,” referring to each model’s unique style, strengths, and preferred tasks. This emerging view helps artists select the right tool for specific projects, much like choosing a camera lens. By combining multiple models—such as Google’s Veo 3, Adobe’s Firefly, and Runway—creators achieve greater creative range and precision. The concept of model personalities is fluid, evolving with updates that improve performance and reduce errors. Overall, the shift underscores a growing reliance on AI tools while emphasizing the human creative vision that guides them. Read more →

OpenAI Introduces Paid Packs for Sora After Free Video Limit Reached

OpenAI Introduces Paid Packs for Sora After Free Video Limit Reached
OpenAI has begun monetizing its AI video app Sora by offering paid "video generation packs" once users hit the daily free limit. Previously, users could create up to 30 videos per day for free, or up to 100 with a Pro subscription. When the cap is reached, the app prompts users to buy additional generations through the App Store, with a bundle of ten extra videos costing roughly $4. Bill Peebles explained on X that the change reflects growing demand and the need to manage GPU resources, noting that free quotas are likely temporary. Read more →

Google Introduces Nano Banana AI to Upgrade Search, Photos, and NotebookLM

Google Introduces Nano Banana AI to Upgrade Search, Photos, and NotebookLM
Google is rolling out its new AI model, Nano Banana, across several of its products. The model powers a major upgrade to image editing in Google Photos and adds a richer set of video‑generation styles to NotebookLM, including whiteboard, anime, retro print, and the original Classic mode. Users can now choose between Brief and Explainer video formats and steer the output with prompts. While a firm timeline isn’t set, Google says Nano Banana will appear in the Photos app within weeks, promising smoother conversational edits and a more consistent generative experience. Read more →

AI Leaders Accelerate Development of World Models Amid Slower LLM Progress

AI Leaders Accelerate Development of World Models Amid Slower LLM Progress
Major AI companies are channeling resources into world models as large language model advances plateau. Runway introduced a video‑generation product that uses world models for real‑time gaming scenes. Niantic leverages data from its long‑running games, including Pokémon Go, to map millions of locations for its spatial AI platform. Nvidia’s Omniverse platform underpins physical AI efforts, aiming to boost robotics and simulation capabilities. Executives from Runway, Niantic, and Nvidia emphasize the strategic importance of these models for diverse industries, despite predictions that fully human‑level AI may still be years away. Read more →

Google Clarifies Gemini Prompt and Media Limits Across Tiers

Google Clarifies Gemini Prompt and Media Limits Across Tiers
Google has updated its Gemini support page to specify how many prompts, tokens, and media outputs users can access on each tier. Free users receive a limited number of daily prompts and media generations, while AI Pro and AI Ultra subscribers enjoy substantially higher caps. The clarification also details context‑window sizes and the availability of Deep Think reports, image creation, video generation, and audio overviews for each plan. Read more →

Google Clarifies Gemini Prompt and Media Limits Across Tiers

Google Clarifies Gemini Prompt and Media Limits Across Tiers
Google has updated its Gemini support page to specify how many prompts, tokens, and media outputs users can access on each tier. Free users receive a limited number of daily prompts and media generations, while AI Pro and AI Ultra subscribers enjoy substantially higher caps. The clarification also details context‑window sizes and the availability of Deep Think reports, image creation, video generation, and audio overviews for each plan. Read more →

Google Clarifies Gemini Prompt and Media Limits Across Tiers

Google Clarifies Gemini Prompt and Media Limits Across Tiers
Google has updated its Gemini support page to specify how many prompts, tokens, and media outputs users can access on each tier. Free users receive a limited number of daily prompts and media generations, while AI Pro and AI Ultra subscribers enjoy substantially higher caps. The clarification also details context‑window sizes and the availability of Deep Think reports, image creation, video generation, and audio overviews for each plan. Read more →

Google Clarifies Gemini Prompt and Media Limits Across Tiers

Google Clarifies Gemini Prompt and Media Limits Across Tiers
Google has updated its Gemini support page to specify how many prompts, tokens, and media outputs users can access on each tier. Free users receive a limited number of daily prompts and media generations, while AI Pro and AI Ultra subscribers enjoy substantially higher caps. The clarification also details context‑window sizes and the availability of Deep Think reports, image creation, video generation, and audio overviews for each plan. Read more →

Google Clarifies Gemini Prompt and Media Limits Across Tiers

Google Clarifies Gemini Prompt and Media Limits Across Tiers
Google has updated its Gemini support page to specify how many prompts, tokens, and media outputs users can access on each tier. Free users receive a limited number of daily prompts and media generations, while AI Pro and AI Ultra subscribers enjoy substantially higher caps. The clarification also details context‑window sizes and the availability of Deep Think reports, image creation, video generation, and audio overviews for each plan. Read more →