What is new on Article Factory and latest in generative AI world

OpenAI Unveils Three Real‑Time Voice Models, Expanding AI to Live Conversation, Translation and Streaming Transcription

OpenAI Unveils Three Real‑Time Voice Models, Expanding AI to Live Conversation, Translation and Streaming Transcription Digital Trends
OpenAI announced three new audio models for its Realtime API—GPT‑Realtime‑2, GPT‑Realtime‑Translate and GPT‑Realtime‑Whisper. The suite pushes voice AI beyond simple back‑and‑forth exchanges, offering live reasoning, on‑the‑fly translation across 70+ languages and streaming transcription. Developers can now build assistants that schedule home tours, manage travel bookings or provide real‑time captions, while pricing starts at $0.017 per minute for Whisper and $0.034 per minute for Translate, with GPT‑Realtime‑2 billed at $32 per million audio tokens. Read more →

OpenAI adds real‑time voice, translation and transcription to its API

OpenAI adds real‑time voice, translation and transcription to its API TechCrunch
OpenAI announced Thursday that its API now supports three new voice‑focused models—GPT‑Realtime‑2, GPT‑Realtime‑Translate and GPT‑Realtime‑Whisper. The suite lets developers build applications that can converse, translate and transcribe speech on the fly, with support for more than 70 input languages and 13 output languages. Billing is split between per‑minute rates for translation and transcription and token‑based pricing for the conversational model. OpenAI says the tools target customer‑service, education, media and creator platforms, and includes guardrails to curb misuse. Read more →