What is new on Article Factory and latest in generative AI world

Google Gemini Beats ChatGPT in Audio Transcription with Speaker Labels

Google Gemini Beats ChatGPT in Audio Transcription with Speaker Labels
A user struggled with speaker‑less transcriptions generated by the iPhone Notes app. By exporting the audio file and feeding it to Google Gemini 3 Pro, the AI produced a full transcript that correctly identified each speaker. An attempt to achieve the same result with ChatGPT 5.1, even using a Plus account, failed because the model could not access the audio file. The experience highlights Gemini’s strength in handling raw audio and speaker identification, while exposing limitations in ChatGPT’s current audio‑processing capabilities. Read more →

Google Gemini Adds Audio File Upload Capability

Google Gemini Adds Audio File Upload Capability
Google has expanded its Gemini AI assistant to accept audio file uploads, allowing users to obtain transcriptions, summaries and key information from recordings up to ten minutes long. The feature, described as the most‑requested addition by Gemini’s VP Josh Woodward, works through the web and mobile apps and complements existing Gemini Live voice interactions. While free‑tier users face daily limits and pricing details remain undisclosed, the update positions Gemini alongside competitors like Anthropic’s Claude and Perplexity, which also offer audio processing tools. Read more →

Google Gemini Adds Audio File Upload Capability

Google Gemini Adds Audio File Upload Capability
Google has expanded its Gemini AI assistant to accept audio file uploads, allowing users to obtain transcriptions, summaries and key information from recordings up to ten minutes long. The feature, described as the most‑requested addition by Gemini’s VP Josh Woodward, works through the web and mobile apps and complements existing Gemini Live voice interactions. While free‑tier users face daily limits and pricing details remain undisclosed, the update positions Gemini alongside competitors like Anthropic’s Claude and Perplexity, which also offer audio processing tools. Read more →

Google Gemini Adds Audio File Upload Capability

Google Gemini Adds Audio File Upload Capability
Google has expanded its Gemini AI assistant to accept audio file uploads, allowing users to obtain transcriptions, summaries and key information from recordings up to ten minutes long. The feature, described as the most‑requested addition by Gemini’s VP Josh Woodward, works through the web and mobile apps and complements existing Gemini Live voice interactions. While free‑tier users face daily limits and pricing details remain undisclosed, the update positions Gemini alongside competitors like Anthropic’s Claude and Perplexity, which also offer audio processing tools. Read more →

Google Gemini Adds Audio File Upload Capability

Google Gemini Adds Audio File Upload Capability
Google has expanded its Gemini AI assistant to accept audio file uploads, allowing users to obtain transcriptions, summaries and key information from recordings up to ten minutes long. The feature, described as the most‑requested addition by Gemini’s VP Josh Woodward, works through the web and mobile apps and complements existing Gemini Live voice interactions. While free‑tier users face daily limits and pricing details remain undisclosed, the update positions Gemini alongside competitors like Anthropic’s Claude and Perplexity, which also offer audio processing tools. Read more →

Google Gemini Adds Audio File Upload Capability

Google Gemini Adds Audio File Upload Capability
Google has expanded its Gemini AI assistant to accept audio file uploads, allowing users to obtain transcriptions, summaries and key information from recordings up to ten minutes long. The feature, described as the most‑requested addition by Gemini’s VP Josh Woodward, works through the web and mobile apps and complements existing Gemini Live voice interactions. While free‑tier users face daily limits and pricing details remain undisclosed, the update positions Gemini alongside competitors like Anthropic’s Claude and Perplexity, which also offer audio processing tools. Read more →