Atrás

ChatGPT Expands Hands‑Free Interaction with Voice Mode

ChatGPT Expands Hands‑Free Interaction with Voice Mode
CNET

Voice Mode Overview

OpenAI’s ChatGPT now includes a Voice Mode that enables users to converse with the AI using spoken input and audio output. The voice button appears in the bottom‑right corner of any conversation on the app, allowing users to toggle between typing and speaking. Two tiers are available: a standard voice option that transcribes speech before processing it with the GPT‑4 model, and an advanced voice option that leverages multimodal models for real‑time listening and speaking. The advanced version is part of the paid subscription, while the standard version is free for all users.

Benefits and Use Cases

The hands‑free experience is described as more natural and conversational, letting users speak naturally with pauses and filler words. It is particularly useful for multitasking situations, such as brainstorming ideas while commuting or cooking. The feature also assists language learners, who can practice speaking and receive spoken translations. Accessibility is a major advantage, offering an alternative for individuals with low vision, dyslexia or motor‑skill challenges. Additionally, the advanced mode’s multimodal capabilities let users point the camera at real‑world objects and receive spoken information about them. Overall, the addition of Voice Mode expands how users can interact with ChatGPT, making the tool faster, more inclusive and adaptable to everyday scenarios.

Usado: News Factory APP - descubrimiento de noticias y automatización - ChatGPT para Empresas

Source: CNET

También disponible en: