OpenAI ChatGPT with Advanced Voice Mode and Vision Features

OpenAI is enhancing ChatGPT by rolling out Advanced Voice Mode with Vision for paid subscribers. This feature integrates real-time video, allowing the AI to process visuals from a smartphone camera.

The feature, part of OpenAI’s GPT-4o model, was announced during the May Spring Updates. It enables ChatGPT to interact visually with environments, providing a dynamic and engaging experience. This feature is exclusive to ChatGPT Plus, Team, and Pro subscribers.

ChatGPT Voice Functionality and Applications:

Users can engage the camera by tapping the Advanced Voice icon in the ChatGPT mobile app and selecting the video option. This lets the AI view surroundings and respond based on visual cues. Practical uses include getting recipe ideas from the fridge’s contents, choosing outfits from wardrobes, or learning about visible landmarks.

Just in time for the holidays, video and screensharing are now starting to roll out in Advanced Voice in the ChatGPT mobile app. pic.twitter.com/HFHX2E33S8
— OpenAI (@OpenAI) December 12, 2024

This feature also supports an emotive voice response system and low latency, enhancing natural language conversations. The AI’s temporary memory of visual details boosts user interaction, making it a valuable tool for continuous learning and engagement.

Read: OpenAI Launches ChatGPT Pro for Engineering and Research

An integrated Screenshare option, accessible via the app’s three-dot menu, lets the AI interact with other apps on the user’s device. This aids with smartphone-related queries and tasks.

The rollout is underway, and Team subscribers will soon gain access. However, regulatory issues have made it unavailable in certain regions, including the EU. Enterprise and Edu users should expect access by early 2025.

Say ho ho ho to Santa in Voice Mode 🎅

Santa is rolling out today to everyone across all ChatGPT platforms and is available until the end of the month…then he will retire back to the North Pole. pic.twitter.com/NVS9bRok4r
— OpenAI (@OpenAI) December 12, 2024

OpenAI’s Advanced Voice Mode with Vision significantly advances AI interaction by merging visual and auditory data. This expands ChatGPT’s capabilities and sets a new standard for AI communication.

Hunger Games Prequel Trailer Sparks Fan Frenzy Over 10-Second Silent Cliffhanger

Billie Eilish Criticizes Elon Musk, Calls Billionaire Wealth “Pathetic”

Gen V Season 2 Trailer Cast, Plot, Premiere Details

OpenAI Rolls Out ChatGPT Advanced Voice with Vision

ChatGPT Voice Functionality and Applications:

Leave a Reply Cancel reply

Advertisement

Recent Posts

WhatsApp Confetti Emoji Reactions Return for Holidays

Estevao Leads Chelsea to 3-0 Champions League Win Over Barcelona

Air India, Akasa Cancel Flights Due to Ethiopia Volcanic Ash

Post Archives

More Popular from Photonews

Pakistan Stock Exchange Ends Flat Amid Cautious Trading and Sector Shifts

PTI’s Dr. Yasmin Rashid Granted Bail by ATC in Key May 9 Case

Bitcoin Plunges Low as Crypto Sell-Off Wipes Out $1 Trillion

Justin Bieber Helps Stranded Driver in Viral Act of Kindness

Girl Reunited with Family After 17 Years, Found at Karachi Edhi Centre

Pakistan’s Rooftop Solar Boom to Create “Negative Grid Demand”

Karachi Authorities Impose Immediate Ban on Sale and Purchase of Illegal Floors

Categories

ChatGPT Voice Functionality and Applications:

Leave a Reply Cancel reply

Advertisement

Recent Posts

Post Archives

More Popular from Photonews

Always Stay Up to Date

Categories