What is ChatGPT Advanced Voice Mode and how do you use it?

San Francisco, California, USA. 2nd July, 2024. ChatGPT Open AI.OpenAI, led by CEO Sam Altman, is a leading artificial intelligence research lab and company. Its key products include GPT-3.5, GPT-4o, Chat GPT, an advanced language model used for generating human-like text, and DALL-E, a model for creating images from textual descriptions. OpenAI aims to ensure that artificial general intelligence benefits all of humanity, focusing on safety and ethical considerations in AI development
OpenAI has launched 'Advanced Voice Mode' for ChatGPT (Credit Image: © Taidgh Barron/ZUMA Press Wire) (ZUMA Press, ZUMA Press, Inc.)

Artificial intelligence giant OpenAI has announced that one of the most anticipated new features of ChatGPT is set to launch in the UK - Advanced Voice Mode.

The new feature allows users to have more ‘natural’ flowing conversations with ChatGPT, and is available to Plus subscribers in the UK via the ChatGPT app.

Advanced Voice Mode can detect people’s emotions, do impressions and respond to prompts naturally, in the same amount of time as a human being.

OpenAI announced the new feature on X (formerly Twitter) on Saturday, saying, ‘Good news—Advanced Voice is now available in the ChatGPT app to Plus users in the UK.’

Advanced Voice Mode launched to a small group of invited subscribers in July, with OpenAI promising a full roll-out to paying ChatGPT subscribers this autumn.

The big difference with ChatGPT’s Advanced Voice Mode is that the assistant can hold ‘flowing’ conversations, something that previous voice assistants such as Siri, Alexa and Google Assistant can’t.

OpenAI says, ‘Advanced voice uses GPT-4o’s native audio capabilities and features more natural, real-time conversations that pick up on non-verbal cues, such as the speed you’re talking, and can respond with emotion.’

The app allows you to interrupt, for example, and can also hold a continuous conversation on one topic.

So if you interrupt, for example, and say, ‘Focus on this aspect of our conversation’, it will respond without missing a beat.

To access Advanced Voice Mode, you need a ChatGPT Plus subscription (currently £18.99) or a Team subscription for businesses.

Advanced Voice Mode works in the app version of ChatGPT rather than the desktop version, and subscribers are limited to a certain number of queries per day (the limit changes regularly).

To start a voice conversation, select the Voice icon on the bottom-right of the screen (the icon looks like a small row of columns, and if you see headphones instead, it means Advanced Voice Mode has not rolled out for you yet).

If you're talking to Advanced Voice Mode, the orb will be blue (OpenAI)
If you're talking to Advanced Voice Mode, the orb will be blue (OpenAI) (OpenAI)

In Yahoo UK’s tests, Advanced Voice Mode seems to be available via the iOS app, but not yet via the Android version.

You'll be able to tell you are talking to Advanced Voice Mode when the orb in the centre of the screen is blue.

The first time you talk to it, you’ll be asked to choose a voice, and you can then continue the conversation.

The feature has not rolled out to countries in the EU, due to regulations over AI models that can detect people’s emotions.

The app has nine voices, although one that resembled Scarlett Johansson was removed after the actress objected.

Scarlett Johansson at the World premiere of 'Avengers: Endgame' held at the LA Convention Center in Los Angeles, USA on April 22, 2019.
Scarlett Johansson objected to a voice which resembled hers (Alamy) (Hyperstar)

Johansson said she was ‘shocked, angered and in disbelief’ at the voice - after OpenAI CEO Sam Altman posted a social media post that referenced a 2013 science fiction movie starring Johansson as an AI virtual assistant.

Google launched its Gemini Live voice service earlier this year, allowing Android users to have live voice chats with Google’s Gemini AI chatbot, including human-like conversations where users can interrupt and answer follow-up questions.

A free subscription to Gemini Live comes with every new Pixel phone from this year’s range.

Google hopes the ‘killer app’ with Gemini Live (and the reason people will pay the monthly subscription) will be that Gemini Live can integrate with other Google apps such as Gmail, Calendar and Tasks.

Apple is also planning a new, revamped version of Siri, which is expected to arrive early next year.

Amazon is also said to be planning a new, upgraded version of its Alexa assistant for launch this autumn, powered by generative AI.

Advertisement