OpenAI on Monday said it plans to halt the use of one of its ChatGPT voices that BԪַHerBԪַ actor Scarlett Johansson says sounds BԪַeerily similarBԪַ to her own.
In a on the social media platform X, OpenAI said it is BԪַworking to pauseBԪַ Sky BԪַ the name of one of five voices that ChatGPT users can chose to speak with. The company said it had BԪַheard questionsBԪַ about how it selects the lifelike audio options available for its flagship artificial intelligence chatbot, particularly Sky, and wanted to address them.
Among those raising questions was Johansson, who famously voiced a fictional, and at the time futuristic, AI assistant in the 2013 film
Johansson issued a statement saying that OpenAI CEO Sam Altman had approached her in September asking her if she would lend her voice to the system, saying he felt it would be BԪַcomforting to peopleBԪַ not at ease with the technology. She said she declined the offer.
BԪַWhen I heard the released demo, I was shocked, angered and in disbelief that Mr. Altman would pursue a voice that sounded so eerily similar to mine that my closest friends and news outlets could not tell the difference,BԪַ Johansson said.
She said OpenAI BԪַreluctantlyBԪַ agreed to take down the Sky voice after she hired lawyers who wrote Altman letters asking about the process by which the company came up with the voice.
OpenAI had moved to debunk the internetBԪַs theories about Johansson in a blog post accompanying its earlier announcement aimed at detailing how ChatGPTBԪַs voices were chosen. The company that it believed AI voices BԪַshould not deliberately mimic a celebrityBԪַs distinctive voiceBԪַ and that the voice of Sky belongs to a BԪַdifferent professional actress.BԪַ But it added that it could not share the name of that professional for privacy reasons.
In a statement sent to The Associated Press following JohanssonBԪַs response late Monday, Altman said that OpenAI cast the voice actor behind Sky BԪַbefore any outreachBԪַ to Johansson.
BԪַThe voice of Sky is not Scarlett JohanssonBԪַs, and it was never intended to resemble hers,BԪַ Altman said. BԪַOut of respect for Ms. Johansson, we have paused using SkyBԪַs voice in our products. We are sorry to Ms. Johansson that we didnBԪַt communicate better.BԪַ
San Francisco-based OpenAI first rolled out voice capabilities for ChatGPT, which included the five different voices, in September, allowing users to engage in back-to-forth conversation with the AI assistant. BԪַVoice ModeBԪַ was originally just available to paid subscribers, but in November, OpenAI that the feature would become free for all users with the mobile app.
And ChatGPTBԪַs interactions are becoming more and more sophisticated. Last week, OpenAI said the can mimic human cadences in its verbal responses and can even try to detect peopleBԪַs moods.
OpenAI says the newest model, dubbed GPT-4o, works faster than previous versions and can reason across text, audio and video in real time. In a demonstration during OpenAIBԪַs May 13 announcement, the AI bot chatted in real time, adding emotion BԪַ specifically BԪַmore dramaBԪַ BԪַ to its voice as requested. It also took a stab at extrapolating a personBԪַs emotional state by looking at a selfie video of their face, aided in language translations, step-by-step math problems and more.
GPT-4o, short for BԪַomni,BԪַ isnBԪַt widely available yet. It will progressively make its way to select users in the coming weeks and months. The modelBԪַs text and image capabilities have already begun rolling out, and is set to reach even some of those that use ChatGPTBԪַs free tier BԪַ but the new voice mode will just be available for paid subscribers of ChatGPT Plus.
While most have yet to get their hands on these newly announced features, the capabilities have conjured up even more comparisons to the Spike JonzeBԪַs dystopian romance BԪַHer,BԪַ which follows an introverted man (Joaquin Phoenix) who falls in love with an AI-operating system (Johansson), leading to many complications.
Altman appeared to tap into this, too BԪַ simply the word BԪַherBԪַ on the social media platform X the day of GPT-4oBԪַs unveiling.
Many reacting to the modelBԪַs demos last week also found some of the interactions struck a strangely flirtatious tone. In one posted by OpenAI, a female-voiced ChatGPT compliments a company employee on BԪַrocking an OpenAI hoodie,BԪַ for example, and in another the chatbot BԪַoh stop it, youBԪַre making me blushBԪַ after being told that itBԪַs amazing.
ThatBԪַs sparked some conversation on the gendered ways critics say tech companies have long used to develop and engage voice assistants BԪַ dating back far before the latest wave of generative AI advanced the capabilities of AI chatbots. In 2019, the United NationsBԪַ culture and science organization built into default female-voiced assistants (like AppleBԪַs Siri to AmazonBԪַs Alexa), even when confronted with sexist insults and harassment.
BԪַThis is clearly programmed to feed dudesBԪַ egos,BԪַ The Daily Show senior correspondent Desi Lydic said of GPT-4o in a segment last week. BԪַYou can really tell that a man built this tech.BԪַ
Wyatte Grantham-philips, The Associated Press