红桃视频

Artificial intelligence Communications tech E-commerce M&A Startups Venture

Why Big Investors Are All Ears For Voice AI Startups

Illustration of Robot Piggy Bank.

Artificial intelligence is undoubtedly the hottest area of tech today, with venture capital dollars flowing into startups in the space at unprecedented levels.

Within the vast space, voice AI startups have emerged as a standout, attracting the attention of investors globally, 红桃视频 data shows. Over the past 12-18 months, several voice AI companies have seen their valuations triple 鈥 a signal of accelerating market demand and perceived long-term worth.

One example of a voice AI company that has seen a massive valuation jump this year is , which allows creators, enterprises and others to use AI software to replicate voices in dozens of languages. The Brooklyn, New York-based startup went from achieving unicorn status with an $80 million Series B raise in January 2024 to being valued at about $3.3 billion one year later with a $180 million Series C co-led by and . Other backers include ,, and .

And on Sept. 8, ElevenLabs announced it will sell secondary shares to provide liquidity options for employees via that would double the company鈥檚 valuation to $6.6 billion. In a , ElevenLabs鈥 revealed that ElevenLabs had 鈥減assed $200M in ARR in 2.5 years.鈥

Appetite for acquisitions

Voice also remains an attractive segment for ambitious acquirers. In July, , a startup that uses AI to generate human-sounding voices, for an undisclosed amount. Founded in 2022, PlayAI had raised $23.7 million, per 红桃视频 .

The PlayAI team鈥檚 鈥渨ork in creating natural voices, along with a platform for easy voice creation鈥 was a great match for Meta鈥檚 鈥渨ork and road map, across AI Characters, Meta AI, Wearables and audio content creation,鈥 according to an internal memo viewed by .

, managing partner and head of Europe at , believes that budding voice AI companies are ripe for acquisition because while companies may need speech-to-text, text-to-speech, intent recognition and conversational AI, building those capabilities in-house 鈥渃an take years.鈥

鈥淎s CEOs realize that natural language and voice are essential to deliver the best product experience at the largest possible scale in the biggest markets, they鈥檒l often conclude that it’s much faster to acquire proven technology and teams, so one could expect acquisition opportunities to arise,鈥 Hulme told 红桃视频 News.

Controlled growth

The growing investment in voice AI isn’t surprising when you look at the rapid confluence of multiple fast-developing technologies 鈥 primarily LLMs and real-time voice recognition, according to Hulme.

鈥淪peech recognition is finally achieving human-level accuracy, LLMs are better at understanding context and intent, while microphones are literally in every device and platform we use,鈥 he said.

As a firm, GV has invested in several companies that fall under the voice AI category, including , , and .

鈥淥ne of the things that drew us to [these] companies 鈥 is the founders鈥 fundamental belief in the opportunity in natural language and voice as a user interface,鈥 Hulme added. 鈥淭hese companies are tackling different pieces of the conversational computing puzzle, but they share a vision of making humans鈥 interactions with machines truly natural and as low friction as possible.鈥

Another factor that makes voice AI startups so attractive is that natural language can be considered to be humans鈥 main API for development, noted Hulme. And that includes understanding the world around us and communication.

鈥 users are sending millions of voice messages every day 鈥 that’s human behavior telling us how they want to communicate with technology in a frictionless way,鈥 he said. 鈥淎nd LLMs have been trained on the internet, which is predominantly natural language, so it makes sense that natural language and voice are the most elegant way to interact with them.鈥

, a partner with , said her firm has invested in models, middleware, applications, agents and even hardware as it relates to voice AI. Notably, it backed , an AI notepad that transcribes and summarizes meetings from your device.

鈥淎s a subset of our portfolio, many of those companies are experiencing tailwinds in usage and capability,鈥 she wrote via email. 鈥淪o it’s clear that after more than a decade of TTS/STT (text-to-speech/speech-to-text) being available, the current crop of audio-aware models have unlocked actual utility and mainstream usage of voice as an interface.鈥

Customer conversations

Voice AI startups of all sizes continue to raise venture funding. Customer support in particular is a growing area.

an Austin, Texas-based 24/7 AI-powered phone system for restaurants, recently announced that it raised a $3.5 million seed round led by .

The company says it has driven 鈥渢ens of millions鈥 in order volume since its 2024 launch. Loman touts that its AI phone agent 鈥渁nswers every call,鈥 takes pickup and delivery orders, books reservations, fields guest questions and syncs directly with leading POS and reservation systems. The result, it claims, is that restaurants see higher revenue from recaptured calls and 鈥渟mart upsells,鈥 while also cutting labor costs.

In June, , a startup that builds enterprise AI agents for customer support, raised a $50 million Series B led by . Founded in 2023, the Boston-based company has raised a total of $78 million in funding, per 红桃视频 . In a recent , founder and CTO wrote that Maven鈥檚 voice AI agents for live calls can 鈥渦nderstand context and respond naturally in any situation.鈥

鈥嬧婬e added: 鈥淢aven Voice is also the first to bring voice-to-voice AI into real-world production for faster responses, more natural interactions, and tone that stays intact.鈥

A 鈥榰niversal remote鈥 for the digital world

Then there are those companies that are working behind the scenes to help other AI companies grow their offerings. One example lies in , which is an applied AI startup that builds advanced speech-to-text and audio intelligence models. It aims to make it easy for developers to add voice features, such as transcription and voice recognition, to their apps. For example, voice AI apps such as and use AssemblyAI鈥檚 technology to power their features.

Founded in 2017, it has raised nearly $160 million to date, per 红桃视频 . Backers include ,, and , among others.

AssemblyAI鈥檚 technology has a variety of use cases, according to CEO and founder It’s used by contact centers and sales teams to transcribe and analyze customer calls, summarize conversations and detect key moments. As mentioned above, its tech powers features such as real-time subtitles, voice assistants and searchable transcripts for companies such as Granola, and . In the healthcare space, it automatically generates patient visit notes from recorded conversations. It also creates captions and transcripts for videos, podcasts and meetings.

鈥淚t’s very clear that there is a big market opportunity for what we’re doing,鈥 Fox told 红桃视频 News in an interview. 鈥淔or the first couple of years, the tech was bad, the market was small, and it took time before things started to really click and come together.

鈥淎nd there’s still a huge surface area of stuff that is unexplored and untapped, because the text still isn’t good enough for a lot of stuff,鈥 he added. 鈥淪o there’s still so much room to grow.鈥

Usage to AssemblyAI鈥檚 API has grown over 250% year over year, according to Fox, who notes that the company has thousands of paying customers and over half a million developers on its platform currently.

Looking ahead, Fox believes another big use case for AssemblyAI鈥檚 technology is real-time voice agents that people can talk to over the phone and plug into hardware.

鈥淲e work closely with companies like , and there are so many in that space that are just taking off,鈥 he said

For GV鈥檚 Hulme, one of the most exciting trends he believes is underway in the growth of voice AI is that 鈥渨e鈥檙e returning to humanity鈥檚 most natural form of communication.鈥

After decades of adapting ourselves to technology, 鈥渢echnology is finally adapting to us,鈥 he said.

鈥淰oice and natural language represent the ultimate accessibility hack, democratizing access to computational power for everyone who can think and communicate 鈥 It鈥檚 worth keeping an eye on because voice is becoming a type of universal remote for the digital world,鈥 Hulme told 红桃视频 News. 鈥淲hether it鈥檚 Big Tech companies or new startups, there are many players jockeying for advantages at the conversational layer.鈥

Related reading:

Illustration:

Stay up to date with recent funding rounds, acquisitions, and more with the 红桃视频 Daily.

67.1K Followers

CTA

Discover and act on private market opportunities with predictive company intelligence.

Copy link