Cash Flows in to Speech Company Coffers
Mordor Intelligence projects the global market for voice recognition technology to be worth $22 billion in 2026 and to nearly triple in size over the next five years. It valued the market at $18.4 billion last year and expects it to hit $61.8 billion by 2031, growing at a compound annual rate of 22.4 percent.
Venture investors have seen the market's potential and poured more than $7 billion into voice AI startups in the first quarter of this year, far more than in any previous period.
ElevenLabs, Synthesia, and Runway have all raised sizable new rounds since the start of the year.
ElevenLabs in February closed a $500 million Series D funding round led by Sequoia Capital, Andreessen Horowitz, ICONIQ, Lightspeed Venture Partners, Evantic Capital, BOND, BroadLight, NFDG, Valor Capital, AMP Coalition, and Smash Capital. This latest funding values ElevenLabs at $11 billion, more than tripling its valuation from one year prior, and brings total funding to $781 million across five rounds since its founding in 2022.
Synthesia in January raised $200 million in Series E from Google Ventures, Evantic, Hedosophia. NVentures (NVIDIA's venture capital arm), Accel, Kleiner Perkins, New Enterprise Associates (NEA), PSP Growth, Air Street Capital, FirstMark, and MMC Ventures.
Runway, a AI video and voice generation company, in February raised $315 million in a Series E round of funding from General Atlantic, Nvidia, Adobe Ventures, AMD Ventures, Fidelity Management & Research Co., and Felicis Ventures. In total, the New York-based startup has raised $860 million since its founding in 2018.
Other speech technology companies that have benefited from the influx of capital included Bland, which recently raked in $50 million in series C funding from Dell Technologies Capital, HubSpot Ventures, Archerman Capital, Tribeca Venture Partners, Emergence Capital, Upfront Ventures, Scale Venture Partners, Y Combinator, Max Levchin, Piotr Dabkowski, and Jeff Lawson. With this round, Bland's total capital raise totaled more than $100 million.
Kotoba Technologies, a San Francisco- and Tokyo-based startup developing real-time speech models for East Asian languages, scored an additional $10 million in funding from Kindred Ventures, Salesforce Ventures, and Sony Innovation Fund, bringing its total funding to $23 million.
And earlier this year Sandbar, a voice ring and conversational interface creator, raised $23 million in a Series A funding round from Adjacent and Kindred Ventures, following a $10 million seed round last year and a $3 million pre-seed round in early 2024, bringing total funding to $36 million.
The latest company to collect venture capitalist cash is Coval, which produces an evaluation platform for voice AI, having just closed a $28 million Series A round of financing by Norwest, Base10 Partners, Twilio Ventures, and Y Combinator. This brings the total capital raised to $31 million since its launch in 2024.
The new funding will enable Coval to address the growing reliability and compliance challenges companies face as they deploy autonomous voice agents. Coval runs tens of millions of evaluations, and with this new capital, the company will expand its sales and solutions engineering teams and advance product capabilities with deeper simulation, new integrations, and enhanced human review and monitoring features.
"Every company is going to have a voice agent just like they have a mobile app or a web app. But today, most enterprises don't have the infrastructure to deploy these systems with confidence," said Brooke Hopkins, founder and CEO of Coval, in a statement. "Coval gives teams the ability to simulate, monitor, and continuously improve voice agents, so they can move from experimentation to reliable production at scale."
"Voice is going to be the number one interface for how humans interact with AI, and that shift creates an entirely new infrastructure layer for enterprises," said Scott Beechuk, a partner at Norwest.
"Twilio's open, flexible, and model-agnostic infrastructure for the agentic era is driving the shift toward human-like voice AI agent experiences becoming the norm in customer engagement," said Andy O'Dower, vice president and field chief technology officer at Twilio. "Trust is critical to scaling these experiences, and our investment in Coval reflects our conviction that comprehensive evaluation and testing tools, combined with a strong observability and reliability layer, are foundational to maintaining momentum in today's voice AI renaissance."