Embracing the Voice Revolution: AI Startup ElevenLabs Raises $19 Million to Advance Text-to-Speech and Voice Cloning Technology

ElevenLabs, a one-year-old AI startup focused on creating new text-to-speech and voice cloning tools, has raised $19 million in a Series A round co-led by Andreessen Horowitz (a16z), marking another milestone in the increasingly competitive field of generative AI voice synthesis. The company aims to revolutionize audio content creation through ultra-realistic text-to-speech models for education, audiobooks, gaming, movies, business, and other industries.

The Founding Story behind ElevenLabs

The story of ElevenLabs began with the two founders, who grew up in Poland, watching poorly dubbed films from the US. They were inspired to create a company that could revolutionize audio content creation, using cutting-edge AI to improve the quality and realism of synthesized speech. They founded ElevenLabs with the goal of developing tools that could bridge the gap between human speech and computer-generated speech, making it possible to create more immersive and engaging audio content in various contexts.

ElevenLabs’ objective

The startup aims to develop a range of ultra-realistic text-to-speech models that can be used in a variety of contexts, from education and audiobooks to gaming and movies. The company’s cutting-edge AI technology can create voice actors that sound like real people, making it possible to produce high-quality audio content quickly and easily. The technology is designed to be highly flexible, allowing it to be used in a range of applications across multiple industries.

New Partnership with Andreessen Horowitz

The Series A funding was co-led by Andreessen Horowitz, a prominent venture capital firm that has backed a wide range of successful startups in recent years. The firm will also join ElevenLabs’ board, bringing valuable expertise and resources to the company as it continues to grow and expand its offerings.

Products from Eleven Labs

Currently available products from ElevenLabs include its Speech Synthesis, VoiceLab, and the newly unveiled AI Speech Classifier with an API. These tools make it possible to generate high-quality synthesized speech quickly and easily, enabling companies and individuals to create engaging audio  content with minimal effort.  ElevenLabs allows and supports the use of voice cloning for “caricature, parody, and satire” as well as “artistic and political speech contributing to public debates.” This stance has generated controversy in some circles, with critics suggesting that the technology could be used to create fake audio content that could deceive the public or spread disinformation. However, ElevenLabs maintains that voice cloning is a useful and valuable tool for a range of creative and artistic purposes.

The funding round and the release of new tools come right after Meta Platforms introduced its own generative AI voice synthesis tool called Voicebox. As the field of generative AI voice synthesis continues to heat up, ElevenLabs is well-positioned to capitalize on this growing market and revolutionize the world of audio content creation. With its cutting-edge technology, growing customer base, and experienced team of experts and investors, ElevenLabs is poised for a bright future in the years to come.

Explore more

How Does Martech Orchestration Align Customer Journeys?

A consumer who completes a high-value transaction only to be bombarded by discount advertisements for that exact same item moments later experiences the digital equivalent of a salesperson following them out of a store and shouting through a megaphone. This friction point is not merely a minor annoyance for the user; it is a glaring indicator of a systemic failure

AMD Launches Ryzen PRO 9000 Series for AI Workstations

Modern high-performance computing has reached a definitive turning point where raw clock speeds alone no longer satisfy the insatiable hunger of local machine learning models. This roundup explores how the Zen 5 architecture addresses the shift from general productivity to AI-centric workstation requirements. By repositioning the Ryzen PRO brand, the industry is witnessing a focused effort to eliminate the data

Will the Radeon RX 9050 Redefine Mid-Range Efficiency?

The pursuit of graphical fidelity has often come at the expense of power consumption, yet the upcoming release of the Radeon RX 9050 suggests a calculated shift toward energy efficiency in the mainstream market. Leaked specifications from an anonymous board partner indicate that this new entry-level or mid-range card utilizes the Navi 44 GPU architecture, a cornerstone of the RDNA

Can the AMD Instinct MI350P Unlock Enterprise AI Scaling?

The relentless surge of agentic artificial intelligence has forced modern corporations to confront a harsh reality: the traditional cloud-centric computing model is rapidly becoming an unsustainable drain on capital and operational flexibility. Many enterprises today find themselves trapped in a costly paradox where scaling their internal AI capabilities threatens to erase the very profit margins those technologies were intended to

How Does OpenAI Symphony Scale AI Engineering Teams?

Scaling a software team once meant navigating a sea of resumes and conducting endless technical interviews, but the emergence of automated orchestration has redefined the very nature of human-led productivity. The traditional model of human-AI collaboration hit a hard limit where a single engineer could typically only supervise three to five concurrent AI sessions before the cognitive load of context