I’m extremely excited about Sonic 3, been using it to create my own voice clone, and it’s the best one I’ve ever done.
We've raised $100M from Kleiner Perkins, Index Ventures, Lightspeed, and NVIDIA.
Today we're introducing Sonic-3 - the state-of-the-art model for realtime conversation.
What makes Sonic-3 great:
- Breakthrough naturalness - laughter and full emotional range
- Lightning fast - 90ms model latency, 190ms end-to-end (fastest on market)
- Supports 42 languages
The difference: We build on State Space Models (SSMs) instead of Transformers.
Transformers (what everyone else uses) are like rewatching the entire conversation from the start before saying each new word. Every word requires reviewing everything.
SSMs (what Sonic-3 uses) are like humans, remembering the topic and vibe of the conversation. Enough context to speak naturally without replaying everything.
My co-founder, Albert, and I pioneered the SSM paradigm at Stanford AI Lab (S4, Mamba), and it is now being adopted industry-wide.
Thousands of businesses like ServiceNow, Cresta, and Decagon power millions of conversations monthly with Sonic.
Try for free or book a demo here:
If you're qualified and we can't make your voice AI better than what you're using now, I'll donate $5K to your chosen charity.
As part of this launch, we cooked something super cool for you 👇🏻
7,004
17
本页面内容由第三方提供。除非另有说明,欧易不是所引用文章的作者,也不对此类材料主张任何版权。该内容仅供参考,并不代表欧易观点,不作为任何形式的认可,也不应被视为投资建议或购买或出售数字资产的招揽。在使用生成式人工智能提供摘要或其他信息的情况下,此类人工智能生成的内容可能不准确或不一致。请阅读链接文章,了解更多详情和信息。欧易不对第三方网站上的内容负责。包含稳定币、NFTs 等在内的数字资产涉及较高程度的风险,其价值可能会产生较大波动。请根据自身财务状况,仔细考虑交易或持有数字资产是否适合您。


