-->

Speech Technology News

Nari Labs Launches Dia TTS Model

Nari Labs' Dia is an open-source model for real-time voice cloning and expressive speech synthesis on consumer devices.

RateGain Launches UNO VIVA, an AI Voice Agent for Hotels

RateGain's UNO VIVA is a hotel-specific voice agent for handling reservation requests.

SoFast Partners with Deepdub

SoFast and Deepdub partnership enables FAST channels in Spanish, French, German, Italian, Portuguese, Arabic, Japanese, Korean, and more.

Scriptor Software Deconstructs Radiology Dictation with rScriptor

rScriptor separates transcription and reporting to help radiologist create better reports.

Altus Nova Launches Multi-User Voice Platform

Altus Nova's new voice platform enables multi-user memorization and learning.

IntelePeer Introduces Next Generation Voice AI Capabilities

IntelePeer enhanced its speech processing solutions with advanced customization and controls.

SoundHound AI Partners with Tencent to Bring Conversational AI to Auto Brands

SoundHound will collaborate with Tencent Intelligent Mobility to provide an enhanced in-vehicle experience with hands-free access to apps, entertainment, and car controls.

IBM Releases Granite 3.3 8B Speech Recognition Model

IBM's Granite 3.3 8B speech rec model uses refined reasoning, RAG, and LoRAs for improved results.

Mango AI Offers Free Voice Cloning

Mango AI's Free Voice Cloning empowers users to create realistic audio.

Reality Defender and PlayAI Partner to Combat Voice Deepfakes

Reality Defender bolsters AI detection capabilities by integrating PlayAI audio data and voice-generating technology into real-time audio.

Deepgram Launches Aura-2 Text-to-Speech Model

Deepgram's Aura-2 TTS model delivers natural, context-aware speech synthesis for real-time interactions.

Donatos Pizza Selects Revmo AI as Voice Ordering Partner

Voice ordering technology expected to improve customer experience and drive higher order conversions for the 174-store Donato's chain.

Presto Launches Phone Ordering Business Unit

Presto is expanding its suite of voice AI solutions for quick-service restaurants.

Wistia Becomes First Video Marketing Platform with End-to-End AI Translation and Voice Dubbing

Wistia's new video localization features help companies connect with global audiences in native languages.

Amazon Launches Nova Sonic, a Gen AI Model for Building Voice Applications and Agents

Amazon Nova Sonic is a single model that unifies speech understanding and speech generation.

Phonic Launches End-to-End Speech-to-Speech Platform for Building Voice Agents

Phonic's intelligent decision system and hyper-realistic voices form the basis of its voice AI.

Krikey AI Launches Talking Avatars with ElevenLabs

Krikey AI users can create talking avatars with ElevenLabs' voice generator and text-to-speech.

SyncWords Introduces Ultra-Low Latency AI Captions with Kobe Muxer

SyncWord's Kobe Muxer is a video captioning solution with near-real-time availability.

Deepdub Launches Deepdub Live for Global Events

Deepdub Live brings expressive, multilingual voice localization to live sports, esports, and news events.

Gladia Launches Solaria, a Multilingual Speech-to-Text ModelĀ 

Gladia's Solaria delivers native-level transcription in 100 languages.