-->
Development Tools and APIs

Development tools and APIs designed to let users create custom speech technology applications are at the foundation of the speech technology industry. See below for the latest development tools and API news, trends, and solutions.

Features

Industry-Standard Speech App Building Blocks Take Shape

Interface interoperability is becoming closer to reality, but more work is needed.

The Top Speech Technologies and Vendors: The 2023 Speech Industry Awards

AI, AI, and more AI: The technology is disrupting everything, and it's found everywhere in our speech industry achievements for 2023.

2023 Speech Industry Award Winner: D-ID Gives a Human Face and Voice to AI

D-ID, an Israeli company founded in 2017, is providing superpowers to individual creators and businesses alike, uniquely enabling them to transform any picture into an interactive video in seconds.

2023 Speech Industry Award Winner: NVIDIA Is Making Voice AI Better for Almost Everyone

NVIDIA saw blowout second-quarter results, surging margins, and incredible demand, which prompted one analyst from Constellation Insights to conclude that "it's clear the company has little competition and a lot of pricing power."

Industry Voices

Why Speech Researchers Need Better Benchmarks

Long-form speech recognition is here and growing. With updated datasets, we can accurately train and test ASR models for real-world use cases.

Four Pitfalls to Avoid When Building Compelling Voice Experiences

As voice experiences grow in popularity, here are some pitfalls developers can avoid when creating voice-focused products. 

Mitigating TDMA Noise in Microphone Lines

Here are a few countermeasures that designers can incorporate to mitigate TDMA noise without affecting the signals.

Protecting User Data: How Close is the US to its Own GDPR?

GDPR has already had wide-ranging consequences for companies collecting data, and now some are calling for federal regulations in the U.S. Voice-data isn't exempt from the regulations, and vendors need to be ready.

Columns

New Trends in Speech Technology: A Report from the Cutting Edge

Here's the research on which dramatic new capabilities are based.

Standards for Evaluating Generative AI

Assessing the output of genAI systems is easier said than done.

Automatic Dialogue Replacement Will Translate to Big Profits

Voice-to-voice translation is one of the most potentially lucrative uses for voice cloning technology.

Interoperability Benefits Everyone, So Everyone Should Get Behind It

While the need to improve interoperability is clear, a lot of work must be done to forge a comprehensive set of standards.

Development Tools and APIs Companies and Suppliers