Speech Technology - comprehensive, independent coverage of information impacting speech technologies

SpeechVerse integrates textual LLMs with speech encoders in one supervised training setup for a more comprehensive understanding of both speech and text.

Apple Proposes Acoustic Model Fusion to Improve Speech Recognition

01 Apr 2024

Apple has proposed a new approach that integrates external acoustic models into end-to-end ASR systems, aiming to refine speech recognition by enriching systems with broader acoustic knowledge.

Voice’s New Frontier: Diabetes Detection

06 Nov 2023

Research has found significant vocal variations in people with and without type 2 diabetes and suggests that voice technologies could be used to detect the condition.

NYC Mayor Uses Voice Tech for Multilingual Robocalls

06 Nov 2023

The robocalls, delivered via voice clone tech, present information on city job fairs and other events in Spanish, Yiddish, Haitian Creole, Mandarin and Cantonese Chinese, and several other languages.

Meta, Google Develop Their Own AI Speech Models

14 Jul 2023

Facebook parent company Meta last month unveiled Voicebox, an advanced AI tool for generating speech from text, while Google introduced AudioPaLM, its own large language model that can tackle speech understanding and generation tasks.

Lawmakers Propose a Ban on Government Use of Biometrics

06 Apr 2023

Carnegie Mellon ASR Pipeline Seeks to Recognize 1,900 Languages Without Audio

16 Nov 2022

A research team from Carnegie Mellon University in Pittsburgh created a voice recognition pipeline that does not need audio to model low-resource languages.

Stanford Model Teaches Turn-Taking to Virtual Assistants

16 Nov 2022

With the goal of creating a more natural conversational flow, a team of researchers at Stanford University replaced the classification approach traditionally used with a more continuous approach.

European Union Reins in Big Tech

19 Jul 2022

The Digital Services Package will prohibit so-called gatekeepers from restricting consumers' access to third-party voice technologies and other applications and digital services.

Microsoft Reins in Its Voice Capabilities

19 Jul 2022

The tech giant said its measures would amount to a significant update to the "Responsible AI Standard" it first released in 2001.

Speech Can Help Students Read, Experts Conclude

13 Oct 2021

Voice-driven tools have had a direct impact on fluency and literacy, unlocking far more natural and nuanced ways of gauging reading fluency and comprehension than traditional quizzes and tests.

Voice Soars as Vehicle Interface of Choice

12 Apr 2021

In-car voice controls must be context-based to offer the best user experience.

Voicebots Outperformed Humans in COVID Detection

16 Nov 2020

Leveraging VOIQ's artificial intelligence-powered voice software to have conversations with citizens over the phone, Colombia became the first country globally to use voicebots at this scale to diagnose millions of individuals within a matter of hours vs. weeks or months with a solely human-powered team.

Overheard/Underheard: A New Collar Gives Dogs a Voice

01 Aug 2020

An AI-powered dog collar uses voice recognition technology to detect and analyze five emotional states—happy, anxious, angry, sad, or relaxed—in dogs and tracks their physical activity.

FYI

Leena AI Launches Agentic AI Colleagues

Hyperlink InfoSystem Launches Clever247.ai Voice AI

SoundHound Partners with Acrelec

Deepfake AI Market to Generate $41.36 Billion by 2032