Sensory has boosted the performance and accuracy of its embedded wake word and speech recognition with shallow learning.
Sensory has updated the embedded artificial intelligence in its TrulyHandsfree technology to dramatically boost performance and accuracy.
TrulyHandsfree is an embedded small vocabulary speech recognition system that features an always-listening wake word. In version 5.0, Sensory's latest deep neural network models for embedded AI have allowed the company to deliver a five times reduction in false accepts compared to version 4.0. A new shallow learning approach takes the biggest speech models and compresses them down by a factor of 5-10 with no decrease in accuracy. Additionally, the latest neural network models offer greater reliability for user-defined triggers, providing the option for users to select the wake word they prefer.
TrulyHandsfree 5.0 includes a layer of security in the voice interface combining voice biometrics recognition and deep neural nets to authenticate users, who can enroll their voices and their own custom triggers or passphrases.
TrulyHandsfree 5.0 also features a new voice barge-in feature, enabled with Sensory's proprietary Acoustic Echo Cancellation (AEC) technology. Users can interrupt devices while playing voice prompts, music or other sounds by saying the trigger phrase to control music playback by voice, or provide any other kind of supported speech commands.
With version 5.0, Sensory incorporates new deep learning noise suppression algorithms that reduce the level of ambient noise provided to the speech recognizer.
For ultra-low power devices that have limited battery capacity such as wearables, Sensory offers its Low Power Sound Detector (LPSD) hardware component for DSPs and smart microphones that can reduce low-power configurations of TrulyHandsfree to operate at an average battery draw of less than a 1mA.
"The demand for voice user interfaces continues to grow rapidly, and TrulyHandsfree 5.0 will allow more manufacturers to incorporate low-cost, low-power voice user interfaces on device without sacrificing the cloud accuracy," said Todd Mozer, CEO of Sensory, in a statement. "TrulyHandsfree 5.0 offers the most advanced and efficient embedded AI technologies we've ever created. Additionally, we've set the bar higher than ever before for speech recognition accuracy by applying our new proprietary echo cancellation and noise reduction algorithms that we are confident will boost far-field voice performance for IoT devices of all kinds."
TrulyHandsfree is deployed on more than 2 billion devices worldwide. Additionally, Sensory can deliver voice triggers for all major IoT cloud services, including Amazon AVS, Apple Siri, Google Assistant, and Microsoft Cortana, provide developer support for cloud service interfaces on Linux, Android, iOS and Windows and dozens of proprietary digital signal processors, microcontrollers, smart microphones, and other low-power embedded devices.