Speechmatics Launches Global English, an Accent-Agnostic Language Pack for Speech-to-Text Transcription

Speechmatics announced the launch of Global English, a single English language pack supporting all major English accents for use in speech-to-text transcription. Global English (GE) was trained on thousands of hours of spoken data from over 40 countries and tens of billions of words drawn from global sources, making it one of the most comprehensive and accurate accent-agnostic transcription solutions on the market.

When tested against providers of similar solutions, Speechmatics says GE consistently produced more accurate transcriptions. Compared directly, GE was between 3% and 55% more accurate than all Google’s Cloud Speech API accent-specific language packs and between 5% and 23% more accurate than IBM’s Cloud US English language pack.

Traditionally, speech recognition has dealt with variations in language by producing a different language pack for every distinct accent or region. However, this meant a whole new set of models trained on data from that particular subset of speakers of the languages. With the launch of GE, Speechmatics is aiming to democratize speech-to-text transcription to overcome industry-wide issues where there are multiple English accents in one recording. The company says it provides a far more accurate, consistent, and cost-effective solution.

The team has been gathering data from a wide range of sources and taking advantage of the astonishing rise in computer power, allowing them to train bigger models, based on more data, capable of supporting more variations. Speechmatics has now built 72 unique languages. With the modern neural network architectures capable of generalizing across variations in speech by using representation learning, Speechmatics were able to generate the accuracy of multiple specialized models all in one language pack.

Free

for qualified subscribers

Subscribe Now Current Issue Past Issues

Speechmatics Launches Global English, an Accent-Agnostic Language Pack for Speech-to-Text Transcription

Voice Deepfake Fraud Surged 1,300 Percent

Sanas Unveils Simultaneous Real-Time Speech-to-Speech Translation

ESTsoft Partners with ElevenLabs

Deepgram Launches Voice Agent API