Speech Applications Security: Protecting Your Business and Your Customers From Hackers
Posted Mar 1, 2006

As speech applications have become more complex, they will become increasingly attractive targets for hackers. Yet, when the topic of security comes up today with regards to speech applications, the conversation generally gravitates toward speaker verification as if access by voice to an application were the only area of possible compromise. Wrong!

Speech and text-based systems are not identical when it comes to the issue of security. With personal information collected (e.g., names, account numbers, social security numbers) and transactions completed with speech automation, new opportunities abound for security breach within a speech recognition system's infrastructure. Those considering or currently managing a speech system must be aware of the security nuances associated with speech. VPN's for remote connectivity, database access protocols, and other fairly standard IT security technologies and procedures will simply not address a number of speech specific points of risk. Three common risk points are:

Some Protection Guidelines

Computer telephony integration (CTI) can make a transfer from the speech system to an agent more seamless by passing information retrieved from the caller to the agent. However, transition points between the speech system and the agent desktop present the opportunity for unauthorized access to sensitive information. It's not simply technology that either illuminates or provides a solution to a possible risk. Transition points create questions about who's responsible for security; managers of the speech system or the agent pool? Encrypting data transfers between a speech solution and agent desktop is a good solution. Finger pointing between managers of each group make it irrelevant if it never gets done until a breach is discovered. A comprehensive approach to securing the entire caller experience, not just a technology component, is required.

Government and Business Regulations

It certainly makes sense to be aware of areas where a speech recognition solution might present a security risk. Working with a speech hosting provider with known security expertise or with in-house security experts managing a premise-based solution will help to protect customer data. However, in a growing list of cases this is not simply good business practice, it's becoming the law.

Some examples include the Gramm-Leach-Bliley Act, which applies to financial institutions and requires a comprehensive information security program. The internal control requirements of Sarbanes-Oxley are being applied to security issues. In the private sector, the Payment Card Industry (PCI) Data Security Standard has direct applicability for organizations taking credit card information. There are a number of others for specific industry segments that insist on maintaining security, and could certainly be interpreted to apply to any speech system.

Securing Sensitive Information Within a Speech System

Securing a speech system involves much more than simply being concerned with network connections. Awareness of some key components of a speech solution is the first step to take for IT/Security Managers who previously have not been exposed to DTMF input, concatenated audio confirmations, and data collected for "tuning." Organizations deploying a speech system either on-premise or as a hosted speech solution must be aware of the risks that speech introduces.

Securing sensitive data within a speech system includes considering the management team, the established processes and the audio and text data being stored, logged, or transmitted. At the highest level there are three broad categories to consider:

People - Is there a dedicated security manager in place that is familiar with speech recognition? Is there a process for background checks on all necessary personnel? Is there separation of duties according to security best practices? Etc.

Process - Is a written security policy in place? Are plans established for the event of a security breach? Are security audit and patch standards established? Etc.

Technology - Is speech logging suppression available? Is LAN and WAN encrypted and mutually authenticated? Do security standards consider the link between speech and agents' systems? Etc.

Although there are many similarities between speech and text-only systems from the perspective of network security, there are also significant differences. An IT or security professional may be quite familiar with the physical connections of the network and telephony hardware, yet not know how audio is maintained or converted or transcribed to text within a speech system. Overlooking components that are fundamental to speech can create opportunities for hackers who suspect that the speech system is vulnerable.

A combination of technology and procedures must be employed to support speech solutions within a secure environment. These high-level goals must be balanced with applicable security regulations and best practices to ensure a secure speech application environment.

Securing a speech system requires knowledge of where the potential for a security breach exists and implementing solutions to thwart unauthorized access by either external or internal threats. Organizations considering a speech solution must evaluate security as a top priority with the understanding that there are issues specific to speech alone that must be addressed.