March 29, 2019
Q & A

Q&A: Bruce Balentine on Discoverability and VUI

At the 2019 SpeechTEK Conference (April 29-May 1), Bruce Balentine, design consultant specializing in speech, audio, and multimodal user interfaces, will be presenting “Discoverability in Spoken user Interfaces.” Conference Chair Jim Larson interviewed Balentine to get a sneak peek at the session and talk about discoverability.

Q: Why is it difficult for users to discover functions and operations that they can perform using voice applications?

A: Users discover functions and operations in a GUI interface by freely exploring, because a GUI utilizes the sense of sight and exists within the three dimensions of space. This is less effective in a VUI, because a VUI utilizes the sense of hearing and exists within the single dimension of time. Users therefore easily become lost, and the passage of time extracts a higher penalty in terms of thinking, confusion, inability to return to known starting places, loss of context, and risk of sudden dialogue terminations.

Q: How can users apply what they know about current voice applications when using new voice applications?

A: Users generally cannot apply what they know about current voice applications when using new voice applications—a phenomenon known as transfer of learning. This is partly because of a lack of standards, which product designers eschew in favor of differentiation for the sake of "branding." It is also because the industry has ranked very-large-vocabulary freeform "natural language" over such ergonomic issues as error-detection and recovery, fixed and learnable methods for backing up or skipping forward, consistent turn-taking rules, and user-machine-environment modeling for situated awareness—all user interface subsets that lend themselves to standardization.

Q: Are frequently asked questions, user guides, and youtube videos enough?

A: FAQs, user guides, and YouTube videos are not enough. External collateral and observation do have their place, but the most effective discovery technique is user exploration. This method of user learning is dissonant with today's variant, opaque, and ill-considered surface designs, which unknowingly send misleading and inconsistent cues that prevent users from forming an effective theory of the machine's mind.

Q: What are the big takeaways from your SpeechTEK presentation?

A: The big takeaways from my presentation include a better understanding of the importance of timing, eye-opening detail about grounding junctures, the importance of user-initiated backup, and an interesting and subtle heuristic-development theory for empathic learning—all features that contribute directly to discoverability in voice applications of all kinds.

Free

for qualified subscribers

Subscribe Now Current Issue Past Issues

Q&A: Deborah Dahl on Natural Language Understanding

Jim Larson talked to Dr. Deborah Dahl, Principal, Conversational Technologies about the increasing importance and capabilities of natural language processing, speech recognition, and

18 Sep 2019

Q&A: Anand Janefalkar on the Ideal IVR

Poorly designed IVRS have been angering customers for decades, but it doesn't have to be this way. We talked to the Founder and CEO of UJET about how well designed IVRs can improve customer experience.

04 Sep 2019

Voice-First User Interfaces Speak to the Omnichannel Future

Screen- and voice-oriented devices are becoming one and the same

15 Jul 2019

Q&A: Dahl and Normandin Explore Conversational Technology Platforms

At the 2019 SpeechTEK conference Yves Normandin of Nu Echo, Inc. and Deborah Dahl of Conversational Technologies, will present "A Comprehensive Guide to Technologies for Conversational Systems." Conference chair Jim Larson talked to Normandin and Dahl to get a sneak peek of the session, and learn about conversational system technologies.

05 Apr 2019

Q&A: Wolf Paulus on the Engineering of Emotion

At the 2019 SpeechTEK Conference, Wolf Paulus, Principal Engineer, Technology Futures, Intuit and University of California, Irvine will be exploring "The Engineering of Emotion." Conference Chair Jim Larson interviewed Paulus to get a sneak peek at the session and explore the world of sentiment analysis.

22 Mar 2019

Q&A: Bruce Balentine on Discoverability and VUI

Q&A: Deborah Dahl on Natural Language Understanding

Q&A: Anand Janefalkar on the Ideal IVR

Voice-First User Interfaces Speak to the Omnichannel Future

Q&A: Dahl and Normandin Explore Conversational Technology Platforms

Q&A: Wolf Paulus on the Engineering of Emotion

IBM Releases Granite 3.3 8B Speech Recognition Model

SoundHound Releases Amelia 7.0

Nari Labs Launches Dia TTS Model

SoundHound AI Partners with Tencent to Bring Conversational AI to Auto Brands