Q&A: Bruce Balentine on Discoverability and VUI
At the 2019 SpeechTEK Conference (April 29-May 1), Bruce Balentine, design consultant specializing in speech, audio, and multimodal user interfaces, will be presenting “Discoverability in Spoken user Interfaces.” Conference Chair Jim Larson interviewed Balentine to get a sneak peek at the session and talk about discoverability.
Q: Why is it difficult for users to discover functions and operations that they can perform using voice applications?
A: Users discover functions and operations in a GUI interface by freely exploring, because a GUI utilizes the sense of sight and exists within the three dimensions of space. This is less effective in a VUI, because a VUI utilizes the sense of hearing and exists within the single dimension of time. Users therefore easily become lost, and the passage of time extracts a higher penalty in terms of thinking, confusion, inability to return to known starting places, loss of context, and risk of sudden dialogue terminations.
Q: How can users apply what they know about current voice applications when using new voice applications?
A: Users generally cannot apply what they know about current voice applications when using new voice applications—a phenomenon known as transfer of learning. This is partly because of a lack of standards, which product designers eschew in favor of differentiation for the sake of "branding." It is also because the industry has ranked very-large-vocabulary freeform "natural language" over such ergonomic issues as error-detection and recovery, fixed and learnable methods for backing up or skipping forward, consistent turn-taking rules, and user-machine-environment modeling for situated awareness—all user interface subsets that lend themselves to standardization.
Q: Are frequently asked questions, user guides, and youtube videos enough?
A: FAQs, user guides, and YouTube videos are not enough. External collateral and observation do have their place, but the most effective discovery technique is user exploration. This method of user learning is dissonant with today's variant, opaque, and ill-considered surface designs, which unknowingly send misleading and inconsistent cues that prevent users from forming an effective theory of the machine's mind.
Q: What are the big takeaways from your SpeechTEK presentation?
A: The big takeaways from my presentation include a better understanding of the importance of timing, eye-opening detail about grounding junctures, the importance of user-initiated backup, and an interesting and subtle heuristic-development theory for empathic learning—all features that contribute directly to discoverability in voice applications of all kinds.
At the 2019 SpeechTEK conference Yves Normandin of Nu Echo, Inc. and Deborah Dahl of Conversational Technologies, will present "A Comprehensive Guide to Technologies for Conversational Systems." Conference chair Jim Larson talked to Normandin and Dahl to get a sneak peek of the session, and learn about conversational system technologies.
At the 2019 SpeechTEK Conference, Wolf Paulus, Principal Engineer, Technology Futures, Intuit and University of California, Irvine will be exploring "The Engineering of Emotion." Conference Chair Jim Larson interviewed Paulus to get a sneak peek at the session and explore the world of sentiment analysis.