University of Cambridge > Talks.cam > Inference Group > Speech Dasher: Fast Writing using Speech and Gaze

Speech Dasher: Fast Writing using Speech and Gaze

Add to your list(s) Download to your calendar using vCal

If you have a question about this talk, please contact Keith Vertanen.

Speech Dasher allows writing using a combination of speech and a zooming interface. Users first speak what they want to write and then they navigate through the space of recognition hypotheses to correct any errors. Speech Dasher’s model combines information from a speech recognizer, from the user, and from a letter-based language model. This allows fast writing of anything predicted by the recognizer while also providing seamless fallback to letter-by-letter spelling for words not in the recognizer’s predictions. In a formative user study, expert users wrote at 40 (corrected) words per minute. They did this despite a recognition word error rate of 22%. Furthermore, they did this using only speech and the direction of their gaze (obtained via an eye tracker).

For more information, see the 4-page CHI note or chapter 4 of my thesis

This will be a short talk.

This talk is part of the Inference Group series.

Tell a friend about this talk:

This talk is included in these lists:

Note that ex-directory lists are not shown.

 

© 2006-2024 Talks.cam, University of Cambridge. Contact Us | Help and Documentation | Privacy and Publicity