|
Goals
To develop a voice-operated procedure browserand astronaut assistant for the International Space Station.
Objectives
As the project has progressed, the followingkey objectives have emerged:
Application
The current experimental version of Clarissa
runs on a
standard ISS laptop, and browses procedures written in an XML format.
The markup language includes constructs for branch points, conditional
steps, querying the user for numerical values, and linking to
subprocedures. For our initial experiments, five procedures have been
converted into Clarissa-compatible XML form: three for sample
collection and testing of the Station's potable water supply, and two
for space suit checkout.
The system has a vocabulary of about 260 words, and supports about 75 different commands. The user can scroll forwards or backwards, move to an arbitrary new step, preview or read out a non-current step without losing their place in the procedure, open a sub-procedure, and read safety-critical portions of the procedure in a mode which checks aggressively that steps have not been skipped. Other commands include support for recording, playing and deleting voice notes, setting timers and alarms, and querying status. Any misrecognition can be undone or corrected, using a command like "undo" or "no, I said go to step fourteen". Speech recognition is in "open mic" mode. In the deployed system, the error rate for distinguishing between voice commands and non-system-directed speech is about 10%; together with Jean-Michel Renders from Xerox Research Center Europe, we have developed experimental methods using Support Vector Machines, which reduce the error rate to about 5%. This work may be integrated into a future version of Clarissa. Spoken output is performed using a recorded voice. Clarissa has been implemented mainly using SICStus Prolog and a speech recognition toolkit provided by Nuance Communications. Application-specific spoken command grammars were constructed using the Regulus platform. Other programming languages and software packages used include Java, C, and SRI International's Gemini and Open Agent Architecture.
Milestones
To the best of our knowledge, Clarissa is the first spoken dialogue system in space. |
Team
Project Lead: Links
|