- Print
- DarkLight
The Vuzix Speech Command engine is a fully embedded, fast, phrase-matching recognition system designed to interpret and respond to voice commands. A platform based vocabulary is available to all apps; it is intended to facilitate default navigation and selection without direction from the client app. That is, a client app can benefit from navigation provided by a base vocabulary with no setup or explicit awareness of the speech command engine. This capability is implemented by mapping phrases to Android key events.
The Vuzix Speech Command engine is implemented as an Android service that runs locally on the device. No cloud servers are used, and the audio data never leaves the device.
Default Behavior
By default, the Vuzix software will scan for any clickable items in your user interface and populate a numeric speech bubble to allow the user to select it after saying “Hello Vuzix”. This allows the application to be easily navigated with voice commands with no modification, as long as the android:clickable attribute is correctly set for each UI element.
Low-Code Customization Approach
Customized Vuzix Speech commands can be leveraged without deploying the Vuzix Speech SDK into your application by adding XML tags in your application’s views. These tags customize the speech bubble behavior. Please see the Low Code Approach article for more information.
Custom Vocabulary Architecture
For the greatest flexibility over the user interface, and the richest feature set, the Vuzix Speech SDK can be directly used by any 3rd party application running on Vuzix hardware.
For many applications, it is desirable to implement a custom vocabulary which performs application-specific actions when an application-specific phrase is spoken (e.g. to capture a still image when “take a picture” is spoken.) The Vuzix Speech Command engine provides two mechanisms by which this can be achieved: Android key events and Android intents.
Each Activity can have its own vocabulary. The system will automatically switch to the proper vocabulary as Activities are paused and resumed. If no vocabulary is provided, the system will use the default navigation commands.
A 3rd party Application may create a custom vocabulary for the Vuzix Speech Command engine by utilizing the Vuzix Speech SDK as described throughout this section of the knowledge base.
Supported Languages:
The speech command engine supports the following languages:
Arabic | Bulgarian | Cantonese | Czech | Danish |
Dutch | English | Farsi | Finnish | French |
German | Greek | Hebrew | Hindi | Hungarian |
Indonesian | Italian | Japanese | Korean | Mandarin |
Norwegian | Polish | Portuguese | Russian | Spanish |
Swedish | Thai | Turkish |