Overview
  • 06 Aug 2025
  • 1 Minute to read
  • Contributors
  • Dark
    Light

Overview

  • Dark
    Light

Article summary

The Vuzix Speech Command engine is a fully embedded, fast, phrase-matching recognition system designed to interpret and respond to voice commands. A platform based vocabulary is available to all apps; it is intended to facilitate default navigation and selection without direction from the client app. That is, a client app can benefit from navigation provided by a base vocabulary with no setup or explicit awareness of the speech command engine. This capability is implemented by mapping phrases to Android key events.

The Vuzix Speech Command engine is implemented as an Android service that runs locally on the device. No cloud servers are used, and the audio data never leaves the device.


Default Behavior

By default, the Vuzix software will scan for any clickable items in your user interface and populate a numeric speech bubble to allow the user to select it after saying “Hello Vuzix”. This allows the application to be easily navigated with voice commands with no modification, as long as the android:clickable attribute is correctly set for each UI element.


Low-Code Customization Approach

Customized Vuzix Speech commands can be leveraged without deploying the Vuzix Speech SDK into your application by adding XML tags in your application’s views. These tags customize the speech bubble behavior. Please see the Low Code Approach article for more information.


Custom Vocabulary Architecture

For the greatest flexibility over the user interface, and the richest feature set, the Vuzix Speech SDK can be directly used by any 3rd party application running on Vuzix hardware.

For many applications, it is desirable to implement a custom vocabulary which performs application-specific actions when an application-specific phrase is spoken (e.g. to capture a still image when “take a picture” is spoken.) The Vuzix Speech Command engine provides two mechanisms by which this can be achieved: Android key events and Android intents.

Each Activity can have its own vocabulary. The system will automatically switch to the proper vocabulary as Activities are paused and resumed. If no vocabulary is provided, the system will use the default navigation commands.

A 3rd party Application may create a custom vocabulary for the Vuzix Speech Command engine by utilizing the Vuzix Speech SDK as described throughout this section of the knowledge base.


Supported Languages:

The speech command engine supports the following languages:

Arabic

Bulgarian

Cantonese

Czech

Danish

Dutch

English

Farsi

Finnish

French

German

Greek

Hebrew

Hindi

Hungarian

Indonesian

Italian

Japanese

Korean

Mandarin

Norwegian

Polish

Portuguese

Russian

Spanish

Swedish

Thai

Turkish