Overview
  • 07 Jun 2024
  • 1 Minute to read
  • Contributors
  • Dark
    Light

Overview

  • Dark
    Light

Article summary

Introduction

The Vuzix Speech Recognizer is a fully embedded, fast, phrase-matching recognition system designed to interpret and respond to voice commands. A platform base vocabulary is available to all apps; it is intended to facilitate default navigation and selection without direction from the client app. That is, a client app can benefit from navigation provided by a base vocabulary with no setup or explicit awareness of the speech recognizer. This capability is implemented by mapping phrases to Android key events.

For many applications, it is desirable to implement a custom vocabulary which performs application-specific actions when an application-specific phrase is spoken (e.g. to capture a still image when “take a picture” is spoken.) The Vuzix Speech Recognition system provides two mechanisms by which this can be achieved: Android key events and Android intents.

Custom Vocabulary Architecture

The Vuzix Speech Recognizer is implemented as an Android service that runs locally on the device. No cloud servers are used, and the audio data never leaves the device.

Each Activity or Fragment can have its own vocabulary. The system will automatically switch to the proper vocabulary as Activities are paused and resumed. If no vocabulary is provided, the system will use the default navigation commands.

An 3rd party Application may create a custom vocabulary for the Vuzix Speech Recognizer service by utilizing the Vuzix Speech SDK as described throughout this section of the knowledge base.


Was this article helpful?

What's Next