Custom Word Databases

The speech-to-text models that are available for Media Server are trained with billions of words, but the performance of any model depends on how closely its training matches the data being processed. When you run speech-to-text you might find that some words, such as product names that would not have been included in the training, are not recognized correctly because they are considered less likely than other word choices.

You can create a custom word database to improve the probability of the custom word being recognized correctly.

NOTE: The speech-to-text models currently support custom word databases only for non-CJK languages.

A custom word database contains a list of words. To add a custom word you only need to supply the word.

To create and use a custom word database

  1. Create the database with the action CreateCustomSpeechWordDatabase. For example:

    /action=CreateCustomSpeechWordDatabase&database=words
  2. Add each custom word with the action TrainCustomSpeechWord. For example:

    /action=TrainCustomSpeechWord&database=words&word=AcmeSoft
  3. When you configure your speech-to-text analysis task, use the configuration parameter CustomWordDatabase to specify the name of the custom word database that you created. For more information about configuring a speech-to-text analysis task, see Transcribe Speech.

For more information about the actions that you can use to manage custom word databases, see the Speech to Text training actions.