Recognition¶
Speech recognition related parameters.
Warning
For configurations made in the file /opt/cpqd/asr/config/engine/engine.conf
, do not leave blank spaces around the equal symbol “=”.
hints.words¶
Description: Adds new words to the Language Model or increases the probability of existing words appearing.
Values: Word list with or without
boost
orpronunciation
attributes. Default: empty.Format:
<palavra>:<boost> [<pronúncia>], <palavra>:<boost> [<pronúncia>], ...Where,
<boost>
and<pronúncia>
are optionals.Important:
The pronunciation must always be in brackets
[ ]
, and there can not be any blank spaces between the brackets and the pronunciation written there.Each word, with its attributes, must be separated by a comma
,
.Each word can only be assigned one pronunciation. If you wish to add more than one pronunciation per word, you will need to repeat the word.
Words can only contain letters and dashes.
Location:
/opt/cpqd/asr/config/engine/engine.conf
, APIExample:
--hints.words=mexirica:1.8 [mixirica], siciliano:2, castanha-do-pará
textify.enabled¶
Description: Enables the automatic formatting of numbers, dates, times, etc.
Values: ‘true’ or ‘false’. Default value: ‘false’.
Location:
/opt/cpqd/asr/config/engine/engine.conf
, APIExample:
--textify.enabled=true
am.models¶
Description: Indicates which acoustic models must be loaded by the ASR engine upon startup. Currently, only one model can be defined. The value entered here must be one of the directories contained in
/opt/cpqd/asr/lang
.Values: Text. Default value:
default
.Location:
/opt/cpqd/asr/config/engine/engine.conf
Example:
--am.models=default
lm.preloadModels¶
Description: Indicates which language models must be loaded by the ASR engine upon startup. Models loaded here are never unloaded from memory and can be updated only by restarting the ASR. It is generally used to load the free speech model or a very large grammar.
Values: The value must be a comma-separated model list.
Location:
/opt/cpqd/asr/config/engine/engine.conf
Example:
--lm.preloadModels=builtin:slm/general
lm.timeToLive¶
Description: Maximum time after which a language model will be unloaded from memory. This is a model’s life cycle in memory.
Values: Integer value in minutes. Default value: 60
Location:
/opt/cpqd/asr/config/engine/engine.conf
Example:
--lm.timeToLive=60
lm.timeToIdle¶
Description: Time after which a language model is to be unloaded from memory if not used for recognition, in other words, maximum idle time.
Values: Integer value in minutes. Default value: 10
Location:
/opt/cpqd/asr/config/engine/engine.conf
Example:
--lm.timeToIdle=10
decoder.partialResultEnabled¶
Description: Indicates whether partial results are enabled. A partial result is text recognized before the audio has been completely received.
Values: ‘true’ or ‘false’. Default value: ‘false’.
Location:
/opt/cpqd/asr/config/engine/engine.conf
Example:
--decoder.partialResultEnabled=false
decoder.partialResultInterval¶
Description: Indicates the time interval for generating a partial result.
Values: Integer value in milliseconds. Default value: 1000
Location:
/opt/cpqd/asr/config/engine/engine.conf
Example:
--decoder.partialResultInterval=1000
noInputTimeout.enabled¶
Description: Enables the noInputTimeout timer for all recognitions.
Values: ‘true’ or ‘false’. Default value: ‘true’.
Location:
/opt/cpqd/asr/config/engine/engine.conf
, APIExample:
--noInputTimeout.enabled=true
noInputTimeout.value¶
Description: Maximum time to wait for speech to begin. After this period, the system ends the recognition and returns NO_INPUT_TIMEOUT.
Values: Value in milliseconds. Default value: 10000.
Location:
/opt/cpqd/asr/config/engine/engine.conf
, APIExample:
--noInputTimeout.value=10000
recognitionTimeout.enabled¶
Description: Enables the recognitionTimeout timer for all recognitions.
Values: ‘true’ or ‘false’. Default value: ‘true’.
Location:
/opt/cpqd/asr/config/engine/engine.conf
, APIExample:
--recognitionTimeout.enabled=true
recognitionTimeout.value¶
Description: Maximum time to wait for speech recognition results. If the recognition has not finished by the end of the defined period, the system will end the recognition and return RECOGNITION_TIMEOUT.
Values: Integer value in milliseconds. Default value: 30000
Location:
/opt/cpqd/asr/config/engine/engine.conf
, APIExample:
--recognitionTimeout.value=30000
decoder.confidenceThreshold¶
Description: Minimum recognition confidence value, to be considered valid and not return NO_MATCH.
Values: Integer number from 0 - 100. Default value: 30.
Location:
/opt/cpqd/asr/config/engine/engine.conf
, APIExample:
--decoder.confidenceThreshold=30
decoder.startInputTimers¶
Description: Automatically launches the enabled timers (noInputTimeout and recognitionTimeout) when recognition starts. If deactivated, the enabled timer will be started manually upon receiving the message START INPUT TIMERS.
Values: ‘true’ or ‘false’. Default value: ‘true’.
Location:
/opt/cpqd/asr/config/engine/engine.conf
, APIExample:
--decoder.startInputTimers=true
decoder.maxSentences¶
Description: Maximum number of probable results generated by the recognition (alternative sentences).
Values: Integer number greater than zero. Default value: 1.
Location:
/opt/cpqd/asr/config/engine/engine.conf
, APIExample:
--decoder.maxSentences=1
decoder.continuousMode¶
Description: Enables continuous mode recognition.
Values: ‘true’ or ‘false’. Default value ‘false’.
Location:
/opt/cpqd/asr/config/engine/engine.conf
, APIExample:
--decoder.continuousMode=true
decoder.wordDetails¶
Description: Controls if word details are to be displayed.
Values: Integer (0,1,2). Default value ‘1’.
no details,
only the first n-best result,
all the n-best results.
Location:
/opt/cpqd/asr/config/engine/engine.conf
, APIExample:
--decoder.wordDetails=1
loggingTag¶
Description: A tag provided by the client application that is registered in the CPQD ASR logs to track a given interaction with a user. The Logging-Tag is defined by the client application and sent to the CPQD ASR by the MRCP, REST and WS APIs.
Values: The ID the user wishes to register in the logs.
Location: API
Example:
--loggingTag=CompanhiaTelecom