Recognition Log

The recognition engine writes its logs in the /var/log/cpqd/asr/engine directory. The most recent log file is asr-engine.log. The oldest files follow the pattern asr-engine.log.-,.log.<index>, where index is a sequential integer. The ASR engine log is configured as explained in the configuration section.

A simplified example of an ASR engine log can be found in Listing 2 and a brief explanation of the messages in Table 4.

Warning

The log is merely for illustration. Some of the lines have been removed to make it simpler. The log messages displayed depend on how the log and the CPQD ASR have been configured.

Listing 2 ASR engine log example
 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
[*][ warning ] [tid=6425] [] [Engine] Library Version: 3.2.0

[*][  info   ] [tid=6425] [] [AcousticModel] Type: [Nnet3]
[*][  info   ] [tid=6425] [] [AcousticModel] Created acoustic model: file source="/opt/cpqd/asr//lang//default/am/6.dat"
[*][ warning ] [tid=6425] [] [Engine] Log level: debug
[*][ warning ] [tid=6425] [] [Engine] Max licenses: 2
[*][ warning ] [tid=6425] [] [Engine] License Type: server
[*][ warning ] [tid=6425] [] [Engine] License ID: XXXXXXXXXXXXXXXX
[*][ warning ] [tid=6425] [] [Engine] License Server Address: license38.cpqd.com.br:443

[*][  info   ] [tid=6425] [] [API] Entering "asr_createSession"
[*][  info   ] [tid=6425] [201805021718500000] [Session] Sampling frequency: 8000.00
[*][  info   ] [tid=6425] [201805021718500000] [Session] Bytes per sample: 2
[*][  info   ] [tid=6425] [201805021718500000] [Config] Tuning Log: Enabled
[*][  info   ] [tid=6425] [201805021718500000] [Config] NoInputTimeout: Enabled
[*][  info   ] [tid=6425] [201805021718500000] [Config] No Input Timeout(ms): 10000
[*][  info   ] [tid=6425] [201805021718500000] [Config] RecognitionTimeout: Enabled
[*][  info   ] [tid=6425] [201805021718500000] [Config] Recognition Timeout(ms): 30000
[*][  info   ] [tid=6425] [201805021718500000] [Config] PartialResult: Disabled
[*][  info   ] [tid=6425] [201805021718500000] [Config] StartInputTimers: Enabled
[*][  info   ] [tid=6425] [201805021718500000] [API] Leaving "asr_createSession": session=201805021718500000

[*][  info   ] [tid=6425] [201805021718500000] [API] Entering "asr_loadLMFromURI": uri="file:///opt/cpqd/asr/samples/grammar/ptbr/pizza.gram"
[*][  info   ] [tid=6425] [201805021718500000] [ModelCache] Grammar: 'file:///opt/cpqd/asr/samples/grammar/ptbr/pizza.gram' Compiled and loaded
[*][  debug  ] [tid=6425] [201805021718500000] [LanguageModel] TimeToIdle defined to 600 seconds for file:///opt/cpqd/asr/samples/grammar/ptbr/pizza.gram
[*][  debug  ] [tid=6425] [201805021718500000] [LanguageModel] TimeToLive defined to 3600 seconds for file:///opt/cpqd/asr/samples/grammar/ptbr/pizza.gram
[*][  info   ] [tid=6425] [201805021718500000] [API] Leaving "asr_loadLMFromURI"
[*][  info   ] [tid=6942] [] [API] Entering "asr_activateLMFromURI": uri="file:///opt/cpqd/asr/samples/grammar/ptbr/pizza.gram"
[*][  debug  ] [tid=6942] [] [API] Language model "file:///opt/cpqd/asr/samples/grammar/ptbr/pizza.gram" activated
[*][  info   ] [tid=6942] [] [API] Leaving "asr_activateLMFromURI"

[*][  info   ] [tid=6942] [] [API] Entering "asr_startRecognition"
[*][  info   ] [tid=6942] [] [SessionPipeline] TuningLog.saveInAudio: Enabled
[*][  info   ] [tid=6942] [] [SessionPipeline] TuningLog.saveEpAudio: Enabled
[*][  info   ] [tid=6942] [] [SessionPipeline] RemoveOffset: Disabled
[*][  info   ] [tid=6942] [] [SessionPipeline] Endpointer: Enabled
[*][  debug  ] [tid=6942] [] [Endpointer] endpointer.startThreshold = 100
[*][  debug  ] [tid=6974] [] [Endpointer] endpointer.endThreshold = 10
[*][  debug  ] [tid=6974] [] [Endpointer] endpointer.headMargin = 200
[*][  debug  ] [tid=6974] [] [Endpointer] endpointer.startWindow = 100
[*][  debug  ] [tid=6974] [] [Endpointer] endpointer.tailMargin = 400
[*][  debug  ] [tid=6974] [] [Endpointer] endpointer.endWindow = 200
[*][  debug  ] [tid=6974] [] [Endpointer] endpointer.waitEnd = 1000
[*][  debug  ] [tid=6974] [] [Endpointer] endpointer.wstep = 10
[*][  debug  ] [tid=6974] [] [Endpointer] endpointer.levelThreshold =5.000000
[*][  debug  ] [tid=6974] [] [Endpointer] endpointer.levelMode = 2
[*][  info   ] [tid=6974] [] [SessionPipeline] Pipeline message: START_RECOGNITION
[*][  info   ] [tid=6974] [] [API] Leaving "asr_startRecognition"

[*][  info   ] [tid=6976] [] [API] Entering "asr_sendAudio": length_bytes=126676, lastPacket=1
[*][  info   ] [tid=6976] [] [SessionPipeline] Pipeline message: SPEECH_START
[*][  info   ] [tid=6976] [] [SessionSink] Pipeline message: END_OF_SPEECH
[*][  info   ] [tid=6976] [] [API] Leaving "asr_sendAudio"

[*][  info   ] [tid=7546] [] [Recognizer] Creating final result: status=RECOGNIZED, last segment=false
[*][  info   ] [tid=7546] [] [Recognizer] Final result detail: text="eu quero uma pizza vegetariana", confidence=100, interp="pizza_vegetariana"

[*][  info   ] [tid=6977] [] [API] Entering "asr_releaseSession": session=201805021722730000

[*][  info   ] [tid=7556] [] [API] Entering "asr_releaseEngine"
[*][  info   ] [tid=7556] [] [API] Leaving "asr_releaseEngine"
Table 4 ASR engine log description

Lines

Description

1-9

The ASR engine has been launched The ASR engine version, the models that were loaded, and the number of channels can be seen

11-21

Recognition session created, with the used parameters

23-30

Used grammar loaded and enabled

32-48

Start of recognition

50-53

Audio sent

55-56

End of recognition and return of results

58-61

End of session and ASR engine shutdown