The following schema describes the preprocessing of an audio file to create a phoneme time track file required for phonetic phrase search.
[createFmd] 0 = a, ts <- audio(MONO, input) 1 = f <- frontend(_, a) 2 = nf <- normalizer(_, f) 3 = output <- phraseprematch(WRITE, nf, ts)
0
|
The audio module processes the mono audio. |
1
|
The frontend module converts the audio data into speech front-end frame data. |
2
|
The normalizer module normalizes the frame data from 1 (f ). |
3
|
The phraseprematch module processes the normalized frame data from 2 (nf ) to generate phone-track information, which is written to the .fmd file for phonetic phrase search. |
|