The following diagram shows the modules in IDOL Speech Server that enable speaker identification in a single step.
|
The a is the audio window series. The w is the output time-marked word series. The |
The schema that implements this feature is:
[MySpeakerId] a ← wav (MONO, input) w ← speakerid (_, a) output ← wout (_, w)
|