The following diagram shows the modules in IDOL Speech Server that enable phonetic search in a single action.
|
The a is the resulting audio window series. The f is the feature vector series. The nf is the normalized feature vector series. The w is the output time-marked word series. The |
The schema that implements this feature is:
[MyPhraseSearch] a ← wav (MONO, input) f ← frontend (_, a) nf ← normalizer (_, f) w ← phraseprematch (_, nf) output ← wout (_, w)
|