The following schema describes the creation of language identification feature files that you can use to create and optimize language identification classifiers.
[LangIDFeature] 0 = a ← wav(MONO, input) 1 = f ← frontend(_, a) 2 = nf ← normalizer(_, f) 3 = lf ← lidfeature(_, nf) 4 = output ← lfout(_, lf)
0
|
The wav module processes the mono audio data. |
1
|
The frontend module converts the audio data into speech front-end frame data. |
2
|
The normalizer module normalizes the frame data from 1 (f ). |
3
|
The lidfeature module converts the normalized frame data (nf ) into language identification feature data. |
4
|
The language identification feature data (lf ) is written to the output file. |
|