IDOL Speech Server processes both stored and live audio. Audio for processing can be acquired from the following sources:
IDOL Speech Server cannot directly capture audio from an audio device or handle media streams. To present audio data to IDOL Speech Server, you must either:
Note: IDOL Speech Server supports nearly all audio and video file formats if you set the FfmpegDirectory
parameter in the speechserver.cfg
configuration file. (For more information about this parameter, see the IDOL Speech Server Reference.) If you do not set this parameter, IDOL Speech Server accepts only 16-bit, linear Pulse Code Modulation (PCM) format WAV files.
For best results, an audio file should:
An audio stream must also:
The following sections provide more detail about each aspect of audio quality.
|