IDOL Speech Server stores phoneme time track information in an .fmd file. You process an individual audio file to create a phoneme time track file. You can then search this file, or combine time track files for multiple audio files into a single file, which you can then search.
To create a phoneme time track file for an audio file
Send an AddTask
action to IDOL Speech Server, and set the following parameters:
Type
|
The task name. Set to WavToFMD . |
File
|
The audio file to process. To restrict processing to a section of the audio file, set the start and end times in the |
Lang
|
The phonetic phrase match language pack to use. For more information about available language packs, see Supported Resources. |
Out
|
The name of the phoneme time track file to produce. |
For example:
http://localhost:13000/action=AddTask&Type=WavToFMD&File=C:/myData/Speech.wav&Lang=ENUS-pm&Out=Speech1.fmd
This action uses port 13000
to instruct IDOL Speech Server, which is located on the local machine, to process the Speech.wav
file to produce the Speech1.fmd
time track file, using the ENUS-pm
language pack.
This action returns a token. You can use the token to:
To combine multiple phoneme time track files
Create a list that contains the names of the individual time track files to combine. Each entry in the list must be on a separate line, and must consist of label and file name pairs separated by a semicolon. The label corresponds to the name to give to the file, and the file names correspond to the full path of the files to be combined. For example:
Label1;filename1 Label2;filename2 ...
For more information about IDOL Speech Server's list manager, see Create and Manage Lists.
Send an AddTask
action to IDOL Speech Server, and set the following parameters:
Type
|
The task name. Set to CombineFMD . |
ListFile
|
The list that specifies the individual time track files to combine. |
ListPath
|
The path to the directory that contains the phoneme time track files. |
FileOut
|
The name of the combined phoneme time track file to produce. |
For example:
http://localhost:13000/action=AddTask&Type=CombineFMD&ListFile=ListManager/fmdList&ListPath=C:\PHRASESEARCH\fmd&FileOut=myData.fmd
This action uses port 13000
to instruct IDOL Speech Server, which is located on the local machine, to combine the individual phoneme time track files specified in the fmdList
list to produce the single file myData.fmd
.
This action returns a token. You can use the token to:
|