The following XML shows a single record produced by speaker identification.
<output> <record> <timestamp> ... </timestamp> <trackname>SpeakerId.Result</trackname> <SpeakerIdResult> <id>3543fda6-8fdb-4cda-b061-8b3765d24429</id> <identity> <identifier>newsreader3</identifier> <database>news</database> <confidence>1</confidence> <metadata> <item> <key>key1</key> <value>value1</value> </item> <item> <key>key2</key> <value>value2</value> </item> </metadata> </identity> <speakerName>newsreader3</speakerName> <gender>MALE</gender> <confidence>1</confidence> </SpeakerIdResult> </record> </output>
The record contains the following information:
id
element provides a unique identifier for the section of audio.The identity
element describes the speaker who was recognized. It contains the following information:
identifier
- the identifier of the speaker who was recognized.database
- the name of the database that contains the speaker.confidence
- the confidence score.metadata
- any custom metadata associated with the speaker. (You can add custom metadata to speakers in your training database).This element can be empty when a speaker is not recognized.
speakerName
element provides the name of the speaker. If the speaker is unknown, this element contains the name Unknown_
. If the section of audio does not contain speech this element contains NonSpeech_
.gender
element provides the gender of the speaker (FEMALE
, MALE
, or NonSpeech_
).confidence
element provides the confidence score for speaker identification. This element is deprecated. Micro Focus recommends using the identity/confidence
element instead.