Identify Speakers

To identify speakers

  1. Create a new configuration to send to Media Server with the process action, or open an existing configuration that you want to modify.

  2. In the [Session] section, add a new analysis task by setting the EngineN parameter. You can give the task any name, for example:

    [Session]
    Engine0=Ingest
    Engine1=RecognizeSpeakers
  3. Create a new section to contain the settings for the task, and set the following parameters:

    Type The analysis engine to use. Set this parameter to SpeakerID.
    Input (Optional) The audio track to process. If you do not specify an input track, Media Server processes the first track of the correct type produced by the ingest engine.
    Database The name of the database to use to recognize speakers. If you do not set this parameter Media Server cannot identify speakers, but can divide the audio into different speakers and detect the gender of each speaker.
    ClosedSet A Boolean value (default false) that specifies whether the audio contains only known speakers (who are in the database specified by the Database parameter). If the audio that you are processing contains only known speakers, set this parameter to true.
    SampleFrequency (Optional, default 16000) The sample frequency of the audio to send to the audio service for analysis, in samples per second (Hz).

    For example:

    [RecognizeSpeakers]
    Type=SpeakerID
    Database=news
    ClosedSet=false
    SampleFrequency=16000

    For more information about the parameters that you can use to configure this task, refer to the Media Server Reference.

  4. Save and close the configuration file. OpenText recommends that you save your configuration files in the location specified by the ConfigDirectory parameter.