The audiopreproc
module can perform several analysis operations on audio samples, covering audio quality as well as broad audio categorization.
The audiopreproc
module has five modes of operation. You can combine multiple modes into a single operation.
Mode | Input | Output | Description |
---|---|---|---|
A
|
a
|
w
|
Performs broad audio classification, breaking down the audio into segments of speech, silence, and non-speech/music. |
C
|
a
|
|
Reports overall percentages of clipping. |
S
|
a
|
|
Calculates the signal-to-noise ratio (SNR) over the entire file. |
S
|
a
|
w
|
Calculates the signal-to-noise ratio (SNR) over the entire file as well as producing SNR estimates across broad categorized speech segments using mode A . |
T
|
a
|
d
|
DTMF detection. Identifies tones corresponding to numbers 0-9, letters A-D, asterisk (*) and hash (#) keys. |
Examples:
w ← audiopreproc (A, a)
output ← audiopreproc (S, a)
w ← audiopreproc (ACS, a)
d ← audiopreproc (T, a)
MaxSegSize | OutputLogLabel |
MaxSilThresh | SampleFrequency |
MinSegSize | SNRFile |
MusicNoiseLab | SpeechLab |
OutputLog | SpeechThreshOffset |
|