To normalize the original transcript text, use the TextNorm
task. The normalized file must contain only one word on each line. Before you send the TextNorm
action, set the WordPerLine
parameter to True
in the [TextNorm]
section of the tasks configuration file.
Normalization might split single entities into multiple words. For example, 35 can become thirty five.
For more information about text normalization, see Normalize Text.
|