If you set soft_hyphens to true, soft hyphens are retained when text is filtered from PDF documents.
soft_hyphens
true
Default value: false
Configuration& Configuration::soft_hyphens(bool emit_softhyphens)