extract_text
The extract_text
function uses File Content Extraction to extract text from a data string.
Syntax
extract_text( data, isHtml)
Arguments
Argument | Description |
---|---|
data
|
(string) A string containing the data to extract text from. |
isHtml
|
(Boolean) Specifies whether the data is a snippet of HTML (default false ). If the data is a standard HTML document then you do not need to set this argument, because File Content Extraction automatically recognizes HTML documents. You might need to set this argument in cases where the data contains HTML markup but is not a complete document that begins with an <html> tag. |
Returns
String. Returns the text extracted from the string.
Example
local htmlSnippet = "some <b> html </b> with <i> unwanted </> markup"; local text = extract_text(htmlSnippet, true);