MinPageAge

The minimum age that a page must reach before it is ingested. The connector does not ingest pages that are newer than the specified age, though links from these pages might still be followed, depending on the value of SpiderDateFilteredPages.

To filter pages by date the connector must be able to extract a date from the page URL, page content, or HTTP headers. Configure how to extract the date by setting DateInUrl, PageDateSelector, or PageDateHeader.

The value of this parameter is a time duration. Specify the duration in the following format:

     1years 2months 3weeks 4days 5hours 6minutes 7.8seconds

You can omit any of the individual durations when they are not required:

     6minutes 7seconds

You can also shorten the string by abbreviating the units to their first letter (months abbreviates to m, minutes to n):

     6n 7s

If you want to specify the value in seconds, no units are required:

     60

The following are all equivalent:

     1minute 3seconds
    1n 3s
     63s
    63
Type: Time duration
Default: 0
Required: No
Configuration Section: TaskName or FetchTasks or Default
Example: MinPageAge=6 hours
See Also:

MaxPageAge

MaxPageDate

MinPageDate


_HP_HTML5_bannerTitle.htm