MaxPageAge

The maximum age that a page can reach and still be ingested. The connector does not ingest pages that are older than the specified age, though links from these pages might still be followed, depending on the value of SpiderDateFilteredPages.

To filter pages by date the connector must be able to extract a date from the page URL, page content, or HTTP headers. Configure how to extract the date by setting DateInUrl, PageDateSelector, or PageDateHeader.

Tip: This parameter configures a rolling limit. For example, some pages might be ingested when you first synchronize with a Web site. On the next synchronize cycle, any pages that exceed the maximum age are removed from IDOL. To specify a fixed date limit, use the parameter MaxPageDate instead.

The value of this parameter is a time duration. Specify the duration in the following format:

     1years 2months 3weeks 4days 5hours 6minutes 7.8seconds

You can omit any of the individual durations when they are not required:

     6minutes 7seconds

You can also shorten the string by abbreviating the units to their first letter (months abbreviates to m, minutes to n):

     6n 7s

If you want to specify the value in seconds, no units are required:

     60

The following are all equivalent:

     1minute 3seconds
    1n 3s
     63s
    63
Type: Time duration
Default: No limit
Required: No
Configuration Section: TaskName or FetchTasks or Default
Example: MaxPageAge=1 week 2 days
See Also:

MaxPageDate

MinPageAge

MinPageDate


_HP_HTML5_bannerTitle.htm