MaxLinkPercentagePerPage

The maximum percentage of textual content on a page that can be represented by links, before the connector does not ingest the page. You can use this parameter to prevent the connector ingesting index and navigation pages. Pages that exceed the maximum percentage are still crawled and the connector follows the links that it finds.

Type: Double
Default: 100.0 (no limit, all pages are ingested)
Required: No
Configuration Section: TaskName or FetchTasks or Default
Example:

With the following example, the connector would not ingest a page if links made up more than half of the text:

MaxLinkPercentagePerPage=50.0
See Also: MaxLinksPerPage