SitemapUrl

The URL of a sitemap that lists the pages to ingest.

If you set this parameter, only the pages on the sitemap are ingested. The connector does not crawl the site by following links. You can set further parameters, including UrlCantHaveRegex and UrlMustHaveRegex, to filter the pages contained in the sitemap.

TIP:

Web Connector can retrieve information in one of the following ways:

In each case the other parameters are ignored. SitemapUrl has precedence, followed by SitemapFile, followed by Url.

Type: String
Default:  
Required: You must set Url, SitemapUrl, or SitemapFile
Configuration Section: TaskName or FetchTasks or Default
Example: SitemapUrl=http://www.mywebsite.com/sitemap.xml
See Also:

IgnoreSitemapScopeErrors

Url

SitemapFile


_FT_HTML5_bannerTitle.htm