Url

A comma-separated list of strings where each string is the URL of a page to start crawling from.

TIP: Web Connector can retrieve information in one of the following ways:

  • To start from a URL and follow links to other pages, set the parameter Url.
  • To retrieve the pages contained in a sitemap, set the parameter SitemapUrl. A sitemap is an XML document, used by some web sites to present web crawlers with a list of pages to retrieve. Using a site map is often the best option, if there is one, because the connector retrieves the pages suggested by the site administrator. This can be easier than crawling the site and choosing the pages to ingest based on their URL or content.
  • To retrieve a list of URLs that are specified in a text file, set the parameter SitemapFile. You must create the file, which is not practical for large sites, but you might want to use this option if you have an external process generating the URLs.

In each case the other parameters are ignored. SitemapUrl has precedence, followed by SitemapFile, followed by Url.

Type: String (comma-separated list)
Default:  
Required: You must set Url, SitemapUrl, or SitemapFile
Configuration Section: TaskName or FetchTasks or Default
Example:

Url=http://www.mywebsite.com/

or

Url0=http://www.mywebsite.com/
Url1=https://www.hpe.com/
See Also:

SitemapUrl

SitemapFile