This section describes how to troubleshoot common problems that might occur when you set up the HPE HTTP Connector.
If the connector cannot connect to the Web site that you want to index, check whether the connector machine is behind a proxy server. If this is the case, use the configuration parameters ProxyHost
and ProxyPort
(or ProxyFromLua
) to specify the host name or IP address, and port, of the proxy server.
If pages are not indexed, set the configuration parameter LogVerbose=true
. You can then view the synchronize
log file to see the links that are extracted from pages. Check your configuration to ensure that it does not exclude the pages that you want to index. The connector cannot parse Javascript, so any links contained in Javascript are not found by the connector and those pages are not indexed.
Some Web sites require visitors, and therefore the connector, to log on before they can retrieve content. You must set the LoginMethod
configuration parameter and provide credentials in the connector’s configuration file.
To determine the correct method to use to log in to a Web site, you can:
If you configure the connector to log on to a Web site by submitting a form, ensure that the connector submits all of the required fields.
|