Manipulate and Enrich Documents

CFS provides features to manipulate and enrich documents. This means that you can add additional information to the documents, and improve the quality of the information, before the documents are indexed into IDOL. For example, you can:

The simplest way to manipulate documents is to use the import tasks and index tasks that are included with CFS. For information about the tasks that are available, see Manipulate and Enrich Documents. You can configure these tasks by modifying configuration parameters in the CFS configuration file.

Import tasks can call other services, for example an IDOL Speech Server or Image Server. You can use an Image Server to perform Optical Character Recognition, logo detection, or face recognition on images. You can use an IDOL Speech Server to extract speech from audio and video files. This enables IDOL to use the data in images and speech for retrieval, clustering, and other operations.

CFS also supports Lua, an embedded scripting language. You can write Lua scripts to manipulate documents and define custom processing rules. For information about the Lua functions that are provided with CFS, refer to the Connector Framework Server Reference.


_HP_HTML5_bannerTitle.htm