HPE CFS provides features to manipulate and enrich documents. This means that you can add additional information to the documents, and improve the quality of the information, before the documents are indexed into IDOL. For example, you can:
The simplest way to manipulate documents is to use the import tasks and index tasks that are included with CFS. For information about the tasks that are available, see Manipulate and Enrich Documents. You can configure these tasks by modifying configuration parameters in the HPE CFS configuration file.
Import tasks can call other services, for example Media Server or Speech Server. You can use Media Server to perform Optical Character Recognition, face recognition, and object recognition on images and video. You can use Speech Server to extract speech from audio and video files. This enables IDOL to use the data in images, video, and speech for retrieval, clustering, and other operations.
HPE CFS also supports Lua, an embedded scripting language. You can write Lua scripts to manipulate documents and define custom processing rules. For information about the Lua functions that are provided with HPE CFS, refer to the HPE Connector Framework Server Reference.
|