You can customize the way that CFS processes a document by adding the following fields to the document. These fields are read by CFS and control processing. You can add a field to a document using a Lua script, configured to run on a Pre Import Task. Unless stated otherwise, the field can have any value.
Document Field | Description |
---|---|
AUTN_NO_FILTER
|
To prevent CFS extracting text, add this field to a document. This field can be useful if you do not want to extract text from certain file types. |
AUTN_NO_EXTRACT
|
To prevent CFS extracting subfiles, add this field to a document. This can be useful if you want to avoid fetching documents from inside a ZIP file. |
AUTN_NEEDS_IMAGE_SERVER_ANALYSIS
|
To run analysis on image files using the ImageServerAnalysis import task, you must add the field AUTN_NEEDS_IMAGE_SERVER_ANALYSIS to the documents. The ImageServerAnalysis task only runs on documents that have this field. |
AUTN_NEEDS_TRANSCRIPTION
|
To extract speech from audio and video, you must add the AUTN_NEEDS_TRANSCRIPTION field to a document. The IdolSpeech import task only runs on documents that have this field. |
AUTN_FORMAT_CORRECT_FOR_TRANSCRIPTION
|
To bypass the transcoding step of an IdolSpeech task, you can add this field to a document. Documents that have this field are not sent to a Transcode Server. |
AUTN_AUDIO_LANGUAGE
|
To bypass the language identification step of an IdolSpeech import task, you can add this field to a document. The value of the field must be the name of the IDOL Speech Server language pack to use for extracting speech. |
|