The filter software development kit (SDK) extracts text and metadata from a variety of file formats on many platforms and can automatically recognize data on 1,500 document types.
Supports file-based and stream-based I/O operations.
Provides in-process and out-of-process filtering.
Convert virtually any document into HTML. Control the content, structure, and format of the HTML output with the HTML export SDK.
Enable end user access to documents with no plugins or native applications required.
Customizable templates and flexible APIs.
End users can view any document in PDF format, negating the need to download special applications.
Extract structure from documents to create well-formed, valid XML in PDF creation.
XSL style sheets or CSS can be used to display the XML data in a human-friendly form.