You can develop custom document filters for single file formats like PDF, as well as for documents that may contain multiple formats like XML, ZIP, or e-mail messages. The multidocument file support allows separate documents in a single file to be indexed separately. Implement the following interface for the custom document filter:
interface DocumentFilter { void setName(java.lang.String name); java.lang.String getName(); void setMimeTypes(java.lang.String[] mimeTypes); java.lang.String[] getMimeTypes();
java.util.Properties getProperties(); void setProperties(java.util.Properties); FilteredDocumentIterator filter(FilterInput fltInput) throws java.io.IOException: }
The filter method allows custom document filters to return multiple documents from a single file or input stream argument. You can also implement the following interfaces in DocumentFilters.xml for better configuration support:
com.omniq.flt.InputMatchingFilter
com.omniq.flt.TempFileFilter
com.omniq.flt.TimeoutFilter
com.omniq.flt.TextDocumentFilter
For a detailed information about implementing this interface, see the Javadocs in: install_location\webapp\docs\api\index.html