Developing custom document filters

You can develop custom document filters for single file formats like PDF, as well as for documents that may contain multiple formats like XML, ZIP, or e-mail messages. The multidocument file support allows separate documents in a single file to be indexed separately. Implement the following interface for the custom document filter:

interface DocumentFilter {
     void setName(java.lang.String name);
     java.lang.String getName();
     void setMimeTypes(java.lang.String[] mimeTypes);
     java.lang.String[] getMimeTypes();
     java.util.Properties getProperties();
void setProperties(java.util.Properties);
FilteredDocumentIterator filter(FilterInput fltInput)
            throws java.io.IOException:
}

The filter method allows custom document filters to return multiple documents from a single file or input stream argument. You can also implement the following interfaces in DocumentFilters.xml for better configuration support:

For a detailed information about implementing this interface, see the Javadocs in: install_location\webapp\docs\api\index.html