Sybase Search uses the Content Adapter, which includes Stellent document filters for parsing many document formats. The Stellent document filter is a multifilter—in other words, the same filter instance handles all supported MIME types. Thus, the Stellent filter is configured to handle the MIME type */*, indicating that it can filter text from documents of any MIME type presented to it.
When Sybase Search obtains a filter for a document, it first identifies its MIME type from the file extension. For example, C:\document.pdf has the MIME type “application” and the subtype “pdf” (application/pdf). Sybase Search then requests a filter from the Filter Factory to handle documents with the identified MIME type.
The filter look-up is performed in this order:
If a filter is configured to handle a specific MIME type, that filter instance is returned.
If a multifilter (*/*) is configured, that filter instance is returned.
No filter is returned, denoting “not indexable.”