You can use preserved terms to ensure that some terms are not removed as part of the indexing and querying processes. For example, the term “US” would be removed from any extracted text if the term “us” was entered in the list of stopwords. The case-sensitive list of preserved terms ensures that “us” will be removed, but “US” is indexed and made available to the query calculations.
You can change the list of preserved terms in one of two ways:
Edit the list of words in the default preserved terms file, preserved_terms_en.tx, located in %OMNIQ_3.0%\OmniQ\config, or
Create a new preserved terms file and configure the Text Manager to read from the new file by editing %OMNIQ_3.0%\OmniQ\config\TextModule.default.xml and changing the value of the preserved.terms.filename parameter to point to the new file.
The preserved term list must be UTF-8 encoded and changed before
you index any documents, as preserved terms require special handling
during indexing. If you have already indexed documents, changing
the preserved terms has no effect, as the terms must still be queried
exactly as before to produce matches (as the terms are fixed in
the indexes).
Copyright © 2005. Sybase Inc. All rights reserved. |
![]() |