Creating, editing, and removing categories

You can set up categories by defining an initial query and a relevance threshold. You can also include metadata filtering. A category must have at least one search term or at least one metadata expression.

Using the initial query and the given threshold of the document’s relevance percent, Sybase Search assigns a document to the category if its relevance percent is equal to or greater than the threshold.

For example, a query that consists of search terms and a minimum document relevance creates a category of documents that are grouped by their relevance to search terms defined in the given query. The use of the document relevance helps ensure that the documents in the category are valid matches.

Another example is a category query that consists of only metadata, such as “fileType = HTML”. This creates a category that only contains HTML documents.Users can search within a category about a certain subject, such as “England World Cup football.” Or, the user can simply use a category to filter search results, such as searching within a category of HTML documents.

Another way of categorizing documents is based on the content from one or more training documents. In this method, Sybase Search extracts the most relevant content from the training documents and uses this information as a new internal query to generate matching documents. This process is similar to the "find similar" feature except, with category training, relevant content is extracted from more than one document, ensuring that the extracted content is relevant to each of the training documents.

This method has the following benefits:

StepsCreating a category using a base query

  1. Click Document Management. The Document Stores Summary page appears.

  2. Click Categories. The Categories page appears.

  3. Click Create. The Create Category page appears.

  4. In the Category Query Terms field, enter a natural-language query. The more information you provide, the more accurate your results are. For more information, see “Searching across documents”.

  5. In the Not Terms field, enter terms to indicate concepts dissimilar to those for which you are searching. For more information, see “Searching across documents”.

  6. Select the Details tab.

  7. In the Name field, enter a text to distinguish the category from others.

  8. In the Description field, enter text to further describe the category.

  9. Select the Document Groups tab.

  10. From the group list, select one or more document groups to restrict your search.

  11. Select the Metadata tab. To include metadata in the category:

    1. Select a metadata parameter from the metadata list. You can add as many as five metadata parameters to the category. Click Add to add more metadata parameters.

    2. Select an operator. All metadata types support the equal to (=) operator. The integer and date types also support greater than or equal to(>=) and the less than or equal to (<=) operators.

    3. Enter a value for the metadata parameter. Table 6-1 lists the predefined metadata parameters and types.

    4. If the metadata parameter contains a value that consists of more than one term, select the Within expression operator when the metadata parameter contains a value that consists of more than one term.

      When you set the operator to AND, every term must be present in the document metadata for the match to succeed. When you set the operator to OR, only one of the terms must be present in the document metadata for the match to succeed.

    5. If you have defined at least two metadata parameters, select the Across Expressions operator. When you set the operator to AND, both metadata parameters must succeed for the match to succeed. When you set the operator to OR, only one of the metadata parameters must succeed.

      For more information, see, “Searching across documents”.

  12. Select the Result Options tab. To set up result options:

    1. From the Minimum document relevance list, select a percentage. The percentage you select defines the minimum relevance ranking that a document must score for it to be included within the category. Documents with scores lower than the percentage that you enter are not included.

    2. Select the Score unknown terms check box to specify that terms unknown to the system—which, therefore do not exist in any indexed document—are considered by the scoring algorithm.

    3. Under Training Options, specify the number of results to display per page and the number of paragraphs to display for each document. Select the Term highlighting check box to highlight the query terms in the search results.

      NoteThe fields under Training Options assist category training and have no effect on category creation. The values specified in these fields are not saved during category creation.

  13. Click Create. Sybase Search creates the category, assigns a unique system-generated numeric ID to it, and automatically adds documents that match the category criteria. Sybase Search displays the new category and list of relevant documents on the View Category page.

StepsCreating a category using training documents

  1. Perform steps 1 through 12 of “Creating a category using a base query”.

  2. Click Run Category Query. The search results are displayed in the search results pane. Each search result contains the Add to training documents link.

  3. From the search results, determine the training document that matches the information you are searching and click Add to training documents. Name or title of the specified document appears in the Training documents box. You can add up to five training documents.

  4. Select the Training Documents option.

  5. Click Train Category. The search result displays documents that fall into the category, sorted by relevance.

  6. Click Create. Sybase Search creates the category and returns you to the Categories page.

StepsCreating a category

  1. From the Categories page, determine the category that you want to edit.

  2. Click Edit.

    The Edit Category page appears.

  3. Make the required changes. For more information about each category property, see “Creating a category using a base query”.

  4. Click Save. Sybase Search updates the category properties and returns you to the Categories page.

StepsRemoving a category

  1. From the Categories page, determine the category that you want to remove.

  2. Click Remove. A message appears, asking you to verify whether to remove the category.

  3. Click OK. Sybase Search removes the category and returns you to the Categories page.