Automatic indexing

The automatic indexing utility is part of Enterprise Portal’s concept-based search service. Use the indexing utility to schedule the indexing of documents in specified data sources. You must index the data sources before you can search them. See “Performing concept-based searches” for a description of concept-based searching.

The automatic indexing utility runs continuously and performs operations from one or more queue files or directories. Specify the names of the queue files or directories in the autoindexer.cfg file. See “Configuration file”.

NoteThe automatic indexing utility PortalSearchAutoindexer runs as a process on UNIX and as both a process and a service on Windows.

A queue file contains a list of file names with a full or relative path. The automatic indexing utility automatically processes new files that appear in the directory specified in the queue file, together with any files that have been replaced or updated. A file’s indexed contents are saved to another file with an idx extension, which can be read by the DRE. The automatic indexing utility also processes data formats such as HTML, Microsoft Word documents, and PDF files.

On Windows platforms, once you start the DRE, you can start the indexing utility as a Windows service that adds new data content dynamically. Communication between the indexing utility and the DRE is fully automated and, once you invoke it, the process can run continuously.

The PortalSearchautoindexer.log file records all the actions performed by the indexing utility, and is located in the PortalSearch/autoindexer subdirectory of your EP installation.

The indexing utility automatically attempts to import and index any new files that appear in the data source directories. Be aware of any applications that create temporary files in a data source directory. For example, if you run Microsoft Word from a data source directory, the temporary files it creates are processed by the indexing utility.

You can define the data source directories, file types, the polling period for data files, and so on, in the autoindexer’s configuration file. The indexing utility uses one configuration file, PortalSearchautoindexer.cfg, which is located in the PortalSearch/autoindexer subdirectory of your EP installation.

NoteIn some older installations, the indexing utility configuration file is DRE.INI.