Example 1: Improving data retrieval using concept-based searches

A global professional recruitment company handles millions of hiring clients and job seekers across the globe, painstakingly searching vast amounts of data in different sources to match job specifications against the resumes of job seekers.

Resumes are stored in relational databases and document systems based on the formats in which they are received. Job specifications are in Microsoft Word files on the Internet while enquiries from companies and job seekers are saved in the in-house file systems. Recruitment consultants must know the format and location of information they want to search.

Incoming resumes and vacancies are manually sorted. Each piece of data is manually tagged to aid search. If errors are made in specifying metadata information, that information may be lost to any future searches. Each resume is tagged manually with keywords to locate these resumes in a future search. This system is time-consuming and gives inaccurate results. Often, the same keywords are used for very different candidates and the search yields results that do not accurately match job descriptions.

Each format and file type is searched using that tool’s internal search capability. This requires the searcher to capably perform a smart search, which, if not executed properly, can yield many unwanted results. There is no capability to search in natural language.

The recruitment company wants to:

Sybase Search provides transparent access to structured and unstructured data in your organization using concept-based search capability across numerous data formats.

NoteSybase Search uses the Sybase Search Content Adapter, which is an add-on option you can purchase separately, to perform searches across proprietary document formats such as Microsoft Word and Adobe Acrobat PDF documents.

Figure 2-1: Sybase Search data flow

This is a Sybase Search data flow diagram. It illustrates the capability to query, search, and find data from structured information such as database and unstructured information such as file systems or network drives. It also illustrates the administration capabilities provided, by Data Services Administrator, of the Data Integration Suite.

Data flow

Sybase Search connects to each data source, which are file systems and databases in the customer’s organization. Using the content-based catalog and search tool, the Search component automatically analyzes, indexes, and categorizes data and prepares the system for users to perform category specific searches. It extracts and processes the text content from file systems, databases, and Internet where the content is unstructured.

Recruitment consultants can use Sybase Search to search for information in any of these ways:

Administration

Data Services Administrator (DSA) enables you to administer the Search component through a GUI-based server manager accessible via a Web console.