/research/projects/fastac/default.htm

originally: http://www.oclc.org/research/projects/fastac/default.htm

FAST as a knowledge base for automatic classification

Goal

Evaluate FAST as a database to support automatic classification.

Description

FAST is based on LCSH and is designed for ease of use in the online environment. This project will assess the suitability of FAST authority records as a knowledge base for automatic classification, employing techniques that were developed for creating, testing, and evaluating Scorpion databases derived from the Dewey Decimal Classification and the Library of Congress Classification.

A successful outcome of this investigation would be a knowledge base for automatic classification derived from a standard that is already in widespread use in digital libraries, which is also scalable, publicly accessible, and compatible with Open Source software.

Research methodology

The approach to this project will include system building and empirical evaluation.

Why OCLC is conducting this research and how it helps libraries

This project rates high on four OCLC Research project-selection criteria because it has the potential to:

  1. enhance existing assets
  2. promote collaboration and consensus
  3. leverage the value of the cooperative
  4. contribute to scholarship.

Anticipated deliverables

The main outcome anticipated for this project is the creation of automated systems of use to the library community. More specific results include:

  1. publicly accessible Scorpion databases
  2. a research paper
  3. algorithms and software for recovering concept hierarchies
  4. algorithms and software for mapping terms to records from an external source.

Schedule

Risks/Assumptions

  1. Hierarchies in FAST records may have internal inconsistencies or may not be easily recovered.
  2. Methods for creating subsets of FAST records may produce ad-hoc results.
  3. Automatic classification requires editorial support.

References

Project team