Investigations in Unsupervised Back-of-the-Book Indexing

PDF Version Also Available for Download.


This paper discusses investigations in unsupervised back-of-the-book indexing.

Physical Description

6 p.

Creation Information

Csomai, Andras & Mihalcea, Rada, 1974- May 2007.


This paper is part of the collection entitled: UNT Scholarly Works and was provided by the UNT College of Engineering to the UNT Digital Library, a digital repository hosted by the UNT Libraries. It has been viewed 324 times. More information about this paper can be viewed below.


People and organizations associated with either the creation of this paper or its content.


Provided By

UNT College of Engineering

The UNT College of Engineering strives to educate and train engineers and technologists who have the vision to recognize and solve the problems of society. The college comprises six degree-granting departments of instruction and research.

Contact Us


Descriptive information to help identify this paper. Follow the links below to find similar items on the Digital Library.

Degree Information


This paper discusses investigations in unsupervised back-of-the-book indexing.

Physical Description

6 p.


Abstract: This paper describes our experiments with unsupervised methods for back-of-the-book index construction. Through comparative evaluations performed on a gold standard data set of 29 books and their corresponding indexes, the authors draw conclusions as to what are the most accurate unsupervised methods for automatic index construction. We show that if the right sequence of methods and heuristics is used, the performance of an unsupervised back-of-the-book index construction system can be raised with up to 250% relative increase in F-measure as compared to the performance of a system based on the traditional tf*idf weighting scheme.


  • The Florida Artificial Intelligence Research Society (FLAIRS) Conference, 2007, Key West, Florida, United States


Item Type


Unique identifying numbers for this paper in the Digital Library or other systems.


This paper is part of the following collection of related materials.

UNT Scholarly Works

Materials from the UNT community's research, creative, and scholarly activities and UNT's Open Access Repository. Access to some items in this collection may be restricted.

What responsibilities do I have when using this paper?


Dates and time periods associated with this paper.

Creation Date

  • May 2007

Added to The UNT Digital Library

  • Jan. 31, 2011, 2:01 p.m.

Description Last Updated

  • March 27, 2014, 12:07 p.m.

Usage Statistics

When was this paper last used?

Yesterday: 0
Past 30 days: 0
Total Uses: 324

Interact With This Paper

Here are some suggestions for what to do next.

Start Reading

PDF Version Also Available for Download.

International Image Interoperability Framework

IIF Logo

We support the IIIF Presentation API

Csomai, Andras & Mihalcea, Rada, 1974-. Investigations in Unsupervised Back-of-the-Book Indexing, paper, May 2007; ( accessed June 20, 2024), University of North Texas Libraries, UNT Digital Library,; crediting UNT College of Engineering.

Back to Top of Screen