Creating a Testbed for the Evaluation of Automatically Generated Back-of-the-book Indexes

PDF Version Also Available for Download.

Description

This paper discusses automatic generating of back-of-the-book indexes.

Physical Description

12 p.

Creation Information

Csomai, Andras & Mihalcea, Rada, 1974- February 2006.

Context

This paper is part of the collection entitled: UNT Scholarly Works and was provided by UNT College of Engineering to Digital Library, a digital repository hosted by the UNT Libraries. It has been viewed 147 times . More information about this paper can be viewed below.

Who

People and organizations associated with either the creation of this paper or its content.

Authors

Organizer of meeting

Publisher

Provided By

UNT College of Engineering

The UNT College of Engineering promotes intellectual and scholarly pursuits in the areas of computer science and engineering, preparing innovative leaders in a variety of disciplines. The UNT College of Engineering encourages faculty and students to pursue interdisciplinary research among numerous subjects of study including databases, numerical analysis, game programming, and computer systems architecture.

Contact Us

What

Descriptive information to help identify this paper. Follow the links below to find similar items on the Digital Library.

Degree Information

Description

This paper discusses automatic generating of back-of-the-book indexes.

Physical Description

12 p.

Notes

Abstract: The automatic generation of back-of-the-book indexes seems to be out of sight of the Information Retrieval and Natural Language Processing communities, although the increasingly large number of books available in electronic format, as well as recent advances in key-phrase extraction, should motivate an increased interest in this topic. In this paper, the authors describe the background relevant to the process of creating back-of-the-book indexes, namely (1) a short overview of the origin and structure of back-of-the-book indexes, and (2) the correspondence that can be established between techniques for automatic index construction and keyphrase extraction. Since the development of any automatic system requires in the first place an evaluation testbed, the authors describe their work in building a gold standard collection of books and indexes, and the authors present several metrics that can be used for the evaluation of automatically generated indexes against the gold standard. Finally, the authors investigate the properties of the gold standard index, such as index size, length of index entries, and upper bounds on coverage as indicated by the presence of index entries in the document.

Source

  • Conference on Computational Linguistics and Intelligent Text Processing (CICLing), 2006, Mexico City, Mexico

Language

Item Type

Collections

This paper is part of the following collection of related materials.

UNT Scholarly Works

Materials from the UNT community's research, creative, and scholarly activities and UNT's Open Access Repository. Access to some items in this collection may be restricted.

What responsibilities do I have when using this paper?

When

Dates and time periods associated with this paper.

Creation Date

  • February 2006

Added to The UNT Digital Library

  • Jan. 31, 2011, 2:01 p.m.

Description Last Updated

  • Sept. 25, 2015, 1:08 p.m.

Usage Statistics

When was this paper last used?

Yesterday: 0
Past 30 days: 1
Total Uses: 147

Interact With This Paper

Here are some suggestions for what to do next.

Start Reading

PDF Version Also Available for Download.

Citations, Rights, Re-Use

Csomai, Andras & Mihalcea, Rada, 1974-. Creating a Testbed for the Evaluation of Automatically Generated Back-of-the-book Indexes, paper, February 2006; [Berlin, Germany]. (digital.library.unt.edu/ark:/67531/metadc30982/: accessed December 13, 2017), University of North Texas Libraries, Digital Library, digital.library.unt.edu; crediting UNT College of Engineering.