An Automatic Method for Generating Sense Tagged Corpora

PDF Version Also Available for Download.

Description

This paper discusses an automatic method for generating sense tagged corpora.

Physical Description

6 p.

Creation Information

Mihalcea, Rada, 1974- & Moldovan, Dan I. 1999.

Context

This paper is part of the collection entitled: UNT Scholarly Works and was provided by UNT College of Engineering to Digital Library, a digital repository hosted by the UNT Libraries. It has been viewed 78 times . More information about this paper can be viewed below.

Who

People and organizations associated with either the creation of this paper or its content.

Authors

Provided By

UNT College of Engineering

The UNT College of Engineering promotes intellectual and scholarly pursuits in the areas of computer science and engineering, preparing innovative leaders in a variety of disciplines. The UNT College of Engineering encourages faculty and students to pursue interdisciplinary research among numerous subjects of study including databases, numerical analysis, game programming, and computer systems architecture.

Contact Us

What

Descriptive information to help identify this paper. Follow the links below to find similar items on the Digital Library.

Degree Information

Description

This paper discusses an automatic method for generating sense tagged corpora.

Physical Description

6 p.

Notes

Copyright 1999 American Association for Artificial Intelligence (AAAI). All rights reserved. http://www.aaai.org

Abstract: The unavailability of very large corpora with semantically disambiguated words is a major limitation in text processing research. For example, statistical methods for word sense disambiguation of free text are known to achieve high accuracy results when large corpora are available to develop context rules, to train and test them. This article presents a novel approach to automatically generate arbitrarily large corpora for word senses. The method is based on (1) the information provided in WordNet, used to formulate queries consisting of synonyms or definitions of word senses, and (2) the information gathered from Internet using existing search engines. The method was tested on 120 word senses and a precision of 91% was observed.

Source

  • Sixteenth National Conference on Artificial Intelligence, 1999, Orlando, Florida, United States

Language

Item Type

Collections

This paper is part of the following collection of related materials.

UNT Scholarly Works

Materials from the UNT community's research, creative, and scholarly activities and UNT's Open Access Repository. Access to some items in this collection may be restricted.

What responsibilities do I have when using this paper?

When

Dates and time periods associated with this paper.

Creation Date

  • 1999

Added to The UNT Digital Library

  • April 13, 2012, 9:48 a.m.

Description Last Updated

  • March 27, 2014, 11:26 a.m.

Usage Statistics

When was this paper last used?

Yesterday: 0
Past 30 days: 1
Total Uses: 78

Interact With This Paper

Here are some suggestions for what to do next.

Start Reading

PDF Version Also Available for Download.

Citations, Rights, Re-Use

Mihalcea, Rada, 1974- & Moldovan, Dan I. An Automatic Method for Generating Sense Tagged Corpora, paper, 1999; (digital.library.unt.edu/ark:/67531/metadc83300/: accessed December 12, 2017), University of North Texas Libraries, Digital Library, digital.library.unt.edu; crediting UNT College of Engineering.