Creating Large Annotated Data Collections with Web Users' Help

Description:

This paper discusses creating annotated data collections.

Creator(s):
Creation Date: April 2003
Partner(s):
UNT College of Engineering
Collection(s):
UNT Scholarly Works
Usage:
Total Uses: 47
Past 30 days: 10
Yesterday: 0
Creator (Author):
Mihalcea, Rada, 1974-

University of North Texas

Creator (Author):
Chklovski, Timothy A. (Timothy Anatolievich), 1977-

Massachusetts Institute of Technology

Publisher Info:
Place of Publication: [Stroudsburg, Pennsylvania]
Date(s):
  • Creation: April 2003
Description:

This paper discusses creating annotated data collections.

Degree:
Note:

Abstract: Open Mind Word Expert is an implemented active learning system that aims to create large annotated corpora by tapping into the world's vast pool of knowledge. It does this by relying on the vast number of Web users who contribute their knowledge to data annotation. Open Mind Word Expert focuses on building semantically annotated corpora, by collecting word sense tagging from the general public over the Web. During the first nine months of activity, the system yielded 90,000 high quality tagged items at a much lower cost than the traditional method of hiring lexicographers.

Physical Description:

8 p.

Language(s):
Subject(s):
Keyword(s): word sense disambiguations | natural language processing | Open Mind Word Expert | annotated corpora | semantics
Source: Fourth International Workshop on Linguistically Interpreted Corpora (LINC), 2003, Budapest, Hungary
Contributor(s):
Partner:
UNT College of Engineering
Collection:
UNT Scholarly Works
Identifier:
  • ARK: ark:/67531/metadc30949
Resource Type: Paper
Format: Text
Rights:
Access: Public