Search Results

open access

Unsupervised Large-Vocabulary Word Sense Disambiguation with Graph-based Algorithms for Sequence Data Labeling

Description: This paper introduces a graph-based algorithm for sequence data labeling, using random walks on graphs encoding label dependencies. The algorithm is illustrated and tested in the context of an unsupervised word sense disambiguation problem, and shown to significantly outperform the accuracy achieved through individual label assignment, as measured on standard sense-annotated data sets.
Date: October 2005
Creator: Mihalcea, Rada, 1974-
open access

UNT: SubFinder: Combining Knowledge Sources for Automatic Lexical Substitution

Description: This paper describes the University of North Texas SubFinder system. The system is able to provide the most likely set of substitutes for a word in a given context, by combining several techniques and knowledge sources. SubFinder has successfully participated in the best and out of ten (oot) tracks in the SEMEVAL lexical substitution task, consistently ranking in the first or second place.
Date: June 2007
Creator: Hassan, Samer; Csomai, Andras; Banea, Carmen; Sinha, Ravi & Mihalcea, Rada, 1974-
open access

Using Encyclopedic Knowledge for Automatic Topic Identification

Description: This paper presents a method for automatic topic identification using an encyclopedic graph derived from Wikipedia. The system is found to exceed the performance of previously proposed machine learning algorithms for topic identification, with an annotation consistency comparable to human annotations.
Date: May 2009
Creator: Coursey, Kino High; Mihalcea, Rada, 1974- & Moen, William E.
open access

Using Wikipedia for Automatic Word Sense Disambiguation

Description: This paper describes a method for generating sense-tagged data using Wikipedia as a source of sense annotations. Through word sense disambiguation experiments, the authors show that the Wikipedia-based sense annotations are reliable and can be used to construct accurate sense classifiers.
Date: April 2007
Creator: Mihalcea, Rada, 1974-

WiFi and WCDMA Network Design

Description: This presentation discusses WiFi access point selection and traffic balancing, multi-cell wideband code division multiple access (WCDMA) with multiple classes, user modeling using 2D Gaussian distribution, and intra-cell and inter-cell interference and capacity.
Date: April 2005
Creator: Akl, Robert G.
open access

Word Alignment for Languages with Scarce Resources

Description: This paper presents the task definition, resources, participating systems, and comparative results for the shared task on word alignment which was organized as part of the Association for Computational Linguistics (ACL) 2005 Workshop on Building and Using Parallel Texts. The shared task included English-Inuktitut, Romanian-English, and English-Hindi sub-tasks, and drew the participation of ten teams from around the world with a total of 50 systems.
Date: June 2005
Creator: Martin, Joel; Mihalcea, Rada, 1974- & Pedersen, Ted
Back to Top of Screen