Search Results

open access

Using Encyclopedic Knowledge for Automatic Topic Identification

Description: This paper presents a method for automatic topic identification using an encyclopedic graph derived from Wikipedia. The system is found to exceed the performance of previously proposed machine learning algorithms for topic identification, with an annotation consistency comparable to human annotations.
Date: May 2009
Creator: Coursey, Kino High; Mihalcea, Rada, 1974- & Moen, William E.
Partner: UNT College of Engineering
open access

Using Wikipedia for Automatic Word Sense Disambiguation

Description: This paper describes a method for generating sense-tagged data using Wikipedia as a source of sense annotations. Through word sense disambiguation experiments, the authors show that the Wikipedia-based sense annotations are reliable and can be used to construct accurate sense classifiers.
Date: April 2007
Creator: Mihalcea, Rada, 1974-
Partner: UNT College of Engineering
open access

Unsupervised Graph-based Word Sense Disambiguation Using Measures of Word Semantic Similarity

Description: This paper describes an unsupervised graph-based method for word sense disambiguation, and presents comparative evaluations using several measures of word semantic similarity and several algorithms for graph centrality. The results indicate that the right combination of similarity metrics and graph centrality algorithms can lead to a performance competing with the state-of-the-art in unsupervised word sense disambiguation, as measured on standard data sets.
Date: September 2007
Creator: Sinha, Ravi & Mihalcea, Rada, 1974-
Partner: UNT College of Engineering
open access

Random-Walk Term Weighting for Improved Text Classification

Description: This paper describes a new approach for estimating term weights in a document, and shows how the new weighting scheme can be used to improve the accuracy of a text classifier.
Date: September 2007
Creator: Hassan, Samer; Mihalcea, Rada, 1974- & Banea, Carmen
Partner: UNT College of Engineering
open access

Explorations in Automatic Book Summarization

Description: This paper discusses explorations in automatic book summarization.
Date: June 2007
Creator: Ceylan, Hakan & Mihalcea, Rada, 1974-
Partner: UNT College of Engineering
open access

Unsupervised Large-Vocabulary Word Sense Disambiguation with Graph-based Algorithms for Sequence Data Labeling

Description: This paper introduces a graph-based algorithm for sequence data labeling, using random walks on graphs encoding label dependencies. The algorithm is illustrated and tested in the context of an unsupervised word sense disambiguation problem, and shown to significantly outperform the accuracy achieved through individual label assignment, as measured on standard sense-annotated data sets.
Date: October 2005
Creator: Mihalcea, Rada, 1974-
Partner: UNT College of Engineering
open access

Text Mining for Automatic Image Tagging

Description: This paper introduces several extractive approaches for automatic image tagging, relying exclusively on information mined from texts. Through evaluations on two datasets, the authors show that their methods exceed competitive baselines by a large margin, and compare favorably with the state-of-the-art that uses both textual and image features.
Date: August 2010
Creator: Leong, Chee Wee; Mihalcea, Rada, 1974- & Hassan, Samer
Partner: UNT College of Engineering
open access

An Evaluation Exercise for Romanian Word Sense Disambiguation

Description: This paper discusses an evaluation exercise for Romanian word sense disambiguation.
Date: July 2004
Creator: Mihalcea, Rada, 1974-; Nastase, Vivi; Chklovski, Timothy A. (Timothy Anatolievich), 1977-; Tatar, Doina; Tufis, Dan & Hristea, Florentina T.
Partner: UNT College of Engineering
open access

Amazon Mechanical Turk for Subjectivity Word Sense Disambiguation

Description: In this paper, the authors discuss research on whether they can use Mechanical Turk (MTurk) to acquire good annotations with respect to gold-standard data, whether they can filter out low-quality workers (spammers), and whether there is a learning effect associated with repeatedly completing the same kind of task.
Date: June 2010
Creator: Akkaya, Cem; Conrad, Alexander; Wiebe, Janyce M. & Mihalcea, Rada, 1974-
Partner: UNT College of Engineering
open access

Quantifying the Limits and Success of Extractive Summarization Systems Across Domains

Description: This paper analyzes the topic identification stage of single-document automatic text summarization across four different domains, consisting of newswire, literary, scientific and legal documents.
Date: June 2010
Creator: Ceylan, Hakan; Mihalcea, Rada, 1974-; Ozertem, Umut; Lloret, Elena & Palomar, Manuel
Partner: UNT College of Engineering
open access

Word Alignment for Languages with Scarce Resources

Description: This paper presents the task definition, resources, participating systems, and comparative results for the shared task on word alignment which was organized as part of the Association for Computational Linguistics (ACL) 2005 Workshop on Building and Using Parallel Texts. The shared task included English-Inuktitut, Romanian-English, and English-Hindi sub-tasks, and drew the participation of ten teams from around the world with a total of 50 systems.
Date: June 2005
Creator: Martin, Joel; Mihalcea, Rada, 1974- & Pedersen, Ted
Partner: UNT College of Engineering
open access

The SENSEVAL-3 English Lexical Sample Task

Description: This paper presents the task definition, resources, participating systems, and comparative results for the English lexical sample task, which was organized as part of the SENSEVAL-3 evaluation exercise.
Date: July 2004
Creator: Mihalcea, Rada, 1974-; Chklovski, Timothy A. (Timothy Anatolievich), 1977- & Kilgarriff, Adam
Partner: UNT College of Engineering
open access

The SENSEVAL-3 Multilingual English-Hindi Lexical Sample Task

Description: This paper describes the English-Hindi Multilingual lexical sample task in SENSEVAL-3.
Date: July 2004
Creator: Chklovski, Timothy A. (Timothy Anatolievich), 1977-; Mihalcea, Rada, 1974-; Pedersen, Ted & Purandare, Amruta
Partner: UNT College of Engineering
Back to Top of Screen