Search Results

open access

The SENSEVAL-3 English Lexical Sample Task

Description: This paper presents the task definition, resources, participating systems, and comparative results for the English lexical sample task, which was organized as part of the SENSEVAL-3 evaluation exercise.
Date: July 2004
Creator: Mihalcea, Rada, 1974-; Chklovski, Timothy A. (Timothy Anatolievich), 1977- & Kilgarriff, Adam
Partner: UNT College of Engineering
open access

SenseLearner: Minimally Supervised Word Sense Disambiguation for All Words in Open Text

Description: This paper introduces SenseLearner - a minimally supervised sense tagger that attempts to disambiguate all content words in a text using the sense from WordNet. SenseLearner participated in the SENSEVAL-3 English all words task, and achieved an average accuracy of 64.6%.
Date: 2004
Creator: Mihalcea, Rada, 1974- & Faruque, Ehsanul
Partner: UNT College of Engineering
open access

Wikify! Linking Documents to Encyclopedic Knowledge

Description: This paper introduces the use of Wikipedia as a resource for automatic keyword extraction and word sense disambiguation, and shows how this online encyclopedia can be used to achieve state-of-the-art results on both these tasks.
Date: November 2007
Creator: Mihalcea, Rada, 1974- & Csomai, Andras
Partner: UNT College of Engineering
open access

Networks and Natural Language Processing

Description: Article discussing networks and natural language processing. The authors present some of the most successful graph-based representations and algorithms used in language processing and try to explain how and why they work.
Date: September 2008
Creator: Radev, Dragomir R. & Mihalcea, Rada, 1974-
Partner: UNT College of Engineering
open access

Characterizing Humour: An Exploration of Features in Humorous Texts

Description: This paper investigates the problem of automatic humor recognition, and provides an in-depth analysis of two of the most frequently observed features of humorous text: human-centeredness and negative polarity. Through experiments performed on two collections of humorous texts, the authors show that these properties of verbal humor are consisted across different data sets.
Date: February 2007
Creator: Mihalcea, Rada, 1974- & Pulman, Stephen
Partner: UNT College of Engineering
open access

TextRank: Bringing Order into Texts

Description: In this paper, the authors introduce TextRank, a graph-based ranking model for text processing, and show how this model can be successfully used in natural language applications.
Date: July 2004
Creator: Mihalcea, Rada, 1974- & Tarau, Paul
Partner: UNT College of Engineering
open access

Co-training and Self-training for Word Sense Disambiguation

Description: This paper investigates the application of co-training and self-training to word sense disambiguation. Optimal and empirical parameter selection methods for co-training and self-training are investigated, with various degrees of error reduction. A new method that combines co-training with majority voting is introduced, with the effect of smoothing the bootstrapping learning curves, and improving the average performance.
Date: May 2004
Creator: Mihalcea, Rada, 1974-
Partner: UNT College of Engineering
open access

The Decomposition of Human-Written Book Summaries

Description: In this paper, the authors evaluate the extent to which human-written book summaries can be obtained through cut-and-paste operations from the original book. The authors analyze the effect of the parameters involved in the decomposition algorithm, and highlight the distinctions in coverage obtained for different summary types.
Date: March 2009
Creator: Ceylan, Hakan & Mihalcea, Rada, 1974-
Partner: UNT College of Engineering
Back to Top of Screen