Co-training and Self-training for Word Sense Disambiguation

Co-training and Self-training for Word Sense Disambiguation

Date: May 2004
Creator: Mihalcea, Rada, 1974-
Description: This paper investigates the application of co-training and self-training to word sense disambiguation. Optimal and empirical parameter selection methods for co-training and self-training are investigated, with various degrees of error reduction. A new method that combines co-training with majority voting is introduced, with the effect of smoothing the bootstrapping learning curves, and improving the average performance.
Contributing Partner: UNT College of Engineering
Graph-based Ranking Algorithms for Sentence Extraction, Applied to Text Summarization

Graph-based Ranking Algorithms for Sentence Extraction, Applied to Text Summarization

Date: July 2004
Creator: Mihalcea, Rada, 1974-
Description: Abstract: This paper presents an innovative unsupervised method for automatic sentence extraction using graph-based ranking algorithms. We evaluate the method in the context of a text summarization task, and show that the results obtained compare favorably with previously published results on established benchmarks.
Contributing Partner: UNT College of Engineering
[Review] The Text Mining Handbook: Advanced Approaches to Analyzing Unstructured Data

[Review] The Text Mining Handbook: Advanced Approaches to Analyzing Unstructured Data

Date: March 2008
Creator: Mihalcea, Rada, 1974-
Description: This book review discusses 'The Text Mining Handbook: Advanced Approaches to Analyzing Unstructured Data' by Ronen Feldman and James Sanger. The book is an introduction to text mining, covering the general architecture of text mining systems, along with the main techniques used by such systems.
Contributing Partner: UNT College of Engineering
Using Wikipedia for Automatic Word Sense Disambiguation

Using Wikipedia for Automatic Word Sense Disambiguation

Date: April 2007
Creator: Mihalcea, Rada, 1974-
Description: This paper describes a method for generating sense-tagged data using Wikipedia as a source of sense annotations. Through word sense disambiguation experiments, the authors show that the Wikipedia-based sense annotations are reliable and can be used to construct accurate sense classifiers.
Contributing Partner: UNT College of Engineering
Unsupervised Large-Vocabulary Word Sense Disambiguation with Graph-based Algorithms for Sequence Data Labeling

Unsupervised Large-Vocabulary Word Sense Disambiguation with Graph-based Algorithms for Sequence Data Labeling

Date: October 2005
Creator: Mihalcea, Rada, 1974-
Description: This paper introduces a graph-based algorithm for sequence data labeling, using random walks on graphs encoding label dependencies. The algorithm is illustrated and tested in the context of an unsupervised word sense disambiguation problem, and shown to significantly outperform the accuracy achieved through individual label assignment, as measured on standard sense-annotated data sets.
Contributing Partner: UNT College of Engineering
Language Independent Extractive Summarization

Language Independent Extractive Summarization

Date: July 2005
Creator: Mihalcea, Rada, 1974-
Description: This paper discusses language independent extractive summarization.
Contributing Partner: UNT College of Engineering
Performance Analysis of a Part of Speech Tagging Task

Performance Analysis of a Part of Speech Tagging Task

Date: February 2003
Creator: Mihalcea, Rada, 1974-
Description: This article discusses performance analysis of a part of speech tagging task.
Contributing Partner: UNT College of Engineering
Instance Based Learning with Automatic Feature Selection Applied to Word Sense Disambiguation

Instance Based Learning with Automatic Feature Selection Applied to Word Sense Disambiguation

Date: August 2002
Creator: Mihalcea, Rada, 1974-
Description: This paper discusses instance based learning with automatic feature selection applied to word sense disambiguation.
Contributing Partner: UNT College of Engineering
Classifier Stacking and Voting for Text Filtering

Classifier Stacking and Voting for Text Filtering

Date: November 2002
Creator: Mihalcea, Rada, 1974-
Description: This article discusses classifier stacking and voting for text filtering.
Contributing Partner: UNT College of Engineering
Making Sense Out of the Web

Making Sense Out of the Web

Date: November 2004
Creator: Mihalcea, Rada, 1974-
Description: This paper discusses the main lines of research in deriving efficient Word Sense Disambiguation.
Contributing Partner: UNT College of Engineering
FIRST PREV 1 2 3 4 5 NEXT LAST