Using Wikipedia for Automatic Word Sense Disambiguation

Using Wikipedia for Automatic Word Sense Disambiguation

Date: April 2007
Creator: Mihalcea, Rada, 1974-
Description: This paper describes a method for generating sense-tagged data using Wikipedia as a source of sense annotations. Through word sense disambiguation experiments, the authors show that the Wikipedia-based sense annotations are reliable and can be used to construct accurate sense classifiers.
Contributing Partner: UNT College of Engineering
Word Sense Disambiguation with Pattern Learning and Automatic Feature Selection

Word Sense Disambiguation with Pattern Learning and Automatic Feature Selection

Date: December 2002
Creator: Mihalcea, Rada, 1974-
Description: Article discussing word sense disambiguation with pattern learning and automatic feature selection.
Contributing Partner: UNT College of Engineering
Classifier Stacking and Voting for Text Filtering

Classifier Stacking and Voting for Text Filtering

Date: November 2002
Creator: Mihalcea, Rada, 1974-
Description: This article discusses classifier stacking and voting for text filtering.
Contributing Partner: UNT College of Engineering
Instance Based Learning with Automatic Feature Selection Applied to Word Sense Disambiguation

Instance Based Learning with Automatic Feature Selection Applied to Word Sense Disambiguation

Date: August 2002
Creator: Mihalcea, Rada, 1974-
Description: This paper discusses instance based learning with automatic feature selection applied to word sense disambiguation.
Contributing Partner: UNT College of Engineering
Language Independent Extractive Summarization

Language Independent Extractive Summarization

Date: July 2005
Creator: Mihalcea, Rada, 1974-
Description: This paper discusses language independent extractive summarization.
Contributing Partner: UNT College of Engineering
Unsupervised Large-Vocabulary Word Sense Disambiguation with Graph-based Algorithms for Sequence Data Labeling

Unsupervised Large-Vocabulary Word Sense Disambiguation with Graph-based Algorithms for Sequence Data Labeling

Date: October 2005
Creator: Mihalcea, Rada, 1974-
Description: This paper introduces a graph-based algorithm for sequence data labeling, using random walks on graphs encoding label dependencies. The algorithm is illustrated and tested in the context of an unsupervised word sense disambiguation problem, and shown to significantly outperform the accuracy achieved through individual label assignment, as measured on standard sense-annotated data sets.
Contributing Partner: UNT College of Engineering
The Multidisciplinary Facets of Research on Humour

The Multidisciplinary Facets of Research on Humour

Date: July 2007
Creator: Mihalcea, Rada, 1974-
Description: In this paper, the authors summarize the main theories of humor that emerged from philosophical and modern psychological research, and survey the past and present developments in the fields of theoretical and computational linguistics.
Contributing Partner: UNT College of Engineering
The Role of Non-Ambiguous Words in Natural Language Disambiguation

The Role of Non-Ambiguous Words in Natural Language Disambiguation

Date: September 2003
Creator: Mihalcea, Rada, 1974-
Description: This article discusses the role of non-ambiguous words in natural language disambiguation.
Contributing Partner: UNT College of Engineering
Graph-based Ranking Algorithms for Sentence Extraction, Applied to Text Summarization

Graph-based Ranking Algorithms for Sentence Extraction, Applied to Text Summarization

Date: July 2004
Creator: Mihalcea, Rada, 1974-
Description: Abstract: This paper presents an innovative unsupervised method for automatic sentence extraction using graph-based ranking algorithms. We evaluate the method in the context of a text summarization task, and show that the results obtained compare favorably with previously published results on established benchmarks.
Contributing Partner: UNT College of Engineering
Making Sense Out of the Web

Making Sense Out of the Web

Date: November 2004
Creator: Mihalcea, Rada, 1974-
Description: This paper discusses the main lines of research in deriving efficient Word Sense Disambiguation.
Contributing Partner: UNT College of Engineering
Co-training and Self-training for Word Sense Disambiguation

Co-training and Self-training for Word Sense Disambiguation

Date: May 2004
Creator: Mihalcea, Rada, 1974-
Description: This paper investigates the application of co-training and self-training to word sense disambiguation. Optimal and empirical parameter selection methods for co-training and self-training are investigated, with various degrees of error reduction. A new method that combines co-training with majority voting is introduced, with the effect of smoothing the bootstrapping learning curves, and improving the average performance.
Contributing Partner: UNT College of Engineering
The Semantic Wildcard

The Semantic Wildcard

Date: May 2002
Creator: Mihalcea, Rada, 1974-
Description: This paper introduces the semantic wildcard, one of the most powerful operators implemented in IRSLO, which allows for searches along general-specific lines.
Contributing Partner: UNT College of Engineering
A Semi-Complete Disambiguation Algorithm for Open Text

A Semi-Complete Disambiguation Algorithm for Open Text

Date: 2000
Creator: Mihalcea, Rada, 1974-
Description: This paper discusses a semi-complete disambiguation algorithm for open text.
Contributing Partner: UNT College of Engineering
Performance Analysis of a Part of Speech Tagging Task

Performance Analysis of a Part of Speech Tagging Task

Date: February 2003
Creator: Mihalcea, Rada, 1974-
Description: This article discusses performance analysis of a part of speech tagging task.
Contributing Partner: UNT College of Engineering
[Review] The Text Mining Handbook: Advanced Approaches to Analyzing Unstructured Data

[Review] The Text Mining Handbook: Advanced Approaches to Analyzing Unstructured Data

Date: March 2008
Creator: Mihalcea, Rada, 1974-
Description: This article reviews the book "'The Text Mining Handbook: Advanced Approaches to Analyzing Unstructured Data," by Ronen Feldman and James Sanger.
Contributing Partner: UNT College of Engineering
Wikify! Linking Documents to Encyclopedic Knowledge

Wikify! Linking Documents to Encyclopedic Knowledge

Date: November 2007
Creator: Mihalcea, Rada, 1974- & Csomai, Andras
Description: This paper introduces the use of Wikipedia as a resource for automatic keyword extraction and word sense disambiguation, and shows how this online encyclopedia can be used to achieve state-of-the-art results on both these tasks.
Contributing Partner: UNT College of Engineering
BABYLON Parallel Text Builder: Gathering Parallel Texts for Low-Density Languages

BABYLON Parallel Text Builder: Gathering Parallel Texts for Low-Density Languages

Date: May 2008
Creator: Mohler, Michael & Mihalcea, Rada, 1974-
Description: This paper discusses BABYLON parallel text builder.
Contributing Partner: UNT College of Engineering
Linguistically Motivated Features for Enhanced Back-of-the-Book Indexing

Linguistically Motivated Features for Enhanced Back-of-the-Book Indexing

Date: June 2008
Creator: Csomai, Andras & Mihalcea, Rada, 1974-
Description: This paper discusses linguistically motivated features for enhanced back-of-the-book indexing.
Contributing Partner: UNT College of Engineering
Linguistic Ethnography: Identifying Dominant Word Classes in Text

Linguistic Ethnography: Identifying Dominant Word Classes in Text

Date: March 2009
Creator: Pulman, Stephen & Mihalcea, Rada, 1974-
Description: This paper discusses linguistic ethnography.
Contributing Partner: UNT College of Engineering
Learning to Identify Educational Materials

Learning to Identify Educational Materials

Date: 2009
Creator: Hassan, Samer & Mihalcea, Rada, 1974-
Description: This paper discusses learning to identify educational materials.
Contributing Partner: UNT College of Engineering
Cross-lingual Semantic Relatedness Using Encyclopedic Knowledge

Cross-lingual Semantic Relatedness Using Encyclopedic Knowledge

Date: August 2009
Creator: Hassan, Samer & Mihalcea, Rada, 1974-
Description: This paper discusses cross-lingual semantic relatedness using encyclopedic knowledge.
Contributing Partner: UNT College of Engineering
Combining Lexical Resources for Contextual Synonym Expansion

Combining Lexical Resources for Contextual Synonym Expansion

Date: 2009
Creator: Sinha, Ravi & Mihalcea, Rada, 1974-
Description: This paper discusses combining lexical resources for contextual synonym expansion.
Contributing Partner: UNT College of Engineering
Text-to-text Semantic Similarity for Automatic Short Answer Grading

Text-to-text Semantic Similarity for Automatic Short Answer Grading

Date: March 2009
Creator: Mohler, Michael & Mihalcea, Rada, 1974-
Description: In this paper, the authors explore unsupervised techniques for the task of automatic short answer grading.
Contributing Partner: UNT College of Engineering
Letter Level Learning for Language Independent Diacritics Restoration

Letter Level Learning for Language Independent Diacritics Restoration

Date: September 2002
Creator: Mihalcea, Rada, 1974- & Nastase, Vivi
Description: This paper discusses letter level learning for language independent diacritics restoration.
Contributing Partner: UNT College of Engineering
FIRST PREV 1 2 3 4 5 NEXT LAST