You limited your search to:

  Partner: UNT College of Engineering
 Department: Computer Science and Engineering
Computational Laughing: Automatic Recognition of Humorous One-liners

Computational Laughing: Automatic Recognition of Humorous One-liners

Date: July 2005
Creator: Mihalcea, Rada, 1974- & Strapparava, Carlo, 1962-
Description: This paper discusses automatic recognition of humor. Abstract: Humor is one of the most interesting and puzzling aspects of human behavior. Despite the attention it has received in fields such as philosophy, linguistics, and psychology, there have been only few attempts to create computational models for humor recognition or generation. In this paper, the authors bring empirical evidence that computational approaches can be successfully applied to the task of humor recognition. Through experiments performed on very large data sets, the authors show that automatic classification techniques can be effectively used to distinguish between humorous and non-humorous texts, with significant improvements observed over apriori known baselines.
Contributing Partner: UNT College of Engineering
Cross-lingual Semantic Relatedness Using Encyclopedic Knowledge

Cross-lingual Semantic Relatedness Using Encyclopedic Knowledge

Date: August 2009
Creator: Hassan, Samer & Mihalcea, Rada, 1974-
Description: This paper discusses cross-lingual semantic relatedness using encyclopedic knowledge. Abstract: In this paper, we address the task of cross-lingual semantic relatedness. We introduce a method that relies on the information extracted from Wikipedia, by exploiting the interlanguage links available between Wikipedia versions in multiple languages. Through experiments performed on several language pairs, we show that the method performs well, with a performance comparable to monolingual measures of relatedness.
Contributing Partner: UNT College of Engineering
Current Research in Wireless at UNT

Current Research in Wireless at UNT

Date: October 2004
Creator: Akl, Robert G.
Description: This presentation discusses wireless networks, access point selections, traffic balancing, multi-cell CDMA, user distribution modeling, and call admission control.
Contributing Partner: UNT College of Engineering
Annotating and Identifying Emotions in Text

Annotating and Identifying Emotions in Text

Date: 2010
Creator: Strapparava, Carlo & Mihalcea, Rada
Description: This book chapter discusses annotating and identifying emotions in text. Abstract: This paper focuses on the classification of emotions and polarity in news headlines and it is meant as an exploration of the connection between emotions and lexical semantics. The authors first describe the construction of the data set used in evaluation exercise "Affective Text" task at SemEval 2007, annotated for six basic emotions: Anger, Disgust, Fear, Joy, Sadness, and Surprise, and for Positive and Negative polarity. The authors also briefly describe the participating systems and their results. Second, exploiting the same data set, the authors propose and evaluate several knowledge-based and corpus-based methods for the automatic identification of emotions in text.
Contributing Partner: UNT College of Engineering
Co-training and Self-training for Word Sense Disambiguation

Co-training and Self-training for Word Sense Disambiguation

Date: May 2004
Creator: Mihalcea, Rada, 1974-
Description: This paper investigates the application of co-training and self-training to word sense disambiguation. Optimal and empirical parameter selection methods for co-training and self-training are investigated, with various degrees of error reduction. A new method that combines co-training with majority voting is introduced, with the effect of smoothing the bootstrapping learning curves, and improving the average performance.
Contributing Partner: UNT College of Engineering
Combining Lexical Resources for Contextual Synonym Expansion

Combining Lexical Resources for Contextual Synonym Expansion

Date: 2009
Creator: Sinha, Ravi & Mihalcea, Rada, 1974-
Description: This paper discusses combining lexical resources for contextual synonym expansion. Abstract: In this paper, we experiment with the task of contextual synonym expansion, and compare the benefits of combining multiple lexical resources using both unsupervised and supervised approaches. Overall, the results obtained through the combination of several resources exceed the current state-of-the-art when selecting the best synonym for a given target word, and place second when selecting the top ten synonyms, thus demonstrating the usefulness of the approach.
Contributing Partner: UNT College of Engineering
Creating a Testbed for the Evaluation of Automatically Generated Back-of-the-book Indexes

Creating a Testbed for the Evaluation of Automatically Generated Back-of-the-book Indexes

Date: February 2006
Creator: Csomai, Andras & Mihalcea, Rada, 1974-
Description: This paper discusses automatic generating of back-of-the-book indexes. Abstract: The automatic generation of back-of-the-book indexes seems to be out of sight of the Information Retrieval and Natural Language Processing communities, although the increasingly large number of books available in electronic format, as well as recent advances in key-phrase extraction, should motivate an increased interest in this topic. In this paper, the authors describe the background relevant to the process of creating back-of-the-book indexes, namely (1) a short overview of the origin and structure of back-of-the-book indexes, and (2) the correspondence that can be established between techniques for automatic index construction and keyphrase extraction. Since the development of any automatic system requires in the first place an evaluation testbed, the authors describe their work in building a gold standard collection of books and indexes, and the authors present several metrics that can be used for the evaluation of automatically generated indexes against the gold standard. Finally, the authors investigate the properties of the gold standard index, such as index size, length of index entries, and upper bounds on coverage as indicated by the presence of index entries in the document.
Contributing Partner: UNT College of Engineering
Creating Large Annotated Data Collections with Web Users' Help

Creating Large Annotated Data Collections with Web Users' Help

Date: April 2003
Creator: Mihalcea, Rada, 1974- & Chklovski, Timothy A. (Timothy Anatolievich), 1977
Description: This paper discusses creating annotated data collections. Abstract: Open Mind Word Expert is an implemented active learning system that aims to create large annotated corpora by tapping into the world's vast pool of knowledge. It does this by relying on the vast number of Web users who contribute their knowledge to data annotation. Open Mind Word Expert focuses on building semantically annotated corpora, by collecting word sense tagging from the general public over the Web. During the first nine months of activity, the system yielded 90,000 high quality tagged items at a much lower cost than the traditional method of hiring lexicographers.
Contributing Partner: UNT College of Engineering
Automatic Keyword Extraction for Learning Object Repositories

Automatic Keyword Extraction for Learning Object Repositories

Date: October 2008
Creator: Coursey, Kino High; Mihalcea, Rada & Moen, William E.
Description: Abstract: This paper describes experiments in metadata generation for learning object repositories. Specifically, the authors present several methods for automatic keyword extraction and evaluate them on a collection of learning objects from an undergraduate history course. The results suggest that automatic keyword extraction is a viable solution for suggesting terms and phrases for metadata annotation.
Contributing Partner: UNT College of Engineering
BABYLON Parallel Text Builder: Gathering Parallel Texts for Low-Density Languages

BABYLON Parallel Text Builder: Gathering Parallel Texts for Low-Density Languages

Date: May 2008
Creator: Mohler, Michael & Mihalcea, Rada
Description: This paper discusses BABYLON parallel text builder. Abstract: This paper describes BABYLON, a system that attempts to overcome the shortage of parallel texts in low-density languages by supplementing existing parallel texts with texts gathered automatically from the Web. In addition to the identification of entire Web pages, the authors also propose a new feature specifically designed to find parallel text chunks within a single document. Experiments carried out on the Quechua-Spanish language pair show that the system is successful in automatically identifying a significant amount of parallel texts on the Web. Evaluations of a machine translation system trained on this corpus indicate that the Web-gathered parallel texts can supplement manually compiled parallel texts and perform significantly better than the manually compiled texts when tested on other Web-gathered data.
Contributing Partner: UNT College of Engineering