You limited your search to:

  Partner: UNT College of Engineering
 Department: Computer Science and Engineering
Automatic Keyword Extraction for Learning Object Repositories

Automatic Keyword Extraction for Learning Object Repositories

Date: October 2008
Creator: Coursey, Kino High; Mihalcea, Rada & Moen, William E.
Description: Abstract: This paper describes experiments in metadata generation for learning object repositories. Specifically, the authors present several methods for automatic keyword extraction and evaluate them on a collection of learning objects from an undergraduate history course. The results suggest that automatic keyword extraction is a viable solution for suggesting terms and phrases for metadata annotation.
Contributing Partner: UNT College of Engineering
An Automatic Method for Generating Sense Tagged Corpora

An Automatic Method for Generating Sense Tagged Corpora

Date: 1999
Creator: Mihalcea, Rada & Moldovan, Dan
Description: This paper discusses an automatic method for generating sense tagged corpora. Abstract: The unavailability of very large corpora with semantically disambiguated words is a major limitation in text processing research. For example, statistical methods for word sense disambiguation of free text are known to achieve high accuracy results when large corpora are available to develop context rules, to train and test them. This article presents a novel approach to automatically generate arbitrarily large corpora for word senses. The method is based on (1) the information provided in WordNet, used to formulate queries consisting of synonyms or definitions of word senses, and (2) the information gathered from Internet using existing search engines. The method was tested on 120 word senses and a precision of 91% was observed.
Contributing Partner: UNT College of Engineering
BABYLON Parallel Text Builder: Gathering Parallel Texts for Low-Density Languages

BABYLON Parallel Text Builder: Gathering Parallel Texts for Low-Density Languages

Date: May 2008
Creator: Mohler, Michael & Mihalcea, Rada
Description: This paper discusses BABYLON parallel text builder. Abstract: This paper describes BABYLON, a system that attempts to overcome the shortage of parallel texts in low-density languages by supplementing existing parallel texts with texts gathered automatically from the Web. In addition to the identification of entire Web pages, the authors also propose a new feature specifically designed to find parallel text chunks within a single document. Experiments carried out on the Quechua-Spanish language pair show that the system is successful in automatically identifying a significant amount of parallel texts on the Web. Evaluations of a machine translation system trained on this corpus indicate that the Web-gathered parallel texts can supplement manually compiled parallel texts and perform significantly better than the manually compiled texts when tested on other Web-gathered data.
Contributing Partner: UNT College of Engineering
A Bootstrapping Method for Building Subjectivity Lexicons for Languages with Scarce Resources

A Bootstrapping Method for Building Subjectivity Lexicons for Languages with Scarce Resources

Date: May 2008
Creator: Banea, Carmen; Wiebe, Janyce M. & Mihalcea, Rada, 1974-
Description: Abstract: This paper introduces a method for creating a subjectivity lexicon for languages with scarce resources. The method is able to build a subjectivity lexicon by using a small seed set of subjectivity words, and online dictionary, and a small raw corpus, coupled with a bootstrapping process that ranks new candidate words based on a similarity measure. Experiments performed with a rule-based sentence level subjectivity classifier show an 18% absolute improvement in F-measure as compared to previously proposed semi-supervised methods.
Contributing Partner: UNT College of Engineering
Building a Sense Tagged Corpus with Open Mind Word Expert

Building a Sense Tagged Corpus with Open Mind Word Expert

Date: July 2002
Creator: Chklovski, Timothy & Mihalcea, Rada
Description: This paper discusses Open Mind Word Expert, an implemented active learning system for collecting word sense tagging from the general public over the Web. It is available at http://teach-computers.org. The authors expect the system to yield a large volume of high-quality training data at a much lower cost than the traditional method of hiring lexicographers. The authors thus propose a Senseval-3 lexical sample activity where the training data is collected via Open Mind Word Expert. If successful, the collection process can be extended to create the definitive corpus of word sense information.
Contributing Partner: UNT College of Engineering
Building Multilingual Semantic Networks with Non-Expert Contributions over the Web

Building Multilingual Semantic Networks with Non-Expert Contributions over the Web

Date: November 2003
Creator: Ayewah, Nathanial; Mihalcea, Rada, 1974- & Nastase, Vivi
Description: This paper discusses building multilingual semantic networks. Abstract: We present a system that allows non-expert Web users to contribute towards building a multilingual lexical resource. Our study focuses on the Romanian-English language pair, and the target resource is a Romanian WordNet strongly connected to the English WordNet. We use a bilingual dictionary, a monolingual definition dictionary and documents on the Web to build synsets, attach them a gloss, and provide some examples. The results of the semi-automatic acquisition system are judged by two human judges, and they are compared to automatic approaches to building a Romanian WordNet.
Contributing Partner: UNT College of Engineering
Call Admission Control Scheme for Arbitrary Traffic Distribution in CDMA Cellular Systems

Call Admission Control Scheme for Arbitrary Traffic Distribution in CDMA Cellular Systems

Date: September 2000
Creator: Akl, Robert G.; Hegde, Manju V.; Naraghi-Pour, Mort & Min, Paul S.
Description: This presentation discusses call admission control (CAC). The authors define a set of feasible call configurations that results in a CAC algorithm that captures the effect of having an arbitrary traffic distribution and whose complexity scales linearly with the number of cells.
Contributing Partner: UNT College of Engineering
Capacity Allocations in Multi-cell UMTS Networks for Different Spreading Factors with Perfect and Imperfect Power Control

Capacity Allocations in Multi-cell UMTS Networks for Different Spreading Factors with Perfect and Imperfect Power Control

Date: January 2006
Creator: Akl, Robert G. & Nguyen, Son
Description: This presentation discusses user and interference models, wideband code division multiple access (WCDMA) capacity with perfect and imperfect power control, and spreading factors with numerical results.
Contributing Partner: UNT College of Engineering
Capacity Allocations in Multi-cell UMTS Networks for Different Spreading Factors with Perfect and Imperfect Power Control

Capacity Allocations in Multi-cell UMTS Networks for Different Spreading Factors with Perfect and Imperfect Power Control

Date: January 2006
Creator: Akl, Robert G. & Nguyen, Son
Description: This paper discusses capacity allocation in multi-cell UMTS networks. Abstract: An analytical model for calculating capacity in multi-cell UMTS networks is presented. Capacity is maximized for different spreading factors and for perfect and imperfect power control. An analytical model is presented for approximating the user distributions in multi-cell third generation WCDMA networks using 2-dimensional Gaussian distributions by determining the means and the standard deviations of the distribution for every cell. This allows for the calculation of the inter-cell interference and the reverse-link capacity of the network. The capacity was determined for signal-to-interference threshold from 5 dB to 10 dB and spreading factor values of 256, 64, 16, and 4.
Contributing Partner: UNT College of Engineering
CCAP: A Strategic Tool for Managing Capacity of CDMA Networks

CCAP: A Strategic Tool for Managing Capacity of CDMA Networks

Date: 1998
Creator: Akl, Robert G.
Description: This presentation discusses CCAP, a strategic tool for managing capacity of CDMA networks. CCAP is a graphical interactive tool for CDMA that calculates the coverage area, call capacity of a CDMA network, and subscriber network performance to optimize capacity.
Contributing Partner: UNT College of Engineering