41 Matching Results

Search Results

Co-training and Self-training for Word Sense Disambiguation

Description: This paper investigates the application of co-training and self-training to word sense disambiguation. Optimal and empirical parameter selection methods for co-training and self-training are investigated, with various degrees of error reduction. A new method that combines co-training with majority voting is introduced, with the effect of smoothing the bootstrapping learning curves, and improving the average performance.
Date: May 2004
Creator: Mihalcea, Rada, 1974-
Partner: UNT College of Engineering

Finding Semantic Associations on Express Lane

Description: This paper introduces a new codification scheme for efficient computation of measures in semantic networks. The scheme is particularly useful for fast computation of semantic associations between words and implementation of an informational retrieval operator for efficient search in semantic spaces. Other applications may also be possible.
Date: May 2004
Creator: Nastase, Vivi & Mihalcea, Rada, 1974-
Partner: UNT College of Engineering

Topic Identification Using Wikipedia Graph Centrality

Description: This paper presents a method for automatic topic identification using a graph-centrality algorithm applied to an encyclopedic graph derived from Wikipedia. When tested on a data set with manually assigned topics, the system is found to significantly improve over a simpler baseline that does not make use of the external encyclopedic knowledge.
Date: May 2009
Creator: Coursey, Kino High & Mihalcea, Rada, 1974-
Partner: UNT College of Engineering

Using Encyclopedic Knowledge for Automatic Topic Identification

Description: This paper presents a method for automatic topic identification using an encyclopedic graph derived from Wikipedia. The system is found to exceed the performance of previously proposed machine learning algorithms for topic identification, with an annotation consistency comparable to human annotations.
Date: May 2009
Creator: Coursey, Kino High; Mihalcea, Rada, 1974- & Moen, William E.
Partner: UNT College of Engineering

Preserving Access to Government Websites: Development and Practice in the CyberCemetery

Description: This paper discusses the development and practice in the CyberCemetery. In the late 1990's, online U.S. government information was appearing and disappearing at a rapid pace. In 1999, the University of North Texas Libraries (UNT) formed a partnership with the U.S. Government Printing Office (GPO) to address this issue by archiving electronic government websites. This archive, known as the CyberCemetery, provides permanent public access to the websites and publications of defunct U.S. government agencies and commissions. This partnership between UNT and GPO has expanded to include the National Archives and Records Administration (NARA). This paper covers the CyberCemetery's development and the process of identifying, capturing, and publishing content in the archive.
Date: May 26, 2008
Creator: Hoffman, Starr
Partner: UNT Libraries

Maya: A Novel Block Encryption Function

Description: This paper discusses a novel block encryption function.
Date: May 2009
Creator: Gomathisankaran, Mahadevan & Lee, Ruby Bei-Loh
Partner: UNT College of Engineering

Organizational Alignment

Description: This paper discusses organizational alignment and digital preservation.
Date: May 23, 2011
Creator: Angevaare, Inge; Gallinger, Michelle; Anderson, Martha; Giaretta, David & Halbert, Martin
Partner: UNT Libraries

The Semantic Wildcard

Description: This paper introduces the semantic wildcard, one of the most powerful operators implemented in IRSLO, which allows for searches along general-specific lines.
Date: May 2002
Creator: Mihalcea, Rada, 1974-
Partner: UNT College of Engineering

Philosophy Matters - Examining the Value of Knowledge

Description: This paper discusses the University of North Texas' (UNT) Center for the Study of Interdisciplinarity (CSID), where philosophers continue to examine the value of knowledge. The authors also discuss one example of CSID's work with the Comparative Assessment of Peer Review (CAPR) project. CAPR is a four-year project (2008-2012) studying the changing nature of peer review processes across six U.S. and foreign public science agencies. CAPR is funded by the U.S. National Science Foundation's (NSF) Science of Science and Innovation Policy (SciSIP) program.
Date: May 10, 2012
Creator: Frodeman, Robert & Holbrook, J. Britt
Partner: UNT College of Arts and Sciences

Metadata: Batch Editing of MARC Records (work area D)

Description: Report for an Institute of Museum and Library Sciences (IMLS) Grant Partner Uplift Project. This reports workflows developed for the conversion of two sets of metadata records and serves the following objectives: research different avenues for the batch import and export of MARC records from popular integrated library systems, identify tools/software which aids in batch editing of MARC records, and document workflow for batch editing MARC records.
Date: May 2, 2007
Creator: Phillips, Mark Edward & Polyakov, Serhiy
Partner: UNT Libraries

OAI Static Repositories

Description: Report for an Institute of Museum and Library Sciences (IMLS) Grant Partner Uplift Project. This report gives an overview of the Open Archive Initiative (OAI) static repositories' and their restrictions and conformation rules. Also, intermediation with a Static Repository Gateway, workflows and best practices, and interfaces for the creation of metadata for OAI static repositories
Date: May 31, 2007
Creator: Polyakov, Serhiy & Phillips, Mark Edward
Partner: UNT Libraries