Search Results

[Dataset of Web Archiving Research Articles]

Description: Datasets used in the presentation, "Towards Building a Collection of Web Archiving Research Articles." The files included here were used to conduct several Machine Learning classification experiments that result in a corpus of scholarly research articles on the topic of web archiving.
Date: August 2014
Creator: Reyes Ayala, Brenda & Caragea, Cornelia
Partner: UNT College of Information

The Effects of Increased Equipment Speed on Online Database Searching Practices

Description: This study reports changes in online database searching at North Texas State University when equipment speed was increased. Data were from database vendor invoices and price and sale data of online equipment. The hypotheses examined the relationship between the decrease in the cost of online equipment and the change to faster online equipment and the change in the number of databases that changed for online types. The change in equipment was related to changes in the number of offline prints per hour, the average time per search, the average number of descriptors per search, the number of searches per month, and the rank order of database use over the studied period. The increase in the number of databases with billed types was related to the number of online billed types produced. The number of prints was related to the number of billed types. Time spent online was examined for annual seasonal cycles. The major statistical tool was time-series analysis, although other methods were applied.
Date: May 1987
Creator: Masters, Gary E. (Gary Everett)
Partner: UNT Libraries

Shifts of focus among dimensions of user information problems as represented during interactive information retrieval

Description: The goal of this study is to increase understanding of information problems as they are revealed in interactions among users and search intermediaries during information retrieval. Specifically, this study seeks to investigate: (a) how interaction between users and search intermediaries reveals aspects of user information problems; (b) to explore the concept of representation with respect to information problems in interactive information retrieval; and (c) how user and search intermediaries focus on aspects of user information problems during the course of searches.
Date: May 1998
Creator: Robins, David B. (David Bruce)
Partner: UNT Libraries

Smoothing the information seeking path: Removing representational obstacles in the middle-school digital library.

Description: Middle school student's interaction within a digital library is explored. Issues of interface features used, obstacles encountered, search strategies and search techniques used, and representation obstacles are examined. A mechanism for evaluating user's descriptors is tested and effects of augmenting the system's resource descriptions with these descriptors on retrieval is explored. Transaction log data analysis (TLA) was used, with external corroborating achievement data provided by teachers. Analysis was conducted using quantitative and qualitative methods. Coding schemes for the failure analysis, search strategies and techniques analysis, as well as extent of match analysis between terms in student's questions and their search terms, and extent of match analysis between search terms and controlled vocabulary were developed. There are five chapters with twelve supporting appendixes. Chapter One presents an introduction to the problem and reviews the pilot study. Chapter Two presents the literature review and theoretical basis for the study. Chapter Three describes the research questions, hypotheses and methods. Chapter Four presents findings. Chapter Five presents a summary of the findings and their support of the hypotheses. Unanticipated findings, limitations, speculations, and areas of further research are indicated. Findings indicate that middle school users interact with the system in various sequences of patterns. User groups' interactions and scaffold use are influenced by the teacher's objectives for using the ADL. Users preferred to use single word searches over Boolean, phrase or natural language searches. Users tended to use a strategy of repeating the same exact search, instead of using the advanced scaffolds. A high percent of users attempted at least one search that included spelling or typographical errors, punctuation, or sequentially repeated searches. Search terms matched the DQ's in some instantiation 54% of all searches. Terms used by the system to represent the resources do not adequately represent the user groups' information needs, however, ...
Date: May 2002
Creator: Abbas, June M.
Partner: UNT Libraries

Using Encyclopedic Knowledge for Automatic Topic Identification

Description: This paper presents a method for automatic topic identification using an encyclopedic graph derived from Wikipedia. The system is found to exceed the performance of previously proposed machine learning algorithms for topic identification, with an annotation consistency comparable to human annotations.
Date: May 2009
Creator: Coursey, Kino High; Mihalcea, Rada, 1974- & Moen, William E.
Partner: UNT College of Engineering

Book Reviews in an Electronic Age

Description: Poster presented at the 2012 IAMSLIC Annual Conference. This poster discusses research on book reviews in marine and aquatic journals to gain insight into the characteristics of the reviews and their value to librarians and research.
Date: August 2012
Creator: Avery, Elizabeth Fuseler; Heil, Kathy & Wiest, Natalie H., 1948-
Partner: UNT Libraries

UNT 2005 TREC QA Participation: Using Lemur as IR Search Engine

Description: This paper reports the authors' TREC 2005 QA participation. The authors' QA system Eagle QA developed last year was expanded and modified for this year's QA experiments. Particularly, the authors used Lemur 4.1 as the Information Retrieval (IR) Engine this year to find documents that may contain answers for the test questions from the document collection. The authors' result shows Lemur did a reasonable job on finding relevant documents. But certainly there is room for further improvement.
Date: 2005
Creator: Chen, Jiangping; Yu, Ping & Ge, He
Partner: UNT College of Information

Resource Discovery Using Z39.50: Promise and Reality

Description: This paper discusses the Z39.50 protocol. The ANSI/NISO Z39.50 protocol for information retrieval addresses the complex challenges of intersystem communication. Original uses envisioned for the protocol look very little like current implementations and uses. In the 1980s, users on one library catalog system would search and retrieve bibliographic records on a remote system. By the late 1990s, there was a need for discovering networked resources and integrating access to them. Yet, the Z39.50 protocol has addressed both these scenarios. This paper provides a portrayal of Z39.50 that explains its flexibility in response to a variety of information retrieval requirements in the networked environment.
Date: November 2000
Creator: Moen, William E.
Partner: UNT College of Information

Improving Z39.50 Interoperability: Z39.50 Profiles and Testbeds for Library Applications

Description: An operating assumption for the networked environment is that many different information systems need to interoperate for users to successfully discover and retrieve distributed resources. Meaningful interoperability is often elusive. In the library community, the Z39.50 standard protocol (ISO 23950/ANSI/NISO Z39.50) for information retrieval promised seamless and transparent networked access to library resources. Too often, the reality has not lived up to the promise. This paper discusses two efforts that offer solution paths to Z39.50 interoperability.
Date: August 2001
Creator: Moen, William E.
Partner: UNT College of Information

Text Semantic Similarity, with Applications

Description: In this paper, the authors present a knowledge-based method for measuring the semantic-similarity of texts. Through experiments performed on two different applications: (1) paraphrase and entailment identification, and (2) word sense similarity, the authors show that this method outperforms the traditional text similarity metrics based on lexical matching.
Date: September 2005
Creator: Corley, Courtney; Csomai, Andras & Mihalcea, Rada, 1974-
Partner: UNT College of Engineering

Hunting and Gathering on the Information Savanna

Description: In this book, the authors offer three different arenas of nontrivial information seeking for our consideration: "Submarine Chasing" explores the thoughts of a highly decorated Cold War submarine hunter. "Bounty Hunting" involves a long and convoluted search for a reported bond skipper. "Engineering Design" presents a content analysis of the few works in epistemological foundations of engineering design activity. These stories, told at great length and in considerable detail, are framed within a foundational model that links the simple act of document seeking to the broader issue of making one's way through life in the physical world. In each case, the authors ramble, mull, and stumble upon ideas without the least prior constraint, developing some threads quite fully and leaving others to tease us, but never ever throwing us to the lions.
Date: 2003
Creator: O'Connor, Brian Clark; Copeland, Jud H., 1943- & Kearns, Jodi L.
Partner: UNT College of Information