Analysis Logic and Procedures for Creating a Test Dataset of MARC 21 Records for the Z39.50 Interoperability Testbed: Phase 1 Testing
Date: January 1, 2002
Creator: Moen, William E. & Holmes, Haley K.
Description: This document describes the logic and procedures to create a test dataset of more than 400,000 (400K) MARC 21 records from the OCLC WorldCat database. This test dataset (hereafter referred to as the dataset) provides a controlled set of data for use in the Z39.50 Interoperability Testbed Project (hereafter referred to as Z-Interop). OCLC selected a 1% weighted sample from its WorldCat database, which contains approximately 45 million records. This document focused on the analysis procedures used to prepare for Phase 1 Testing in Spring 2002. A subsequent version of this document will address the additional procedures for Phase 2 Testing scheduled for Summer 2002.
Contributing Partner: UNT College of Information
Permallink:digital.library.unt.edu/ark:/67531/metadc110994/
Creating Radioactive MARC Records and Z Queries Using the MARCdocs Database
Date: December 2, 2004
Creator: Moen, William E.
Description: This document describes how the authors can extend a relational database of MARC documentation to store the appropriate information that will support the automatic generation of the special, diagnostic MARC records the authors will call radioactive MARC (RadMARC) records. The information contained in the database will also support the generation of the Z queries used in the interoperability testing.
Contributing Partner: UNT College of Information
Permallink:digital.library.unt.edu/ark:/67531/metadc111003/
Data Normalization Procedures on Decomposed MARC 21 Records
Date: October 25, 2001
Creator: Kim, Ed & Moen, William E.
Description: In this document, the authors present some aspects of data normalization of the decomposed records to improve the results of analysis. The data normalization processes use pattern-matching techniques to eliminate and/or generalize anomalous characters and terms. Since the unit of analysis in preparing the test dataset of 400,000 MARC 21 records is a "word," there was a need for data normalization to provide reliability in the subsequent analysis.
Contributing Partner: UNT College of Information
Permallink:digital.library.unt.edu/ark:/67531/metadc111005/
Decomposing MARC 21 Records for Analysis
Date: October 1, 2002
Creator: Moen, William E.
Description: This document discusses decomposing MARC 21 records for analysis. To prepare the test dataset of the 1% sample of MARC 21 records from the WorldCat database for use in the Z39.50 Interoperability Testbed, the authors need to be able to efficiently analyze the records to determine relevant records to be returned for a set of test searches. The first step in that analysis is to determine the occurrence of test search terms in specific records. This document describes the general approach for this analysis and identifies specifications for the analysis.
Contributing Partner: UNT College of Information
Permallink:digital.library.unt.edu/ark:/67531/metadc110995/
Developing an Alternative Approach for Interoperability Testing of Library Z39.50 Servers
Date: March 30, 2004
Creator: Moen, William E.
Description: This document describes a plan of work to develop and test an alternative approach for interoperability testing. This approach builds on the conceptual and technical infrastructure developed during the Z-Interop Project.
Contributing Partner: UNT College of Information
Permallink:digital.library.unt.edu/ark:/67531/metadc111279/
Digital Information Curation for 21st Century Science and Scholarship: Experience-Based Learning for Information Professionals and Disciplinary Researchers
Date: 2011
Creator: Moen, William E.; Kim, Jeonghyun & Halbert, Martin
Description: This is the narrative for a proposal to the Institute of Museum and Library Services' (IMLS) Laura Bush 21st Century Librarian Program. The proposed initiative's goal is to build capacity in the University of North Texas' (UNT) Library and Information Sciences (LIS) curriculum to increase the number of appropriately trained information professionals and disciplinary researchers and scholars for digital curation and data management responsibilities.
Contributing Partner: UNT College of Information
Permallink:digital.library.unt.edu/ark:/67531/metadc86945/
An Event Model for Herbarium Specimen Data in XML Poster Abstract
Date: 2010
Creator: Moen, William E.; Neill, Amanda K.; Best, Jason H.; McCotter, Melody; Xu, Hong & Huang, Jane Q.
Description: This abstract describes a poster about the Apiary Project. The Apiary Project, a collaboration of the Texas Center for Digital Knowledge at the University of North Texas and the Botanical Research Institute of Texas, is building a framework and web-based workflow for the extraction and parsing of herbarium specimen data. The workflow will support the transformation of written or printed specimen data into a high-quality machine-processable XML format. This poster describes an event model that informed the development of the Apiary XML Application Schema
Contributing Partner: UNT College of Information
Permallink:digital.library.unt.edu/ark:/67531/metadc81384/
Indexing Guidelines to Support Z39.50 Profile Searches
Date: February 1, 2002
Creator: Moen, William E.
Description: This document provides guidelines for indexing MARC 21 records to support a set of searches using Z39.50. The Z39.50 Interoperability Testbed Project (Z-Interop) uses these guidelines to index the 400,000 MARC 21 records that comprise the Z-Interop reference implementation of the Z39.50 server and online catalog.
Contributing Partner: UNT College of Information
Permallink:digital.library.unt.edu/ark:/67531/metadc111000/
MARCdocs: The MARC 21 Bibliographic Format Database
Date: October 28, 2004
Creator: Moen, William E. & Thomale, Jason
Description: This document discusses MARCdocs. MARCdocs, the MARC 21 Documentation Database, is a pilot effort aimed at structuring the textual documentation from the MARC 21 Format for Bibliographic Data into a relational database.
Contributing Partner: UNT College of Information
Permallink:digital.library.unt.edu/ark:/67531/metadc111004/
Metadata: A Networked Information Strategy to Improve Access to and Management of Government Information
Date: 2001
Creator: Moen, William E.
Description: This document is part of a Government Information Quarterly Special Issue. The author serves as the editor of this issue focusing on the use of metadata as a strategy to improve access to and management of electronic government information. Contributions by writers address federal and state metadata activities and issues.
Contributing Partner: UNT College of Information
Permallink:digital.library.unt.edu/ark:/67531/metadc102300/