UNT College of Information - 4 Matching Results

Search Results

Data Normalization Procedures on Decomposed MARC 21 Records

Description: In this document, the authors present some aspects of data normalization of the decomposed records to improve the results of analysis. The data normalization processes use pattern-matching techniques to eliminate and/or generalize anomalous characters and terms. Since the unit of analysis in preparing the test dataset of 400,000 MARC 21 records is a "word," there was a need for data normalization to provide reliability in the subsequent analysis.
Date: October 25, 2001
Creator: Kim, Ed & Moen, William E.

SQL Data Analysis Procedures to Create Aggregate and Candidate Record Groups on Sample of Decomposed MARC Records Phase 1 Testing

Description: This document describes the data analysis procedures developed to create the Aggregate and Candidate Record Groups using SQL statements. This is the preliminary version of these procedures tested and validated on a sample of decomposed MARC records. (For a description of how the MARC records were decomposed see the Z-Interop document, Decomposing MARC 21 Records for Analysis. A subsequent version may be necessary as the authors move to the procedures for the entire file of decomposed records.
Date: October 14, 2001
Creator: Yoon, JungWon & Moen, William E.