Data Normalization Procedures on Decomposed MARC 21 Records

One of 28 items in the series: Z-Interop available on this site.

PDF Version Also Available for Download.

Description

In this document, the authors present some aspects of data normalization of the decomposed records to improve the results of analysis. The data normalization processes use pattern-matching techniques to eliminate and/or generalize anomalous characters and terms. Since the unit of analysis in preparing the test dataset of 400,000 MARC 21 records is a "word," there was a need for data normalization to provide reliability in the subsequent analysis.

Physical Description

5 p.

Creation Information

Kim, Ed & Moen, William E. October 25, 2001.

Context

This text is part of the collection entitled: UNT Scholarly Works and was provided by the UNT College of Information to the UNT Digital Library, a digital repository hosted by the UNT Libraries. It has been viewed 253 times. More information about this text can be viewed below.

Who

People and organizations associated with either the creation of this text or its content.

Authors

  • Kim, Ed University of North Texas; Z-Interop Research Assistant
  • Moen, William E. University of North Texas; Principal Investigator

Provided By

UNT College of Information

Situated at the intersection of people, technology, and information, the College of Information's faculty, staff and students invest in innovative research, collaborative partnerships, and student-centered education to serve a global information society. The college offers programs of study in information science, learning technologies, and linguistics.

Contact Us

What

Descriptive information to help identify this text. Follow the links below to find similar items on the Digital Library.

Titles

  • Main Title: Data Normalization Procedures on Decomposed MARC 21 Records
  • Series Title: Z-Interop

Description

In this document, the authors present some aspects of data normalization of the decomposed records to improve the results of analysis. The data normalization processes use pattern-matching techniques to eliminate and/or generalize anomalous characters and terms. Since the unit of analysis in preparing the test dataset of 400,000 MARC 21 records is a "word," there was a need for data normalization to provide reliability in the subsequent analysis.

Physical Description

5 p.

Language

Item Type

Identifier

Unique identifying numbers for this text in the Digital Library or other systems.

Collections

This text is part of the following collection of related materials.

UNT Scholarly Works

Materials from the UNT community's research, creative, and scholarly activities and UNT's Open Access Repository. Access to some items in this collection may be restricted.

What responsibilities do I have when using this text?

When

Dates and time periods associated with this text.

Creation Date

  • October 25, 2001

Added to The UNT Digital Library

  • Oct. 25, 2012, 2:42 p.m.

Description Last Updated

  • March 21, 2013, 2:01 p.m.

Usage Statistics

When was this text last used?

Yesterday: 0
Past 30 days: 0
Total Uses: 253

Interact With This Text

Here are some suggestions for what to do next.

Start Reading

PDF Version Also Available for Download.

International Image Interoperability Framework

IIF Logo

We support the IIIF Presentation API

Kim, Ed & Moen, William E. Data Normalization Procedures on Decomposed MARC 21 Records, text, October 25, 2001; (https://digital.library.unt.edu/ark:/67531/metadc111005/: accessed March 21, 2025), University of North Texas Libraries, UNT Digital Library, https://digital.library.unt.edu; crediting UNT College of Information.

Back to Top of Screen