Curation of the End-of-Term Web Archive

PDF Version Also Available for Download.

Description

Paper for the 2011 IS&T Archiving Conference. This paper discusses the Classification of the End-of-Term Archive research project at the University of North Texas.

Physical Description

6 p.

Creation Information

Murray, Kathleen R.; Ko, Lauren & Phillips, Mark Edward 2011.

Context

This paper is part of the collection entitled: UNT Scholarly Works and was provided by UNT Libraries to Digital Library, a digital repository hosted by the UNT Libraries. It has been viewed 160 times . More information about this paper can be viewed below.

Who

People and organizations associated with either the creation of this paper or its content.

Authors

Publisher

Provided By

UNT Libraries

Library facilities at the University of North Texas function as the nerve center for teaching and academic research. In addition to a major collection of electronic journals, books and databases, five campus facilities house just under six million cataloged holdings, including books, periodicals, maps, documents, microforms, audiovisual materials, music scores, full-text journals and books. A branch library is located at the University of North Texas Dallas Campus.

Contact Us

What

Descriptive information to help identify this paper. Follow the links below to find similar items on the Digital Library.

Degree Information

Description

Paper for the 2011 IS&T Archiving Conference. This paper discusses the Classification of the End-of-Term Archive research project at the University of North Texas.

Physical Description

6 p.

Notes

Abstract: The Classification of the End-of-Term Archive research project at the University of North Texas Libraries is investigating the feasibility of machine-generated classification of websites in the 16-terabyte End-of-Term (EOT) Web Archive. The research is being conducted concurrently in two areas: Archive Classification and Web Archive Metrics. A set of 1,151 URLs within the EOT Archive was analyzed using link analysis methods to identify related groupings or clusters. Investigations into visualization of the underlying relationships among the URLs were also conducted. Subject Matter Experts (SMEs) in the classification of government information manually classified the same set of URLs using the Superintendent of Documents (SuDocs) Classification Numbering System, which is a hierarchical scheme that groups government publications by federal agencies. The SME-classification will serve as the criterion to evaluate the effectiveness of the link analysis. In a parallel work area of the project, metrics for Web archives were discussed in a focus group with the SMEs, who identified key criteria libraries would likely employ in acquiring materials from Web archives. Participants also identified two service models libraries will need from Web archive service providers: acquisition and access models. A subsequent survey of Federal Depository Libraries measured the demand for each of these models, as well as libraries' perceived capabilities to support long-term preservation and local hosting of materials from Web archives. It appears that some existing library metrics, but more importantly, standard usage statistics will be essential metrics.

Source

  • IS & T--the Society for Imaging Science and Technology Archiving Conference, 2011, Salt Lake City, Utah, United States

Language

Item Type

Collections

This paper is part of the following collection of related materials.

UNT Scholarly Works

The Scholarly Works Collection is home to materials from the University of North Texas community's research, creative, and scholarly activities and serves as UNT's Open Access Repository. It brings together articles, papers, artwork, music, research data, reports, presentations, and other scholarly and creative products representing the expertise in our university community. Access to some items in this collection may be restricted.

What responsibilities do I have when using this paper?

When

Dates and time periods associated with this paper.

Creation Date

  • 2011

Added to The UNT Digital Library

  • June 7, 2011, 8:36 a.m.

Description Last Updated

  • Feb. 26, 2014, 12:29 p.m.

Usage Statistics

When was this paper last used?

Yesterday: 1
Past 30 days: 1
Total Uses: 160

Interact With This Paper

Here are some suggestions for what to do next.

Start Reading

PDF Version Also Available for Download.

Citations, Rights, Re-Use

Murray, Kathleen R.; Ko, Lauren & Phillips, Mark Edward. Curation of the End-of-Term Web Archive, paper, 2011; [Springfield, Virginia]. (digital.library.unt.edu/ark:/67531/metadc36301/: accessed April 27, 2017), University of North Texas Libraries, Digital Library, digital.library.unt.edu; .