End of Term 2008 Presidential Web Archive: PDF Content Analysis Metadata

Metadata describes a digital item, providing (if known) such information as creator, publisher, contents, size, relationship to other resources, and more. Metadata may also contain "preservation" components that help us to maintain the integrity of digital files over time.

Title

  • Main Title End of Term 2008 Presidential Web Archive: PDF Content Analysis

Creator

  • Author: Phillips, Mark Edward
    Creator Type: Personal
    Creator Info: University of North Texas

Date

  • Creation: 2012-12-05

Language

  • English

Description

  • Content Description: This presentation discusses the End of Term 2008 Presidential Web Archive. The University of North Texas (UNT) Libraries collaborated with members of the International Internet Preservation Consortium (IIPC) on the End of Term 2008 Presidential Web Harvest from October, 2008 to February, 2009. The project team archived 160,211,356 URIs during this collaboration, which became a research dataset for an IMLS-funded grant to investigate collection development using web archives. The project team analyzed the 10,318,073 PDFs and developed a retrieval and exploration system for collection developers interested in acquiring and developing born-digital collections from the End of Term Web Archive.
  • Physical Description: 104 p.

Subject

  • Keyword: archives
  • Keyword: harvests
  • Keyword: End of Term Archives
  • Keyword: Presidential campaigns

Source

  • Conference: Best Practices Exchange Conference, 2012, Annapolis, Maryland, United States

Collection

  • Name: UNT Scholarly Works
    Code: UNTSW

Institution

  • Name: UNT Libraries
    Code: UNT

Rights

  • Rights Access: public

Resource Type

  • Presentation

Format

  • Image

Identifier

  • Archival Resource Key: ark:/67531/metadc130188

Degree

  • Academic Department: Libraries

Note

  • Display Note: Abstract: This presentation discusses the End of Term 2008 Presidential Web Archive. The University of North Texas (UNT) Libraries collaborated with members of the International Internet Preservation Consortium (IIPC) on the End of Term 2008 Presidential Web Harvest from October, 2008 to February, 2009. The project team archived 160,211,356 URIs during this collaboration, which became a research dataset for an IMLS-funded grant to investigate collection development using web archives. The project team analyzed the 10,318,073 PDFs and developed a retrieval and exploration system for collection developers interested in acquiring and developing born-digital collections from the End of Term Web Archive.