Moving the End of Term Web Archive to the Cloud to Encourage Research Use and Reuse

PDF Version Also Available for Download.

Description

Short paper presented at the 2022 Web Archiving and Digital Libraries Virtual Workshop, in conjunction with the Joint Conference on Digital Libraries (JCDL), on June 24, 2022. The paper discusses the End of Term (EOT) Web Archive project and process of organizing, staging, processing, and moving these collections into the Amazon cloud.

Physical Description

4 p.

Creation Information

Phillips, Mark Edward & Alam, Sawood June 24, 2022.

Context

This paper is part of the collection entitled: UNT Scholarly Works and was provided by the UNT Libraries to the UNT Digital Library, a digital repository hosted by the UNT Libraries. It has been viewed 154 times, with 4 in the last month. More information about this paper can be viewed below.

Who

People and organizations associated with either the creation of this paper or its content.

Authors

Rights Holder

For guidance see Citations, Rights, Re-Use.

  • © The Authors

Provided By

UNT Libraries

The UNT Libraries serve the university and community by providing access to physical and online collections, fostering information literacy, supporting academic research, and much, much more.

Contact Us

What

Descriptive information to help identify this paper. Follow the links below to find similar items on the Digital Library.

Degree Information

Description

Short paper presented at the 2022 Web Archiving and Digital Libraries Virtual Workshop, in conjunction with the Joint Conference on Digital Libraries (JCDL), on June 24, 2022. The paper discusses the End of Term (EOT) Web Archive project and process of organizing, staging, processing, and moving these collections into the Amazon cloud.

Physical Description

4 p.

Notes

Abstract: The End of Term Web (EOT) Archive is a collaborative project with a goal of collecting the United States federal web, loosely defines as .gov and .mil, every four years coinciding with presidential elections and often a transition in the Executive Branch of the government. In 2021 the End of Term team began to process the longitudinal web archive for EOT-2008, EOT-2012, EOT-2016, and EOT-2020 to move into the Amazon S3 storage service as part of the Amazon Open Data Program. This effort adopted tools, structures, and documentation developed by Common Crawl in an effort to maximize potential research access and reuse of existing tools and documentation. This paper presents the process of organizing, staging, processing, and moving these collections into the Amazon cloud.

This is the pre-print version of the paper. Ok

Language

Item Type

Identifier

Unique identifying numbers for this paper in the Digital Library or other systems.

Relationships

Collections

This paper is part of the following collection of related materials.

UNT Scholarly Works

Materials from the UNT community's research, creative, and scholarly activities and UNT's Open Access Repository. Access to some items in this collection may be restricted.

Related Items

Moving the End of Term Web Archive to the Cloud to Encourage Research Use and Reuse (Presentation)

Moving the End of Term Web Archive to the Cloud to Encourage Research Use and Reuse

Presentation given at the 2022 Web Archiving and Digital Libraries Virtual Workshop, in conjunction with the Joint Conference on Digital Libraries (JCDL), on June 24, 2022. This presentation discusses the End of Term (EOT) Web Archive project and process of organizing, staging, processing, and moving these collections into the Amazon cloud.

Relationship to this item: (Is Basis For)

Moving the End of Term Web Archive to the Cloud to Encourage Research Use and Reuse, ark:/67531/metadc1998718

What responsibilities do I have when using this paper?

When

Dates and time periods associated with this paper.

Creation Date

  • June 24, 2022

Added to The UNT Digital Library

  • Sept. 30, 2022, 11:32 a.m.

Description Last Updated

  • Oct. 6, 2022, 9:19 a.m.

Usage Statistics

When was this paper last used?

Yesterday: 0
Past 30 days: 4
Total Uses: 154

Interact With This Paper

Here are some suggestions for what to do next.

Start Reading

PDF Version Also Available for Download.

International Image Interoperability Framework

IIF Logo

We support the IIIF Presentation API

Phillips, Mark Edward & Alam, Sawood. Moving the End of Term Web Archive to the Cloud to Encourage Research Use and Reuse, paper, June 24, 2022; (https://digital.library.unt.edu/ark:/67531/metadc1998717/: accessed June 15, 2024), University of North Texas Libraries, UNT Digital Library, https://digital.library.unt.edu; .

Back to Top of Screen