Moving the End of Term Web Archive to the Cloud to Encourage Research Use and Reuse
PDF Version Also Available for Download.
Description
Short paper presented at the 2022 Web Archiving and Digital Libraries Virtual Workshop, in conjunction with the Joint Conference on Digital Libraries (JCDL), on June 24, 2022. The paper discusses the End of Term (EOT) Web Archive project and process of organizing, staging, processing, and moving these collections into the Amazon cloud.
This
paper
is part of the collection entitled:
UNT Scholarly Works
and
was provided by the UNT Libraries
to the
UNT Digital Library,
a digital repository hosted by the
UNT Libraries.
It has been viewed 209 times, with 8 in the last month.
More information about this paper can be viewed below.
The UNT Libraries serve the university and community by providing access to physical and online collections, fostering information literacy, supporting academic research, and much, much more.
Short paper presented at the 2022 Web Archiving and Digital Libraries Virtual Workshop, in conjunction with the Joint Conference on Digital Libraries (JCDL), on June 24, 2022. The paper discusses the End of Term (EOT) Web Archive project and process of organizing, staging, processing, and moving these collections into the Amazon cloud.
Physical Description
4 p.
Notes
Abstract: The End of Term Web (EOT) Archive is a collaborative project with a goal of collecting the United States federal web, loosely defines as .gov and .mil, every four years coinciding with presidential elections and often a transition in the Executive Branch of the government. In 2021 the End of Term team began to process the longitudinal web archive for EOT-2008, EOT-2012, EOT-2016, and EOT-2020 to move into the Amazon S3 storage service as part of the Amazon Open Data Program. This effort adopted tools, structures, and documentation developed by Common Crawl in an effort to maximize potential research access and reuse of existing tools and documentation. This paper presents the process of organizing, staging, processing, and moving these collections into the Amazon cloud.
Moving the End of Term Web Archive to the Cloud to Encourage Research Use and Reuse, ark:/67531/metadc1998718
Collections
This paper is part of the following collection of related materials.
UNT Scholarly Works
Materials from the UNT community's research, creative, and scholarly activities and UNT's Open Access Repository. Access to some items in this collection may be restricted.
Presentation given at the 2022 Web Archiving and Digital Libraries Virtual Workshop, in conjunction with the Joint Conference on Digital Libraries (JCDL), on June 24, 2022. This presentation discusses the End of Term (EOT) Web Archive project and process of organizing, staging, processing, and moving these collections into the Amazon cloud.
Relationship to this item: (Is Basis For)
Moving the End of Term Web Archive to the Cloud to Encourage Research Use and Reuse, ark:/67531/metadc1998718