Analyzing WARC on Serverless Computing Metadata

Metadata describes a digital item, providing (if known) such information as creator, publisher, contents, size, relationship to other resources, and more. Metadata may also contain "preservation" components that help us to maintain the integrity of digital files over time.

Title

  • Main Title Analyzing WARC on Serverless Computing

Creator

  • Author: Chen, Yinlin
    Creator Type: Personal
    Creator Info: Virginia Tech

Contributor

  • Organizer of meeting: Bibliothèque nationale (Luxembourg)
    Contributor Type: Organization
  • Organizer of meeting: International Internet Preservation Consortium
    Contributor Type: Organization

Date

  • Creation: 2021-06-15

Language

  • English

Description

  • Content Description: Presentation for the IIPC General Assembly and Web Archiving Conference virtually held on June 14-16, 2021. This presentation highlights Virginia Tech's serverless architecture design and implementations, elaborate the technical solution on integrating multiple AWS services with other techniques, and describes their streamlined and scalable approach to analyze large WARC datasets.
  • Physical Description: 22 p.

Subject

  • Keyword: web archiving
  • Keyword: data visualization
  • Keyword: data analysis

Source

  • Conference: 2021 International Internet Preservation Coalition (IIPC) General Assembly and Web Archiving Conference, June 14-16, 2021

Collection

  • Name: International Internet Preservation Consortium (IIPC) General Assembly and Web Archiving Conference
    Code: IIPCM

Institution

  • Name: International Internet Preservation Consortium
    Code: IIPC

Rights

  • Rights Access: public

Resource Type

  • Presentation

Format

  • Image

Identifier

  • Accession or Local Control No: Chen_Analyzing_WARC
  • Archival Resource Key: ark:/67531/metadc1827556
Back to Top of Screen