Analyzing WARC on Serverless Computing Metadata
Metadata describes a digital item, providing (if known) such information as creator, publisher, contents, size, relationship to other resources, and more. Metadata may also contain "preservation" components that help us to maintain the integrity of digital files over time.
Title
- Main Title Analyzing WARC on Serverless Computing
Creator
-
Author: Chen, YinlinCreator Type: PersonalCreator Info: Virginia Tech
Contributor
-
Organizer of meeting: Bibliothèque nationale (Luxembourg)Contributor Type: Organization
-
Organizer of meeting: International Internet Preservation ConsortiumContributor Type: Organization
Date
- Creation: 2021-06-15
Language
- English
Description
- Content Description: Presentation for the IIPC General Assembly and Web Archiving Conference virtually held on June 14-16, 2021. This presentation highlights Virginia Tech's serverless architecture design and implementations, elaborate the technical solution on integrating multiple AWS services with other techniques, and describes their streamlined and scalable approach to analyze large WARC datasets.
- Physical Description: 22 p.
Subject
- Keyword: web archiving
- Keyword: data visualization
- Keyword: data analysis
Source
- Conference: 2021 International Internet Preservation Coalition (IIPC) General Assembly and Web Archiving Conference, June 14-16, 2021
Collection
-
Name: International Internet Preservation Consortium (IIPC) General Assembly and Web Archiving ConferenceCode: IIPCM
Institution
-
Name: International Internet Preservation ConsortiumCode: IIPC
Rights
- Rights Access: public
Resource Type
- Presentation
Format
- Image
Identifier
- Accession or Local Control No: Chen_Analyzing_WARC
- Archival Resource Key: ark:/67531/metadc1827556