This system will be undergoing maintenance December 14th from 8:00AM to 1:00PM CST.

Search Results

Unlocking the Archive: Open Access to News Content as Corpora [Presentation]

Description: Presentation describing the experience of analyzing web news at scale, enriching its metadata and providing it as data for computational analysis, using open API, the workflow, limitations found and opportunities. It was presented at the IIPC General Assembly and Web Archiving Conference held on April 8-10, 2025 in Oslo, Norway.
Date: April 10, 2025
Creator: Tønnessen, Jon Carlstedt & Birkenes, Magnus Breder
Partner: International Internet Preservation Consortium
open access

Rocket-Borne IR Scanning Radiometer for High Altitude Radiation Measurements

Description: Report describing a transistorized, scanning, filter radiometer designed for use in an Atlas piggy-back pod. The instrument is designed for moderately narrow-band measurements at 2.7, 3.5, 4.3, 4.7, and 6.3 microns of the irradiance arriving at the radiometer from the earth's atmosphere within a small field of view as a function of height above the true horizon. The measurements will be made while the vehicle is in the altitude range 3 x 100,000 to 10 to the 6th power feet. A fairly complete de… more
Date: April 10, 1962
Creator: Smiley, Vern N.
Partner: UNT Libraries Government Documents Department
captions transcript

Zombie E-Journals and the National Library of Spain [Video]

Description: Recording of a presentation exploring the efforts of the Spanish Web Archive to preserve electronic journals in Spain to ensure that the content does not become in accessible. It was presented at the IIPC General Assembly and Web Archiving Conference held on April 8-10, 2025, in Oslo, Norway.
Date: April 10, 2025
Duration: 5 minutes 35 seconds
Creator: Cerdán Medina, José Carlos
Partner: International Internet Preservation Consortium
captions transcript

Unlocking the Archive: Open Access to News Content as Corpora [Video]

Description: Recording of a presentation describing the experience of analyzing web news at scale, enriching its metadata and providing it as data for computational analysis, using open API, the workflow, limitations found and opportunities. This recording originated from the IIPC General Assembly and Web Archiving Conference held on April 8-10, 2025 in Oslo, Norway.
Date: April 10, 2025
Duration: 14 minutes 42 seconds
Creator: Tønnessen, Jon Carlstedt & Birkenes, Magnus Breder
Partner: International Internet Preservation Consortium
captions transcript

From Posts to Archives: The National Library of Singapore’s Journey in Collecting Social Media [Video]

Description: Recording of a presentation highlights the National Library of Singapore's (NLS) journey in collecting social media, our collecting framework and strategy, as well as learning points and future plans. This recording originated from the IIPC General Assembly and Web Archiving Conference held on April 8-10, 2025 in Oslo, Norway.
Date: April 10, 2025
Duration: 14 minutes 37 seconds
Creator: Tay, Shereen & Lee, Meiyu
Partner: International Internet Preservation Consortium
captions transcript

So You’ve Got a WACZ: How Archives Become Verifiable Evidence [Video]

Description: Recording of a presentation that shares a workflow and toolkit, developed by the Starling Lab for Data Integrity, for collecting and organizing web archives alongside integrity and provenance data. It also showcases case studies and projects with our collaborators including Black Voice News, the Atlantic Council’s DFRLab, and conflict monitors. This recording originated from the IIPC General Assembly and Web Archiving Conference held on April 8-10, 2025 in Oslo, Norway.
Date: April 10, 2025
Duration: 16 minutes 05 seconds
Creator: Simon, Basile & Walker, Lindsay
Partner: International Internet Preservation Consortium
captions transcript

Recently Orphaned Newspapers: From Archived Webpages to Reusable Datasets and Research Outlooks [Video]

Description: Recording of a presentation reporting on the progress of converting the web archives of a recently orphaned newspaper into accessible article collections in IPTC (International Press Telecommunications Council) standard format for news representation. This recording originated from the IIPC General Assembly and Web Archiving Conference held on April 8-10, 2025 in Oslo, Norway.
Date: April 10, 2025
Duration: 17 minutes 54 seconds
Creator: Chuang, Tyng-Ruey; Wang, Chia-Hsun & Wu, Hung-Yen
Partner: International Internet Preservation Consortium
captions transcript

Analysing the Publications Office of the European Union Web Archive for the Rationalisation of Digital Content Generation [Video]

Description: Recording of a presentation that show various interesting statistics generated about the content the Publications Office of the EU has crawled for their web archive.. This recording originated from the IIPC General Assembly and Web Archiving Conference held on April 8-10, 2025 in Oslo, Norway.
Date: April 10, 2025
Duration: 5 minutes 14 seconds
Creator: Angers, Alexandre
Partner: International Internet Preservation Consortium
captions transcript

NewsWARC: Analyzing News Over Time in the Web Archive [Video]

Description: Recording of a presentation highlighting NewsWARC, a tool, developed as an internship project, for aiding researchers to explore news content in a web archive collection over time. This recording originated from the IIPC General Assembly and Web Archiving Conference held on April 8-10, 2025 in Oslo, Norway.
Date: April 10, 2025
Duration: 13 minutes 10 seconds
Creator: Emara, Amr; Ezz, Khaled; Hazem, Shaden & Eldakar, Youssef
Partner: International Internet Preservation Consortium
captions transcript

Making Research Data Published to the Web FAIR [Video]

Description: Recording of a presentation discussing the work undertaken by the University of Sheffield’s Library to mitigate potential data loss from research published online. It includes a case study of the capturing of a research group’s website to deposit in our institutional data repository, the creation of collaboratively created guidance for researchers and research data managers, and the embedding good practice at the University to enable Open Research and Open Data will remain open and FAIR. This r… more
Date: April 10, 2025
Duration: 15 minutes 47 seconds
Creator: Hooper, Bryony
Partner: International Internet Preservation Consortium
captions transcript

Innovative Web Archiving Amid Crisis: Leveraging Browsertrix and Hybrid Working Models to Capture the UK General Election 2024 [Video]

Description: Recording of a presentation, that discuss the challenges and opportunities encountered during the process of archiving websites for the UK general election in 2024, providing valuable insights for those interested in Browsertrix’s capabilities and in executing web archiving with a mixed-model approach across different institutions with diverse interests and expertise in unusually challenging circumstances within the framework provided by a historic time series. This recording originated from th… more
Date: April 10, 2025
Duration: 20 minutes 11 seconds
Creator: Bingham, Nicola & Grimshaw, Jennie
Partner: International Internet Preservation Consortium
captions transcript

From Pages to People: Tailoring Web Archives for Different Use Cases [Video]

Description: Recording of a presentation describing work being done to improve the useability of the UK Web Archive within Cambridge University Libraries and the National Library of Scotland with the help of developing additional materials (datasets, interfaces) and planning outreach events (exhibitions, calls, workshops) to ensure the web archives meet the expectations of readers, data users, and the digitally curious.. This recording originated from the IIPC General Assembly and Web Archiving Conference h… more
Date: April 10, 2025
Duration: 13 minutes 01 second
Creator: Kocsis, Andrea & Talboom, Leontien
Partner: International Internet Preservation Consortium
captions transcript

Enhancing Accessibility to Belgian Born-Digital Heritage [Video]

Description: Recording of a presentation that provides an overview of the BelgicaWeb project’s system architecture, the technical challenges encountered in collecting content, and the solutions implemented. This recording originated from the IIPC General Assembly and Web Archiving Conference held on April 8-10, 2025 in Oslo, Norway.
Date: April 10, 2025
Duration: 16 minutes 30 seconds
Creator: Vandendyck, Christina
Partner: International Internet Preservation Consortium
captions transcript

Detecting and Diagnosing Errors in Replaying Archived Web Pages [Video]

Description: Recording of a presentation that describes the presenters' work in developing a new approach for a) more reliably detecting whether the replay of an archived page violates fidelity, and b) pinpointing the cause when this occurs. This recording originated from the IIPC General Assembly and Web Archiving Conference held on April 8-10, 2025 in Oslo, Norway.
Date: April 10, 2025
Duration: 15 minutes 50 seconds
Creator: Zhu, Jingyuan; Sun, Huanchen & Madhyastha, Harsha
Partner: International Internet Preservation Consortium
captions transcript

Building a Toolchain for Screen Recording-Based Web Archiving of SVOD Platforms [Video]

Description: Recording of a presentation sharing the ongoing development of a generic toolchain based on screen recording designed to effectively address DRM restrictions, capture high-quality content, and scale efficiently. This recording originated from the IIPC General Assembly and Web Archiving Conference held on April 8-10, 2025 in Oslo, Norway.
Date: April 10, 2025
Duration: 13 minutes 58 seconds
Creator: Di Lisi, Alexis
Partner: International Internet Preservation Consortium
captions transcript

Better Together: Building a Scalable Multi-Crawler Web Harvesting Toolkit [Video]

Description: Recording of a presentation that outline some of the many lessons and best practices the Internet Archive has learned from the challenges, requirements, research, and practical experience from collaborating with other memory institutions for over 25 years to meet the harvesting needs of the preservation community. This recording originated from the IIPC General Assembly and Web Archiving Conference held on April 8-10, 2025 in Oslo, Norway.
Date: April 10, 2025
Duration: 14 minutes 09 seconds
Creator: Dempsey, Alex
Partner: International Internet Preservation Consortium
captions transcript

Building Towards Environmentally Sustainable Web Archiving: The UK Government Web Archive and Beyond [Video]

Description: Recording of a presentation that discuss one approach to the development of a framework for more environmentally sustainable web archiving, using the UK Government Web Archive as a case study. This recording originated from the IIPC General Assembly and Web Archiving Conference held on April 8-10, 2025 in Oslo, Norway.
Date: April 10, 2025
Duration: 15 minutes 50 seconds
Creator: Winters, Jane; Goudarouli, Eirini & Bickford, Jake
Partner: International Internet Preservation Consortium
captions transcript

Where Fashion Meets Science: Collecting and Curating a Creative Web Archive [Video]

Description: Recording of a presentation highlighting the work of the University of the Arts London to preserve the websites of the Helen Storey Foundation Archive. This recording originated from the IIPC General Assembly and Web Archiving Conference held on April 8-10, 2025 in Oslo, Norway.
Date: April 10, 2025
Duration: 5 minutes 28 seconds
Creator: Thurlow, Elisabeth
Partner: International Internet Preservation Consortium
captions transcript

Preservation of Mexico's Web Heritage [Video]

Description: Recording of a presentation that discusses the challenges faced during the first stage of selection and awareness raising of what will be the Mexican Web Archive, through the National Library of Mexico. This presentation will describe the decision-making process, ethical problems, strategies developed in this stage of selection and awareness of the subject because it is little known in Mexico. This recording originated from the IIPC General Assembly and Web Archiving Conference held on April 8-… more
Date: April 10, 2025
Duration: 5 minutes 11 seconds
Creator: Silva Bretón, Carolina
Partner: International Internet Preservation Consortium
captions transcript

Modernizing Web Archives: The Bumpy Road Towards a General ARC2WARC Conversion Tool [Video]

Description: Recording of a presentation sharing the presenters' experience converting Common Crawl’s older ARC archives to WARC. This recording originated from the IIPC General Assembly and Web Archiving Conference held on April 8-10, 2025 in Oslo, Norway.
Date: April 10, 2025
Duration: 5 minutes 35 seconds
Creator: Ortiz Suarez, Pedro
Partner: International Internet Preservation Consortium
captions transcript

"What You See No One Saw" [Video]

Description: Recording of a presentation that explores a technical and human-centered approach to better preserve the web by focusing on personas—archetypes of web users with distinct behaviors, preferences, and interactions. This recording originated from the IIPC General Assembly and Web Archiving Conference held on April 8-10, 2025 in Oslo, Norway.
Date: April 10, 2025
Duration: 5 minutes 01 second
Creator: Kelly, Mat; Poole, Alex H.; Weigle, Michele C.; Nelson, Michael L.; Reid, Travis; Rauch, Christopher B. et al.
Partner: International Internet Preservation Consortium
captions transcript

Modifying ePADD for Entity Extraction in Non-English Languages [Video]

Description: Recording of a presentation sharing recent advancements in enhancing ePADD, an open-source email archiving tool, by modifying its lexical search and entity extraction pipeline to accommodate a non-English language. This recording originated from the IIPC General Assembly and Web Archiving Conference held on April 8-10, 2025 in Oslo, Norway.
Date: April 10, 2025
Duration: 5 minutes 21 seconds
Creator: Beauguitte, Pierre
Partner: International Internet Preservation Consortium
captions transcript

Modelling Archived Web Objects as Semantic Entities to Manage Contextual and Versioning Issues [Video]

Description: Recording of a presentation demonstrating a data model with the ontology layering that describes it and discusses how this model could support more sustainable versioning practices. This recording originated from the IIPC General Assembly and Web Archiving Conference held on April 8-10, 2025 in Oslo, Norway.
Date: April 10, 2025
Duration: 5 minutes 30 seconds
Creator: Storrar, Tom
Partner: International Internet Preservation Consortium
Back to Top of Screen