Politely Downloading Millions of WARC Files Without Burning the Servers Down

PDF Version Also Available for Download.

Description

Poster sharing the development and release of an open source, cross-platform, dependency-free and user-friendly tool that implements an HTTP client with an easy to configure retry-strategy with exponential backoff and jitter. The introduction of these more polite retry strategies allow users to avoid download errors, and to more quickly download data as the exponential backoff with jitter lets the server more easily handle concurrent requests. It was presented at the IIPC General Assembly and Web Archiving Conference held on April 8-10, 2025 in Oslo, Norway.

Physical Description

2 p.: ill.; 23.39 x 33.11 in.

Creation Information

Ortiz Suarez, Pedro; Vaughan, Thom & Lindahl, Greg April 9, 2025.

Context

This poster is part of the collection entitled: International Internet Preservation Consortium (IIPC) General Assembly and Web Archiving Conference and was provided by the International Internet Preservation Consortium to the UNT Digital Library, a digital repository hosted by the UNT Libraries. It has been viewed 12994 times, with 21 in the last month. More information about this poster can be viewed below.

Who

People and organizations associated with either the creation of this poster or its content.

Authors

Organizer of meetings

Provided By

International Internet Preservation Consortium

The mission of the IIPC is to acquire, preserve and make accessible knowledge and information from the Internet for future generations everywhere, promoting global exchange and international relations.

Contact Us

What

Descriptive information to help identify this poster. Follow the links below to find similar items on the Digital Library.

Description

Poster sharing the development and release of an open source, cross-platform, dependency-free and user-friendly tool that implements an HTTP client with an easy to configure retry-strategy with exponential backoff and jitter. The introduction of these more polite retry strategies allow users to avoid download errors, and to more quickly download data as the exponential backoff with jitter lets the server more easily handle concurrent requests. It was presented at the IIPC General Assembly and Web Archiving Conference held on April 8-10, 2025 in Oslo, Norway.

Physical Description

2 p.: ill.; 23.39 x 33.11 in.

Source

  • 2025 International Internet Preservation Consortium (IIPC) General Assembly and Web Archiving Conference, April 8-10, 2025. Oslo, Norway

Language

Item Type

Identifier

Unique identifying numbers for this poster in the Digital Library or other systems.

Collections

This poster is part of the following collection of related materials.

International Internet Preservation Consortium (IIPC) General Assembly and Web Archiving Conference

Presentations, abstracts, posters and other materials from the International Internet Preservation Consortium's (IIPC) annual General Assembly and Web Archiving Conference.

What responsibilities do I have when using this poster?

When

Dates and time periods associated with this poster.

Creation Date

  • April 9, 2025

Added to The UNT Digital Library

  • June 30, 2025, 6:42 p.m.

Description Last Updated

  • Sept. 19, 2025, 11:30 a.m.

Usage Statistics

When was this poster last used?

Yesterday: 0
Past 30 days: 21
Total Uses: 12,994

Interact With This Poster

Here are some suggestions for what to do next.

Start Viewing

PDF Version Also Available for Download.

International Image Interoperability Framework

IIF Logo

We support the IIIF Presentation API

Ortiz Suarez, Pedro; Vaughan, Thom & Lindahl, Greg. Politely Downloading Millions of WARC Files Without Burning the Servers Down, poster, April 9, 2025; (https://digital.library.unt.edu/ark:/67531/metadc2472441/: accessed June 7, 2026), University of North Texas Libraries, UNT Digital Library, https://digital.library.unt.edu; crediting International Internet Preservation Consortium.

Back to Top of Screen