Search Results

open access

TwitterVane Administrators Guide

Description: This document serves as the TwitterVane Administrators Guide. It describes how to control, monitor, and maintain Tweet Stream and Tweet processing for the TwitterVane web application.
Date: February 21, 2013
Partner: International Internet Preservation Consortium
open access

WARC implementation guidelines

Description: This report gathers advice and best practice to help institutions designing and creating WARC files for collection management, access, preservation, and interoperability with collections from different institutions.
Date: January 27, 2009
Creator: Oury, Clément
Partner: International Internet Preservation Consortium
open access

Crowdsourcing Workshop & Use Cases

Description: This report describes a crowdsourcing workshop at the 2012 International Internet Preservation Coalition General Assembly. This report contains a workshop report, the discussion paper "Can Crowdsourcing Play a Role in Archiving the Web?, workshop schedule, a list of resources, questions to ask of crowdsourcing sites, crowdsourcing use case templates, and the article "The Crowd & the Library: The Agony and Exstasy of 'Crowdsourcing' Our Cultural Heritage."
Date: May 4, 2012
Creator: Pennock, Maureen E.; Hockx-Yu, Helen & Owens, Trevor
Partner: International Internet Preservation Consortium
open access

Putting it all together: creating a unified web harvesting workflow at the Bibliothèque nationale de France

Description: This article presents the complete web harvesting workflow at the Bibliothèque Nationale de France for the International Internet Preservation Consortium sponsored workshop "How to fit in? Integrating a web archiving program in your organisation."
Date: November 2012
Creator: Le Follic, Annick; Stirling, Peter & Wendland, Bert
Partner: International Internet Preservation Consortium
open access

Web Harvesting Survey

Description: This document contains a survey to identify and classify many of the conditions found on web sites that influence the harvesting of content and the quality of an archival crawl.
Date: March 8, 2004
Creator: Library of Congress
Partner: International Internet Preservation Consortium
open access

Web Archives: The Future(s)

Description: This report aims to stimulate further discussion among web archivists and researchers about the future ways in which web archives can be used by researchers.
Date: June 30, 2011
Creator: Meyer, Eric T.; Thomas, Arthur & Schroeder, Ralph
Partner: International Internet Preservation Consortium
open access

Harvesting Practices Report

Description: This report summarizes the results of the International Internet Preservation Consortium (IIPC) Harvesting Practices Survey, developed in order to understand, analyze and to collate the current Internet archiving processes and experiences amongst IIPC members.
Date: June 10, 2011
Creator: Mayr, Michaela
Partner: International Internet Preservation Consortium
open access

Facing the Challenge of Web Archives Preservation: the Role and Work of the IIPC Preservation Working Group

Description: This paper documents the results of a survey about the current state of preservation in International Internet Preservation Consortium (IIPC) member web archives.
Date: October 2014
Creator: Goethals, Andrea; Oury, Clément; Pearson, David; Sierman, Barbara & Steinke, Tobias
Partner: International Internet Preservation Consortium
open access

Twittervane Guide

Description: This document contains a guide for using Twittervane, a tool that can extract and analyse URLs embedded in a tweet, allowing for the capture of URLs related to a specific topic of interest in a collection.
Date: unknown
Partner: International Internet Preservation Consortium
open access

Information and documentation — Statistics and Quality Indicators for Web Archiving

Description: This technical report defines statistical terms and quality criteria for Web archiving. It considers the needs and practices across a wide range of heritage and research organisations such as national and research libraries, archives, museums, research centres and heritage foundations.
Date: 2012
Partner: International Internet Preservation Consortium
open access

A Vision of the Role and Future of Web Archives

Description: This text was presented at the 2012 General Assembly of the International Internet Preservation Coalition, and appears as a three-part blog post in The Signal, a blog hosted by the Library of Congress. This text discusses the role and future of web archives.
Date: 2012
Creator: Leetaru, Kalev H.
Partner: International Internet Preservation Consortium
open access

Web Harvesting Survey

Description: This report contains a survey of the conditions found on web sites that influence the harvesting of content and the quality of an archival crawl.
Date: July 2004
Creator: Marill, Jennifer; Boyko, Andrew; Ashenfelder, Michael & Jones, Gina
Partner: International Internet Preservation Consortium
open access

Web Archive Profiling Via Sampling Final Report

Description: This report covers the results, deliverables, and ongoing status of the International Internet Preservation Consortium (IIPC) funded project "Web Archive Profiling Via Sampling" with links to code, datasets, presentations, and papers as appropriate.
Date: September 16, 2016
Creator: Alam, Sawood; Nelson, Michael L.; Van de Sompel, Herbert; Balakireva, Lyudmila; Shankar, Harihar; Bornand, Nicolas J. et al.
Partner: International Internet Preservation Consortium
open access

Evaluating Twittervane: Project Final Report

Description: This report provides the final update on the Twittervane project, a prototype application capable of collecting and analyzing Twitter feeds and outputting URLs mentioned in the Tweets.
Date: June 16, 2013
Creator: Pitt, Mary & Hockx-Yu, Helen
Partner: International Internet Preservation Consortium
Back to Top of Screen