Search Results

open access

Programmatic Extraction of ‘Documents’ from Web Archives: Identifying Document Characteristics from Content Selector Interviews

Description: White paper documenting the results of interviews with professionals who manage collections of state or federal documents, and institutional repositories. These interviews gathered information about collection policies and characteristics of born-digital publications that are incorporated into these bodies of materials, to inform future machine learning algorithms.
Date: 2020
Creator: Fox, Nathaniel T.; Phillips, Mark Edward & Tarver, Hannah
Back to Top of Screen