Date: April 2013
Creator: Phillips, Mark Edward & Murray, Kathleen R.
Description: This paper discusses improving access to web archives through innovative analysis of PDF content. The paper discusses the overall workflow and describes the tools used to extract document features. Findings suggest opportunities for the development of retrieval tools that will provide new ways of selecting content and building collections from large Web archives.
Contributing Partner: UNT Libraries