Description: This paper discusses improving access to web archives through innovative analysis of PDF content. The paper discusses the overall workflow and describes the tools used to extract document features. Findings suggest opportunities for the development of retrieval tools that will provide new ways of selecting content and building collections from large Web archives.
Date: April 2013
Creator: Phillips, Mark Edward & Murray, Kathleen R.
Item Type: Refine your search to only Paper
Partner: UNT Libraries