Towards Building a Collection of Web Archiving Research Articles

PDF Version Also Available for Download.

Description

Paper for the 2014 Association for Information Science and Technology (ASIS&T) Annual Meeting. This paper discusses building a collection of web archiving research articles.

Physical Description

5 p.

Creation Information

Reyes Ayala, Brenda & Caragea, Cornelia November 3, 2014.

Context

This paper is part of the collection entitled: UNT Scholarly Works and was provided by UNT College of Information to Digital Library, a digital repository hosted by the UNT Libraries. It has been viewed 130 times , with 7 in the last month . More information about this paper can be viewed below.

Who

People and organizations associated with either the creation of this paper or its content.

Authors

Provided By

UNT College of Information

The UNT College of Information educates students and advances domains of knowledge in information science, library science, computing and technology systems, learning and cognition, and human performance.

Contact Us

What

Descriptive information to help identify this paper. Follow the links below to find similar items on the Digital Library.

Degree Information

Description

Paper for the 2014 Association for Information Science and Technology (ASIS&T) Annual Meeting. This paper discusses building a collection of web archiving research articles.

Physical Description

5 p.

Notes

Abstract: The field of Web Archiving exists in a fluid, fragmented, and heterogeneous state. Part of the problem is that this field is relatively new and its literature is scattered across a wide range of journal and conference venues. This makes the state of Web Archiving as a discipline particularly difficult to ascertain. This paper presents an approach to building a collection of articles about the subject. We begin with a small dataset of articles taken from a Web Archiving Bibliography and then proceed to expand it by crawling the Web and collecting additional documents. The crawled documents are then classified using machine learning classification techniques. We show that by extracting the documents' titles and abstracts and representing them using the "bag of words" approach, we are able to accurately identify documents from the Web crawler as documents that are about Web Archiving. We also discuss our results in the context of Web Archiving as an emerging field.

Source

  • Association for Information Science and Technology (ASIS&T) Annual Meeting, 2014, Seattle, Washington, United States

Language

Item Type

Collections

This paper is part of the following collection of related materials.

UNT Scholarly Works

Materials from the UNT community's research, creative, and scholarly activities and UNT's Open Access Repository. Access to some items in this collection may be restricted.

What responsibilities do I have when using this paper?

When

Dates and time periods associated with this paper.

Creation Date

  • November 3, 2014

Added to The UNT Digital Library

  • Dec. 4, 2014, 2:16 p.m.

Usage Statistics

When was this paper last used?

Yesterday: 0
Past 30 days: 7
Total Uses: 130

Interact With This Paper

Here are some suggestions for what to do next.

Start Reading

PDF Version Also Available for Download.

Citations, Rights, Re-Use

Reyes Ayala, Brenda & Caragea, Cornelia. Towards Building a Collection of Web Archiving Research Articles, paper, November 3, 2014; (digital.library.unt.edu/ark:/67531/metadc461721/: accessed December 11, 2017), University of North Texas Libraries, Digital Library, digital.library.unt.edu; crediting UNT College of Information.