Towards Building a Collection of Web Archiving Research Articles

PDF Version Also Available for Download.

Description

Paper for the 2014 Association for Information Science and Technology (ASIS&T) Annual Meeting. This paper discusses building a collection of web archiving research articles.

Physical Description

5 p.

Creation Information

Reyes Ayala, Brenda & Caragea, Cornelia November 3, 2014.

Context

This paper is part of the collection entitled: UNT Scholarly Works and was provided by the UNT College of Information to the UNT Digital Library, a digital repository hosted by the UNT Libraries. It has been viewed 351 times. More information about this paper can be viewed below.

Who

People and organizations associated with either the creation of this paper or its content.

Authors

Provided By

UNT College of Information

Situated at the intersection of people, technology, and information, the College of Information's faculty, staff and students invest in innovative research, collaborative partnerships, and student-centered education to serve a global information society. The college offers programs of study in information science, learning technologies, and linguistics.

Contact Us

What

Descriptive information to help identify this paper. Follow the links below to find similar items on the Digital Library.

Description

Paper for the 2014 Association for Information Science and Technology (ASIS&T) Annual Meeting. This paper discusses building a collection of web archiving research articles.

Physical Description

5 p.

Notes

Abstract: The field of Web Archiving exists in a fluid, fragmented, and heterogeneous state. Part of the problem is that this field is relatively new and its literature is scattered across a wide range of journal and conference venues. This makes the state of Web Archiving as a discipline particularly difficult to ascertain. This paper presents an approach to building a collection of articles about the subject. We begin with a small dataset of articles taken from a Web Archiving Bibliography and then proceed to expand it by crawling the Web and collecting additional documents. The crawled documents are then classified using machine learning classification techniques. We show that by extracting the documents' titles and abstracts and representing them using the "bag of words" approach, we are able to accurately identify documents from the Web crawler as documents that are about Web Archiving. We also discuss our results in the context of Web Archiving as an emerging field.

Source

  • Association for Information Science and Technology (ASIS&T) Annual Meeting, 2014, Seattle, Washington, United States

Language

Item Type

Identifier

Unique identifying numbers for this paper in the Digital Library or other systems.

Collections

This paper is part of the following collection of related materials.

UNT Scholarly Works

Materials from the UNT community's research, creative, and scholarly activities and UNT's Open Access Repository. Access to some items in this collection may be restricted.

What responsibilities do I have when using this paper?

When

Dates and time periods associated with this paper.

Creation Date

  • November 3, 2014

Added to The UNT Digital Library

  • Dec. 4, 2014, 2:16 p.m.

Usage Statistics

When was this paper last used?

Yesterday: 0
Past 30 days: 0
Total Uses: 351

Interact With This Paper

Here are some suggestions for what to do next.

Start Reading

PDF Version Also Available for Download.

International Image Interoperability Framework

IIF Logo

We support the IIIF Presentation API

Reyes Ayala, Brenda & Caragea, Cornelia. Towards Building a Collection of Web Archiving Research Articles, paper, November 3, 2014; (https://digital.library.unt.edu/ark:/67531/metadc461721/: accessed March 26, 2023), University of North Texas Libraries, UNT Digital Library, https://digital.library.unt.edu; crediting UNT College of Information.

Back to Top of Screen