UNT Libraries - Browse

ABOUT BROWSE FEED

Extended Date/Time Format (EDTF) Dates Research Datasets

Description: Two datasets, each with 390,751 date samples from the UNT Libraries' digital collections. These samples were compiled for research regarding the Extended Date/Time Format (EDTF) standard. The first dataset contains a concatenated list of date values from the metadata records in The Portal to Texas History, the UNT Digital Library, and The Gateway to Oklahoma History. The "classified" dataset includes labels expressing whether each date is EDTF-valid and the level of conformance.
Date: February 28, 2013
Creator: Phillips, Mark Edward
Item Type: Dataset

One Million Pages of Texas Newspapers: Dataset

Description: This dataset represents the first million pages of Texas newspapers added to The Portal to Texas History as part of the Texas Digital Newspaper Program. The dataset consists of 123,184 newspaper issues from 569 titles, comprising 1,000,003 pages. Additionally the 3,349,156 item uses associated with this dataset as of April 7, 2013 are included.
Date: April 7, 2013
Creator: Phillips, Mark Edward & Hicks, William
Item Type: Dataset

Quality Assurance Practices in Web Archiving [Dataset]

Description: This dataset contains the results of a survey of quality assurance practices within the field of web archiving and its practitioners. To understand current QA practices, the authors surveyed institutions engaged in web archiving, which included national libraries, colleges and universities, and museums and art libraries. The survey was administered online. It includes the completed responses of 54 participants. The data has been anonymized for privacy reasons. This dataset was used in the "Current Quality Assurance Practices in Web Archiving" paper, available from the UNT Digital Library.
Date: December 2014
Creator: Reyes Ayala, Brenda; Phillips, Mark Edward & Ko, Lauren
Item Type: Dataset

Restricted University of North Texas Electronic Theses and Dissertations

Description: This dataset contains responses to a survey questionnaire distributed by the University of North Texas (UNT) Libraries asking 125 authors of electronic theses and dissertations (ETDs) whether they agree to change the existing restricted permission status on their ETDs.
Date: February 24, 2014
Creator: Alemneh, Daniel Gelaw
Item Type: Dataset

Texas Newspapers Natural Language Processing

Description: This dataset includes data on natural language processing from the Texas Newspapers Project. The dataset includes word counts, name entity recognition results, and topic models.
Date: April 7, 2013
Creator: Torget, Andrew J., 1978-
Item Type: Dataset

[U.S. Patent OCR Files: Disk USP001]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset

[U.S. Patent OCR Files: Disk USP002]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset

[U.S. Patent OCR Files: Disk USP003]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset

[U.S. Patent OCR Files: Disk USP004]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset

[U.S. Patent OCR Files: Disk USP005]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset

[U.S. Patent OCR Files: Disk USP006]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset

[U.S. Patent OCR Files: Disk USP007]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset

[U.S. Patent OCR Files: Disk USP008]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset

[U.S. Patent OCR Files: Disk USP009]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset

[U.S. Patent OCR Files: Disk USP010]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset

[U.S. Patent OCR Files: Disk USP011]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset

[U.S. Patent OCR Files: Disk USP012]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset

[U.S. Patent OCR Files: Disk USP013]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset

[U.S. Patent OCR Files: Disk USP014]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset

[U.S. Patent OCR Files: Disk USP015]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset

[U.S. Patent OCR Files: Disk USP016]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset

[U.S. Patent OCR Files: Disk USP017]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset

[U.S. Patent OCR Files: Disk USP018]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset

[U.S. Patent OCR Files: Disk USP019]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset