UNT Data Repository - Browse

ABOUT BROWSE FEED

One Million Pages of Texas Newspapers: Dataset

Description: This dataset represents the first million pages of Texas newspapers added to The Portal to Texas History as part of the Texas Digital Newspaper Program. The dataset consists of 123,184 newspaper issues from 569 titles, comprising 1,000,003 pages. Additionally the 3,349,156 item uses associated with this dataset as of April 7, 2013 are included.
Date: April 7, 2013
Creator: Phillips, Mark Edward & Hicks, William
Item Type: Dataset
Partner: UNT Libraries

Extended Date/Time Format (EDTF) Dates Research Datasets

Description: Two datasets, each with 390,751 date samples from the UNT Libraries' digital collections. These samples were compiled for research regarding the Extended Date/Time Format (EDTF) standard. The first dataset contains a concatenated list of date values from the metadata records in The Portal to Texas History, the UNT Digital Library, and The Gateway to Oklahoma History. The "classified" dataset includes labels expressing whether each date is EDTF-valid and the level of conformance.
Date: February 28, 2013
Creator: Phillips, Mark Edward
Item Type: Dataset
Partner: UNT Libraries

"Stand With Wendy" Twitter Dataset

Description: This dataset contains Twitter JSON data for several Twitter search queries collected the week following the filibuster by Wendy Davis in the Texas Senate related to Senate Bill 5, using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 560,954 Tweets make up the combined dataset.
Date: 2013-06-25/2013-07-03
Creator: Phillips, Mark Edward
Item Type: Dataset
Partner: UNT Libraries

DataRes Project Secondary Survey

Description: Dataset from the DataRes Project. This dataset is the secondary survey on data management needs of researchers.
Date: October 2012
Creator: Keralis, Spencer D. C.; Stark, Shannon; Halbert, Martin & Moen, William E.
Item Type: Dataset
Partner: UNT Libraries

DataRes Project Primary Survey

Description: Dataset from the DataRes Project. This dataset is the primary survey on data management needs of researchers.
Date: June 2012
Creator: Keralis, Spencer D. C.; Stark, Shannon; Halbert, Martin & Moen, William E.
Item Type: Dataset
Partner: UNT Libraries

Texas Newspapers Natural Language Processing

Description: This dataset includes data on natural language processing from the Texas Newspapers Project. The dataset includes word counts, name entity recognition results, and topic models.
Date: April 7, 2013
Creator: Torget, Andrew J., 1978-
Item Type: Dataset
Partner: UNT Libraries

DataRes Project Institution Policy Scan Data

Description: Dataset from the DataRes Project indicating the name of the institutions in the study, funding awarded by the National Science Foundation (NSF) and the National Institute of Health (NIH) during the 2010-2011 fiscal year, whether institutions have a Data Management Policy, and the URL is a policy exists.
Date: 2011-10/2013-09
Creator: Keralis, Spencer D. C.; Stark, Shannon; Najmi, Anjum; Freese, Ephraim & Ugartechea, Monica
Item Type: Dataset
Partner: UNT Libraries

"Nelson Mandela" Twitter Dataset

Description: This dataset contains Twitter JSON data for several Twitter search queries that were collected the week following the death of Nelson Mandela on December 5, 2013 using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 10,678,479 Tweets make up the combined dataset.
Date: December 15, 2013
Creator: Phillips, Mark Edward
Item Type: Dataset
Partner: UNT Libraries

Data Annex to the United Nations Truth Commission on the civil war in El Salvador from 1979--1991 (digitized text)

Description: This dataset contains statistical information transcribed from the supplementary documentation of a United Nations (UN) report compiled by The Commission on the Truth for El Salvador (La Comision de la Verdad para El Salvador). It includes information about approximately 20,000 civilian/noncombatant victims of the civil war in El Salvador (from 1979 to 1991) taken from interviews of those who survived or knew/knew of those who were victims.
Date: October 2012
Creator: Mason, T. David; Hamner, Jesse & Phillips, Mark Edward
Item Type: Dataset
Partner: UNT Libraries

[U.S. Patent OCR Files: Disk USP031]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset
Partner: UNT Libraries

[U.S. Patent OCR Files: Disk USP008]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset
Partner: UNT Libraries

[U.S. Patent OCR Files: Disk USP034]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset
Partner: UNT Libraries

[U.S. Patent OCR Files: Disk USP014]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset
Partner: UNT Libraries

[U.S. Patent OCR Files: Disk USP019]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset
Partner: UNT Libraries

[U.S. Patent OCR Files: Disk USP009]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset
Partner: UNT Libraries

[U.S. Patent OCR Files: Disk USP002]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset
Partner: UNT Libraries

[U.S. Patent OCR Files: Disk USP020]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset
Partner: UNT Libraries

[U.S. Patent OCR Files: Disk USP024]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset
Partner: UNT Libraries

[U.S. Patent OCR Files: Disk USP007]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset
Partner: UNT Libraries

[U.S. Patent OCR Files: Disk USP001]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset
Partner: UNT Libraries

[U.S. Patent OCR Files: Disk USP028]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset
Partner: UNT Libraries

[U.S. Patent OCR Files: Disk USP004]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset
Partner: UNT Libraries

[U.S. Patent OCR Files: Disk USP029]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset
Partner: UNT Libraries

[U.S. Patent OCR Files: Disk USP022]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset
Partner: UNT Libraries