Search Results

Labeled PDF Dataset from UNT.edu

Description: This dataset contains a random sample of 2000 PDF documents from the Spring 2017 Web Archive of the unt.edu domain. (https://digital.library.unt.edu/ark:/67531/metadc993363/) that have been sorted into two categories, ForRepo and NotForRepo.
Date: November 15, 2017
Creator: Andrews, Pamela & Phillips, Mark Edward

Water Quality Corridor Management for Restoration (WQCM-R) Modeling Dataset

Description: The dataset was developed to support research intended to develop a spatially-explicit model that prioritizes riparian areas in terms of potential for ecosystem restoration specifically to improve water quality downstream of the riparian area, and ultimately improve drinking water quality. The model was developed and then tested on the Lewisville Lake watershed (north central Texas, just north of Dallas, Texas, USA). The dataset contains environmental data for 90 sub-watersheds that form the ov… more
Date: June 10, 2019
Creator: Atkinson, Samuel F.

[Response Data: Survey of Benchmarks in Metadata Quality]

Description: Complete, anonymized dataset of responses to the Survey of Benchmarks in Metadata Quality. Date, time, IP addresses, and geographic data has been omitted. Responses that included project, organization, and/or repository names were removed from this data, as well as potentially identifying names, acronyms, and/or links.
Date: July 2019
Creator: Digital Library Federation. Assessment Interest Group. Metadata Working Group. Benchmarks Sub-Group.

Political Science Curriculum Map

Description: This dataset provides a data analysis of how student learning objective from PSCI syllabi map to threshold concepts from the ACRL Framework for Information Literacy for Higher Education (2016) and the AAC&U Information Literacy Value Rubric (2013). The data includes non-core course for courses offered from the Fall 2017 semester to the Spring 2020 semester. This data analysis is conducted every three years. This curriculum map excludes core course previously as they were examined in the UNT Lib… more
Date: May 11, 2020
Creator: Henson, Brea

DataRes Project Institution Policy Scan Data

Description: Dataset from the DataRes Project indicating the name of the institutions in the study, funding awarded by the National Science Foundation (NSF) and the National Institute of Health (NIH) during the 2010-2011 fiscal year, whether institutions have a Data Management Policy, and the URL is a policy exists.
Date: 2011-10/2013-09
Creator: Keralis, Spencer D. C.; Stark, Shannon; Najmi, Anjum; Freese, Ephraim & Ugartechea, Monica

Labeled PDF Dataset from End of Term (EOT) 2008 Web Archive

Description: This dataset contains a random sample of 2000 PDF documents from the usda.gov domain in the End of Term (EOT) 2008 Web Archive. These samples were categorized as being of interest for possible inclusion in the Technical Report Archive and Image Library (TRAIL). Each PDF has been sorted into two categories, Technical_Report and Not_Technical_Report.
Date: July 2018
Creator: Kirkwood, Patricia; Phillips, Mark Edward & Caldwell, Christopher

Data Annex to the United Nations Truth Commission on the civil war in El Salvador from 1979--1991 (digitized text)

Description: This dataset contains statistical information transcribed from the supplementary documentation of a United Nations (UN) report compiled by The Commission on the Truth for El Salvador (La Comision de la Verdad para El Salvador). It includes information about approximately 20,000 civilian/noncombatant victims of the civil war in El Salvador (from 1979 to 1991) taken from interviews of those who survived or knew/knew of those who were victims.
Date: October 2012
Creator: Mason, T. David; Hamner, Jesse & Phillips, Mark Edward

2015 FIFA Corruption Scandal Twitter Dataset

Description: This dataset is comprised of tweets that are related to the 2015 FIFA corruption scandal. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 8,615,937 Tweets make up the combined dataset.
Date: 2015-05-21/2015-06-05
Creator: Phillips, Mark Edward

2016 Democratic National Convention in Philadelphia Twitter Dataset

Description: This dataset is comprised of tweets that are related to the 2016 Democratic National Committee meeting in Philadelphia, Pennsylvania that took place on July 25–28, 2016. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 15,676 Tweets make up the combined dataset.
Date: 2016-07-15/2016-08-01
Creator: Phillips, Mark Edward

2018 Texas Sentate Debate Twitter Dataset

Description: This dataset contains Twitter JSON data for Tweets related to the United States Senate race between Beto O'Rourke and Ted Cruz. This dataset contains Tweets captured around their first debate on September 21, 2018. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 3,006,198 Tweets and 101,050 media files make up the combined dataset.
Date: 2018-09-12/2018-10-03
Creator: Phillips, Mark Edward

Badlands National Park Twitter Dataset

Description: This dataset contains Twitter JSON data for Tweets related to the Badlands National Park (BadlandsNPS) user's tweets related to climate change and the Trump administration. This dataset was collected a few days before and following the phenomenon on Twitter. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 321,821 Tweets make up the combined dataset.
Date: 2017-01-15/2017-01-29
Creator: Phillips, Mark Edward

Dallas Police Shooting Twitter Dataset

Description: This dataset contains Twitter JSON data for several Twitter search queries that were collected the week following the shooting of police officers in Dallas, Texas on July 7th 2017, using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 7,146,993 Tweets make up the combined dataset.
Date: 2016-07-05/2016-07-14
Creator: Phillips, Mark Edward

#DescribeTrumpWithOneWord Twitter Dataset

Description: This dataset contains Twitter JSON data for Tweets related to the hashtag #DescribeTrumpWithOneWord. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 15,676 Tweets make up the combined dataset.
Date: 2017-09-02/2017-09-22
Creator: Phillips, Mark Edward

#DiaperDon Twitter Dataset

Description: This dataset contains Twitter JSON data for Tweets related to the hashtag #DiaperDon. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 866,987 Tweets make up the combined dataset.
Date: 2020-11-18/2020-12-01
Creator: Phillips, Mark Edward

ERCOT/2021 Texas Power Crisis Twitter Dataset

Description: This dataset contains Twitter JSON data for Tweets related to the This dataseic Reliability Countil of Texas (ERCOT) during the 2021 Texas power crisis from February 10th, thru February 27th, 2021. The dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 612,082 Tweets make up the combined dataset.
Date: 2021-02-09/2021-02-24
Creator: Phillips, Mark Edward

Extended Date/Time Format (EDTF) Dates Research Datasets

Description: Two datasets, each with 390,751 date samples from the UNT Libraries' digital collections. These samples were compiled for research regarding the Extended Date/Time Format (EDTF) standard. The first dataset contains a concatenated list of date values from the metadata records in The Portal to Texas History, the UNT Digital Library, and The Gateway to Oklahoma History. The "classified" dataset includes labels expressing whether each date is EDTF-valid and the level of conformance.
Date: February 28, 2013
Creator: Phillips, Mark Edward
Back to Top of Screen