UNT Data Repository - 83 Matching Results

Search Results

2015 FIFA Corruption Scandal Twitter Dataset

Description: This dataset is comprised of tweets that are related to the 2015 FIFA corruption scandal. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 15,676 Tweets make up the combined dataset.
Date: 2015-05-21/2015-06-05
Creator: Phillips, Mark Edward
Partner: UNT Libraries

2016 Democratic National Convention in Philadelphia Twitter Dataset

Description: This dataset is comprised of tweets that are related to the 2016 Democratic National Committee meeting in Philadelphia, Pennsylvania that took place on July 25–28, 2016. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 15,676 Tweets make up the combined dataset.
Date: 2016-07-15/2016-08-01
Creator: Phillips, Mark Edward
Partner: UNT Libraries

Badlands National Park Twitter Dataset

Description: This dataset contains Twitter JSON data for Tweets related to the Badlands National Park (BadlandsNPS) user's tweets related to climate change and the Trump administration. This dataset was collected a few days before and following the phenomenon on Twitter. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 321,821 Tweets make up the combined dataset.
Date: 2017-01-15/2017-01-29
Creator: Phillips, Mark Edward
Partner: UNT Libraries

Coda Archival Digital Repository Dataset

Description: This dataset contains information extracted from the UNT Libraries' Coda Digital Repository. It contains information related to number of files, size, and ingest date of digital objects added to that system. It can be used for analysis and investigation of the growth and makeup of digital repositories.
Date: April 1, 2014
Creator: Phillips, Mark Edward & Ko, Lauren
Partner: UNT Libraries

Dallas Police Shooting Twitter Dataset

Description: This dataset contains Twitter JSON data for several Twitter search queries that were collected the week following the shooting of police officers in Dallas, Texas on July 7th 2017, using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 7,146,993 Tweets make up the combined dataset.
Date: 2016-07-05/2016-07-14
Creator: Phillips, Mark Edward
Partner: UNT Libraries

Data Annex to the United Nations Truth Commission on the civil war in El Salvador from 1979--1991 (digitized text)

Description: This dataset contains statistical information transcribed from the supplementary documentation of a United Nations (UN) report compiled by The Commission on the Truth for El Salvador (La Comision de la Verdad para El Salvador). It includes information about approximately 20,000 civilian/noncombatant victims of the civil war in El Salvador (from 1979 to 1991) taken from interviews of those who survived or knew/knew of those who were victims.
Date: October 2012
Creator: Mason, T. David; Hamner, Jesse & Phillips, Mark Edward
Partner: UNT Libraries

DataRes Project Institution Policy Scan Data

Description: Dataset from the DataRes Project indicating the name of the institutions in the study, funding awarded by the National Science Foundation (NSF) and the National Institute of Health (NIH) during the 2010-2011 fiscal year, whether institutions have a Data Management Policy, and the URL is a policy exists.
Date: 2011-10/2013-09
Creator: Keralis, Spencer D. C.; Stark, Shannon; Najmi, Anjum; Freese, Ephraim & Ugartechea, Monica
Partner: UNT Libraries

#DescribeTrumpWithOneWord Twitter Dataset

Description: This dataset contains Twitter JSON data for Tweets related to the hashtag #DescribeTrumpWithOneWord. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 15,676 Tweets make up the combined dataset.
Date: 2017-09-02/2017-09-22
Creator: Phillips, Mark Edward
Partner: UNT Libraries

Extended Date/Time Format (EDTF) Dates Research Datasets

Description: Two datasets, each with 390,751 date samples from the UNT Libraries' digital collections. These samples were compiled for research regarding the Extended Date/Time Format (EDTF) standard. The first dataset contains a concatenated list of date values from the metadata records in The Portal to Texas History, the UNT Digital Library, and The Gateway to Oklahoma History. The "classified" dataset includes labels expressing whether each date is EDTF-valid and the level of conformance.
Date: February 28, 2013
Creator: Phillips, Mark Edward
Partner: UNT Libraries

Hurricane Harvey Twitter Dataset

Description: This dataset contains Twitter JSON data for Tweets related to Hurricane Harvey and the subsequent flooding along the Texas gulf region. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 7,041,866 Tweets make up the combined dataset.
Date: 2017-08-18/2017-09-22
Creator: Phillips, Mark Edward
Partner: UNT Libraries

Impact of Library Instruction

Description: This dataset contains anonymized data for students who were enrolled in English 1320 at the University of North Texas between the Fall 2012 semester and the Spring 2016 semester.
Date: July 2017
Creator: Hargis, Carol; Leuzinger, Julie & Rowe, Jennifer
Partner: UNT Libraries

#Kaepernick7 and #ISupportKaepernickBecause Twitter Dataset

Description: This dataset contains Twitter JSON data for Tweets related to the hashtags #Kaepernick7 and ISupportKaepernickBecause This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 573,379 Tweets make up the combined dataset.
Date: 2016-08-20/2016-08-31
Creator: Phillips, Mark Edward
Partner: UNT Libraries

"Nelson Mandela" Twitter Dataset

Description: This dataset contains Twitter JSON data for several Twitter search queries that were collected the week following the death of Nelson Mandela on December 5, 2013 using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 10,678,479 Tweets make up the combined dataset.
Date: December 15, 2013
Creator: Phillips, Mark Edward
Partner: UNT Libraries

One Million Pages of Texas Newspapers: Dataset

Description: This dataset represents the first million pages of Texas newspapers added to The Portal to Texas History as part of the Texas Digital Newspaper Program. The dataset consists of 123,184 newspaper issues from 569 titles, comprising 1,000,003 pages. Additionally the 3,349,156 item uses associated with this dataset as of April 7, 2013 are included.
Date: April 7, 2013
Creator: Phillips, Mark Edward & Hicks, William
Partner: UNT Libraries

Portal to Texas History Newspaper OCR Text Dataset: Abilene

Description: Dataset of OCR text from The Portal to Texas History and the Texas Digital Newspaper Program. This dataset includes titles from Abilene Texas from the years 1888 to 1923. Titles included in this dataset include: Abilene Daily Reporter, Abilene Morning Reporter, Abilene Semi-Weekly Farm Reporter, Abilene Semi-Weekly Reporter, Abilene Weekly Reporter, The Abilene Reporter, The Abilene Semi-Weekly Reporter, and the Abilene Weekly Reporter. In all there are 7,208 issues comprised of 62,871 pages of text.
Date: November 12, 2015
Creator: Phillips, Mark Edward
Partner: UNT Libraries

Portal to Texas History Newspaper OCR Text Dataset: Brenham

Description: Dataset of OCR text from The Portal to Texas History and the Texas Digital Newspaper Program. This dataset includes titles from Brenham Texas from the years 1876 to 1923. Titles included in this dataset include: Brenham Banner, Brenham Daily Banner, Brenham Daily Banner-Press, Brenham Evening Press, Brenham Weekly Banner, Brenham WEekly Banner-Press, and The Daily Banner. In all there are 10,720 issues comprised of 50,368 pages of text.
Date: November 12, 2015
Creator: Phillips, Mark Edward
Partner: UNT Libraries

Portal to Texas History Newspaper OCR Text Dataset: Bryan

Description: Dataset of OCR text from The Portal to Texas History and the Texas Digital Newspaper Program. This dataset includes titles from Bryan Texas from the years 1883 to 1922. Titles included in this dataset include: Bryan Daily Eagle, Bryan Daily Eagle and Pilot, Bryan Morning Eagle, Bryan Morning Eagle and Pilot, The Brazos Weekly Pilot, The Bryan Daily Eagle, The Bryan Eagle, and The Bryan Weekly Eagle and Pilot . In all there are 5,843 issues comprised of 27,360 pages of text.
Date: November 12, 2015
Creator: Phillips, Mark Edward
Partner: UNT Libraries

Portal to Texas History Newspaper OCR Text Dataset: Denton

Description: Dataset of OCR text from The Portal to Texas History and the Texas Digital Newspaper Program. This dataset includes titles from Denton Texas from the years 1892 to 1911. Titles included in this dataset include: Denton County News, Denton County Record and Chronicle, Denton Evening News, Legal Tender, Record and Chronicle, The Denton County Record, and The Denton Monitor. In all there are 690 issues comprised of 4,686 pages of text.
Date: November 12, 2015
Creator: Phillips, Mark Edward
Partner: UNT Libraries

Portal to Texas History Newspaper OCR Text Dataset: El Paso

Description: Dataset of OCR text from The Portal to Texas History and the Texas Digital Newspaper Program. This dataset includes titles from El Paso Texas from the years 1881 to 1921. Titles included in this dataset include: El Paso Daily Herald, El Paso Daily Times, El Paso Herald, El Paso International Daily Times, El Paso Morning Times, El Paso Sunday Times, El Paso Times, The El Paso Daily Times, and The El Paso Time. In all there are 17,104 issues comprised of 177,640 pages of text.
Date: November 12, 2015
Creator: Phillips, Mark Edward
Partner: UNT Libraries