Search Results

#Kaepernick7 and #ISupportKaepernickBecause Twitter Dataset

Description: This dataset contains Twitter JSON data for Tweets related to the hashtags #Kaepernick7 and ISupportKaepernickBecause This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 573,379 Tweets make up the combined dataset.
Date: 2016-08-20/2016-08-31
Creator: Phillips, Mark Edward
Partner: UNT Libraries

Labeled PDF Dataset from End of Term (EOT) 2008 Web Archive

Description: This dataset contains a random sample of 2000 PDF documents from the usda.gov domain in the End of Term (EOT) 2008 Web Archive. These samples were categorized as being of interest for possible inclusion in the Technical Report Archive and Image Library (TRAIL). Each PDF has been sorted into two categories, Technical_Report and Not_Technical_Report.
Date: July 2018
Creator: Kirkwood, Patricia; Phillips, Mark Edward & Caldwell, Christopher
Partner: UNT Libraries

Labeled PDF Dataset from UNT.edu

Description: This dataset contains a random sample of 2000 PDF documents from the Spring 2017 Web Archive of the unt.edu domain. (https://digital.library.unt.edu/ark:/67531/metadc993363/) that have been sorted into two categories, ForRepo and NotForRepo.
Date: November 15, 2017
Creator: Andrews, Pamela & Phillips, Mark Edward
Partner: UNT Libraries

"Nelson Mandela" Twitter Dataset

Description: This dataset contains Twitter JSON data for several Twitter search queries that were collected the week following the death of Nelson Mandela on December 5, 2013 using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 10,678,479 Tweets make up the combined dataset.
Date: December 15, 2013
Creator: Phillips, Mark Edward
Partner: UNT Libraries

Notre Dame Cathedral Fire Dataset

Description: This dataset contains Twitter JSON data for Tweets related to the fire at Notre Dame Cathedral in Paris, France. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 8,046,185 Tweets and 163,055 media files make up the combined dataset.
Date: 2019-04-08/2019-04-29
Creator: Phillips, Mark Edward
Partner: UNT Libraries

One Million Pages of Texas Newspapers: Dataset

Description: This dataset represents the first million pages of Texas newspapers added to The Portal to Texas History as part of the Texas Digital Newspaper Program. The dataset consists of 123,184 newspaper issues from 569 titles, comprising 1,000,003 pages. Additionally the 3,349,156 item uses associated with this dataset as of April 7, 2013 are included.
Date: April 7, 2013
Creator: Phillips, Mark Edward & Hicks, William
Partner: UNT Libraries

Political Science Curriculum Map

Description: This dataset provides a data analysis of how student learning objective from PSCI syllabi map to threshold concepts from the ACRL Framework for Information Literacy for Higher Education (2016) and the AAC&U Information Literacy Value Rubric (2013). The data includes non-core course for courses offered from the Fall 2017 semester to the Spring 2020 semester. This data analysis is conducted every three years. This curriculum map excludes core course previously as they were examined in the UNT Lib… more
Date: May 11, 2020
Creator: Henson, Brea
Partner: UNT Libraries

Portal to Texas History Newspaper OCR Text Dataset: Abilene

Description: Dataset of OCR text from The Portal to Texas History and the Texas Digital Newspaper Program. This dataset includes titles from Abilene Texas from the years 1888 to 1923. Titles included in this dataset include: Abilene Daily Reporter, Abilene Morning Reporter, Abilene Semi-Weekly Farm Reporter, Abilene Semi-Weekly Reporter, Abilene Weekly Reporter, The Abilene Reporter, The Abilene Semi-Weekly Reporter, and the Abilene Weekly Reporter. In all there are 7,208 issues comprised of 62,871 pages… more
Date: November 12, 2015
Creator: Phillips, Mark Edward
Partner: UNT Libraries

Portal to Texas History Newspaper OCR Text Dataset: Brenham

Description: Dataset of OCR text from The Portal to Texas History and the Texas Digital Newspaper Program. This dataset includes titles from Brenham Texas from the years 1876 to 1923. Titles included in this dataset include: Brenham Banner, Brenham Daily Banner, Brenham Daily Banner-Press, Brenham Evening Press, Brenham Weekly Banner, Brenham WEekly Banner-Press, and The Daily Banner. In all there are 10,720 issues comprised of 50,368 pages of text.
Date: November 12, 2015
Creator: Phillips, Mark Edward
Partner: UNT Libraries

Portal to Texas History Newspaper OCR Text Dataset: Bryan

Description: Dataset of OCR text from The Portal to Texas History and the Texas Digital Newspaper Program. This dataset includes titles from Bryan Texas from the years 1883 to 1922. Titles included in this dataset include: Bryan Daily Eagle, Bryan Daily Eagle and Pilot, Bryan Morning Eagle, Bryan Morning Eagle and Pilot, The Brazos Weekly Pilot, The Bryan Daily Eagle, The Bryan Eagle, and The Bryan Weekly Eagle and Pilot . In all there are 5,843 issues comprised of 27,360 pages of text.
Date: November 12, 2015
Creator: Phillips, Mark Edward
Partner: UNT Libraries

Portal to Texas History Newspaper OCR Text Dataset: Denton

Description: Dataset of OCR text from The Portal to Texas History and the Texas Digital Newspaper Program. This dataset includes titles from Denton Texas from the years 1892 to 1911. Titles included in this dataset include: Denton County News, Denton County Record and Chronicle, Denton Evening News, Legal Tender, Record and Chronicle, The Denton County Record, and The Denton Monitor. In all there are 690 issues comprised of 4,686 pages of text.
Date: November 12, 2015
Creator: Phillips, Mark Edward
Partner: UNT Libraries

Portal to Texas History Newspaper OCR Text Dataset: El Paso

Description: Dataset of OCR text from The Portal to Texas History and the Texas Digital Newspaper Program. This dataset includes titles from El Paso Texas from the years 1881 to 1921. Titles included in this dataset include: El Paso Daily Herald, El Paso Daily Times, El Paso Herald, El Paso International Daily Times, El Paso Morning Times, El Paso Sunday Times, El Paso Times, The El Paso Daily Times, and The El Paso Time. In all there are 17,104 issues comprised of 177,640 pages of text.
Date: November 12, 2015
Creator: Phillips, Mark Edward
Partner: UNT Libraries

Portal to Texas History Newspaper OCR Text Dataset: Fort Worth

Description: Dataset of OCR text from The Portal to Texas History and the Texas Digital Newspaper Program. This dataset includes titles from Fort Worth Texas from the years 1883 to 1896. Titles included in this dataset include: Fort Worth Daily Gazette, Fort Worth Gazette, and Fort Worth Weekly Gazette. In all there are 4,146 issues comprised of 36,199 pages of text.
Date: November 12, 2015
Creator: Phillips, Mark Edward
Partner: UNT Libraries

Portal to Texas History Newspaper OCR Text Dataset: Gainesville

Description: Dataset of OCR text from The Portal to Texas History and the Texas Digital Newspaper Program. This dataset includes titles from Gainesville Texas from the years 1888 to 1897. Titles included in this dataset include: The Daily Hesperian, and The Gainesville Daily Hesperian. In all there are 2,286 issues comprised of 9,359 pages of text.
Date: November 12, 2015
Creator: Phillips, Mark Edward
Partner: UNT Libraries
Back to Top of Screen