Search Results

2015 FIFA Corruption Scandal Twitter Dataset

Description: This dataset is comprised of tweets that are related to the 2015 FIFA corruption scandal. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 8,615,937 Tweets make up the combined dataset.
Date: 2015-05-21/2015-06-05
Creator: Phillips, Mark Edward

2016 Democratic National Convention in Philadelphia Twitter Dataset

Description: This dataset is comprised of tweets that are related to the 2016 Democratic National Committee meeting in Philadelphia, Pennsylvania that took place on July 25–28, 2016. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 15,676 Tweets make up the combined dataset.
Date: 2016-07-15/2016-08-01
Creator: Phillips, Mark Edward

2018 Texas Sentate Debate Twitter Dataset

Description: This dataset contains Twitter JSON data for Tweets related to the United States Senate race between Beto O'Rourke and Ted Cruz. This dataset contains Tweets captured around their first debate on September 21, 2018. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 3,006,198 Tweets and 101,050 media files make up the combined dataset.
Date: 2018-09-12/2018-10-03
Creator: Phillips, Mark Edward

3D Printable Punctuation and Spaces Type Setting Kit

Description: Individual 3D dataset files for Punctuation and type setting spaces. Punctuation pieces include ampersand, apostrophe, colon, comma, double parentheses, forward slash, period, question mark, and semi colon. Space pieces include three em, four em, 5 em, em quad, and en quad.
Date: January 21, 2021
Creator: Strait, Bob

[Age of the UNT Libraries Collection Dataset, 2013]

Description: Dataset generated for the University of North Texas Libraries collection tabulating the number of items published by decade within each subject area.
Date: December 2013
Creator: University of North Texas. Libraries.

Badlands National Park Twitter Dataset

Description: This dataset contains Twitter JSON data for Tweets related to the Badlands National Park (BadlandsNPS) user's tweets related to climate change and the Trump administration. This dataset was collected a few days before and following the phenomenon on Twitter. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 321,821 Tweets make up the combined dataset.
Date: 2017-01-15/2017-01-29
Creator: Phillips, Mark Edward

Coda Archival Digital Repository Dataset

Description: This dataset contains information extracted from the UNT Libraries' Coda Digital Repository. It contains information related to number of files, size, and ingest date of digital objects added to that system. It can be used for analysis and investigation of the growth and makeup of digital repositories.
Date: April 1, 2014
Creator: Phillips, Mark Edward & Ko, Lauren

Congressional Globe OCR Dataset

Description: Dataset of OCR text from the Congressional Globe collection in the UNT Digital Library. In all there are 112 volumes and 104,615 pages of text in this dataset.
Date: April 6, 2015
Creator: Phillips, Mark Edward

Corpus del Español: Web/Dialects

Description: Dataset of words collected from text available on the Internet from twenty-one different countries; the individual files include concordance information, parts-of-speech, and other arrangements of the data.
Access: Restricted to UNT Community Members. Login required if off-campus.
Date: unknown
Creator: Davies, Mark

Corpus of Contemporary American English (1990-2012)

Description: Dataset of American English words collected from spoken language, fiction, popular magazines, newspapers, and academic texts; the individual files include concordance information, parts-of-speech, and other arrangements of the data.
Access: Restricted to UNT Community Members. Login required if off-campus.
Date: unknown
Creator: Davies, Mark

Corpus of Contemporary American English (2012-2015 update)

Description: Dataset of American English words collected from spoken language, fiction, popular magazines, newspapers, and academic texts; the individual files include concordance information, parts-of-speech, and other arrangements of the data.
Access: Restricted to UNT Community Members. Login required if off-campus.
Date: unknown
Creator: Davies, Mark

Corpus of Contemporary American English (2020 update)

Description: Dataset of American English words collected from spoken language, fiction, popular magazines, newspapers, and academic texts; the individual files include concordance information, parts-of-speech, and other arrangements of the data.
Access: Restricted to UNT Community Members. Login required if off-campus.
Date: April 2020
Creator: Davies, Mark

Corpus of Global Web-Based English (GloWbE)

Description: Dataset of words collected from text available on the Internet from twenty different countries; the individual files include concordance information, parts-of-speech, and other arrangements of the data.
Access: Restricted to UNT Community Members. Login required if off-campus.
Date: unknown
Creator: Davies, Mark

Corpus of News on the Web (NOW) - April 2017

Description: Dataset of words collected from newspapers and magazines from twenty different countries; the individual files include concordance information, parts-of-speech, and other arrangements of the data.
Access: Restricted to UNT Community Members. Login required if off-campus.
Date: unknown
Creator: Davies, Mark

Corpus of News on the Web (NOW) - April 2018

Description: Dataset of words collected from newspapers and magazines from twenty different countries; the individual files include concordance information, parts-of-speech, and other arrangements of the data.
Access: Restricted to UNT Community Members. Login required if off-campus.
Date: April 2018
Creator: Davies, Mark

Corpus of News on the Web (NOW) - August 2017

Description: Dataset of words collected from newspapers and magazines from twenty different countries; the individual files include concordance information, parts-of-speech, and other arrangements of the data.
Access: Restricted to UNT Community Members. Login required if off-campus.
Date: unknown
Creator: Davies, Mark

Corpus of News on the Web (NOW) - December 2016

Description: Dataset of words collected from newspapers and magazines from twenty different countries; the individual files include concordance information, parts-of-speech, and other arrangements of the data.
Access: Restricted to UNT Community Members. Login required if off-campus.
Date: unknown
Creator: Davies, Mark

Corpus of News on the Web (NOW) - December 2017

Description: Dataset of words collected from newspapers and magazines from twenty different countries; the individual files include concordance information, parts-of-speech, and other arrangements of the data.
Access: Restricted to UNT Community Members. Login required if off-campus.
Date: unknown
Creator: Davies, Mark
Back to Top of Screen