Search Results

[U.S. Patent OCR Files: Disk USP031]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward

[U.S. Patent OCR Files: Disk USP033]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward

[U.S. Patent OCR Files: Disk USP013]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward

[U.S. Patent OCR Files: Disk USP009]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward

[U.S. Patent OCR Files: Disk USP028]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward

[U.S. Patent OCR Files: Disk USP030]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward

[U.S. Patent OCR Files: Disk USP008]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward

UNT Scholarly Works Record Discoveries Dataset

Description: Dataset containing four tab-delimited data files and associated code for analysis of data. Dataset represents record discoveries identified from Apache log files created by the UNT Digital Library platform. Record discoveries are for the UNT Scholarly Works Repository from May 2014 to January 2015.
Date: January 24, 2015
Creator: Phillips, Mark Edward

Corpus del Español: Web/Dialects

Description: Dataset of words collected from text available on the Internet from twenty-one different countries; the individual files include concordance information, parts-of-speech, and other arrangements of the data.
Access: Restricted to UNT Community Members. Login required if off-campus.
Date: unknown
Creator: Davies, Mark

Notre Dame Cathedral Fire Dataset

Description: This dataset contains Twitter JSON data for Tweets related to the fire at Notre Dame Cathedral in Paris, France. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 8,046,185 Tweets and 163,055 media files make up the combined dataset.
Date: 2019-04-08/2019-04-29
Creator: Phillips, Mark Edward

ETS Corpus of Non-Native Written English

Description: ETS Corpus of Non-Native Written English was developed by Educational Testing Service and is comprised of 12,100 English essays written by speakers of 11 non-English native languages as part of an international test of academic English proficiency, TOEFL (Test of English as a Foreign Language). The test includes reading, writing, listening, and speaking sections and is delivered by computer in a secure test center. This release contains 1,100 essays for each of the 11 native languages sampled f… more
Access: Restricted to UNT Community Members. Login required if off-campus.
Date: June 16, 2014
Creator: Blanchard, Daniel; Tetreault, Joel; Higgins, Derrick; Cahill, Aoife & Chodorow, Martin

Restricted University of North Texas Electronic Theses and Dissertations

Description: This dataset contains responses to a survey questionnaire distributed by the University of North Texas (UNT) Libraries asking 125 authors of electronic theses and dissertations (ETDs) whether they agree to change the existing restricted permission status on their ETDs.
Date: February 24, 2014
Creator: Alemneh, Daniel Gelaw

UNT Digital Library Value Study Data

Description: This dataset contains collected responses to a study of the UNT Digital Library collections, including the UNT Scholarly Works institutional repository and The Portal to Texas History. It contains responses to 26 survey questions regarding the use and perceived value of the UNT Digital Library collections and 13 demographic questions.
Date: October 2013
Creator: Waugh, Laura; Murray, Kathleen R.; Phillips, Mark Edward & Belden, Dreanna

UNT Libraries Metadata Edit Dataset

Description: This dataset contains data samples from metadata records extracted from the UNT Libraries' Digital Collections. It contains one sample per metadata record version in the system with aggregate counts of fields and also hash values of an element as well. Data was collected in March 2014 with dates from May 19, 2004 to February 4, 2014.
Date: April 2014
Creator: Phillips, Mark Edward

"Yes All Women" Twitter Dataset

Description: This dataset contains Twitter JSON data for several Twitter search queries that were collected around the #YesAllWomen Twitter "conversation" between May 25, 2014 and June 8, 2014 using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 2,805,763 Tweets and 34,532 images make up the combined dataset.
Date: June 8, 2014
Creator: Phillips, Mark Edward

UNT Libraries Edit Event Dataset 2014

Description: Dataset containing metadata edit events for the UNT Libraries Digital Collections from January 1, 2014 until December 31, 2014. There are a total of 94,222 samples in the dataset from 193 different metadata editors.
Date: February 2015
Creator: Phillips, Mark Edward

Corpus of Contemporary American English (2012-2015 update)

Description: Dataset of American English words collected from spoken language, fiction, popular magazines, newspapers, and academic texts; the individual files include concordance information, parts-of-speech, and other arrangements of the data.
Access: Restricted to UNT Community Members. Login required if off-campus.
Date: unknown
Creator: Davies, Mark
Back to Top of Screen