Search Results

[Dataset: Piece of movable type B with jet detached]

Description: 3D dataset model of a piece of moveable type with jet detached for a majiscule sans serif B. The resulting 3D printed model will replicate the historical artifact from the hand press period. This dataset will print a piece of type for a majuscule modern sans-serif B without the jet attached. To use this model for instruction, you must also print the removable jet, which will insert into the piece of type to illustrate the process of jet removal. These models are for teaching purposes only and c… more
Date: April 1, 2017
Creator: Jacobs, Courtney E.; McIntosh, Marcia; O'Sullivan, Kevin M. & Strait, Bob

[Dataset: Punch-B]

Description: 3D dataset model of a punch for a majiscule sans serif B. The resulting 3D printed model will replicate the historical artifact used to design and cast type during the hand press period. These models are for teaching purposes only and cannot be used to cast type using molten type metal, nor can they be used for printing. This dataset is an individual file and is part of a complete set of teaching tools.
Date: April 1, 2017
Creator: Jacobs, Courtney E.; McIntosh, Marcia; O'Sullivan, Kevin M. & Strait, Bob

[Dataset: Removable-Jet]

Description: 3D dataset model of a removable jet for a majiscule sans serif B. The resulting 3D printed model will replicate the historical artifact used to design and cast type during the hand press period. This dataset will print a single detachable jet piece. To use this model for instruction, you must also print a model of the type without jet; the removable jet inserts into the type piece to illustrate the process of jet removal. These models are for teaching purposes only and cannot be used to cast ty… more
Date: April 1, 2017
Creator: Jacobs, Courtney E.; McIntosh, Marcia; O'Sullivan, Kevin M. & Strait, Bob

[Dataset: Woodcut Facsimile]

Description: 3D dataset model of a facsimile of a 16th century woodcut illustrating a typecaster at work. The model is 4 inches by 4 inches square, and should print type high: 0.918 inches. To print on a standard FDM printer, a smaller mm. nozzle is necessary. A resin print of this dataset was set in a locked form of a tabletop roller press, inked, and printed successfully.
Date: April 1, 2017
Creator: Jacobs, Courtney E.; McIntosh, Marcia; O'Sullivan, Kevin M. & Strait, Bob

#DescribeTrumpWithOneWord Twitter Dataset

Description: This dataset contains Twitter JSON data for Tweets related to the hashtag #DescribeTrumpWithOneWord. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 15,676 Tweets make up the combined dataset.
Date: 2017-09-02/2017-09-22
Creator: Phillips, Mark Edward

ETS Corpus of Non-Native Written English

Description: ETS Corpus of Non-Native Written English was developed by Educational Testing Service and is comprised of 12,100 English essays written by speakers of 11 non-English native languages as part of an international test of academic English proficiency, TOEFL (Test of English as a Foreign Language). The test includes reading, writing, listening, and speaking sections and is delivered by computer in a secure test center. This release contains 1,100 essays for each of the 11 native languages sampled f… more
Access: Restricted to UNT Community Members. Login required if off-campus.
Date: June 16, 2014
Creator: Blanchard, Daniel; Tetreault, Joel; Higgins, Derrick; Cahill, Aoife & Chodorow, Martin

Extended Date/Time Format (EDTF) Dates Research Datasets

Description: Two datasets, each with 390,751 date samples from the UNT Libraries' digital collections. These samples were compiled for research regarding the Extended Date/Time Format (EDTF) standard. The first dataset contains a concatenated list of date values from the metadata records in The Portal to Texas History, the UNT Digital Library, and The Gateway to Oklahoma History. The "classified" dataset includes labels expressing whether each date is EDTF-valid and the level of conformance.
Date: February 28, 2013
Creator: Phillips, Mark Edward

Gaming Census Dataset

Description: This dataset represents survey feedback gathered about games in libraries, collections, cataloging, outreach, and programming.
Date: December 3, 2018
Creator: Brannon, Sian; Robson, Diane & Dewitt-Miller, Erin

Hurricane Dorian Twitter Dataset

Description: This dataset contains Twitter JSON data for Tweets related to Hurricane Dorian which is the most intense tropical cyclone on record to strike the Bahamas, and is regarded as the worst natural disaster in the country's history. This dataset was created using the twarc (https://github.com/DocNow/twarc) package that makes use of Twitter's search API. A total of 3,000,553 Tweets and 84,216 media files make up the combined dataset.
Date: 2019-08-25/2019-09-14
Creator: Phillips, Mark Edward

Hurricane Florence Twitter Dataset

Description: This dataset contains Twitter JSON data for Tweets related to Hurricane Florence and the subsequent flooding along the Carolina coastal region. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 4,971,575 Tweets and 347,205 media files make up the combined dataset.
Date: 2018-09-05/2018-10-03
Creator: Phillips, Mark Edward

Hurricane Harvey Twitter Dataset

Description: This dataset contains Twitter JSON data for Tweets related to Hurricane Harvey and the subsequent flooding along the Texas gulf region. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 7,041,866 Tweets make up the combined dataset.
Date: 2017-08-18/2017-09-22
Creator: Phillips, Mark Edward

#Kaepernick7 and #ISupportKaepernickBecause Twitter Dataset

Description: This dataset contains Twitter JSON data for Tweets related to the hashtags #Kaepernick7 and ISupportKaepernickBecause This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 573,379 Tweets make up the combined dataset.
Date: 2016-08-20/2016-08-31
Creator: Phillips, Mark Edward

Labeled PDF Dataset from End of Term (EOT) 2008 Web Archive

Description: This dataset contains a random sample of 2000 PDF documents from the usda.gov domain in the End of Term (EOT) 2008 Web Archive. These samples were categorized as being of interest for possible inclusion in the Technical Report Archive and Image Library (TRAIL). Each PDF has been sorted into two categories, Technical_Report and Not_Technical_Report.
Date: July 2018
Creator: Kirkwood, Patricia; Phillips, Mark Edward & Caldwell, Christopher

Labeled PDF Dataset from UNT.edu

Description: This dataset contains a random sample of 2000 PDF documents from the Spring 2017 Web Archive of the unt.edu domain. (https://digital.library.unt.edu/ark:/67531/metadc993363/) that have been sorted into two categories, ForRepo and NotForRepo.
Date: November 15, 2017
Creator: Andrews, Pamela & Phillips, Mark Edward

Link Resolver Testing

Description: This excel file accompanies a workshop presentation titled 'Is it really that bad? Verifying the extent of full-text linking problems'.
Date: August 9, 2012
Creator: Harker, Karen

"Nelson Mandela" Twitter Dataset

Description: This dataset contains Twitter JSON data for several Twitter search queries that were collected the week following the death of Nelson Mandela on December 5, 2013 using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 10,678,479 Tweets make up the combined dataset.
Date: December 15, 2013
Creator: Phillips, Mark Edward

Notre Dame Cathedral Fire Dataset

Description: This dataset contains Twitter JSON data for Tweets related to the fire at Notre Dame Cathedral in Paris, France. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 8,046,185 Tweets and 163,055 media files make up the combined dataset.
Date: 2019-04-08/2019-04-29
Creator: Phillips, Mark Edward

One Million Pages of Texas Newspapers: Dataset

Description: This dataset represents the first million pages of Texas newspapers added to The Portal to Texas History as part of the Texas Digital Newspaper Program. The dataset consists of 123,184 newspaper issues from 569 titles, comprising 1,000,003 pages. Additionally the 3,349,156 item uses associated with this dataset as of April 7, 2013 are included.
Date: April 7, 2013
Creator: Phillips, Mark Edward & Hicks, William

Portal to Texas History Newspaper OCR Text Dataset: Abilene

Description: Dataset of OCR text from The Portal to Texas History and the Texas Digital Newspaper Program. This dataset includes titles from Abilene Texas from the years 1888 to 1923. Titles included in this dataset include: Abilene Daily Reporter, Abilene Morning Reporter, Abilene Semi-Weekly Farm Reporter, Abilene Semi-Weekly Reporter, Abilene Weekly Reporter, The Abilene Reporter, The Abilene Semi-Weekly Reporter, and the Abilene Weekly Reporter. In all there are 7,208 issues comprised of 62,871 pages… more
Date: November 12, 2015
Creator: Phillips, Mark Edward

Portal to Texas History Newspaper OCR Text Dataset: Brenham

Description: Dataset of OCR text from The Portal to Texas History and the Texas Digital Newspaper Program. This dataset includes titles from Brenham Texas from the years 1876 to 1923. Titles included in this dataset include: Brenham Banner, Brenham Daily Banner, Brenham Daily Banner-Press, Brenham Evening Press, Brenham Weekly Banner, Brenham WEekly Banner-Press, and The Daily Banner. In all there are 10,720 issues comprised of 50,368 pages of text.
Date: November 12, 2015
Creator: Phillips, Mark Edward

Portal to Texas History Newspaper OCR Text Dataset: Bryan

Description: Dataset of OCR text from The Portal to Texas History and the Texas Digital Newspaper Program. This dataset includes titles from Bryan Texas from the years 1883 to 1922. Titles included in this dataset include: Bryan Daily Eagle, Bryan Daily Eagle and Pilot, Bryan Morning Eagle, Bryan Morning Eagle and Pilot, The Brazos Weekly Pilot, The Bryan Daily Eagle, The Bryan Eagle, and The Bryan Weekly Eagle and Pilot . In all there are 5,843 issues comprised of 27,360 pages of text.
Date: November 12, 2015
Creator: Phillips, Mark Edward

Portal to Texas History Newspaper OCR Text Dataset: Denton

Description: Dataset of OCR text from The Portal to Texas History and the Texas Digital Newspaper Program. This dataset includes titles from Denton Texas from the years 1892 to 1911. Titles included in this dataset include: Denton County News, Denton County Record and Chronicle, Denton Evening News, Legal Tender, Record and Chronicle, The Denton County Record, and The Denton Monitor. In all there are 690 issues comprised of 4,686 pages of text.
Date: November 12, 2015
Creator: Phillips, Mark Edward
Back to Top of Screen