Search Results

Portal to Texas History Newspaper OCR Text Dataset: Galveston

Description: Dataset of OCR text from The Portal to Texas History and the Texas Digital Newspaper Program. This dataset includes titles from Galveston Texas from the years 1849 to 1897. Titles included in this dataset include: Galveston Weekly News, and The Galveston Daily News. In all there are 8,136 issues comprised of 56,953 pages of text.
Date: November 12, 2015
Creator: Phillips, Mark Edward
Partner: UNT Libraries

Portal to Texas History Newspaper OCR Text Dataset: San Antonio

Description: Dataset of OCR text from The Portal to Texas History and the Texas Digital Newspaper Program. This dataset includes titles from San Antonio Texas from the years 1874 to 1920. Titles included in this dataset include: San Antonio Daily Express, San Antonio Daily Light, San Antonio Express, The Daily Express, and The San Antonio Light. In all there are 6,866 issues comprised of 130,726 pages of text.
Date: November 12, 2015
Creator: Phillips, Mark Edward
Partner: UNT Libraries

Portal to Texas History Newspaper OCR Text Dataset: Brenham

Description: Dataset of OCR text from The Portal to Texas History and the Texas Digital Newspaper Program. This dataset includes titles from Brenham Texas from the years 1876 to 1923. Titles included in this dataset include: Brenham Banner, Brenham Daily Banner, Brenham Daily Banner-Press, Brenham Evening Press, Brenham Weekly Banner, Brenham WEekly Banner-Press, and The Daily Banner. In all there are 10,720 issues comprised of 50,368 pages of text.
Date: November 12, 2015
Creator: Phillips, Mark Edward
Partner: UNT Libraries

Portal to Texas History Newspaper OCR Text Dataset: Bryan

Description: Dataset of OCR text from The Portal to Texas History and the Texas Digital Newspaper Program. This dataset includes titles from Bryan Texas from the years 1883 to 1922. Titles included in this dataset include: Bryan Daily Eagle, Bryan Daily Eagle and Pilot, Bryan Morning Eagle, Bryan Morning Eagle and Pilot, The Brazos Weekly Pilot, The Bryan Daily Eagle, The Bryan Eagle, and The Bryan Weekly Eagle and Pilot . In all there are 5,843 issues comprised of 27,360 pages of text.
Date: November 12, 2015
Creator: Phillips, Mark Edward
Partner: UNT Libraries

Portal to Texas History Newspaper OCR Text Dataset: Houston

Description: Dataset of OCR text from The Portal to Texas History and the Texas Digital Newspaper Program. This dataset includes titles from Houston, Texas from the years 1893 to 1924. Titles included in this dataset include: The Houston Daily Post and The Houston Post. In all there are 9,855 issues comprised of 184,900 pages of text.
Date: November 12, 2015
Creator: Phillips, Mark Edward
Partner: UNT Libraries

Portal to Texas History Newspaper OCR Text Dataset: Fort Worth

Description: Dataset of OCR text from The Portal to Texas History and the Texas Digital Newspaper Program. This dataset includes titles from Fort Worth Texas from the years 1883 to 1896. Titles included in this dataset include: Fort Worth Daily Gazette, Fort Worth Gazette, and Fort Worth Weekly Gazette. In all there are 4,146 issues comprised of 36,199 pages of text.
Date: November 12, 2015
Creator: Phillips, Mark Edward
Partner: UNT Libraries

Portal to Texas History Newspaper OCR Text Dataset: El Paso

Description: Dataset of OCR text from The Portal to Texas History and the Texas Digital Newspaper Program. This dataset includes titles from El Paso Texas from the years 1881 to 1921. Titles included in this dataset include: El Paso Daily Herald, El Paso Daily Times, El Paso Herald, El Paso International Daily Times, El Paso Morning Times, El Paso Sunday Times, El Paso Times, The El Paso Daily Times, and The El Paso Time. In all there are 17,104 issues comprised of 177,640 pages of text.
Date: November 12, 2015
Creator: Phillips, Mark Edward
Partner: UNT Libraries

UNT Libraries Edit Event Dataset 2014

Description: Dataset containing metadata edit events for the UNT Libraries Digital Collections from January 1, 2014 until December 31, 2014. There are a total of 94,222 samples in the dataset from 193 different metadata editors.
Date: February 2015
Creator: Phillips, Mark Edward
Partner: UNT Libraries

UNT Scholarly Works Record Discoveries Dataset

Description: Dataset containing four tab-delimited data files and associated code for analysis of data. Dataset represents record discoveries identified from Apache log files created by the UNT Digital Library platform. Record discoveries are for the UNT Scholarly Works Repository from May 2014 to January 2015.
Date: January 24, 2015
Creator: Phillips, Mark Edward
Partner: UNT Libraries

Quality Assurance Practices in Web Archiving [Dataset]

Description: This dataset contains the results of a survey of quality assurance practices within the field of web archiving and its practitioners. To understand current QA practices, the authors surveyed institutions engaged in web archiving, which included national libraries, colleges and universities, and museums and art libraries. The survey was administered online. It includes the completed responses of 54 participants. The data has been anonymized for privacy reasons. This dataset was used in the "Curr… more
Date: December 2014
Creator: Reyes Ayala, Brenda; Phillips, Mark Edward & Ko, Lauren
Partner: UNT Libraries

UNT Digital Library Value Study Data

Description: This dataset contains collected responses to a study of the UNT Digital Library collections, including the UNT Scholarly Works institutional repository and The Portal to Texas History. It contains responses to 26 survey questions regarding the use and perceived value of the UNT Digital Library collections and 13 demographic questions.
Date: October 2013
Creator: Waugh, Laura; Murray, Kathleen R.; Phillips, Mark Edward & Belden, Dreanna
Partner: UNT Libraries

"Yes All Women" Twitter Dataset

Description: This dataset contains Twitter JSON data for several Twitter search queries that were collected around the #YesAllWomen Twitter "conversation" between May 25, 2014 and June 8, 2014 using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 2,805,763 Tweets and 34,532 images make up the combined dataset.
Date: June 8, 2014
Creator: Phillips, Mark Edward
Partner: UNT Libraries

UNT Libraries Metadata Edit Dataset

Description: This dataset contains data samples from metadata records extracted from the UNT Libraries' Digital Collections. It contains one sample per metadata record version in the system with aggregate counts of fields and also hash values of an element as well. Data was collected in March 2014 with dates from May 19, 2004 to February 4, 2014.
Date: April 2014
Creator: Phillips, Mark Edward
Partner: UNT Libraries

Coda Archival Digital Repository Dataset

Description: This dataset contains information extracted from the UNT Libraries' Coda Digital Repository. It contains information related to number of files, size, and ingest date of digital objects added to that system. It can be used for analysis and investigation of the growth and makeup of digital repositories.
Date: April 1, 2014
Creator: Phillips, Mark Edward & Ko, Lauren
Partner: UNT Libraries

Texas Digital Newspaper Program Issue Dataset for IFLA/Rootstech Analysis

Description: This dataset contains the descriptive metadata harvested from the Texas Digital Newspaper Program collection on The Portal to Texas History and is accompanied by a dataset derived from the harvested metadata. This dataset was used for an IFLA Newspaper Section and Rootstech presentation.
Date: January 16, 2014
Creator: Phillips, Mark Edward & Krahmer, Ana
Partner: UNT Libraries
Back to Top of Screen