UNT Libraries - Browse

ABOUT BROWSE FEED

Portal to Texas History Newspaper OCR Text Dataset: McKinney

Description: Dataset of OCR text from The Portal to Texas History and the Texas Digital Newspaper Program. This dataset includes titles from McKinney Texas from the years 1880 to 1936. Titles included in this dataset include: Collin County Mercury, McKinney Weekly Democrat-Gazette, The Daily Courier, The Daily Gazette, The Democrat, The Democrat-Gazette, The Lion Roar, The McKinney Advocate, The McKinney Examiner, The McKinney Gazette, The Semi-Weekly Courier, The Southern Jerseyite, and The Weekly Democrat-Gazette. In all there are 1,568 issues comprised of 12,975 pages of text.
Date: November 12, 2015
Creator: Phillips, Mark Edward
Item Type: Dataset

Portal to Texas History Newspaper OCR Text Dataset: San Antonio

Description: Dataset of OCR text from The Portal to Texas History and the Texas Digital Newspaper Program. This dataset includes titles from San Antonio Texas from the years 1874 to 1920. Titles included in this dataset include: San Antonio Daily Express, San Antonio Daily Light, San Antonio Express, The Daily Express, and The San Antonio Light. In all there are 6,866 issues comprised of 130,726 pages of text.
Date: November 12, 2015
Creator: Phillips, Mark Edward
Item Type: Dataset

Portal to Texas History Newspaper OCR Text Dataset: Temple

Description: Dataset of OCR text from The Portal to Texas History and the Texas Digital Newspaper Program. This dataset includes titles from Temple Texas from the years 1907 to 1922. Titles included in this dataset include: Temple Daily Telegram. In all there are 4,627 issues comprised of 44,633 pages of text.
Date: November 12, 2015
Creator: Phillips, Mark Edward
Item Type: Dataset

Quality Assurance Practices in Web Archiving [Dataset]

Description: This dataset contains the results of a survey of quality assurance practices within the field of web archiving and its practitioners. To understand current QA practices, the authors surveyed institutions engaged in web archiving, which included national libraries, colleges and universities, and museums and art libraries. The survey was administered online. It includes the completed responses of 54 participants. The data has been anonymized for privacy reasons. This dataset was used in the "Current Quality Assurance Practices in Web Archiving" paper, available from the UNT Digital Library.
Date: December 2014
Creator: Reyes Ayala, Brenda; Phillips, Mark Edward & Ko, Lauren
Item Type: Dataset

Restricted University of North Texas Electronic Theses and Dissertations

Description: This dataset contains responses to a survey questionnaire distributed by the University of North Texas (UNT) Libraries asking 125 authors of electronic theses and dissertations (ETDs) whether they agree to change the existing restricted permission status on their ETDs.
Date: February 24, 2014
Creator: Alemneh, Daniel Gelaw
Item Type: Dataset

"Stand With Wendy" Twitter Dataset

Description: This dataset contains Twitter JSON data for several Twitter search queries collected the week following the filibuster by Wendy Davis in the Texas Senate related to Senate Bill 5, using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 560,954 Tweets make up the combined dataset.
Date: 2013-06-25/2013-07-03
Creator: Phillips, Mark Edward
Item Type: Dataset

Succession Planning Through Mentoring in the Library Survey

Description: Survey Instrument used for study, "Succession Planning Through Mentoring in the Library." The purpose of this study is to determine if libraries are incorporating succession planning in their hiring, recruitment and retention plans and if there is perceived value among librarians in incorporating mentoring in their succession plans.
Date: January 2016
Creator: Leuzinger, Julie & Rowe, Jennifer
Item Type: Text

Texas Digital Newspaper Program Issue Dataset for IFLA/Rootstech Analysis

Description: This dataset contains the descriptive metadata harvested from the Texas Digital Newspaper Program collection on The Portal to Texas History and is accompanied by a dataset derived from the harvested metadata. This dataset was used for an IFLA Newspaper Section and Rootstech presentation.
Date: January 16, 2014
Creator: Phillips, Mark Edward & Krahmer, Ana
Item Type: Dataset

Texas Newspapers Natural Language Processing

Description: This dataset includes data on natural language processing from the Texas Newspapers Project. The dataset includes word counts, name entity recognition results, and topic models.
Date: April 7, 2013
Creator: Torget, Andrew J., 1978-
Item Type: Dataset

[U.S. Patent OCR Files: Disk USP001]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset

[U.S. Patent OCR Files: Disk USP002]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset

[U.S. Patent OCR Files: Disk USP003]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset

[U.S. Patent OCR Files: Disk USP004]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset

[U.S. Patent OCR Files: Disk USP005]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset

[U.S. Patent OCR Files: Disk USP006]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset

[U.S. Patent OCR Files: Disk USP007]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset

[U.S. Patent OCR Files: Disk USP008]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset

[U.S. Patent OCR Files: Disk USP009]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset

[U.S. Patent OCR Files: Disk USP010]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset

[U.S. Patent OCR Files: Disk USP011]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset

[U.S. Patent OCR Files: Disk USP012]

Description: This dataset contains the compiled Optical Character Recognition (OCR) text files for the content of patent grants issued by the United States Patent Office from ## to ## (non-inclusive).
Date: 2013
Creator: Phillips, Mark Edward
Item Type: Dataset