316 Matching Results

Search Results

Advanced search parameters have been applied.

Extended Date/Time Format (EDTF) Dates Research Datasets

Description: Two datasets, each with 390,751 date samples from the UNT Libraries' digital collections. These samples were compiled for research regarding the Extended Date/Time Format (EDTF) standard. The first dataset contains a concatenated list of date values from the metadata records in The Portal to Texas History, the UNT Digital Library, and The Gateway to Oklahoma History. The "classified" dataset includes labels expressing whether each date is EDTF-valid and the level of conformance.
Date: February 28, 2013
Creator: Phillips, Mark Edward
Partner: UNT Libraries

"Nelson Mandela" Twitter Dataset

Description: This dataset contains Twitter JSON data for several Twitter search queries that were collected the week following the death of Nelson Mandela on December 5, 2013 using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 10,678,479 Tweets make up the combined dataset.
Date: December 15, 2013
Creator: Phillips, Mark Edward
Partner: UNT Libraries

Hurricane Harvey Twitter Dataset

Description: This dataset contains Twitter JSON data for Tweets related to Hurricane Harvey and the subsequent flooding along the Texas gulf region. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 7,041,866 Tweets make up the combined dataset.
Date: 2017-08-18/2017-09-22
Creator: Phillips, Mark Edward
Partner: UNT Libraries

Badlands National Park Twitter Dataset

Description: This dataset contains Twitter JSON data for Tweets related to the Badlands National Park (BadlandsNPS) user's tweets related to climate change and the Trump administration. This dataset was collected a few days before and following the phenomenon on Twitter. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 321,821 Tweets make up the combined dataset.
Date: 2017-01-15/2017-01-29
Creator: Phillips, Mark Edward
Partner: UNT Libraries

#Kaepernick7 and #ISupportKaepernickBecause Twitter Dataset

Description: This dataset contains Twitter JSON data for Tweets related to the hashtags #Kaepernick7 and ISupportKaepernickBecause This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 573,379 Tweets make up the combined dataset.
Date: 2016-08-20/2016-08-31
Creator: Phillips, Mark Edward
Partner: UNT Libraries

2016 Democratic National Convention in Philadelphia Twitter Dataset

Description: This dataset is comprised of tweets that are related to the 2016 Democratic National Committee meeting in Philadelphia, Pennsylvania that took place on July 25–28, 2016. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 15,676 Tweets make up the combined dataset.
Date: 2016-07-15/2016-08-01
Creator: Phillips, Mark Edward
Partner: UNT Libraries

Dallas Police Shooting Twitter Dataset

Description: This dataset contains Twitter JSON data for several Twitter search queries that were collected the week following the shooting of police officers in Dallas, Texas on July 7th 2017, using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 7,146,993 Tweets make up the combined dataset.
Date: 2016-07-05/2016-07-14
Creator: Phillips, Mark Edward
Partner: UNT Libraries

Legacy TRAIL Content Conversion Plan: Blue Angel Technology Crawls

Description: This report was prepared for the Texas State Library and Archives Commission (TSLAC). It outlines the current state of and future conversion roadmap for the content captured by Blue Angel Technologies during the time period of 2002-2006 for TSLAC as part of the Texas Resource and Information Locator Service (TRAIL). This document will give a history of the data stored at the University of North Texas Libraries and information about the makeup of the collection as it exists in 2011, including the physical layout on disk, number of files, and MIME types. Finally, this document outlines a conversion path from the current file based organization system to a standard archival data format for inclusion in a future TRAIL discovery interface.
Date: January 31, 2011
Creator: Phillips, Mark Edward
Partner: UNT Libraries

End of Term 2008 Presidential Web Archive: PDF Content Analysis

Description: This presentation discusses the End of Term 2008 Presidential Web Archive. The University of North Texas (UNT) Libraries collaborated with members of the International Internet Preservation Consortium (IIPC) on the End of Term 2008 Presidential Web Harvest from October, 2008 to February, 2009. The project team archived 160,211,356 URIs during this collaboration, which became a research dataset for an IMLS-funded grant to investigate collection development using web archives. The project team analyzed the 10,318,073 PDFs and developed a retrieval and exploration system for collection developers interested in acquiring and developing born-digital collections from the End of Term Web Archive.
Date: December 5, 2012
Creator: Phillips, Mark Edward
Partner: UNT Libraries

Open Access at the UNT Libraries

Description: This presentation discusses the University of North Texas (UNT) Libraries' Digital Library collection. It showcases the collections included in the UNT Digital Library, the statistics of use, and highlights the UNT Scholarly Works institutional repository.
Date: May 20, 2011
Creator: Phillips, Mark Edward
Partner: UNT Libraries

"Stand With Wendy" Twitter Dataset

Description: This dataset contains Twitter JSON data for several Twitter search queries collected the week following the filibuster by Wendy Davis in the Texas Senate related to Senate Bill 5, using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 560,954 Tweets make up the combined dataset.
Date: 2013-06-25/2013-07-03
Creator: Phillips, Mark Edward
Partner: UNT Libraries

UNT Libraries Edit Event Dataset 2014

Description: Dataset containing metadata edit events for the UNT Libraries Digital Collections from January 1, 2014 until December 31, 2014. There are a total of 94,222 samples in the dataset from 193 different metadata editors.
Date: February 2015
Creator: Phillips, Mark Edward
Partner: UNT Libraries

Web Archiving at UNT and End of Term Harvest 2008

Description: This presentation illustrates the collaborative project with the University of North Texas (UNT) and the United States Government Printing Office as part of the Federal Depository Library Program. The CyberCemetery was created as an online archive of government websites that have ceased operation. This also discusses the UNT collaboration with the United States Government Printing Office, the Library of Congress, the Internet Archive, and the California Digital Library in an End of Term Harvest project in 2008 to archive the Presidential campaign websites before they ceased to exist.
Date: November 7, 2008
Creator: Phillips, Mark Edward
Partner: UNT Libraries

Infrastructure for the UNT Digital Library

Description: This presentation discusses the University of North Texas Libraries' Digital Projects Unit workflows, how they overcame challenges, and how the use of established models and standards helped them find solutions to the workflow. It presents the editing system created for the Digital Projects Unit, the organizational structure for each object, and identifiers.
Date: August 2, 2010
Creator: Phillips, Mark Edward
Partner: UNT Libraries

Metadata to fit your needs... How much is too much?

Description: This presentation briefly introduces the University of North Texas (UNT) Libraries and their mission. It explains the structure of the Digital Projects Unit having the Digital Library and The Portal to Texas History, and discusses their metadata structure and its role in Digital Projects.
Date: March 16, 2009
Creator: Phillips, Mark Edward
Partner: UNT Libraries