Two datasets, each with 390,751 date samples from the UNT Libraries' digital collections. These samples were compiled for research regarding the Extended Date/Time Format (EDTF) standard. The first dataset contains a concatenated list of date values from the metadata records in The Portal to Texas History, the UNT Digital Library, and The Gateway to Oklahoma History. The "classified" dataset includes labels expressing whether each date is EDTF-valid and the level of conformance.
This dataset represents the first million pages of Texas newspapers added to The Portal to Texas History as part of the Texas Digital Newspaper Program. The dataset consists of 123,184 newspaper issues from 569 titles, comprising 1,000,003 pages. Additionally the 3,349,156 item uses associated with this dataset as of April 7, 2013 are included.