The UNT College of Arts and Sciences educates students in traditional liberal arts, performing arts, sciences, professional, and technical academic programs. In addition to its departments, the college includes academic centers, institutes, programs, and offices providing diverse courses of study.
Paper on mapping texts and combining text-mining and geo-visualization to unlock the research potential of historical newspapers.
Abstract: In September 2010, the University of North Texas (in partnership with Stanford University) was awarded a National Endowment for the Humanities Level II Digital Humanities Start-up Grant (Award #HD-51188-10) to develop a series of experimental models for combining the possibilities of text-mining with geospatial mapping in order to unlock the research potential of large-scale collections of historical newspapers. Using a sample of approximately 230,000 pages of historical newspapers from the 'Chronicling America' digital newspaper database, we developed two interactive visualizations of the language content of these massive collections of historical documents as they spread across both time and space: one measuring the quantity and quality of the digitized content, and a second measuring several of the most widely used large-scale language pattern metrics common in natural language processing work. This white paper documents those experiments and their outcomes, as well as our recommendations for future work.
Torget, Andrew J., 1978-; Mihalcea, Rada, 1974-; Christensen, Jon & McGhee, Geoff.Mapping Texts: Combining Text-Mining and Geo-Visualization To Unlock The Research Potential of Historical Newspapers,
accessed March 29, 2023),
University of North Texas Libraries, UNT Digital Library, https://digital.library.unt.edu;
crediting UNT College of Arts and Sciences.