Mapping Texts: Combining Text-Mining and Geo-Visualization To Unlock The Research Potential of Historical NewspapersMapping TextsTorget, Andrew J., 1978-perUniversity of North TexasMihalcea, Rada, 1974-perUniversity of North TexasChristensen, JonperStanford UniversityMcGhee, GeoffperStanford UniversityNational Endowment for the Humanitiesorg2011engPaper on mapping texts and combining text-mining and geo-visualization to unlock the research potential of historical newspapers.53 p.text mininggeo-visualizationnewspapershistorical documentsUNTSWUNTCASpublictext_papertextHD-51188-10HistoryComputer Science and EngineeringAbstract: In September 2010, the University of North Texas (in partnership with Stanford University) was awarded a National Endowment for the Humanities Level II Digital Humanities Start-up Grant (Award #HD-51188-10) to develop a series of experimental models for combining the possibilities of text-mining with geospatial mapping in order to unlock the research potential of large-scale collections of historical newspapers. Using a sample of approximately 230,000 pages of historical newspapers from the 'Chronicling America' digital newspaper database, we developed two interactive visualizations of the language content of these massive collections of historical documents as they spread across both time and space: one measuring the quantity and quality of the digitized content, and a second measuring several of the most widely used large-scale language pattern metrics common in natural language processing work. This white paper documents those experiments and their outcomes, as well as our recommendations for future work.
lwaugh
DC
ark:/67531/metadc83797
2012-04-27, 10:13:11
awarrenfells
2023-11-10, 12:28:21
False