Mapping Texts: Combining Text-Mining and Geo-Visualization To Unlock The Research Potential of Historical Newspapers Page: 9
53 p.View a full description of this paper.
Extracted Text
The following text was automatically extracted from the image on this page using optical character recognition software:
well as the creation and operation of this model, is also described in greater detail below). The
overarching purpose of this visualization is to provide users with the ability to survey the collected
language patterns that emanate from the newspaper collection for any particular location or time
period for the available data.
PROJECT TEAMS
Because the project required deep expertise in multiple fields, we built two project teams
that each tackled a distinct side of the project. A team based at the University of North Texas
focused on the language assessment, quantification, and overall text-mining side of the project. A
team at Stanford University worked on designing and constructing the dynamic visualizations of
those language patterns. The two teams worked in tandem-as parallel processes-to continually
tailor, adjust, and refine the work on both sides of the project as we sought to fit these two sides
together.
The University of North Texas team was headed by Andrew J. Torget, a digital historian
specializing in the American Southwest, and Rada Mihalcea, a nationally-recognized computer
science expert in natural language processing. Tze-I "Elisa" Yang (a graduate student in UNT's
computer science department) took the lead in data manipulation and processing of the text-
mining efforts, while Mark Phillips (Assistant Dean for Digital Libraries at UNT) provided technical
assistance in accessing the digital newspapers.
The Stanford team was headed by Jon Christensen (Executive Director for the Bill Lane
Center for the American West) and Geoff McGhee (Creative Director for Media and
Communications at the Lane Center). Yinfeng Qin, Rio Akasaka and Jason Ningxuan Wang
Upcoming Pages
Here’s what’s next.
Search Inside
This paper can be searched. Note: Results may vary based on the legibility of text within the document.
Tools / Downloads
Get a copy of this page or view the extracted text.
Citing and Sharing
Basic information for referencing this web page. We also provide extended guidance on usage rights, references, copying or embedding.
Reference the current page of this Paper.
Torget, Andrew J., 1978-; Mihalcea, Rada, 1974-; Christensen, Jon & McGhee, Geoff. Mapping Texts: Combining Text-Mining and Geo-Visualization To Unlock The Research Potential of Historical Newspapers, paper, 2011; (https://digital.library.unt.edu/ark:/67531/metadc83797/m1/9/: accessed April 25, 2024), University of North Texas Libraries, UNT Digital Library, https://digital.library.unt.edu; crediting UNT College of Arts and Sciences.