Paper for the 2011 ACL Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities. This paper discusses topic modeling on historical newspaper.
The UNT College of Arts and Sciences educates students in traditional liberal arts, performing arts, sciences, professional, and technical academic programs. In addition to its departments, the college includes academic centers, institutes, programs, and offices providing diverse courses of study.
Paper for the 2011 ACL Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities. This paper discusses topic modeling on historical newspaper.
Physical Description
9 p.
Notes
Abstract: In this paper, we explore the task of automatic text processing applied to collections of historical newspapers, with the aim of assisting historical research. In particular, in this first stage of the project, we experiment with the use of topical models as a means to identify potential issues of interest for historians.
Association for Computational Linguistics (ACL) Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities (LATECH), 2011, Portland, Oregon, United States
This paper is part of the following collection of related materials.
UNT Scholarly Works
Materials from the UNT community's research, creative, and scholarly activities and UNT's Open Access Repository. Access to some items in this collection may be restricted.