Article examining how digital language archives can address challenges of legacy field materials through a case study from the Computational Resource for South Asian Languages (CoRSAL), a repository dedicated to preserving and documenting the languages of South Asia. It focuses on two CoRSAL legacy projects: the curation of Norman Zide’s extensive collection on the Munda languages of north-central India (Gutob, Gta’, Korku, and Mundari), and the classroom fieldnotes of James Matisoff on Tibeto-Burman languages. These cases illustrate the opportunities for deriving rich metadata and contextual information from legacy field materials, demonstrating how digital curation can extend the life and reach …
continued below
Situated at the intersection of people, technology, and information, the College of Information's faculty, staff and students invest in innovative research, collaborative partnerships, and student-centered education to serve a global information society. The college offers programs of study in information science, learning technologies, and linguistics.
Article examining how digital language archives can address challenges of legacy field materials through a case study from the Computational Resource for South Asian Languages (CoRSAL), a repository dedicated to preserving and documenting the languages of South Asia. It focuses on two CoRSAL legacy projects: the curation of Norman Zide’s extensive collection on the Munda languages of north-central India (Gutob, Gta’, Korku, and Mundari), and the classroom fieldnotes of James Matisoff on Tibeto-Burman languages. These cases illustrate the opportunities for deriving rich metadata and contextual information from legacy field materials, demonstrating how digital curation can extend the life and reach of fieldworkers’ original documentation. It was presented at the 3rd International Workshop on Digital Language Archives held on December 15-16, 2025 as part of the ACM/IEEE Joint Conference on Digital Libraries 2025.
Physical Description
3 p.
Notes
Abstract: Pre-digital to born-digital formats are vital to the preservation and transmission of linguistic and cultural knowledge. Yet, these materials present distinctive challenges: their content and formats reflect the specialized practices of the subfield of Language Documentation and Description, they were originally organized for analog rather than digital use, and they typically assumed a single user—the fieldworker. This paper examines how digital language archives can address these challenges through a case study from the Computational Resource for South Asian Languages (CoRSAL), a repository dedicated to preserving and documenting the languages of South Asia. We focus on two CoRSAL legacy projects: the curation of Norman Zide’s extensive collection on the Munda languages of north-central India (Gutob, Gta’, Korku, and Mundari), and the classroom fieldnotes of James Matisoff on Tibeto-Burman languages. These cases illustrate the opportunities for deriving rich metadata and contextual information from legacy field materials, demonstrating how digital curation can extend the life and reach of fieldworkers’ original documentation.
Publication Title:
Proceedings of the 3rd International Workshop on Digital Language Archives: LangArc 2025
Page Start:
49
Page End:
51
Peer Reviewed:
Yes
Relationships
Proceedings of the International Workshop on Digital Language Archives: LangArc-2025, ark:/67531/metadc2543332
Collections
This article is part of the following collection of related materials.
International Workshop on Digital Language Archives
This interactive workshop explores a broad scope of issues related to digital language archives—digital libraries that preserve and provide online access to language data. The collection includes proceedings and articles from the workshop.
Conference proceedings of the 3rd International Workshop on Digital Language Archives held on December 15-16, 2025 as part of the ACM/IEEE Joint Conference on Digital Libraries 2025. It includes 11 peer-reviewed papers that were presented at the workshop and an introduction from the workshop organizers.
Relationship to this item: (Is Part Of)
Proceedings of the International Workshop on Digital Language Archives: LangArc-2025, ark:/67531/metadc2543332
Chelliah, Shobhana Lakshmi; Lowe, John Brandon & Fos, Jonas.Finding Metadata for Digital Archiving in Linguistic Fieldnotes,
article,
December 30, 2025;
(https://digital.library.unt.edu/ark:/67531/metadc2543327/:
accessed March 16, 2026),
University of North Texas Libraries, UNT Digital Library, https://digital.library.unt.edu;
crediting UNT College of Information.