Search Results

open access

Welcome from the LangArc-2025 Workshop Organizers

Description: Proceedings introduction for the 3rd International Workshop on Digital Language Archives (LangArc-2025). It provides an overview of the workshop and proceedings which focus on a wide range of issues related to digital language archives. The workshop was held on December 15-16, 2025 as part of the ACM/IEEE Joint Conference on Digital Libraries 2025.
Date: December 30, 2025
Creator: Zavalina, Oksana; Chelliah, Shobhana Lakshmi & Burke, Mary
Partner: UNT College of Information
open access

Proceedings of the International Workshop on Digital Language Archives: LangArc-2025

Description: Conference proceedings of the 3rd International Workshop on Digital Language Archives held on December 15-16, 2025 as part of the ACM/IEEE Joint Conference on Digital Libraries 2025. It includes 11 peer-reviewed papers that were presented at the workshop and an introduction from the workshop organizers.
Date: December 30, 2025
Creator: Zavalina, Oksana; Chelliah, Shobhana Lakshmi & Burke, Mary
Partner: UNT College of Information
open access

The digital language archive as a platform for exploring language data, analyses, and publications

Description: Article outlining a platform for a digital language archive which integrates original data, analyses, and publications. Thereby, the digital language archive becomes the host and facilitator of research, which supports the integrity of research projects. It was presented at the 3rd International Workshop on Digital Language Archives held on December 15-16, 2025 as part of the ACM/IEEE Joint Conference on Digital Libraries 2025.
Date: December 30, 2025
Creator: Weber, Tobias
Partner: UNT College of Information
open access

Replicating hospitality: Reframing a documentary linguistics digital archiving curriculum to meet the needs of refugees and their service workers

Description: Article detailing the development of the Archiving Our Refugee Stories (ORSA) curriculum, adapted from CoRSAL’s existing digital archiving curriculum. The ORSA is an emerging repository created by and for refugees to house their memories and equip them with the knowledge and skills to preserve their stories. It was presented at the 3rd International Workshop on Digital Language Archives held on December 15-16, 2025 as part of the ACM/IEEE Joint Conference on Digital Libraries 2025.
Date: December 30, 2025
Creator: Frederick, Merrion; Roeschley, Ana; Brown, Nathan; Braga, Damares & Khader, Deama
Partner: UNT College of Information
open access

Learning to Assess Use of Materials in Language Archival Collections

Description: Article on presenting a learning module for a graduate course on community language archiving for information professionals offered at the University of North Texas. The focus of the practical assignment in which students evaluate digital language archive usage through comparative analyses of stratified random samples and interpret their findings. It reports selected results of this learning in a summer semester of 2024/2025 academic year and discusses future steps. It was presented at the 3rd … more
Date: December 30, 2025
Creator: Zavalina, Oksana; Savchenko, Viktoriia & Savchenko, Kostiantyn
Partner: UNT College of Information
open access

Advancing Language Revitalization Goals Through Community-based Archiving

Description: Article examining how community-based archiving initiatives reflect language revitalization goals through a case study of an archival collection created by community members engaged in revitalization efforts. Understanding the decision-making and intentions behind such collections provides insight and guidance for future community-based archiving projects. It was presented at the 3rd International Workshop on Digital Language Archives held on December 15-16, 2025 as part of the ACM/IEEE Joint C… more
Date: December 30, 2025
Creator: Burke, Mary
Partner: UNT College of Information
open access

Assessing the Impact of Image Resolution on OCR Transcription Accuracy

Description: Article investigating the relationship between image resolution and OCR (optical character recognition) performance, with a focus on both character-level accuracy and the integrity of subsequent text processing pipelines. The findings have practical implications for document digitization workflows, especially in resource-constrained environments where high-resolution image storage and processing may be questionable. It was presented at the 3rd International Workshop on Digital Language Archives… more
Date: December 30, 2025
Creator: Boubehziz, Toufik; Koudoro-Parfait, Caroline & Lejeune, Gaël
Partner: UNT College of Information
open access

Training Future Librarians and Archivists to Create and Describe Digital Language Archive Collections

Description: Article presenting a major practical project to provide training in language archiving for future information professionals. In this course, graduate students in library science, information science, archival studies, and other disciplines learn to create the items for their own collections of language-focused materials, provide metadata to describe items, and make collections available through the Omeka-powered digital repository. The learning objectives of this project are discussed and compa… more
Date: December 30, 2025
Creator: Zavalina, Oksana
Partner: UNT College of Information
open access

Curation as Collective Practice: Archiving the Inga-Lil Hansson Akha Materials

Description: Article discussing the archival curation of re- search material collected by professor Inga-Lill Hansson from 1974 to 2022 through extensive fieldwork on the Akha language. It describes the relationship-centered collaboration between Hansson’s network and the archiving team, the gradual curation workflow, and reflect on the technical, interpretive, and ethical decisions required to make a single scholar’s lifetime of work discoverable and responsibly reusable. It was presented at the 3rd Intern… more
Date: December 30, 2025
Creator: Benavides, Jose; Chelliah, Shobhana Lakshmi; Borch, Søren; Burke, Mary & Weber, Sydney
Partner: UNT College of Information
open access

A Short Reflection on the Long History of Language Archives

Description: Article presenting a historical narrative surrounding the term language archives. Three elements are highlighted: (1) the history of the use of the term, (2) the sociological connections between individual voices central to the modern development of language archives, (3) the evolution of metadata schemes across institutions in newly established language archives. It was presented at the 3rd International Workshop on Digital Language Archives held on December 15-16, 2025 as part of the ACM/IEEE… more
Date: December 30, 2025
Creator: Paterson, Hugh, III
Partner: UNT College of Information
open access

Community Archiving Across Languages: Strategies and Tools for Multilingual Praxis

Description: Article describing how the Archivo de Respuestas Emergencias de Puerto Rico / Emergency Archive of Puerto Rico (AREPR) team put its values into praxis through participatory design of both the project’s collections and the platform in which they are housed. It was presented at the 3rd International Workshop on Digital Language Archives held on December 15-16, 2025 as part of the ACM/IEEE Joint Conference on Digital Libraries 2025.
Date: December 30, 2025
Creator: Boyles, Christina
Partner: UNT College of Information
open access

Enhancing Photograph Description in Language Archive Metadata with Artificial Intelligence

Description: Article presenting a project in which generative AI was utilized to enhance free-text descriptions in metadata records that represent a typical kind of resources in language archives: photographs. It was presented at the 3rd International Workshop on Digital Language Archives held on December 15-16, 2025 as part of the ACM/IEEE Joint Conference on Digital Libraries 2025.
Date: December 30, 2025
Creator: Zavalin, Vyacheslav
Partner: UNT College of Information
open access

Finding Metadata for Digital Archiving in Linguistic Fieldnotes

Description: Article examining how digital language archives can address challenges of legacy field materials through a case study from the Computational Resource for South Asian Languages (CoRSAL), a repository dedicated to preserving and documenting the languages of South Asia. It focuses on two CoRSAL legacy projects: the curation of Norman Zide’s extensive collection on the Munda languages of north-central India (Gutob, Gta’, Korku, and Mundari), and the classroom fieldnotes of James Matisoff on Tibeto-… more
Date: December 30, 2025
Creator: Chelliah, Shobhana Lakshmi; Lowe, John Brandon & Fos, Jonas
Partner: UNT College of Information
open access

A CARE- and FAIR-Ready Distributed Access Control System for Human-Created Data

Description: Article describe the approach which the authors are taking to access control and a design for a distributed access control system which can look after the A-is-for-accessible in FAIR data while respecting the CARE principles. It was presented at the 2nd International Workshop on Digital Language Archives held on June 30, 2023 as part of the ACM/IEEE Joint Conference on Digital Libraries 2023.
Date: July 3, 2023
Creator: Sefton, Peter; Sacal Bonequi, Moises; Musgrave, Simon & Fewster, Jenny
Partner: UNT College of Information
open access

Multiperspectivity and Neutrality in Language Archives

Description: Article discussing linguistic data and their creation with a focus on the human actions and decisions that shape them. It was presented at the 2nd International Workshop on Digital Language Archives held on June 30, 2023 as part of the ACM/IEEE Joint Conference on Digital Libraries 2023.
Date: July 3, 2023
Creator: Weber, Tobias
Partner: UNT College of Information
open access

Why it Can be Difficult to Make Historic Language Recordings Accessible: A View from a Corpus of Historic Dialect Recordings

Description: Article reporting the experiences made in preparing a corpus of historic Austrian dialect recordings from the Phonogrammarchiv’s holdings and the real-life issues encountered in the process and discusses what needs to be done with such a corpus before something can be done with that corpus. It was presented at the 2nd International Workshop on Digital Language Archives held on June 30, 2023 as part of the ACM/IEEE Joint Conference on Digital Libraries 2023.
Date: July 3, 2023
Creator: Huber, Christian
Partner: UNT College of Information
open access

Making Photographs in Language Archives Maximally Useful: Metadata Guidelines for Community and Academic Depositors

Description: Article focusing on metadata creation for photographs in language archives. It was presented at the 2nd International Workshop on Digital Language Archives held on June 30, 2023 as part of the ACM/IEEE Joint Conference on Digital Libraries 2023.
Date: July 3, 2023
Creator: Chelliah, Shobhana Lakshmi
Partner: UNT College of Information
open access

Bharatavani Project - Reviving Linguistic Diversity and Cultural Heritage in India: A Case Study

Description: Article presenting an overview of the Bharatavani project, which focuses on recording socio-cultural and linguistic information about 121 Indian languages and making it accessible to a broader audience. It was presented at the 2nd International Workshop on Digital Language Archives held on June 30, 2023 as part of the ACM/IEEE Joint Conference on Digital Libraries 2023.
Date: July 3, 2023
Creator: Choudhary, Narayan; Premkumar, LR; Singh, Chandan; Mondal, Shubhanan; Priya, Shivangi; Sudarshan, Beluru et al.
Partner: UNT College of Information
open access

Proceedings of the International Workshop on Digital Language Archives: LangArc-2023

Description: Conference proceedings of the 2nd International Workshop on Digital Language Archives held on June 30, 2023 as part of the ACM/IEEE Joint Conference on Digital Libraries 2023. It includes 10 peer-reviewed papers that were presented at the workshop and an introduction from the workshop organizers.
Date: July 3, 2023
Creator: Zavalina, Oksana & Chelliah, Shobhana Lakshmi
Partner: UNT College of Information
open access

Exploration of Metadata Practices in Digital Collections of Archives with Arabian Language Materials

Description: Article for a study aimed to develop understanding of the current state of metadata practices in digital collections of archival institutions in the Arabian Gulf region. It also explored perspectives (including attitudes and possible barriers) for development of large-scale regional portals that would facilitate discovery of Arab digital archives (including language collections) by aggregating metadata. It was presented at the 2nd International Workshop on Digital Language Archives held on June… more
Date: June 16, 2023
Creator: Aljalahmah, Saleh & Zavalina, Oksana
Partner: UNT College of Information
open access

Language Archiving Training: A Case Study of a Metadata Course in Library and Information Science Graduate Program, 2020 - 2023

Description: Article explores the training gap between the way these materials are organized and represented and the understanding of that data – and expectations towards the more functional ways of its organization and representation – by language preservation and revitalization researchers, and by members of language communities. Information resources collected by language archives have unique attributes of importance to their target user groups, and these attributes and their representation are not curre… more
Date: June 16, 2023
Creator: Zavalina, Oksana
Partner: UNT College of Information
open access

OLAC and Serials: An Appraisal

Description: Article reporting on how journal articles are presented within the Open Language Archive Community’s (OLAC) OAI-PMH aggregator for language resources. Understanding how secondary journal materials are presented in OLAC records is a first step towards increasing the end-user utility of the OLAC aggregator. It was presented at the 2nd International Workshop on Digital Language Archives held on June 30, 2023 as part of the ACM/IEEE Joint Conference on Digital Libraries 2023.
Date: June 16, 2023
Creator: Paterson, Hugh, III
Partner: UNT College of Information
open access

Ukrainian Archival Metadata in WorldCat: Exploratory Analysis

Description: Article documenting a study examining a sample of WorldCat records representing Ukrainian-language archival materials (including digital resources) that were not officially published or released with the goal to examine the extent to which these metadata records support the user tasks of find, identify, select, obtain, and explore. It was presented at the 2nd International Workshop on Digital Language Archives held on June 30, 2023 as part of the ACM/IEEE Joint Conference on Digital Libraries 2… more
Date: June 16, 2023
Creator: Zavalin, Vyacheslav
Partner: UNT College of Information
open access

Towards Making Shared Metadata Interoperable across the Open Language Archives Community

Description: Article presenting two methods for connecting aggregated records to their source institutional metadata profiles. The use case of the Open Language Archives Community (OLAC) application profile is considered and evaluated. It was presented at the 2nd International Workshop on Digital Language Archives held on June 30, 2023 as part of the ACM/IEEE Joint Conference on Digital Libraries 2023.
Date: June 16, 2023
Creator: Paterson, Hugh, III
Partner: UNT College of Information
Back to Top of Screen