Search Results

Advanced search parameters have been applied.
open access

Wikify! Linking Documents to Encyclopedic Knowledge

Description: This paper introduces the use of Wikipedia as a resource for automatic keyword extraction and word sense disambiguation, and shows how this online encyclopedia can be used to achieve state-of-the-art results on both these tasks.
Date: November 2007
Creator: Mihalcea, Rada, 1974- & Csomai, Andras
Partner: UNT College of Engineering
open access

Expanding the Search for Digital Preservation Solutions: Adopting PREMIS in Cultural Heritage Institutions

Description: Paper accompanying a poster presentation for the 2009 ACM/IEEE-CS Joint Conference on Digital Libraries. This paper provides some preliminary results on factors that affect the adoption of PREMIS (Preservation Metadata Implementation Strategies) in cultural heritage institutions.
Date: 2009
Creator: Alemneh, Daniel Gelaw
Partner: UNT Libraries

The University of North Texas Libraries' Portal to Texas History: Archival Challenges and Solutions [Poster]

Description: Poster presented at the 2004 Joint Conference on Digital Libraries (JCDL). This poster details the system that automates the collection of metadata records to coordinate access to web-viewable files and preservation of archived master files.
Date: 2004
Creator: Hartman, Cathy Nelson; Nordstrom, Kurt & Phillips, Mark Edward
Partner: UNT Libraries
open access

The University of North Texas Libraries' Portal to Texas History: Archival Challenges and Solutions

Description: This paper discusses the University of North Texas (UNT) Libraries' Portal to Texas History's archival challenges and solutions. The UNT Texas History Portal Project strives to balance the goals of accessibility of information and long-term preservation of digital objects. This poster details the system that automates the collection of metadata records to coordinate access to web-viewable files and preservation of archived master files.
Date: 2004
Creator: Nordstrom, Kurt; Hartman, Cathy Nelson & Phillips, Mark Edward
Partner: UNT Libraries
open access

Welcome from the LangArc-2025 Workshop Organizers

Description: Proceedings introduction for the 3rd International Workshop on Digital Language Archives (LangArc-2025). It provides an overview of the workshop and proceedings which focus on a wide range of issues related to digital language archives. The workshop was held on December 15-16, 2025 as part of the ACM/IEEE Joint Conference on Digital Libraries 2025.
Date: December 30, 2025
Creator: Zavalina, Oksana; Chelliah, Shobhana Lakshmi & Burke, Mary
Partner: UNT College of Information
open access

Proceedings of the International Workshop on Digital Language Archives: LangArc-2025

Description: Conference proceedings of the 3rd International Workshop on Digital Language Archives held on December 15-16, 2025 as part of the ACM/IEEE Joint Conference on Digital Libraries 2025. It includes 11 peer-reviewed papers that were presented at the workshop and an introduction from the workshop organizers.
Date: December 30, 2025
Creator: Zavalina, Oksana; Chelliah, Shobhana Lakshmi & Burke, Mary
Partner: UNT College of Information
open access

The digital language archive as a platform for exploring language data, analyses, and publications

Description: Article outlining a platform for a digital language archive which integrates original data, analyses, and publications. Thereby, the digital language archive becomes the host and facilitator of research, which supports the integrity of research projects. It was presented at the 3rd International Workshop on Digital Language Archives held on December 15-16, 2025 as part of the ACM/IEEE Joint Conference on Digital Libraries 2025.
Date: December 30, 2025
Creator: Weber, Tobias
Partner: UNT College of Information
open access

Advancing Language Revitalization Goals Through Community-based Archiving

Description: Article examining how community-based archiving initiatives reflect language revitalization goals through a case study of an archival collection created by community members engaged in revitalization efforts. Understanding the decision-making and intentions behind such collections provides insight and guidance for future community-based archiving projects. It was presented at the 3rd International Workshop on Digital Language Archives held on December 15-16, 2025 as part of the ACM/IEEE Joint C… more
Date: December 30, 2025
Creator: Burke, Mary
Partner: UNT College of Information
open access

Assessing the Impact of Image Resolution on OCR Transcription Accuracy

Description: Article investigating the relationship between image resolution and OCR (optical character recognition) performance, with a focus on both character-level accuracy and the integrity of subsequent text processing pipelines. The findings have practical implications for document digitization workflows, especially in resource-constrained environments where high-resolution image storage and processing may be questionable. It was presented at the 3rd International Workshop on Digital Language Archives… more
Date: December 30, 2025
Creator: Boubehziz, Toufik; Koudoro-Parfait, Caroline & Lejeune, Gaƫl
Partner: UNT College of Information
open access

Training Future Librarians and Archivists to Create and Describe Digital Language Archive Collections

Description: Article presenting a major practical project to provide training in language archiving for future information professionals. In this course, graduate students in library science, information science, archival studies, and other disciplines learn to create the items for their own collections of language-focused materials, provide metadata to describe items, and make collections available through the Omeka-powered digital repository. The learning objectives of this project are discussed and compa… more
Date: December 30, 2025
Creator: Zavalina, Oksana
Partner: UNT College of Information
open access

Learning to Assess Use of Materials in Language Archival Collections

Description: Article on presenting a learning module for a graduate course on community language archiving for information professionals offered at the University of North Texas. The focus of the practical assignment in which students evaluate digital language archive usage through comparative analyses of stratified random samples and interpret their findings. It reports selected results of this learning in a summer semester of 2024/2025 academic year and discusses future steps. It was presented at the 3rd … more
Date: December 30, 2025
Creator: Zavalina, Oksana; Savchenko, Viktoriia & Savchenko, Kostiantyn
Partner: UNT College of Information
open access

Curation as Collective Practice: Archiving the Inga-Lil Hansson Akha Materials

Description: Article discussing the archival curation of re- search material collected by professor Inga-Lill Hansson from 1974 to 2022 through extensive fieldwork on the Akha language. It describes the relationship-centered collaboration between Hansson’s network and the archiving team, the gradual curation workflow, and reflect on the technical, interpretive, and ethical decisions required to make a single scholar’s lifetime of work discoverable and responsibly reusable. It was presented at the 3rd Intern… more
Date: December 30, 2025
Creator: Benavides, Jose; Chelliah, Shobhana Lakshmi; Borch, SĆøren; Burke, Mary & Weber, Sydney
Partner: UNT College of Information
open access

A Short Reflection on the Long History of Language Archives

Description: Article presenting a historical narrative surrounding the term language archives. Three elements are highlighted: (1) the history of the use of the term, (2) the sociological connections between individual voices central to the modern development of language archives, (3) the evolution of metadata schemes across institutions in newly established language archives. It was presented at the 3rd International Workshop on Digital Language Archives held on December 15-16, 2025 as part of the ACM/IEEE… more
Date: December 30, 2025
Creator: Paterson, Hugh, III
Partner: UNT College of Information
open access

Community Archiving Across Languages: Strategies and Tools for Multilingual Praxis

Description: Article describing how the Archivo de Respuestas Emergencias de Puerto Rico / Emergency Archive of Puerto Rico (AREPR) team put its values into praxis through participatory design of both the project’s collections and the platform in which they are housed. It was presented at the 3rd International Workshop on Digital Language Archives held on December 15-16, 2025 as part of the ACM/IEEE Joint Conference on Digital Libraries 2025.
Date: December 30, 2025
Creator: Boyles, Christina
Partner: UNT College of Information
open access

Replicating hospitality: Reframing a documentary linguistics digital archiving curriculum to meet the needs of refugees and their service workers

Description: Article detailing the development of the Archiving Our Refugee Stories (ORSA) curriculum, adapted from CoRSAL’s existing digital archiving curriculum. The ORSA is an emerging repository created by and for refugees to house their memories and equip them with the knowledge and skills to preserve their stories. It was presented at the 3rd International Workshop on Digital Language Archives held on December 15-16, 2025 as part of the ACM/IEEE Joint Conference on Digital Libraries 2025.
Date: December 30, 2025
Creator: Frederick, Merrion; Roeschley, Ana; Brown, Nathan; Braga, Damares & Khader, Deama
Partner: UNT College of Information
open access

Enhancing Photograph Description in Language Archive Metadata with Artificial Intelligence

Description: Article presenting a project in which generative AI was utilized to enhance free-text descriptions in metadata records that represent a typical kind of resources in language archives: photographs. It was presented at the 3rd International Workshop on Digital Language Archives held on December 15-16, 2025 as part of the ACM/IEEE Joint Conference on Digital Libraries 2025.
Date: December 30, 2025
Creator: Zavalin, Vyacheslav
Partner: UNT College of Information
open access

Finding Metadata for Digital Archiving in Linguistic Fieldnotes

Description: Article examining how digital language archives can address challenges of legacy field materials through a case study from the Computational Resource for South Asian Languages (CoRSAL), a repository dedicated to preserving and documenting the languages of South Asia. It focuses on two CoRSAL legacy projects: the curation of Norman Zide’s extensive collection on the Munda languages of north-central India (Gutob, Gta’, Korku, and Mundari), and the classroom fieldnotes of James Matisoff on Tibeto-… more
Date: December 30, 2025
Creator: Chelliah, Shobhana Lakshmi; Lowe, John Brandon & Fos, Jonas
Partner: UNT College of Information

Moving the End of Term Web Archive to the Cloud to Encourage Research Use and Reuse

Description: Presentation given at the 2022 Web Archiving and Digital Libraries Virtual Workshop, in conjunction with the Joint Conference on Digital Libraries (JCDL), on June 24, 2022. This presentation discusses the End of Term (EOT) Web Archive project and process of organizing, staging, processing, and moving these collections into the Amazon cloud.
Date: June 24, 2022
Creator: Phillips, Mark Edward & Alam, Sawood
Partner: UNT Libraries
open access

Bharatavani Project - Reviving Linguistic Diversity and Cultural Heritage in India: A Case Study

Description: Article presenting an overview of the Bharatavani project, which focuses on recording socio-cultural and linguistic information about 121 Indian languages and making it accessible to a broader audience. It was presented at the 2nd International Workshop on Digital Language Archives held on June 30, 2023 as part of the ACM/IEEE Joint Conference on Digital Libraries 2023.
Date: July 3, 2023
Creator: Choudhary, Narayan; Premkumar, LR; Singh, Chandan; Mondal, Shubhanan; Priya, Shivangi; Sudarshan, Beluru et al.
Partner: UNT College of Information
Back to Top of Screen