Search Results

open access

Challenges to Representing Personal Names and Language Names in Language Archives: Examples from Northeast India

Description: Article reviewing one particular challenge to data management relevant to South Asia, which is the complexity of names (of individuals, groups, and languages). It was presented at the 1st International Workshop on Digital Language Archives held on September 30-October 1, 2021 as part of the ACM/IEEE Joint Conference on Digital Libraries 2021.
Date: October 7, 2021
Creator: Burke, Mary & Chelliah, Shobhana Lakshmi
Partner: UNT College of Information
open access

A Corpus of Negations and their Underlying Positive Interpretations

Description: Article presenting a corpus of negations and their underlying positive interpretations using negations from Simple Wikipedia, automatically generating potential positive interpretations, and collecting manual annotations that effectively rewrite the negation in positive terms. This article was presented at the Eighth Joint Conference on Lexical and Computational Semantics (SEM 2019) in Minneapolis, Minnesota, June 6-7, 2019.
Date: June 2019
Creator: Sarabi, Zahra; Killian, Erin; Blanco, Eduardo & Palmer, Alexis
Partner: University of North Texas
open access

It’s not a Non-Issue: Negation as a Source of Error in Machine Translation

Description: Article investigates whether translating negation is an issue for modern MT systems using 17 translation directions as test bed and provides a linguistically motivated analysis that explains the majority of the findings. The authors release their annotations and code to replicate analysis here: https://github.com/mosharafhossain/negation-mt.
Date: November 2020
Creator: Hossain, Md Mosharaf; Blanco, Eduardo; Palmer, Alexis & Anastasopoulos, Antonios
Partner: University of North Texas
open access

Neural classification of Norwegian radiology reports: using NLP to detect findings in CT-scans of children

Description: This article trained machine learning techniques to classify Norwegian radiology reports of pediatric CT examinations according to their description of abnormal findings. The developed models are robust with respect to different contexts, and may be used in quality assurance processes.
Date: March 4, 2021
Creator: Dahl, Fredrik A.; Rama, Taraka; Hurlen, Petter; Brekke, Pål H.; Husby, Haldor; Gundersen, Tore et al.
Partner: UNT College of Information
open access

Phrasal Proper Names in German and Norwegian

Description: Article discusses the morpho-syntax of phrasal proper names like Deutsche Bahn 'German Railway' and Norske Skog 'Norwegian Forest' in German and Norwegian. The authors document that phrasal proper names may show features of recursivity evidenced most clearly in Norwegian.
Date: September 9, 2023
Creator: Julien, Marit & Roehrs, Dorian
Partner: UNT College of Information
open access

Prenominal possessives in Yiddish: mayn khaver versus mayner a khaver

Description: Article provides a systematic comparison and detailed analysis of two prenominal possessive constructions in Yiddish, the familiar mayn khaver ‘my friend’ and the less well-known mayner a khaver ‘a friend of mine.’
Date: February 21, 2022
Creator: Roehrs, Dorian
Partner: UNT College of Information
open access

Serial Verbs and Complex Paths in Klallam

Description: This article describes and illustrates the construction of complex paths and discusses the place of Klallam in a typology of motion events.
Date: 2008
Creator: Montler, Timothy
Partner: UNT College of Arts and Sciences
open access

Synthetic data for annotation and extraction of family history information from clinical text

Description: This article investigates the use of synthetic data for the annotation and automated extraction of family history information relating to cases of cardiac disease from Norwegian clinical text. This work assesses the validity and applicability of the annotated synthetic corpus using machine learning techniques. The methodology outlined in this article may be useful in other situations where limited availability of clinical text hinders NLP tasks.
Date: July 14, 2021
Creator: Brekke, Pål H.; Kasicheyanula, Taraka; Pilán, Ildikó; Nytrø, Øystein & Øvrelid, Lilja
Partner: UNT College of Information
open access

A test of Generalized Bayesian dating: A new linguistic dating method

Description: Article addressing if a new Bayesian framework can be introduced and ways to overcome subjectivity. The authors introduce a new method called Generalized Bayesian Dating (GBD) for inferring dates of language groups from lexical and phonological data. This work has implications for future performance testing in the area of linguistic dating.
Date: August 12, 2020
Creator: Kasicheyanula, Taraka & Søren Wichmann
Partner: UNT College of Information
open access

User needs in language archives: Findings from interviews with language archive managers, depositors, and end-users

Description: This article is an exploratory study providing empirical data on language archive user needs and supports some anecdotal evidence of known issues facing language archive end-users, depositors, and managers in primarily academic contexts.
Date: April 2022
Creator: Burke, Mary; Zavalina, Oksana; Chelliah, Shobhana Lakshmi & Phillips, Mark Edward
Partner: University of North Texas
open access

What do complexity measures measure? Correlating and validating corpus-based measures of morphological complexity

Description: Article describes how the authors present an analysis of eight measures used for quantifying morphological complexity of natural languages. The measures they study are corpus-based measures of morphological complexity with varying requirements for corpus annotation.
Date: September 22, 2022
Creator: Çöltekin, Çağrı & Rama, Taraka
Partner: UNT College of Information
open access

WikiPossessions: Possession Timeline Generation as an Evaluation Benchmark for Machine Reading Comprehension of Long Texts

Description: Article presents WikiPossessions, a new benchmark corpus for the task of temporally-oriented possession (TOP), or tracking objects as they change hands over time. In addition to the corpus, the authors release evaluation scripts and a baseline model for the task.
Date: May 2020
Creator: Blanco, Eduardo; Palmer, Alexis & Chinnappa, Dhivya
Partner: University of North Texas
Back to Top of Screen