Multilingual Information Access for Digital Libraries - The Metadata Records Translation Project

Multilingual Information Access for Digital Libraries - The Metadata Records Translation Project

Date: 2011
Creator: Chen, Jiangping
Description: This presentation was given as an invited talk to faculty and students at Wuhan University, Beijing Normal University, Nankai University, and the Library of Chinese Academy of Sciences. The presentation discusses research on multilingual information access for digital libraries and the Metadata Records Translation (MRT) Project.
Contributing Partner: UNT College of Information
Cross-language search: The case of Google Language Tools

Cross-language search: The case of Google Language Tools

Date: March 2, 2009
Creator: Chen, Jiangping & Bao, Yu
Description: Article on cross-language searching and a case study of Google Language Tools, especially its cross-language search service.
Contributing Partner: UNT College of Information
Metadata Records Translation And Evaluation for Multilingual Information Access

Metadata Records Translation And Evaluation for Multilingual Information Access

Date: September 2011
Creator: Chen, Jiangping
Description: This presentation was given as an invited talk to faculty and students at Universidad Autónoma del Estado de México (UAEM) in Toluca, Mexico. The presentation discusses research on multilingual information access and the Metadata Records Translation (MRT) Project.
Contributing Partner: UNT College of Information
Multilingual Word Sense Disambiguation Using Wikipedia

Multilingual Word Sense Disambiguation Using Wikipedia

Date: August 2013
Creator: Dandala, Bharath
Description: Ambiguity is inherent to human language. In particular, word sense ambiguity is prevalent in all natural languages, with a large number of the words in any given language carrying more than one meaning. Word sense disambiguation is the task of automatically assigning the most appropriate meaning to a polysemous word within a given context. Generally the problem of resolving ambiguity in literature has revolved around the famous quote “you shall know the meaning of the word by the company it keeps.” In this thesis, we investigate the role of context for resolving ambiguity through three different approaches. Instead of using a predefined monolingual sense inventory such as WordNet, we use a language-independent framework where the word senses and sense-tagged data are derived automatically from Wikipedia. Using Wikipedia as a source of sense-annotations provides the much needed solution for knowledge acquisition bottleneck. In order to evaluate the viability of Wikipedia based sense-annotations, we cast the task of disambiguating polysemous nouns as a monolingual classification task and experimented on lexical samples from four different languages (viz. English, German, Italian and Spanish). The experiments confirm that the Wikipedia based sense annotations are reliable and can be used to construct accurate monolingual sense classifiers. ...
Contributing Partner: UNT Libraries