Letter Level Learning for Language Independent Diacritics Restoration

PDF Version Also Available for Download.

Description

This paper discusses letter level learning for language independent diacritics restoration.

Physical Description

7 p.

Creation Information

Mihalcea, Rada, 1974- & Nastase, Vivi September 2002.

Context

This paper is part of the collection entitled: UNT Scholarly Works and was provided by the UNT College of Engineering to the UNT Digital Library, a digital repository hosted by the UNT Libraries. It has been viewed 1340 times. More information about this paper can be viewed below.

Who

People and organizations associated with either the creation of this paper or its content.

Authors

Provided By

UNT College of Engineering

The UNT College of Engineering strives to educate and train engineers and technologists who have the vision to recognize and solve the problems of society. The college comprises six degree-granting departments of instruction and research.

Contact Us

What

Descriptive information to help identify this paper. Follow the links below to find similar items on the Digital Library.

Degree Information

Description

This paper discusses letter level learning for language independent diacritics restoration.

Physical Description

7 p.

Notes

Abstract: This paper presents a method for diacritics restoration based on learning mechanisms that act at letter level. The method requires no additional tagging tools or resources other than raw text, which makes it independent of the language, and particularly appealing for languages for which there are few resources available. The algorithm was evaluated on four different languages, namely Czech, Hungarian, Polish, and Romanian, and an average accuracy of over 98% was observed.

Source

  • Sixth Conference on Natural Language Learning (CoNLL), 2002, Taipei, Taiwan

Language

Item Type

Identifier

Unique identifying numbers for this paper in the Digital Library or other systems.

Collections

This paper is part of the following collection of related materials.

UNT Scholarly Works

Materials from the UNT community's research, creative, and scholarly activities and UNT's Open Access Repository. Access to some items in this collection may be restricted.

What responsibilities do I have when using this paper?

When

Dates and time periods associated with this paper.

Creation Date

  • September 2002

Added to The UNT Digital Library

  • Jan. 31, 2011, 2:01 p.m.

Description Last Updated

  • March 27, 2014, 12:14 p.m.

Usage Statistics

When was this paper last used?

Yesterday: 0
Past 30 days: 2
Total Uses: 1,340

Interact With This Paper

Here are some suggestions for what to do next.

Start Reading

PDF Version Also Available for Download.

International Image Interoperability Framework

IIF Logo

We support the IIIF Presentation API

Mihalcea, Rada, 1974- & Nastase, Vivi. Letter Level Learning for Language Independent Diacritics Restoration, paper, September 2002; (https://digital.library.unt.edu/ark:/67531/metadc30944/: accessed April 24, 2024), University of North Texas Libraries, UNT Digital Library, https://digital.library.unt.edu; crediting UNT College of Engineering.

Back to Top of Screen