Metadata Quality Enhancement for Large Digital Collections: Web Browser Automation with Selenium IDE

PDF Version Also Available for Download.

Description

Poster presented at the 2012 TCDL Annual Conference. This poster discusses metadata quality enhancement for large digital collections.

Physical Description

1 p.

Creation Information

Weidner, Andrew & Alemneh, Daniel Gelaw May 24, 2012.

Context

This poster is part of the collection entitled: UNT Scholarly Works and was provided by UNT Libraries to Digital Library, a digital repository hosted by the UNT Libraries. It has been viewed 674 times , with 9 in the last month . More information about this poster can be viewed below.

Who

People and organizations associated with either the creation of this poster or its content.

Authors

Organizer of meeting

Provided By

UNT Libraries

With locations on the Denton campus of the University of North Texas and one in Dallas, UNT Libraries serves the school and the community by providing access to physical and online collections; The Portal to Texas History and UNT Digital Libraries; academic research, and much, much more.

Contact Us

What

Descriptive information to help identify this poster. Follow the links below to find similar items on the Digital Library.

Degree Information

Description

Poster presented at the 2012 TCDL Annual Conference. This poster discusses metadata quality enhancement for large digital collections.

Physical Description

1 p.

Notes

Abstract: Creating and maintaining accurate descriptive metadata for digital objects is one of the best ways to connect with digital library users and maintain those connections over the long term. Good metadata empowers users to not only discover exactly what they searched for, but also to locate relevant resources that they did not expect to find. Metadata quality characteristics for digital libraries depend on many factors, including: the types of resources the repository offers and the users' needs, which vary across the spectrum of user communities. The metadata quality issue is particularly acute if there are multiple institutions participating in collaborative digital projects that employ diverse naming schemes for their documents and files. Furthermore, harvesting large sets of documents from open repositories presents a number of challenges for creating accurate descriptive metadata. For example, metadata schema do not always map well, creating disconnects when published in the local repository. In the aforementioned cases, substantial rework is usually required to create descriptive data that meets local repository standards. The University of North Texas (UNT) digital libraries group utilizes various tools and mechanisms to ensure metadata consistency and precision across all digital resources. Pre-populated controlled vocabulary terms in its Web-based dashboard editing interface enable metadata operators to easily select standard values via drop-down menus and auto-suggest for text input fields. In addition, careful mapping prior to ingest facilitates accurate conversions among various metadata element sets. Crosswalks also facilitate exporting metadata records to other systems. To support these activities - in cases where post-ingest metadata normalization will enhance recall and precision for its digital objects - the UNT Libraries recently implemented Selenium IDE as a tool for streamlining the process of editing large sets of metadata records. Created by the Web development community in order to simplify the process of testing Web applications, Selenium IDE is a Firefox browser plug-in that provides an integrated development environment for creating, debugging, and running Web browser automation scripts. This poster discusses the complex set of tools and actions required to maintain usable and sustainable digital collections and demonstrate how Selenium IDE facilitates metadata editing for large digital collections by automating a range of data entry tasks. Any institution that employs a content management system with a Web-based metadata editing interface can potentially benefit from Selenium IDE's automation capabilities.

Source

  • Texas Conference on Digital Libraries (TCDL), 2012, Austin, Texas, United States

Language

Item Type

Collections

This poster is part of the following collection of related materials.

UNT Scholarly Works

Materials from the UNT community's research, creative, and scholarly activities and UNT's Open Access Repository. Access to some items in this collection may be restricted.

What responsibilities do I have when using this poster?

When

Dates and time periods associated with this poster.

Creation Date

  • May 24, 2012

Added to The UNT Digital Library

  • June 8, 2012, 10:10 a.m.

Description Last Updated

  • Oct. 15, 2014, 12:51 p.m.

Usage Statistics

When was this poster last used?

Yesterday: 0
Past 30 days: 9
Total Uses: 674

Interact With This Poster

Here are some suggestions for what to do next.

Enlarge

PDF Version Also Available for Download.

Citations, Rights, Re-Use

Weidner, Andrew & Alemneh, Daniel Gelaw. Metadata Quality Enhancement for Large Digital Collections: Web Browser Automation with Selenium IDE, poster, May 24, 2012; (digital.library.unt.edu/ark:/67531/metadc86138/: accessed October 22, 2017), University of North Texas Libraries, Digital Library, digital.library.unt.edu; .