Metadata Quality Enhancement for Large Digital Collections: Web Browser Automation with Selenium IDE

PDF Version Also Available for Download.


Poster presented at the 2012 TCDL Annual Conference. This poster discusses metadata quality enhancement for large digital collections.

Physical Description

1 p.

Creation Information

Weidner, Andrew & Alemneh, Daniel Gelaw May 24, 2012.


This poster is part of the collection entitled: UNT Scholarly Works and was provided by UNT Libraries to Digital Library, a digital repository hosted by the UNT Libraries. It has been viewed 627 times , with 5 in the last month . More information about this poster can be viewed below.


People and organizations associated with either the creation of this poster or its content.


Organizer of meeting

Rights Holders

For guidance see Citations, Rights, Re-Use.

  • Unknown

Provided By

UNT Libraries

Library facilities at the University of North Texas function as the nerve center for teaching and academic research. In addition to a major collection of electronic journals, books and databases, five campus facilities house just under six million cataloged holdings, including books, periodicals, maps, documents, microforms, audiovisual materials, music scores, full-text journals and books. A branch library is located at the University of North Texas Dallas Campus.

Contact Us


Descriptive information to help identify this poster. Follow the links below to find similar items on the Digital Library.

Degree Information


Poster presented at the 2012 TCDL Annual Conference. This poster discusses metadata quality enhancement for large digital collections.

Physical Description

1 p.


Abstract: Creating and maintaining accurate descriptive metadata for digital objects is one of the best ways to connect with digital library users and maintain those connections over the long term. Good metadata empowers users to not only discover exactly what they searched for, but also to locate relevant resources that they did not expect to find. Metadata quality characteristics for digital libraries depend on many factors, including: the types of resources the repository offers and the users' needs, which vary across the spectrum of user communities. The metadata quality issue is particularly acute if there are multiple institutions participating in collaborative digital projects that employ diverse naming schemes for their documents and files. Furthermore, harvesting large sets of documents from open repositories presents a number of challenges for creating accurate descriptive metadata. For example, metadata schema do not always map well, creating disconnects when published in the local repository. In the aforementioned cases, substantial rework is usually required to create descriptive data that meets local repository standards. The University of North Texas (UNT) digital libraries group utilizes various tools and mechanisms to ensure metadata consistency and precision across all digital resources. Pre-populated controlled vocabulary terms in its Web-based dashboard editing interface enable metadata operators to easily select standard values via drop-down menus and auto-suggest for text input fields. In addition, careful mapping prior to ingest facilitates accurate conversions among various metadata element sets. Crosswalks also facilitate exporting metadata records to other systems. To support these activities - in cases where post-ingest metadata normalization will enhance recall and precision for its digital objects - the UNT Libraries recently implemented Selenium IDE as a tool for streamlining the process of editing large sets of metadata records. Created by the Web development community in order to simplify the process of testing Web applications, Selenium IDE is a Firefox browser plug-in that provides an integrated development environment for creating, debugging, and running Web browser automation scripts. This poster discusses the complex set of tools and actions required to maintain usable and sustainable digital collections and demonstrate how Selenium IDE facilitates metadata editing for large digital collections by automating a range of data entry tasks. Any institution that employs a content management system with a Web-based metadata editing interface can potentially benefit from Selenium IDE's automation capabilities.


  • Texas Conference on Digital Libraries (TCDL), 2012, Austin, Texas, United States


Item Type


This poster is part of the following collection of related materials.

UNT Scholarly Works

The Scholarly Works Collection is home to materials from the University of North Texas community's research, creative, and scholarly activities and serves as UNT's Open Access Repository. It brings together articles, papers, artwork, music, research data, reports, presentations, and other scholarly and creative products representing the expertise in our university community.** Access to some items in this collection may be restricted.**

What responsibilities do I have when using this poster?


Dates and time periods associated with this poster.

Creation Date

  • May 24, 2012

Added to The UNT Digital Library

  • June 8, 2012, 10:10 a.m.

Description Last Updated

  • Oct. 15, 2014, 12:51 p.m.

Usage Statistics

When was this poster last used?

Yesterday: 0
Past 30 days: 5
Total Uses: 627

Interact With This Poster

Here are some suggestions for what to do next.


PDF Version Also Available for Download.

Citations, Rights, Re-Use

Weidner, Andrew & Alemneh, Daniel Gelaw. Metadata Quality Enhancement for Large Digital Collections: Web Browser Automation with Selenium IDE, poster, May 24, 2012; ( accessed March 23, 2017), University of North Texas Libraries, Digital Library,; .