Extracting and Parsing of Herbarium Specimen Data: Exploring the Use of the Dublin Core Application Profile Framework

Description:

This paper discusses extraction and parsing of herbarium specimen data to make the critical information available in digital form.

Creator(s):
Creation Date: February 2010
Partner(s):
UNT College of Information
Collection(s):
UNT Scholarly Works
Usage:
Total Uses: 135
Past 30 days: 0
Yesterday: 0
Creator (Author):
Moen, William E.

University of North Texas

Creator (Author):
Huang, Jane Q.

University of North Texas

Creator (Author):
McCotter, Melody

University of North Texas

Creator (Author):
Neill, Amanda K.

Botanical Research Institute of Texas

Creator (Author):
Best, Jason H.

Botanical Research Institute of Texas

Date(s):
  • Creation: February 2010
Description:

This paper discusses extraction and parsing of herbarium specimen data to make the critical information available in digital form.

Degree:
Note:

Abstract: Herbaria around the world house millions of plant specimens; botanists and other researchers value these resources as ingredients in biodiversity research. Even when the specimen sheets are digitized and made available online, the critical information about the specimen stored on the sheet are not in a usable (i.e., machine-processible) form. This paper describes a current research and development project that is designing and testing high-throughput workflows that combine machine- and human-processes to extract and parse the specimen label data. The primary focus of the paper is the metadata needs for the workflow and the creation of the structured metadata records describing the plant specimen. In the project, the authors are exploring the use of the new Dublin Core Metadata Initiative framework for application profiles. First articulated as the Singapore Framework for Dublin Core Application Profiles in 2007, the use of this framework is in its infancy. The promises of this framework for maximum interoperability and for documenting the use of metadata for maximum reusability, and for supporting metadata applications that are in conformance with Web architectural principles provide the incentive to explore and add implementation experience regarding this new framework.

Physical Description:

7 p.

Language(s):
Subject(s):
Keyword(s): metadata application profiles | Darwin core | Dublin core | application profiles | Singapore frameworks | biodiversity information | herbarium specimens
Source: iConference, 2010, Champaign-Urbana, Illinois, United States
Contributor(s):
Series Title: Apiary Project
Partner:
UNT College of Information
Collection:
UNT Scholarly Works
Identifier:
  • ARK: ark:/67531/metadc81386
Resource Type: Paper
Format: Text
Rights:
Access: Public