Scientific Data Management Center for Enabling Technologies Page: 4 of 64
This report is part of the collection entitled: Office of Scientific & Technical Information Technical Reports and was provided to UNT Digital Library by the UNT Libraries Government Documents Department.
The following text was automatically extracted from the image on this page using optical character recognition software:
The SDM center was established under the SciDAC program to address these issues. The SciDAC-1
Scientific Data Management (SDM) Center succeeded in bringing an initial set of advanced data
management technologies to DOE application scientists in astrophysics, climate, fusion, and biology.
Equally important, it established collaborations with these scientists to better understand their science as
well as their forthcoming data management and data analytics challenges. Building on our early
successes, we have greatly enhanced, robustified, and deployed our technology to these communities. In
some cases, we identified new needs that have been addressed in order to simplify the use of our
technology by scientists. This report summarizes our work so far in SciDAC-2.
Our approach is to employ an evolutionary development and deployment process: from research through
prototypes to deployment and infrastructure. Accordingly, we have organized our activities in three layers
that abstract the end-to-end data flow described above. We labeled the layers (from bottom to top):
" Storage Efficient Access (SEA)
" Data Mining and Analysis (DMA)
" Scientific Process Automation (SPA)
The SEA layer is immediately on top of hardware, operating systems, file systems, and mass storage
systems, and provides parallel data access technology, and transparent access to archival storage. The
DMA layer, which builds on the functionality of the SEA layer, consists of indexing, feature
identification, and parallel statistical analysis technology. The SPA layer, which is on top of the DMA
layer, provides the ability to compose scientific workflows from the components in the DMA layer as
well as application specific modules. The organization of the center and technologies included in each
layer are shown below.
Scientific Process Automation (SPA) Layer
Management Specialized Scientific
Engine Workflow Dashboard
Data Mining and Analysis (DMA) Layer
Parallel R Data Efficient
Analysis and indexing
Statistical Feature (Bitmap
Analysis Identification Index)
Storage Efficient Access (SEA) Layer
Active Resource Adaptable Parallel Parallel Virtual
Storage Manager 1/O System I/O NetCDF File
(SRM) (ADIOS) (ROMIO) System
Hardware, Operating Systems, and Storage Systems
Here’s what’s next.
This report can be searched. Note: Results may vary based on the legibility of text within the document.
Tools / Downloads
Get a copy of this page or view the extracted text.
Citing and Sharing
Basic information for referencing this web page. We also provide extended guidance on usage rights, references, copying or embedding.
Reference the current page of this Report.
Vouk, Mladen A. Scientific Data Management Center for Enabling Technologies, report, January 15, 2013; United States. (https://digital.library.unt.edu/ark:/67531/metadc839220/m1/4/: accessed November 19, 2019), University of North Texas Libraries, UNT Digital Library, https://digital.library.unt.edu; crediting UNT Libraries Government Documents Department.