Steps toward fault-tolerant quantum chemistry. Metadata

Metadata describes a digital item, providing (if known) such information as creator, publisher, contents, size, relationship to other resources, and more. Metadata may also contain "preservation" components that help us to maintain the integrity of digital files over time.

Title

  • Main Title Steps toward fault-tolerant quantum chemistry.

Creator

  • Author: Taube, Andrew Garvin
    Creator Type: Personal

Contributor

  • Sponsor: United States. Department of Energy.
    Contributor Type: Organization
    Contributor Info: USDOE

Publisher

  • Name: Sandia National Laboratories
    Place of Publication: United States

Date

  • Creation: 2010-05-01

Language

  • English

Description

  • Content Description: Developing quantum chemistry programs on the coming generation of exascale computers will be a difficult task. The programs will need to be fault-tolerant and minimize the use of global operations. This work explores the use a task-based model that uses a data-centric approach to allocate work to different processes as it applies to quantum chemistry. After introducing the key problems that appear when trying to parallelize a complicated quantum chemistry method such as coupled-cluster theory, we discuss the implications of that model as it pertains to the computational kernel of a coupled-cluster program - matrix multiplication. Also, we discuss the extensions that would required to build a full coupled-cluster program using the task-based model. Current programming models for high-performance computing are fault-intolerant and use global operations. Those properties are unsustainable as computers scale to millions of CPUs; instead one must recognize that these systems will be hierarchical in structure, prone to constant faults, and global operations will be infeasible. The FAST-OS HARE project is introducing a scale-free computing model to address these issues. This model is hierarchical and fault-tolerant by design, allows for the clean overlap of computation and communication, reducing the network load, does not require checkpointing, and avoids the complexity of many HPC runtimes. Development of an algorithm within this model requires a change in focus from imperative programming to a data-centric approach. Quantum chemistry (QC) algorithms, in particular electronic structure methods, are an ideal test bed for this computing model. These methods describe the distribution of electrons in a molecule, which determine the properties of the molecule. The computational cost of these methods is high, scaling quartically or higher in the size of the molecule, which is why QC applications are major users of HPC resources. The complexity of these algorithms means that MPI alone is insufficient to achieve parallel scaling; QC developers have been forced to use alternative approaches to achieve scalability and would be receptive to radical shifts in the programming paradigm. Initial work in adapting the simplest QC method, Hartree-Fock, to this the new programming model indicates that the approach is beneficial for QC applications. However, the advantages to being able to scale to exascale computers are greatest for the computationally most expensive algorithms; within QC these are the high-accuracy coupled-cluster (CC) methods. Parallel coupledcluster programs are available, however they are based on the conventional MPI paradigm. Much of the effort is spent handling the complicated data dependencies between the various processors, especially as the size of the problem becomes large. The current paradigm will not survive the move to exascale computers. Here we discuss the initial steps toward designing and implementing a CC method within this model. First, we introduce the general concepts behind a CC method, focusing on the aspects that make these methods difficult to parallelize with conventional techniques. Then we outline what is the computational core of the CC method - a matrix multiply - within the task-based approach that the FAST-OS project is designed to take advantage of. Finally we outline the general setup to implement the simplest CC method in this model, linearized CC doubles (LinCC).
  • Physical Description: 30 p.

Subject

  • Keyword: Electronic Structure
  • Keyword: Quantum Chemistry.
  • Keyword: Coupled Problems (Complex Systems)
  • STI Subject Categories: 99 General And Miscellaneous//Mathematics, Computing, And Information Science
  • Keyword: Computers
  • Keyword: Programming
  • Keyword: Kernels
  • Keyword: Radicals Quantum Chemistry.
  • Keyword: Algorithms
  • Keyword: Electrons
  • Keyword: Chemistry
  • Keyword: Focusing
  • Keyword: Distribution
  • Keyword: Design

Collection

  • Name: Office of Scientific & Technical Information Technical Reports
    Code: OSTI

Institution

  • Name: UNT Libraries Government Documents Department
    Code: UNTGD

Resource Type

  • Report

Format

  • Text

Identifier

  • Report No.: SAND2010-3388
  • Grant Number: AC04-94AL85000
  • DOI: 10.2172/992330
  • Office of Scientific & Technical Information Report Number: 992330
  • Archival Resource Key: ark:/67531/metadc1012213
Back to Top of Screen