Farm batch system and Fermi inter-process communication and synchronization toolkit

Mandrichenko, I.V.

You Are Here:
University Libraries
UNT Digital Library
UNT Libraries Government Documents Department
This Article
Page: 4

Farm batch system and Fermi inter-process communication and synchronization toolkit Page: 4 of 13

151 Kilobytes pages

This article is part of the collection entitled: Office of Scientific & Technical Information Technical Reports and was provided to UNT Digital Library by the UNT Libraries Government Documents Department.

View a full description of this article.

Previous search

Adjust Image
Rotate Left
Rotate Right
Brightness, Contrast, etc. (Experimental)
Cropping Tool
Download Sizes
Preview all sizes/dimensions or...
Download Thumbnail
Download Small
Download Medium
Download Large
High Resolution Files
IIIF Image JSON
IIIF Image URL
Accessibility
View Extracted Text

zoom Next

These controls are experimental and have not yet been optimized for user experience.

brightness

Reset Brightness 0

contrast

Reset Contrast 0

saturation

Reset Saturation 0

sharpen

Reset Sharpness 0

exposure

Reset Exposure 0

hue

Reset Hue 0

gamma

Reset Gama 0

Applying filters

Farm batch system and Fermi inter-process communication and synchronization toolkit

[Sequence #]: 4 of 13

Previous item Next item

Extracted Text

The following text was automatically extracted from the image on this page using optical character recognition software:

"Fgr 2:~ FBS..P Desig
FI .e andw wi i
" FARM isaFSdeoIhtrn nea wke ne.. cree ironetfrue
pr c ss s st rt them as+o requste byJ ,rp rst ersau o M a d U ,n tfe M w e
U ie ar an ts a
e Lggr r ogdamo i rspnsbl fr eevn n rr strn erradeetlgifra
- m ti rtl' I -i
Figure 2: FBS Design
n Job Manager (JM) is an FBS process that controls single section running on the farm. LSF
starts JMs as LSF batch processes. JM is responsible for allocating resources on farm nodes
with FLIMD, and communicating with FARMD(s) on nodes allocated for the section by
FLIMD to start user processes, and wait for their completion.
T FARMD is a FBS daemon that runs on each worker node. It creates environment for user
processes, starts them as requested by JM, reports their status to JM and UI, notifies JM when
user process exits.
e Historian is FBS historical database manager. It receives section start/exit statistics and stores
it on disk. UI provides a tool for reading this database and generating reports as requested
by user.
" Logger or log daemon is responsible for receiving and storing error and event log informa-
tion sent by other FBS components. This information is primarily used for FBS debugging
and trouble shooting.
2.2.4 Robustness and Reliability
FBS design makes it highly reliable and robust with respect to failure of individual components.
This is achieved by distributing run-time information among different FBS components and avoid-
ing redundancy of the information. Basic idea is that FLIMD, as the most critical component, can
recover after failure based on information received from JMs. JM is highly reliable component and
most likely reason for its failure is failure of the node where it runs. Since in typical configuration
all JMs and LSF run on the same node, failure of the node inevitably means failure of LSF and the
whole farm, and necessarily leads to re-initialization of the batch system. Unlikely failure of an in-

rB& "npQnPmc

Upcoming Pages

Here’s what’s next.

5 of 13

6 of 13

7 of 13

8 of 13

Show all pages in this article.

Search Inside

This article can be searched. Note: Results may vary based on the legibility of text within the document.

or search this site for other articles

Tools / Downloads

Get a copy of this page or view the extracted text.

Preview all sizes/dimensions or...

Download Thumbnail
Download Small
Download Medium
Download Large
IIIF Image JSON
IIIF Image

View Extracted (OCR) Text

Citing and Sharing

Basic information for referencing this web page. We also provide extended guidance on usage rights, references, copying or embedding.

Reference the current page of this Article.

Mandrichenko, I.V. Farm batch system and Fermi inter-process communication and synchronization toolkit, article, February 20, 2001; Batavia, Illinois. (https://digital.library.unt.edu/ark:/67531/metadc718290/m1/4/: accessed April 24, 2024), University of North Texas Libraries, UNT Digital Library, https://digital.library.unt.edu; crediting UNT Libraries Government Documents Department.

Farm batch system and Fermi inter-process communication and synchronization toolkit Page: 4 of 13

Upcoming Pages

Search Inside

Tools / Downloads

Citing and Sharing

Reference the current page of this Article.

Print / Share This Page

Permanent URL (This Page)

Univesal Viewer

International Image Interoperability Framework (This Page)