Vestnik Yuzhno-Ural'skogo Gosudarstvennogo Universiteta. Seriya "Vychislitelnaya Matematika i Informatika"
RUS  ENG    JOURNALS   PEOPLE   ORGANISATIONS   CONFERENCES   SEMINARS   VIDEO LIBRARY   PACKAGE AMSBIB  
General information
Latest issue
Archive

Search papers
Search references

RSS
Latest issue
Current issues
Archive issues
What is RSS



Vestn. YuUrGU. Ser. Vych. Matem. Inform.:
Year:
Volume:
Issue:
Page:
Find






Personal entry:
Login:
Password:
Save password
Enter
Forgotten password?
Register


Vestn. YuUrGU. Ser. Vych. Matem. Inform., 2015, Volume 4, Issue 3, Pages 5–12 (Mi vyurv1)  

Computer Science, Engineering and Control

Simulation of failures in high-performance computing systems under MPI-ULFM

A. A. Bondarenko, M. V. Iakobovski

Keldysh Institute of Applied Mathematics (Moscow, Russian Federation)

Abstract: In this paper, we consider one of the main problems that occur in the area of highperformance computing is to continue computations despite of failures. For the programs running on such systems it is very important to handle failures and continue computations on working nodes. One of the MPI 3.1 standardization efforts aim is adding new techniques, approaches, or concepts to support for fault tolerance in MPI applications. The paper briefly describes a library for simulation of failures and testing fault-tolerant algorithms using functional of developing MPI 3.1 standard. In the test problem we describe one of the techniques of fault tolerance and we compare checkpoint in operational memory versus checkpoint in the distributed file system.

Keywords: parallel computing, fault tolerance, checkpoint, simulation of failures, MPI, ULFM.

DOI: https://doi.org/10.14529/cmse150301

Full text: PDF file (784 kB)
References: PDF file   HTML file

UDC: 004.052.3
Received: 13.04.2015

Citation: A. A. Bondarenko, M. V. Iakobovski, “Simulation of failures in high-performance computing systems under MPI-ULFM”, Vestn. YuUrGU. Ser. Vych. Matem. Inform., 4:3 (2015), 5–12

Citation in format AMSBIB
\Bibitem{BonIak15}
\by A.~A.~Bondarenko, M.~V.~Iakobovski
\paper Simulation of failures in high-performance computing systems under MPI-ULFM
\jour Vestn. YuUrGU. Ser. Vych. Matem. Inform.
\yr 2015
\vol 4
\issue 3
\pages 5--12
\mathnet{http://mi.mathnet.ru/vyurv1}
\crossref{https://doi.org/10.14529/cmse150301}
\elib{https://elibrary.ru/item.asp?id=23790220}


Linking options:
  • http://mi.mathnet.ru/eng/vyurv1
  • http://mi.mathnet.ru/eng/vyurv/v4/i3/p5

    SHARE: VKontakte.ru FaceBook Twitter Mail.ru Livejournal Memori.ru


    Citing articles on Google Scholar: Russian citations, English citations
    Related articles on Google Scholar: Russian articles, English articles
  • Vestnik Yuzhno-Ural'skogo Gosudarstvennogo Universiteta. Seriya "Vychislitelnaya Matematika i Informatika"
    Number of views:
    This page:181
    Full text:65
    References:20

     
    Contact us:
     Terms of Use  Registration to the website  Logotypes © Steklov Mathematical Institute RAS, 2022