Produktnummer:
1829d55c1696ba48b586e47cff846218ba
Themengebiete: | Algorithm-Based Fault Tolerance Fault-Tolerance Fault Predictors High-Performance Computing Resilience Silent Errors |
---|---|
Veröffentlichungsdatum: | 15.07.2015 |
EAN: | 9783319209425 |
Sprache: | Englisch |
Seitenzahl: | 320 |
Produktart: | Gebunden |
Herausgeber: | Herault, Thomas Robert, Yves |
Verlag: | Springer International Publishing |
Produktinformationen "Fault-Tolerance Techniques for High-Performance Computing"
This timely text presents a comprehensive overview of fault tolerance techniques for high-performance computing (HPC). The text opens with a detailed introduction to the concepts of checkpoint protocols and scheduling algorithms, prediction, replication, silent error detection and correction, together with some application-specific techniques such as ABFT. Emphasis is placed on analytical performance models. This is then followed by a review of general-purpose techniques, including several checkpoint and rollback recovery protocols. Relevant execution scenarios are also evaluated and compared through quantitative models. Features: provides a survey of resilience methods and performance models; examines the various sources for errors and faults in large-scale systems; reviews the spectrum of techniques that can be applied to design a fault-tolerant MPI; investigates different approaches to replication; discusses the challenge of energy consumption of fault-tolerance methods in extreme-scale systems.

Sie möchten lieber vor Ort einkaufen?
Sie haben Fragen zu diesem oder anderen Produkten oder möchten einfach gerne analog im Laden stöbern? Wir sind gerne für Sie da und beraten Sie auch telefonisch.
Juristische Fachbuchhandlung
Georg Blendl
Parcellistraße 5 (Maxburg)
8033 München
Montag - Freitag: 8:15 -18 Uhr
Samstags geschlossen