Open Access. Powered by Scholars. Published by Universities.®

Digital Commons Network

Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics

PDF

University of Tennessee, Knoxville

Theses/Dissertations

2015

ABFT

Articles 1 - 1 of 1

Full-Text Articles in Entire DC Network

Algorithm-Based Fault Tolerance For Two-Sided Dense Matrix Factorizations, Yulu Jia Dec 2015

Algorithm-Based Fault Tolerance For Two-Sided Dense Matrix Factorizations, Yulu Jia

Doctoral Dissertations

The mean time between failure (MTBF) of large supercomputers is decreasing, and future exascale computers are expected to have a MTBF of around 30 minutes. Therefore, it is urgent to prepare important algorithms for future machines with such a short MTBF. Eigenvalue problems (EVP) and singular value problems (SVP) are common in engineering and scientific research. Solving EVP and SVP numerically involves two-sided matrix factorizations: the Hessenberg reduction, the tridiagonal reduction, and the bidiagonal reduction. These three factorizations are computation intensive, and have long running times. They are prone to suffer from computer failures.

We designed algorithm-based fault tolerant (ABFT) …