Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Doctoral Dissertations

2005

High-performance

Articles 1 - 1 of 1

Full-Text Articles in Physical Sciences and Mathematics

Availability Modeling And Evaluation On High Performance Cluster Computing Systems, Hertong Song Oct 2005

Availability Modeling And Evaluation On High Performance Cluster Computing Systems, Hertong Song

Doctoral Dissertations

Cluster computing has been attracting more and more attention from both the industrial and the academic world for its enormous computing power, cost effective, and scalability. Beowulf type cluster, for example, is a typical High Performance Computing (HPC) cluster system. Availability, as a key attribute of the system, needs to be considered at the system design stage and monitored at mission time. Moreover, system monitoring is a must to help identify the defects and ensure the system's availability requirement.

In this study, novel solutions which provide availability modeling, model evaluation, and data analysis as a single framework have been investigated. …