Using accelerated life tests to estimate time to software aging failure

TitleUsing accelerated life tests to estimate time to software aging failure
Publication TypeJournal Article
Year of Publication2010
AuthorsR Matias, KS Trivedi, and PRM Maciel
JournalProceedings - International Symposium on Software Reliability Engineering, ISSRE
Start Page211
Pagination211 - 219
Date Published12/2010
Abstract

Software aging is a phenomenon defined as the continuing degradation of software systems during runtime, being particularly noticeable in long-running applications. Aging-related failures are very difficult to observe, because the accumulation of aging effects usually requires a long-term execution. Thus, collecting a statistically significant sample of times to aging-related failures so as to estimate the system's lifetime distribution is a very hard task. This is an important problem that prevents many experimental and analytical studies, mainly those focused on modeling of software aging aspects, of using representative parameter values. In this paper we propose and evaluate the use of quantitative accelerated life tests (QALT) to reduce the time to obtain the lifetime distribution of systems that fail due to software aging. Since QALT was developed for hardware failures, in this paper, we adapt it to software aging experiments. We test the proposed approach experimentally, estimating the lifetime distribution of a real web server system. The accuracy of the estimated distribution is evaluated by comparing its reliability estimates with a sample of failure times observed from the real system under test. The mean time to failure calculated from the real sample falls inside the 90% confidence interval constructed from the estimated lifetime distribution, demonstrating the high accuracy of the estimated model. The proposed approach reduces the time required to obtain the failure times by a factor of seven, for the real system investigated. © 2010 IEEE.

DOI10.1109/ISSRE.2010.42
Short TitleProceedings - International Symposium on Software Reliability Engineering, ISSRE