Title | Third workshop on proactive failure avoidance, recovery, and maintenance (PFARM) |
Publication Type | Journal Article |
Year of Publication | 2011 |
Authors | M Malek, F Salfner, and KS Trivedi |
Journal | Proceedings of the International Conference on Dependable Systems and Networks |
Start Page | 595 |
Pagination | 595 - 596 |
Date Published | 08/2011 |
Abstract | Over the last decade, research on dependable computing has undergone a shift from reactive towards proactive methods: In classical fault tolerance a system reacts to errors or component failures in order to prevent them from turning into system failures, and maintenance follows fixed, time-based plans. However, due to an ever increasing system complexity, use of commercial-off-the-shelf components, virtualization, ongoing system patches and updates and dynamicity such approaches have become difficult to apply. Therefore, a new area in dependability research has emerged focusing on proactive approaches that start acting before a problem arises in order to increase time-to-failure and/or reduce time-to-repair. These techniques frequently build on the anticipation of upcoming problems based on runtime monitoring. Industry and academia use several terms for such techniques, each focusing on different aspects, including self-* computing, autonomic computing, proactive fault management, trustworthy computing, software rejuvenation, or preventive/proactive maintenance. © 2011 IEEE. |
DOI | 10.1109/DSN.2011.5958234 |
Short Title | Proceedings of the International Conference on Dependable Systems and Networks |