Third workshop on proactive failure avoidance, recovery, and maintenance (PFARM)

Abstract

Over the last decade, research on dependable computing has undergone a shift from reactive towards proactive methods: In classical fault tolerance a system reacts to errors or component failures in order to prevent them from turning into system failures, and maintenance follows fixed, time-based plans. However, due to an ever increasing system complexity, use of commercial-off-the-shelf components, virtualization, ongoing system patches and updates and dynamicity such approaches have become difficult to apply. Therefore, a new area in dependability research has emerged focusing on proactive approaches that start acting before a problem arises in order to increase time-to-failure and/or reduce time-to-repair. These techniques frequently build on the anticipation of upcoming problems based on runtime monitoring. Industry and academia use several terms for such techniques, each focusing on different aspects, including self-computing, autonomic computing, proactive fault management, trustworthy computing, software rejuvenation, or preventive/proactive maintenance. © 2011 IEEE.

DOI

10.1109/DSNW.2011.5958822

Year

2011

Chicago Citation

Malek, M., F. Salfner, and K. S. Trivedi. “Third workshop on proactive failure avoidance, recovery, and maintenance (PFARM).” Proceedings of the International Conference on Dependable Systems and Networks, September 2, 2011, 257–58. https://doi.org/10.1109/DSNW.2011.5958822.