ARES: A Framework for Management of Aging and Rejuvenation in Softwarized Networks

TitleARES: A Framework for Management of Aging and Rejuvenation in Softwarized Networks
Publication TypeJournal Article
Year of Publication2021
AuthorsP Vizarreta, C Sieber, A Blenk, A Van Bemten, V Ramachandra, W Kellerer, C Mas-Machuca, and K Trivedi
JournalIEEE Transactions on Network and Service Management
Volume18
Issue2
Start Page1389
Pagination1389 - 1400
Date Published06/2021
Abstract

The recent trend of network softwarization suggests a radical shift in the implementation of traditional network intelligence. In Software Defined Networking (SDN), for instance, the control plane functions of forwarding devices are outsourced to the controller. Softwarized network components are expected to provide uninterrupted service during long periods of time, which makes them prone to the effects of software aging, a phenomena that has been observed in operational software systems where the failure rate increases or the performance of the software degrades with the elapsed time since the last restart. The effects of software aging in operational networks are typically mitigated by software rejuvenation, i.e., planned restarts cleaning the internal system state in order to prevent or postpone aging-related failures. This article presents ARES, a three-step methodological framework for the management of the effects of software aging in softwarized networks, applied to the case study of open source SDN orchestration platforms. Using ARES, we demonstrate that software aging is a systematic problem that cannot be neglected in network orchestration systems. It stems not only from aging-related bugs and natural aging due to fragmentation, but also from design choices, e.g., when implementing distributed systems. Measurements for Open Network Operating System (ONOS) and OpenDaylight (ODL) demonstrate how 'simple' and common networking tasks let network performance degrade rapidly and even lead to crashes: for instance, adding and removing 300 intents per second in ONOS significantly increases the response time by 50% per day and depletes the memory at the rate of 18GB per day. Moreover, we demonstrate a first rejuvenation approach that can mitigate the effects of aging in ONOS.

DOI10.1109/TNSM.2020.3030589
Short TitleIEEE Transactions on Network and Service Management