This paper proposes an approach to software faults diagnosis in complex fault tolerant systems, encompassing the phases of error detection, fault location, and system recovery. Errors are detected in the first phase, exploiting the operating system support. Faults are identified during the location phase, adopting on a machine learning approach; this phase then triggers the proper recovery action for the occurred fault - actuated in the third phase. Feedback actions are also adopted in the location phase to improve detection quality over time. A real world application from the Air Traffic Control field has been used as case study for evaluating the proposed approach. Experimental results, achieved by means of fault injection, show that the diagnosis engine is able to diagnose faults with high accuracy and at a low overhead.

A Recovery-Oriented Approach for Software Fault Diagnosis in Complex Critical Systems / Carrozza, Gabriella; Natella, Roberto. - In: INTERNATIONAL JOURNAL OF ADAPTIVE, RESILIENT AND AUTONOMIC SYSTEMS. - ISSN 1947-9220. - 2:1(2011), pp. 77-104. [10.4018/jaras.2011010105]

A Recovery-Oriented Approach for Software Fault Diagnosis in Complex Critical Systems

Carrozza, Gabriella;Natella, Roberto
2011

Abstract

This paper proposes an approach to software faults diagnosis in complex fault tolerant systems, encompassing the phases of error detection, fault location, and system recovery. Errors are detected in the first phase, exploiting the operating system support. Faults are identified during the location phase, adopting on a machine learning approach; this phase then triggers the proper recovery action for the occurred fault - actuated in the third phase. Feedback actions are also adopted in the location phase to improve detection quality over time. A real world application from the Air Traffic Control field has been used as case study for evaluating the proposed approach. Experimental results, achieved by means of fault injection, show that the diagnosis engine is able to diagnose faults with high accuracy and at a low overhead.
2011
A Recovery-Oriented Approach for Software Fault Diagnosis in Complex Critical Systems / Carrozza, Gabriella; Natella, Roberto. - In: INTERNATIONAL JOURNAL OF ADAPTIVE, RESILIENT AND AUTONOMIC SYSTEMS. - ISSN 1947-9220. - 2:1(2011), pp. 77-104. [10.4018/jaras.2011010105]
File in questo prodotto:
File Dimensione Formato  
jaras-2-1-article-5.pdf

solo utenti autorizzati

Tipologia: Documento in Post-print
Licenza: Accesso privato/ristretto
Dimensione 2.02 MB
Formato Adobe PDF
2.02 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11588/697442
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact