On-line timing error detection entails gathering and analyzing monitoring data to pinpoint deviations from the expected timing behavior of a given software system. Current solutions for system monitoring and runtime analysis present several practical drawbacks that limit their usability in real industrial systems, such as the need of kernel-level probes or the coarse per-node/per-process monitoring granularity. This paper proposes a novel framework for timing error detection that capitalizes on the systematic interleaving of logging instructions across the functional code in order to overcome above limitations. The paper faces the practical challenges related to the specification and implementation of a log weaving technique, detection algorithms, and a data centralization platform to collect and analyze fine-grained execution traces in distributed systems. We experiment the proposed framework in two real-world critical information systems from the Crisis Management and the Air Traffic Control domains. Results show that our framework achieves 95% timing error coverage and allows reconstructing error trends with high statistical confidence at negligible performance overhead.

A framework for on-line timing error detection in software systems / Cinque, Marcello; Cotroneo, Domenico; Della Corte, Raffaele; Pecchia, Antonio. - In: FUTURE GENERATION COMPUTER SYSTEMS. - ISSN 0167-739X. - 90:(2019), pp. 521-538. [10.1016/j.future.2018.08.025]

A framework for on-line timing error detection in software systems

Cinque, Marcello;Cotroneo, Domenico;Della Corte, Raffaele;Pecchia, Antonio
2019

Abstract

On-line timing error detection entails gathering and analyzing monitoring data to pinpoint deviations from the expected timing behavior of a given software system. Current solutions for system monitoring and runtime analysis present several practical drawbacks that limit their usability in real industrial systems, such as the need of kernel-level probes or the coarse per-node/per-process monitoring granularity. This paper proposes a novel framework for timing error detection that capitalizes on the systematic interleaving of logging instructions across the functional code in order to overcome above limitations. The paper faces the practical challenges related to the specification and implementation of a log weaving technique, detection algorithms, and a data centralization platform to collect and analyze fine-grained execution traces in distributed systems. We experiment the proposed framework in two real-world critical information systems from the Crisis Management and the Air Traffic Control domains. Results show that our framework achieves 95% timing error coverage and allows reconstructing error trends with high statistical confidence at negligible performance overhead.
2019
A framework for on-line timing error detection in software systems / Cinque, Marcello; Cotroneo, Domenico; Della Corte, Raffaele; Pecchia, Antonio. - In: FUTURE GENERATION COMPUTER SYSTEMS. - ISSN 0167-739X. - 90:(2019), pp. 521-538. [10.1016/j.future.2018.08.025]
File in questo prodotto:
File Dimensione Formato  
1-s2.0-S0167739X18309609-main.pdf

solo utenti autorizzati

Tipologia: Versione Editoriale (PDF)
Licenza: Copyright dell'editore
Dimensione 4.24 MB
Formato Adobe PDF
4.24 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11588/729326
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 7
  • ???jsp.display-item.citation.isi??? 6
social impact