In a recent study, Reinforcement Learning (RL) used in combination with many-objective search, has been shown to outperform alternative techniques (random search and many-objective search) for online testing of Deep Neural Network-enabled systems. The empirical evaluation of these techniques was conducted on a state-of-the-art Autonomous Driving System (ADS). This work is a replication and extension of that empirical study. Our replication shows that RL does not outperform pure random test generation in a comparison conducted under the same settings of the original study, but with no confounding factor coming from the way collisions are measured. Our extension aims at eliminating some of the possible reasons for the poor performance of RL observed in our replication: (1) the presence of reward components providing contrasting feedback to the RL agent; (2) the usage of an RL algorithm (Q-learning) which requires discretization of an intrinsically continuous state space. Results show that our new RL agent is able to converge to an effective policy that outperforms random search. Results also highlight other possible improvements, which open to further investigations on how to best leverage RL for online ADS testing.

Reinforcement learning for online testing of autonomous driving systems: a replication and extension study / Giamattei, Luca; Biagiola, Matteo; Pietrantuono, Roberto; Russo, Stefano; Tonella, Paolo. - In: EMPIRICAL SOFTWARE ENGINEERING. - ISSN 1573-7616. - 30:1(2025). [10.1007/s10664-024-10562-5]

Reinforcement learning for online testing of autonomous driving systems: a replication and extension study

Luca Giamattei
;
Roberto Pietrantuono;Stefano Russo;
2025

Abstract

In a recent study, Reinforcement Learning (RL) used in combination with many-objective search, has been shown to outperform alternative techniques (random search and many-objective search) for online testing of Deep Neural Network-enabled systems. The empirical evaluation of these techniques was conducted on a state-of-the-art Autonomous Driving System (ADS). This work is a replication and extension of that empirical study. Our replication shows that RL does not outperform pure random test generation in a comparison conducted under the same settings of the original study, but with no confounding factor coming from the way collisions are measured. Our extension aims at eliminating some of the possible reasons for the poor performance of RL observed in our replication: (1) the presence of reward components providing contrasting feedback to the RL agent; (2) the usage of an RL algorithm (Q-learning) which requires discretization of an intrinsically continuous state space. Results show that our new RL agent is able to converge to an effective policy that outperforms random search. Results also highlight other possible improvements, which open to further investigations on how to best leverage RL for online ADS testing.
2025
Reinforcement learning for online testing of autonomous driving systems: a replication and extension study / Giamattei, Luca; Biagiola, Matteo; Pietrantuono, Roberto; Russo, Stefano; Tonella, Paolo. - In: EMPIRICAL SOFTWARE ENGINEERING. - ISSN 1573-7616. - 30:1(2025). [10.1007/s10664-024-10562-5]
File in questo prodotto:
File Dimensione Formato  
s10664-024-10562-5 (3).pdf

accesso aperto

Tipologia: Versione Editoriale (PDF)
Licenza: Creative commons
Dimensione 3.99 MB
Formato Adobe PDF
3.99 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11588/1002998
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? ND
social impact