Ensemble methods are supervised learning algorithms that provide highly accurate solutions by training many models. Random forest is probably the most widely used in regression and classifcation problems. It builds decision trees on diferent samples and takes their majority vote for classifcation and average in case of regression. However, such an algorithm sufers from a lack of explainability and thus does not allow users to understand how particular decisions are made. To improve on that, we propose a new way of interpreting an ensemble tree structure. Starting from a random forest model, our approach is able to explain graphically the relationship structure between the response variable and predictors. The proposed method appears to be useful in all real-world cases where model interpretation for predictive purposes is crucial. The proposal is evaluated by means of real data sets.

Explainable Ensemble Trees / Aria, Massimo; Gnasso, Agostino; Iorio, Carmela; Pandolfo, Giuseppe. - In: COMPUTATIONAL STATISTICS. - ISSN 0943-4062. - (2023). [10.1007/s00180-022-01312-6]

Explainable Ensemble Trees

Massimo Aria
;
Agostino Gnasso;Carmela Iorio;Giuseppe Pandolfo
2023

Abstract

Ensemble methods are supervised learning algorithms that provide highly accurate solutions by training many models. Random forest is probably the most widely used in regression and classifcation problems. It builds decision trees on diferent samples and takes their majority vote for classifcation and average in case of regression. However, such an algorithm sufers from a lack of explainability and thus does not allow users to understand how particular decisions are made. To improve on that, we propose a new way of interpreting an ensemble tree structure. Starting from a random forest model, our approach is able to explain graphically the relationship structure between the response variable and predictors. The proposed method appears to be useful in all real-world cases where model interpretation for predictive purposes is crucial. The proposal is evaluated by means of real data sets.
2023
Explainable Ensemble Trees / Aria, Massimo; Gnasso, Agostino; Iorio, Carmela; Pandolfo, Giuseppe. - In: COMPUTATIONAL STATISTICS. - ISSN 0943-4062. - (2023). [10.1007/s00180-022-01312-6]
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11588/906526
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? 4
social impact