In recent years, the automatic segmentation and classification of digital survey data has been experimented with in built heritage studies. Despite the encouraging progress in the use of Machine and Deep Learning techniques, the semantic segmentation of point clouds is more complex, especially for the historic environment for which, due to the heterogeneity of shapes, it is more difficult to recognize homogeneous regions with similar properties. Given the need to process a large volume of already annotated data for the training and recognition of new scenes, the type and quality of the initial data play a fundamental role in the classification process, as they influence the subdivision into predefined categories that are not always consistent with a decomposition into architectural elements and sub-elements shared by the scientific community. This is an interpretative problem that already emerges from the traditional manual labelling, which, being highly subjective, reduces the reproducibility of the results. The paper focuses on understanding to what extent the recognition of homogeneous regions is influenced by factors such as:—manual labelling carried out by annotators with different specializations;—density value of the point clouds;—type of data acquired depending on the acquisition sensor. These evaluations were conducted by employing the Random Forest algorithm on specific pre-processed point cloud datasets, with reference to the typology of the Franciscan cloister, in order to make the recognition flows more controlled and less ambiguous, aiming at an advancement towards the more efficient modelling and management of the existing architectural heritage.

The Influence of Data Quality in Supervised ML-AI Classification Approaches for Historical Heritage / Antuono, Giuseppe; Cera, Valeria; Campi, Massimiliano; D’Agostino, Pierpaolo. - (2025), pp. 595-613. [10.1007/978-3-032-04711-3_34]

The Influence of Data Quality in Supervised ML-AI Classification Approaches for Historical Heritage

Giuseppe Antuono
;
Valeria Cera;Massimiliano Campi;Pierpaolo D’Agostino
2025

Abstract

In recent years, the automatic segmentation and classification of digital survey data has been experimented with in built heritage studies. Despite the encouraging progress in the use of Machine and Deep Learning techniques, the semantic segmentation of point clouds is more complex, especially for the historic environment for which, due to the heterogeneity of shapes, it is more difficult to recognize homogeneous regions with similar properties. Given the need to process a large volume of already annotated data for the training and recognition of new scenes, the type and quality of the initial data play a fundamental role in the classification process, as they influence the subdivision into predefined categories that are not always consistent with a decomposition into architectural elements and sub-elements shared by the scientific community. This is an interpretative problem that already emerges from the traditional manual labelling, which, being highly subjective, reduces the reproducibility of the results. The paper focuses on understanding to what extent the recognition of homogeneous regions is influenced by factors such as:—manual labelling carried out by annotators with different specializations;—density value of the point clouds;—type of data acquired depending on the acquisition sensor. These evaluations were conducted by employing the Random Forest algorithm on specific pre-processed point cloud datasets, with reference to the typology of the Franciscan cloister, in order to make the recognition flows more controlled and less ambiguous, aiming at an advancement towards the more efficient modelling and management of the existing architectural heritage.
2025
978-3-032-04710-6
The Influence of Data Quality in Supervised ML-AI Classification Approaches for Historical Heritage / Antuono, Giuseppe; Cera, Valeria; Campi, Massimiliano; D’Agostino, Pierpaolo. - (2025), pp. 595-613. [10.1007/978-3-032-04711-3_34]
File in questo prodotto:
File Dimensione Formato  
The Influence of Data Quality in Supervised ML-AI Classification Approaches for Historical Heritage_.pdf

solo utenti autorizzati

Tipologia: Versione Editoriale (PDF)
Licenza: Creative commons
Dimensione 8 MB
Formato Adobe PDF
8 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11588/1032275
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact