In recent years, the automatic segmentation and classification of digital survey data has been experimented with in built heritage studies. Despite the encouraging progress in the use of Machine and Deep Learning techniques, the semantic segmentation of point clouds is more complex, especially for the historic environment for which, due to the heterogeneity of shapes, it is more difficult to recognize homogeneous regions with similar properties. Given the need to process a large volume of already annotated data for the training and recognition of new scenes, the type and quality of the initial data play a fundamental role in the classification process, as they influence the subdivision into predefined categories that are not always consistent with a decomposition into architectural elements and sub-elements shared by the scientific community. This is an interpretative problem that already emerges from the traditional manual labelling, which, being highly subjective, reduces the reproducibility of the results. The paper focuses on understanding to what extent the recognition of homogeneous regions is influenced by factors such as:—manual labelling carried out by annotators with different specializations;—density value of the point clouds;—type of data acquired depending on the acquisition sensor. These evaluations were conducted by employing the Random Forest algorithm on specific pre-processed point cloud datasets, with reference to the typology of the Franciscan cloister, in order to make the recognition flows more controlled and less ambiguous, aiming at an advancement towards the more efficient modelling and management of the existing architectural heritage.
The Influence of Data Quality in Supervised ML-AI Classification Approaches for Historical Heritage / Antuono, Giuseppe; Cera, Valeria; Campi, Massimiliano; D’Agostino, Pierpaolo. - (2025), pp. 595-613. [10.1007/978-3-032-04711-3_34]
The Influence of Data Quality in Supervised ML-AI Classification Approaches for Historical Heritage
Giuseppe Antuono
;Valeria Cera;Massimiliano Campi;Pierpaolo D’Agostino
2025
Abstract
In recent years, the automatic segmentation and classification of digital survey data has been experimented with in built heritage studies. Despite the encouraging progress in the use of Machine and Deep Learning techniques, the semantic segmentation of point clouds is more complex, especially for the historic environment for which, due to the heterogeneity of shapes, it is more difficult to recognize homogeneous regions with similar properties. Given the need to process a large volume of already annotated data for the training and recognition of new scenes, the type and quality of the initial data play a fundamental role in the classification process, as they influence the subdivision into predefined categories that are not always consistent with a decomposition into architectural elements and sub-elements shared by the scientific community. This is an interpretative problem that already emerges from the traditional manual labelling, which, being highly subjective, reduces the reproducibility of the results. The paper focuses on understanding to what extent the recognition of homogeneous regions is influenced by factors such as:—manual labelling carried out by annotators with different specializations;—density value of the point clouds;—type of data acquired depending on the acquisition sensor. These evaluations were conducted by employing the Random Forest algorithm on specific pre-processed point cloud datasets, with reference to the typology of the Franciscan cloister, in order to make the recognition flows more controlled and less ambiguous, aiming at an advancement towards the more efficient modelling and management of the existing architectural heritage.| File | Dimensione | Formato | |
|---|---|---|---|
|
The Influence of Data Quality in Supervised ML-AI Classification Approaches for Historical Heritage_.pdf
solo utenti autorizzati
Tipologia:
Versione Editoriale (PDF)
Licenza:
Creative commons
Dimensione
8 MB
Formato
Adobe PDF
|
8 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


