This research explores the application of Probabilistic Distance Clustering (PDC) to compositional data, specifically addressing the challenge of handling a bounded sample space in distance computations. It first reviews existing log-ratio transformations that map compositions into Euclidean space for clustering. Then, it adapts PDC to handle compositional data by using the additive ratio combined with the Box-Cox transformation. A simulation study will evaluate this method compared to the well-established isometric log-ratio transformation.
Probabilistic Distance Clustering for Compositional Data / Simonacci, Violetta; Tortora, Cristina; Palumbo, Francesco; Gallo, Michele. - (2025), pp. 345-350. ( SIS2025 - Statistics for Innovation Genova 16-18 Giugno) [10.1007/978-3-031-96033-8_56].
Probabilistic Distance Clustering for Compositional Data
Simonacci, Violetta
;Palumbo, Francesco;
2025
Abstract
This research explores the application of Probabilistic Distance Clustering (PDC) to compositional data, specifically addressing the challenge of handling a bounded sample space in distance computations. It first reviews existing log-ratio transformations that map compositions into Euclidean space for clustering. Then, it adapts PDC to handle compositional data by using the additive ratio combined with the Box-Cox transformation. A simulation study will evaluate this method compared to the well-established isometric log-ratio transformation.| File | Dimensione | Formato | |
|---|---|---|---|
|
SIS2025_CoDaPDC.pdf
solo utenti autorizzati
Descrizione: Short paper SIS2025
Tipologia:
Versione Editoriale (PDF)
Licenza:
Copyright dell'editore
Dimensione
243.51 kB
Formato
Adobe PDF
|
243.51 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


