Genomic string comparison via alignment are widely applied for mining and retrieval of information in biological databases. In some situation, the effectiveness of such alignment based comparison is still unclear, e.g., for sequences with non-uniform length and with significant shuffling of identical substrings. An alternative approach is the one based on information theory distances. We shall present four of the most representative alignment-free distance measures, based on mutual information. Each one has a different origin and expression. Our comparison involves a sort of arrangement, to reduce different concepts to a unique formalism, so as it has been possible to construct a phylogenetic tree for each of them.
Analysis and Comparison of Information Theory-Based Distances for Genomic Strings / Balzano, Walter; F., Cicalese; M. R., Del Sorbo; U., Vaccaro. - STAMPA. - (2008), pp. 292-310. (Intervento presentato al convegno Collective Dynamics: Topics on Competition and Cooperation in the Biosciences, A.I.P. American Institute of Physics Conference Proceedings).
Analysis and Comparison of Information Theory-Based Distances for Genomic Strings
BALZANO, WALTER
;
2008
Abstract
Genomic string comparison via alignment are widely applied for mining and retrieval of information in biological databases. In some situation, the effectiveness of such alignment based comparison is still unclear, e.g., for sequences with non-uniform length and with significant shuffling of identical substrings. An alternative approach is the one based on information theory distances. We shall present four of the most representative alignment-free distance measures, based on mutual information. Each one has a different origin and expression. Our comparison involves a sort of arrangement, to reduce different concepts to a unique formalism, so as it has been possible to construct a phylogenetic tree for each of them.File | Dimensione | Formato | |
---|---|---|---|
Analysis and Comparison of Information Theory-based Distances for Genomic Strings.pdf
non disponibili
Tipologia:
Documento in Post-print
Licenza:
Accesso privato/ristretto
Dimensione
4.07 MB
Formato
Adobe PDF
|
4.07 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.