Association Rules (AR) are a well known data mining tool aiming to detect patterns of association in data bases. The major drawback to knowledge extraction through AR mining is the huge number of rules produced when dealing with large amounts of data. Several proposals in the literature tackle this problem with different approaches. In this framework, the general aim of the present proposal is to identify patterns of association in large binary data. We propose an iterative procedure combining clustering and dimensionality reduction techniques: each iteration involves a quantification of the starting binary attributes and an agglomerative algorithm on the obtained quantitative variables. The objective is to find a quantification that emphasizes the presence of groups of co-occurring attributes in data.

A Two-Step Iterative Procedure for Clustering of Binary Sequences / IODICE D'ENZA, A.; Palumbo, Francesco. - STAMPA. - Studies in Classification, Data Analysis, and Knowledge Organization:(2010), pp. 33-+. [10.1007/978-3-642-03739-9_4]

A Two-Step Iterative Procedure for Clustering of Binary Sequences

A. IODICE D'ENZA;PALUMBO, FRANCESCO
2010

Abstract

Association Rules (AR) are a well known data mining tool aiming to detect patterns of association in data bases. The major drawback to knowledge extraction through AR mining is the huge number of rules produced when dealing with large amounts of data. Several proposals in the literature tackle this problem with different approaches. In this framework, the general aim of the present proposal is to identify patterns of association in large binary data. We propose an iterative procedure combining clustering and dimensionality reduction techniques: each iteration involves a quantification of the starting binary attributes and an agglomerative algorithm on the obtained quantitative variables. The objective is to find a quantification that emphasizes the presence of groups of co-occurring attributes in data.
2010
9783642037382
A Two-Step Iterative Procedure for Clustering of Binary Sequences / IODICE D'ENZA, A.; Palumbo, Francesco. - STAMPA. - Studies in Classification, Data Analysis, and Knowledge Organization:(2010), pp. 33-+. [10.1007/978-3-642-03739-9_4]
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11588/368898
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? 0
social impact