In modern applications large amounts of data are produced at a high rate and are characterized by relationship structures changing over time. Principal component analysis (PCA) and multiple correspondence analysis (MCA) are well established dimension reduction methods to explore relationships within a set of variables. A critical step of the PCA and MCA algorithms is a singular value decomposition (SVD) or an eigenvalue decomposition (EVD) of a suitably transformed matrix. The high computational and memory requirements of ordinary SVD and EVD make their application impractical on massive or sequential data sets. A series of incremental SVD/EVD approaches are available to address these issues. The idm R package is introduced that implements two efficient incremental SVD approaches. The procedures in question share desirable properties that ease their embedding in PCA and MCA. The package also provides functions for producing animated visualizations of the obtained solutions. A comparison of online MCA implementations in terms of accuracy is also included.

The idm package: Incremental decomposition methods in R / IODICE D'ENZA, Alfonso; Markos, Angelos; Buttarazzi, Davide. - In: JOURNAL OF STATISTICAL SOFTWARE. - ISSN 1548-7660. - 86:CN4(2018), pp. 1-24. [10.18637/jss.v086.c04]

The idm package: Incremental decomposition methods in R

IODICE D'ENZA Alfonso
;
2018

Abstract

In modern applications large amounts of data are produced at a high rate and are characterized by relationship structures changing over time. Principal component analysis (PCA) and multiple correspondence analysis (MCA) are well established dimension reduction methods to explore relationships within a set of variables. A critical step of the PCA and MCA algorithms is a singular value decomposition (SVD) or an eigenvalue decomposition (EVD) of a suitably transformed matrix. The high computational and memory requirements of ordinary SVD and EVD make their application impractical on massive or sequential data sets. A series of incremental SVD/EVD approaches are available to address these issues. The idm R package is introduced that implements two efficient incremental SVD approaches. The procedures in question share desirable properties that ease their embedding in PCA and MCA. The package also provides functions for producing animated visualizations of the obtained solutions. A comparison of online MCA implementations in terms of accuracy is also included.
2018
The idm package: Incremental decomposition methods in R / IODICE D'ENZA, Alfonso; Markos, Angelos; Buttarazzi, Davide. - In: JOURNAL OF STATISTICAL SOFTWARE. - ISSN 1548-7660. - 86:CN4(2018), pp. 1-24. [10.18637/jss.v086.c04]
File in questo prodotto:
File Dimensione Formato  
06_JSS_idm_ready.pdf

accesso aperto

Tipologia: Versione Editoriale (PDF)
Licenza: Non specificato
Dimensione 8.22 MB
Formato Adobe PDF
8.22 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11588/741476
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 7
  • ???jsp.display-item.citation.isi??? 6
social impact