Denoising algorithms are widely studied to improve image quality in many applicative fields, such as for example the Magnetic Resonance Imaging (MRI). In real scenarios, computationally expensive schemes have been rendered practicable by means of programming methodologies that resort to massively parallel architectures, known as Graphic Processor Units (GPUs). In this paper, we propose an hybrid CPU and GPU parallel implementation of the Overcomplete Local Principal Component Analysis (OLPCA) for the image denoising. We have implemented some computational tasks of the denoising procedure focusing on a strategy that combines both shared and global memories mapping approach on the GPU with the aim to optimize the performance of OLPCA. The experimental results show improvements in terms of GFlops and memory throughput with a promising speedup with respect to the CPU version that encourage its usability in the expensive application of the Diffusion Weighted Imaging (DWI) as filter of noisy images

Local principal component analysis overcomplete method: A GPU parallel implementation combining shared and global memories / Cuomo, Salvatore; DE MICHELE, Pasquale; Galletti, Ardelio; Marcellino, Livia. - (2016), pp. 81-87. (Intervento presentato al convegno 14th International Conference on High Performance Computing and Simulation, HPCS 2016 tenutosi a aut nel 2016) [10.1109/HPCSim.2016.7568319].

Local principal component analysis overcomplete method: A GPU parallel implementation combining shared and global memories

CUOMO, SALVATORE;DE MICHELE, PASQUALE;MARCELLINO, LIVIA
2016

Abstract

Denoising algorithms are widely studied to improve image quality in many applicative fields, such as for example the Magnetic Resonance Imaging (MRI). In real scenarios, computationally expensive schemes have been rendered practicable by means of programming methodologies that resort to massively parallel architectures, known as Graphic Processor Units (GPUs). In this paper, we propose an hybrid CPU and GPU parallel implementation of the Overcomplete Local Principal Component Analysis (OLPCA) for the image denoising. We have implemented some computational tasks of the denoising procedure focusing on a strategy that combines both shared and global memories mapping approach on the GPU with the aim to optimize the performance of OLPCA. The experimental results show improvements in terms of GFlops and memory throughput with a promising speedup with respect to the CPU version that encourage its usability in the expensive application of the Diffusion Weighted Imaging (DWI) as filter of noisy images
2016
9781509020881
9781509020881
Local principal component analysis overcomplete method: A GPU parallel implementation combining shared and global memories / Cuomo, Salvatore; DE MICHELE, Pasquale; Galletti, Ardelio; Marcellino, Livia. - (2016), pp. 81-87. (Intervento presentato al convegno 14th International Conference on High Performance Computing and Simulation, HPCS 2016 tenutosi a aut nel 2016) [10.1109/HPCSim.2016.7568319].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11588/647828
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? 3
social impact