The quality of datasets is a critical issue in big data mining. More interesting things could be found for datasets with higher quality. The existence of missing values in geographical data would worsen the quality of big datasets. To improve the data quality, the missing values are generally needed to be estimated using various machine learning algorithms or mathematical methods such as approximations and interpolations. In this paper, we propose an adaptive Radial Basis Function (RBF) interpolation algorithm for estimating missing values in geographical data. In the proposed method, the samples with known values are considered as the data points, while the samples with missing values are considered as the interpolated points. For each interpolated point, first, a local set of data points are adaptively determined. Then, the missing value of the interpolated point is imputed via interpolating using the RBF interpolation based on the local set of data points. Moreover, the shape factors of the RBF are also adaptively determined by considering the distribution of the local set of data points. To evaluate the performance of the proposed method, we compare our method with the commonly used k-Nearest Neighbor (kNN) interpolation and Adaptive Inverse Distance Weighted (AIDW) interpolation, and conduct three groups of benchmark experiments. Experimental results indicate that the proposed method outperforms the kNN interpolation and AIDW interpolation in terms of accuracy, but worse than the kNN interpolation and AIDW interpolation in terms of efficiency.

Adaptive RBF Interpolation for Estimating Missing Values in Geographical Data / Gao, K.; Mei, G.; Cuomo, S.; Piccialli, F.; Xu, N.. - 11973:(2020), pp. 122-130. [10.1007/978-3-030-39081-5_12]

Adaptive RBF Interpolation for Estimating Missing Values in Geographical Data

Cuomo S.
;
Piccialli F.;
2020

Abstract

The quality of datasets is a critical issue in big data mining. More interesting things could be found for datasets with higher quality. The existence of missing values in geographical data would worsen the quality of big datasets. To improve the data quality, the missing values are generally needed to be estimated using various machine learning algorithms or mathematical methods such as approximations and interpolations. In this paper, we propose an adaptive Radial Basis Function (RBF) interpolation algorithm for estimating missing values in geographical data. In the proposed method, the samples with known values are considered as the data points, while the samples with missing values are considered as the interpolated points. For each interpolated point, first, a local set of data points are adaptively determined. Then, the missing value of the interpolated point is imputed via interpolating using the RBF interpolation based on the local set of data points. Moreover, the shape factors of the RBF are also adaptively determined by considering the distribution of the local set of data points. To evaluate the performance of the proposed method, we compare our method with the commonly used k-Nearest Neighbor (kNN) interpolation and Adaptive Inverse Distance Weighted (AIDW) interpolation, and conduct three groups of benchmark experiments. Experimental results indicate that the proposed method outperforms the kNN interpolation and AIDW interpolation in terms of accuracy, but worse than the kNN interpolation and AIDW interpolation in terms of efficiency.
2020
978-3-030-39080-8
978-3-030-39081-5
Adaptive RBF Interpolation for Estimating Missing Values in Geographical Data / Gao, K.; Mei, G.; Cuomo, S.; Piccialli, F.; Xu, N.. - 11973:(2020), pp. 122-130. [10.1007/978-3-030-39081-5_12]
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11588/806646
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? 3
social impact