Support Vector Regression (SVR) is a new generation of Machine Learning algorithms, suitable for predictive data modeling problems. The objective of this paper is to investigate the effectiveness of SVR for Web effort estimation, in particular when dealing with a cross-company dataset. To gain a deeper insight on the method, we carried out an empirical study using four kernels for SVR, namely linear, polynomial, Gaussian, and sigmoid. Moreover, we used two variables’ preprocessing strategies (normalization and logarithmic), and two different dependent variables (effort and inverse effort). As a result, SVR was applied using six different configurations for each kernel. As for the dataset, we employed the Tukutuku database, which is widely adopted in Web effort estimation studies. A hold-out approach was adopted to evaluate the prediction accuracy for all the configurations, using two training sets, each containing data on 130 projects randomly selected, and two test sets, each containing the remaining 65 projects. As benchmark, SVR-based predictions were also compared to predictions obtained using Manual StepWise Regression, Case-Based Reasoning, and Bayesian Networks. Our results suggest that SVR performed well, since on the first hold-out, the linear kernel with a logarithmic transformation of variables provided significantly superior prediction accuracy than all the other techniques, while for the second hold-out, the Gaussian kernel achieved significantly superior predictions than all other techniques, except for Manual StepWise Regression.

Applying support vector regression for web effort estimation using a cross-company dataset / Corazza, Anna; DI MARTINO, Sergio; F., Ferrucci; C., Gravino; E., Mendes. - STAMPA. - (2009), pp. 191-202. (Intervento presentato al convegno Empirical Software Engineering and Measurement tenutosi a Lake Buena Vista, Florida, USA nel 2009) [10.1109/ESEM.2009.5315991].

Applying support vector regression for web effort estimation using a cross-company dataset

CORAZZA, ANNA;DI MARTINO, SERGIO;
2009

Abstract

Support Vector Regression (SVR) is a new generation of Machine Learning algorithms, suitable for predictive data modeling problems. The objective of this paper is to investigate the effectiveness of SVR for Web effort estimation, in particular when dealing with a cross-company dataset. To gain a deeper insight on the method, we carried out an empirical study using four kernels for SVR, namely linear, polynomial, Gaussian, and sigmoid. Moreover, we used two variables’ preprocessing strategies (normalization and logarithmic), and two different dependent variables (effort and inverse effort). As a result, SVR was applied using six different configurations for each kernel. As for the dataset, we employed the Tukutuku database, which is widely adopted in Web effort estimation studies. A hold-out approach was adopted to evaluate the prediction accuracy for all the configurations, using two training sets, each containing data on 130 projects randomly selected, and two test sets, each containing the remaining 65 projects. As benchmark, SVR-based predictions were also compared to predictions obtained using Manual StepWise Regression, Case-Based Reasoning, and Bayesian Networks. Our results suggest that SVR performed well, since on the first hold-out, the linear kernel with a logarithmic transformation of variables provided significantly superior prediction accuracy than all the other techniques, while for the second hold-out, the Gaussian kernel achieved significantly superior predictions than all other techniques, except for Manual StepWise Regression.
2009
9781424448425
Applying support vector regression for web effort estimation using a cross-company dataset / Corazza, Anna; DI MARTINO, Sergio; F., Ferrucci; C., Gravino; E., Mendes. - STAMPA. - (2009), pp. 191-202. (Intervento presentato al convegno Empirical Software Engineering and Measurement tenutosi a Lake Buena Vista, Florida, USA nel 2009) [10.1109/ESEM.2009.5315991].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11588/365496
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 17
  • ???jsp.display-item.citation.isi??? 12
social impact