The contribution is meant as a pilot exercise on the development of a variable selection technique that combines fitting and prediction performance. The simplest step towards this goal involves a forward selection algorithm for binary classification achieved via logistic regression. At each step, the algorithm selects the predictor, among the covariates that are significantly associated with the outcome, that entails the significantly largest AUC increment with respect to the previous model. For the sake of illustration, we present some examples relative to the search of the most predictive factors of the propensity to pension planning, health insurance subscription, and the use of Financial Technology services like home banking, taken from the Survey on Household Income and Wealth 2020 run by Bank of Italy. Concluding remarks are provided on further developments

A variable selection procedure based on predictive ability: a preliminary study on logistic regression / Coppola, Mariarosaria; Simone, Rosaria. - (2023), pp. 1285-1290. (Intervento presentato al convegno SIS 2023-Statistical Learning, Sustainability and Impact Evaluation - SEAS IN tenutosi a Ancona Italy nel 21-23 giugno 2023).

A variable selection procedure based on predictive ability: a preliminary study on logistic regression

Mariarosaria Coppola;Rosaria Simone
2023

Abstract

The contribution is meant as a pilot exercise on the development of a variable selection technique that combines fitting and prediction performance. The simplest step towards this goal involves a forward selection algorithm for binary classification achieved via logistic regression. At each step, the algorithm selects the predictor, among the covariates that are significantly associated with the outcome, that entails the significantly largest AUC increment with respect to the previous model. For the sake of illustration, we present some examples relative to the search of the most predictive factors of the propensity to pension planning, health insurance subscription, and the use of Financial Technology services like home banking, taken from the Survey on Household Income and Wealth 2020 run by Bank of Italy. Concluding remarks are provided on further developments
2023
9788891935618
A variable selection procedure based on predictive ability: a preliminary study on logistic regression / Coppola, Mariarosaria; Simone, Rosaria. - (2023), pp. 1285-1290. (Intervento presentato al convegno SIS 2023-Statistical Learning, Sustainability and Impact Evaluation - SEAS IN tenutosi a Ancona Italy nel 21-23 giugno 2023).
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11588/949665
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact