Although in the last decade several fact-checking organizations have emerged to verify misinformation, fake news has continued to proliferate, especially through social media platforms. Even though adopting improved detection strategies is of utmost importance, the fact-checking process could be optimized by verifying whether a claim has been previously fact-checked. Despite some ad-hoc information retrieval approaches having been recently proposed, the utility of modern (neural) retrieval systems have not been investigated yet. In this paper, we consider the standard two-phases retriever-reranker architecture and benchmark different state-of-the-art techniques from the information retrieval and Q&A literature. We design several experiments on a real-world Twitter dataset to analyze the efficiency and the effectiveness of the benchmark approaches. Our results show that combining standard and neural approaches is the most promising research direction to improve retrievers performance and that complex (neural) rerankers might still be efficient in practice since there is no need to process a high number of documents to improve ranking performance.

Information retrieval algorithms and neural ranking models to detect previously fact-checked information / Chakraborty, Tanmoy; LA GATTA, Valerio; Moscato, Vincenzo; Sperli', Giancarlo. - In: NEUROCOMPUTING. - ISSN 0925-2312. - 557:(2023). [10.1016/j.neucom.2023.126680]

Information retrieval algorithms and neural ranking models to detect previously fact-checked information

Valerio La Gatta;Vincenzo Moscato;Giancarlo Sperli'
2023

Abstract

Although in the last decade several fact-checking organizations have emerged to verify misinformation, fake news has continued to proliferate, especially through social media platforms. Even though adopting improved detection strategies is of utmost importance, the fact-checking process could be optimized by verifying whether a claim has been previously fact-checked. Despite some ad-hoc information retrieval approaches having been recently proposed, the utility of modern (neural) retrieval systems have not been investigated yet. In this paper, we consider the standard two-phases retriever-reranker architecture and benchmark different state-of-the-art techniques from the information retrieval and Q&A literature. We design several experiments on a real-world Twitter dataset to analyze the efficiency and the effectiveness of the benchmark approaches. Our results show that combining standard and neural approaches is the most promising research direction to improve retrievers performance and that complex (neural) rerankers might still be efficient in practice since there is no need to process a high number of documents to improve ranking performance.
2023
Information retrieval algorithms and neural ranking models to detect previously fact-checked information / Chakraborty, Tanmoy; LA GATTA, Valerio; Moscato, Vincenzo; Sperli', Giancarlo. - In: NEUROCOMPUTING. - ISSN 0925-2312. - 557:(2023). [10.1016/j.neucom.2023.126680]
File in questo prodotto:
File Dimensione Formato  
1-s2.0-S0925231223008032-main.pdf

accesso aperto

Licenza: Creative commons
Dimensione 782.27 kB
Formato Adobe PDF
782.27 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11588/952843
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 8
  • ???jsp.display-item.citation.isi??? 5
social impact