Summarization techniques are becoming an essential part of everyday life, basically because summaries allow users to spend less time making effective access to the desired information. In this paper, we present a general framework for retrieving relevant information from news articles and a novel summarization algorithm based on a deep semantic analysis of texts. In particular, a set of triples (subject, predicate, object) is extracted from each document and it is then used to build a summary through an unsupervised clustering algorithm exploiting the notion of semantic similarity. Finally, we leverage the centroids of clusters to determine the most significant summary sentences using some heuristics. Several experiments are carried out using the standard DUC methodology and ROUGE software and show how the proposed method outperforms several summarizer systems in terms of recall and readability. © Springer International Publishing AG 2017.

Semantic summarization of news from heterogeneous sources / Amato, F.; D’Acierno, A.; Colace, F.; Moscato, V.; Penta, A.; Picariello, A.. - 1:(2017), pp. 305-314. (Intervento presentato al convegno International Conference on P2P, Parallel, Grid, Cloud and Internet Computing ( 3PGCIC 2017) tenutosi a Soonchunhyang University, Asan, Korea nel November 5–7, 2016) [10.1007/978-3-319-49109-7_29].

Semantic summarization of news from heterogeneous sources

Amato, F.;Colace, F.;Moscato, V.;Picariello, A.
2017

Abstract

Summarization techniques are becoming an essential part of everyday life, basically because summaries allow users to spend less time making effective access to the desired information. In this paper, we present a general framework for retrieving relevant information from news articles and a novel summarization algorithm based on a deep semantic analysis of texts. In particular, a set of triples (subject, predicate, object) is extracted from each document and it is then used to build a summary through an unsupervised clustering algorithm exploiting the notion of semantic similarity. Finally, we leverage the centroids of clusters to determine the most significant summary sentences using some heuristics. Several experiments are carried out using the standard DUC methodology and ROUGE software and show how the proposed method outperforms several summarizer systems in terms of recall and readability. © Springer International Publishing AG 2017.
2017
978-3-319-49109-7
978-3-319-49108-0
Semantic summarization of news from heterogeneous sources / Amato, F.; D’Acierno, A.; Colace, F.; Moscato, V.; Penta, A.; Picariello, A.. - 1:(2017), pp. 305-314. (Intervento presentato al convegno International Conference on P2P, Parallel, Grid, Cloud and Internet Computing ( 3PGCIC 2017) tenutosi a Soonchunhyang University, Asan, Korea nel November 5–7, 2016) [10.1007/978-3-319-49109-7_29].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11588/822325
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 1
social impact