Documents' summarization techniques automatically extract relevant information from different sources with respect to a list of topics: they can be profitably used by a variety of applications and in particular for automatic indexing and categorization in order to facilitate the production and delivery of new multimedia contents. In this paper we propose a novel approach for summarizing documents retrieved from the Internet: we propose to capture the semantic nature of a document, expressed in natural language, in order to retrieve a number of RDF triplets and to clusterize these ones aggregating similar information. An overview of the system and some preliminary results are described.
Semantic Summarization of Web Documents / A., D'Acierno; Moscato, Vincenzo; A., Penta; F., Persia; Picariello, Antonio. - ELETTRONICO. - (2010), pp. 430-435. (Intervento presentato al convegno 4th IEEE International Conference on Semantic Computing (ICSC 2010) tenutosi a Pittsburgh, PA, USA nel September 22-24, 2010) [10.1109/ICSC.2010.28].
Semantic Summarization of Web Documents
MOSCATO, VINCENZO;PICARIELLO, ANTONIO
2010
Abstract
Documents' summarization techniques automatically extract relevant information from different sources with respect to a list of topics: they can be profitably used by a variety of applications and in particular for automatic indexing and categorization in order to facilitate the production and delivery of new multimedia contents. In this paper we propose a novel approach for summarizing documents retrieved from the Internet: we propose to capture the semantic nature of a document, expressed in natural language, in order to retrieve a number of RDF triplets and to clusterize these ones aggregating similar information. An overview of the system and some preliminary results are described.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.