Big Data paradigm is leading both research and industry effort calling for new approaches in many computer science areas. In this paper, we show how semantic similarity search for natural language texts can be leveraged in biomedical domain by Word Embedding models obtained by word2vec algorithm, exploiting a specifically developed Big Data architecture. We tested our approach using a dataset extracted from the whole PubMed library. Moreover, we describe a user friendly web front-end able to show the usability of this methodology on a real context that allowed us to learn some useful lessons about this peculiar kind of data.

Some lessons learned using health data literature for smart information retrieval / Ciampi, M.; De Pietro, G.; Masciari, E.; Silvestri, S.. - (2020), pp. 931-934. (Intervento presentato al convegno 35th Annual ACM Symposium on Applied Computing, SAC 2020 tenutosi a cze nel 2020) [10.1145/3341105.3374128].

Some lessons learned using health data literature for smart information retrieval

De Pietro G.;Masciari E.
;
2020

Abstract

Big Data paradigm is leading both research and industry effort calling for new approaches in many computer science areas. In this paper, we show how semantic similarity search for natural language texts can be leveraged in biomedical domain by Word Embedding models obtained by word2vec algorithm, exploiting a specifically developed Big Data architecture. We tested our approach using a dataset extracted from the whole PubMed library. Moreover, we describe a user friendly web front-end able to show the usability of this methodology on a real context that allowed us to learn some useful lessons about this peculiar kind of data.
2020
9781450368667
Some lessons learned using health data literature for smart information retrieval / Ciampi, M.; De Pietro, G.; Masciari, E.; Silvestri, S.. - (2020), pp. 931-934. (Intervento presentato al convegno 35th Annual ACM Symposium on Applied Computing, SAC 2020 tenutosi a cze nel 2020) [10.1145/3341105.3374128].
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11588/806758
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 4
  • ???jsp.display-item.citation.isi??? 4
social impact