In the age of Semantic Web, one of the most valuable challenges is the one connected with the information extraction from raw data. Information must be managed with sophisticated linguistic and computational architectures, which are able to approach the semantic dimension of words and sentences. In this paper we propose a morphosemantic method for the automatic creation and population of medical lexical resources. Our approach is grounded on a list of neoclassical formative elements pertaining to the medical domain an on a large sized corpus of medical diagnoses. The outcomes of this work are automatically built electronic dictionaries and thesauri and an annotated corpus for the NLP in the medical domain. © 2014 IEEE.
Automatic population of Italian medical thesauri: A morphosemantic approach / Amato, Flora; Elia, A.; Maisto, A.; Mazzeo, A.; Pelosi, S.. - (2014), pp. 432-436. (Intervento presentato al convegno 9th International Conference on P2P, Parallel, Grid, Cloud and Internet Computing, 3PGCIC 2014; Guangzhou, Guangdong; China; 8 November 2014 through 10 November 2014; Category numberE5391; Code 110463) [10.1109/3PGCIC.2014.89].
Automatic population of Italian medical thesauri: A morphosemantic approach
AMATO, FLORA;
2014
Abstract
In the age of Semantic Web, one of the most valuable challenges is the one connected with the information extraction from raw data. Information must be managed with sophisticated linguistic and computational architectures, which are able to approach the semantic dimension of words and sentences. In this paper we propose a morphosemantic method for the automatic creation and population of medical lexical resources. Our approach is grounded on a list of neoclassical formative elements pertaining to the medical domain an on a large sized corpus of medical diagnoses. The outcomes of this work are automatically built electronic dictionaries and thesauri and an annotated corpus for the NLP in the medical domain. © 2014 IEEE.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.