The corpus VoLIP (The Voice of LIP) is an Italian speech resource which associates the audio signals to the orthographic transcriptions of the LIP Corpus. The LIP Corpus was designed to represent diaphasic, diatopic and diamesic variation. The Corpus was collected in the early '90s to compile a frequency lexicon of spoken Italian and its size was tailored to produce a reliable frequency lexicon for the first 3,000 lemmas. Therefore, it consists of about 500,000 word tokens for 60 hours of recording. The speech materials belong to five different text registers and they were collected in four different cities. Thanks to a modern technological approach VoLIP web service allows users to search the LIP corpus using IMDI metadata, lexical or morpho-syntactic entry keys, receiving as result the audio portions aligned to the corresponding required entry. The VoLIP corpus is freely available at the URL http://www.parlaritaliano.it.

VOLIP: A corpus of spoken Italian and a virtuous example of reuse of linguistic resources / Alfano, I.; Cutugno, F.; De Rosa, A.; Iacobini, C.; Savy, R.; Voghera, M.. - (2014), pp. 3897-3901. ( 9th International Conference on Language Resources and Evaluation, LREC 2014 Harpa Concert Hall and Conference Center, isl 2014).

VOLIP: A corpus of spoken Italian and a virtuous example of reuse of linguistic resources

Alfano I.;Cutugno F.
;
Voghera M.
2014

Abstract

The corpus VoLIP (The Voice of LIP) is an Italian speech resource which associates the audio signals to the orthographic transcriptions of the LIP Corpus. The LIP Corpus was designed to represent diaphasic, diatopic and diamesic variation. The Corpus was collected in the early '90s to compile a frequency lexicon of spoken Italian and its size was tailored to produce a reliable frequency lexicon for the first 3,000 lemmas. Therefore, it consists of about 500,000 word tokens for 60 hours of recording. The speech materials belong to five different text registers and they were collected in four different cities. Thanks to a modern technological approach VoLIP web service allows users to search the LIP corpus using IMDI metadata, lexical or morpho-syntactic entry keys, receiving as result the audio portions aligned to the corresponding required entry. The VoLIP corpus is freely available at the URL http://www.parlaritaliano.it.
2014
VOLIP: A corpus of spoken Italian and a virtuous example of reuse of linguistic resources / Alfano, I.; Cutugno, F.; De Rosa, A.; Iacobini, C.; Savy, R.; Voghera, M.. - (2014), pp. 3897-3901. ( 9th International Conference on Language Resources and Evaluation, LREC 2014 Harpa Concert Hall and Conference Center, isl 2014).
File in questo prodotto:
File Dimensione Formato  
906_Paper.pdf

accesso aperto

Tipologia: Versione Editoriale (PDF)
Licenza: Dominio pubblico
Dimensione 598.93 kB
Formato Adobe PDF
598.93 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11588/1009836
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 5
  • ???jsp.display-item.citation.isi??? ND
social impact