In this paper we explore the usefulness of prosodic features for syllable classification. In order to do this, we represent the syllable as a static analysis unit such that its acoustic-temporal dynamics could be merged into a set of features that the SVM classifier will consider as a whole. In the first part of our experiment we used MFCC as features for classification, obtaining a maximum accuracy of 86.66%. The second part of our study tests whether the prosodic information is complementary to the cepstral information for syllable classification. The results obtained show that combining the two types of information does improve the classification, but further analysis is necessary for a more successful combination of the two types of features.

Syllable classification using static matrices and prosodic features / Ludusan, B.; Origlia, Antonio; Cutugno, Francesco. - ELETTRONICO. - (2010), pp. 1-4. (Intervento presentato al convegno Speech Prosody 2010 tenutosi a Chicago nel 11-14/5/2010).

Syllable classification using static matrices and prosodic features

ORIGLIA, ANTONIO;CUTUGNO, FRANCESCO
2010

Abstract

In this paper we explore the usefulness of prosodic features for syllable classification. In order to do this, we represent the syllable as a static analysis unit such that its acoustic-temporal dynamics could be merged into a set of features that the SVM classifier will consider as a whole. In the first part of our experiment we used MFCC as features for classification, obtaining a maximum accuracy of 86.66%. The second part of our study tests whether the prosodic information is complementary to the cepstral information for syllable classification. The results obtained show that combining the two types of information does improve the classification, but further analysis is necessary for a more successful combination of the two types of features.
2010
9780557519316
Syllable classification using static matrices and prosodic features / Ludusan, B.; Origlia, Antonio; Cutugno, Francesco. - ELETTRONICO. - (2010), pp. 1-4. (Intervento presentato al convegno Speech Prosody 2010 tenutosi a Chicago nel 11-14/5/2010).
File in questo prodotto:
File Dimensione Formato  
3.txt

non disponibili

Tipologia: Abstract
Licenza: Accesso privato/ristretto
Dimensione 768 B
Formato Text
768 B Text   Visualizza/Apri   Richiedi una copia
2.pdf

accesso aperto

Tipologia: Documento in Post-print
Licenza: Dominio pubblico
Dimensione 145.42 kB
Formato Adobe PDF
145.42 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11588/391232
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact