Acoustic and language model adaptation in a voice interactive system for elderly peopleReportar como inadecuado




Acoustic and language model adaptation in a voice interactive system for elderly people - Descarga este documento en PDF. Documentación en PDF para descargar gratis. Disponible también para leer online.

1 Aalto University 2 TIPIC-SAMOVAR - Traitement de l-Information Pour Images et Communications SAMOVAR - Services répartis, Architectures, MOdélisation, Validation, Administration des Réseaux 3 EPH - Département Electronique et Physique 4 SAMOVAR - Services répartis, Architectures, MOdélisation, Validation, Administration des Réseaux 5 TSI - Département Traitement du Signal et des Images 6 LTCI - Laboratoire Traitement et Communication de l-Information

Abstract : Automatic Speech Recognition ASR systems can perform better if trained for a specific application. Though, since we require a huge amount of information to train models it is not feasible to build such systems once ready for the user, but we could use adaptation to make the ASR system more appropriate for the final use. In this work we address adaptation for the vocal characteristics of the speaker, environmental noise and the language model. Acoustic model adaptation is done by Speaker Adaptive Training SAT, linear Vocal Tract Length Normalization lVTLN and constrained Maximum Likelihood Linear Regression cMLLR. Interpolation is applied for language model adaptation. The relative WER reduction by using cMLLR was9.44%. The perplexity of the language model could be relatively improved by 14.47%

Keywords : Automatic speech recognition Speaker model adaptation Vocal interactive system for Elderly Language model adaptation





Autor: Saeideh Mirzaei - Jerome Boudy - Pierrick Milhorat - Gérard Chollet - Mikko Kurimo -

Fuente: https://hal.archives-ouvertes.fr/



DESCARGAR PDF




Documentos relacionados