Combinaison de modèles de langage pour lidentification de thèmesReportar como inadecuado




Combinaison de modèles de langage pour lidentification de thèmes - Descarga este documento en PDF. Documentación en PDF para descargar gratis. Disponible también para leer online.

1 LIA - Laboratoire Informatique d-Avignon

Abstract : A new statistical method for Language Modeling and spoken document classification is proposed. It is based on a mixture of topic dependent probabilities. Each topic dependent probability is in turn a mixture of n-gram probabilities and the probability of Kullback-Lieber KL distances between key-word unigrams and distribution obtained from the content of a cache memory. Experimental result on topic classification using a corpus of 60 Mwords from the French newspaper Le Monde show the excellent performance of the cache memory and its complementary role in providing different statistics for the decision process.

Keywords : Topic identification





Autor: Brigitte Bigi - Renato De Mori - Marc El Bèze - Thierry Spriet -

Fuente: https://hal.archives-ouvertes.fr/



DESCARGAR PDF




Documentos relacionados