Supervised Machine Learning Techniques to Detect TimeML Events in French and EnglishReportar como inadecuado

Supervised Machine Learning Techniques to Detect TimeML Events in French and English - Descarga este documento en PDF. Documentación en PDF para descargar gratis. Disponible también para leer online.

1 TEXMEX - Multimedia content-based indexing IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique 2 LinkMedia - Creating and exploiting explicit links between multimedia fragments IRISA-D6 - MEDIA ET INTERACTIONS, Inria Rennes – Bretagne Atlantique 3 LIMSI - Laboratoire d-Informatique pour la Mécanique et les Sciences de l-Ingénieur

Abstract : Identifying events from texts is an information extraction task necessary for many NLP applications. Through the TimeML specifications and TempEval challenges, it has received some attention in the last years; yet, no reference result is available for French. In this paper, we try to fill this gap by proposing several event extraction systems, combining for instance Conditional Random Fields, language modeling and k-nearest-neighbors. These systems are evaluated on French corpora and compared with state-of-the-art methods on English. The very good results obtained on both languages validate our whole approach.

Keywords : Event identification information extraction TimeML Tem-pEval CRF language modeling English French

Autor: Béatrice Arnulphy - Vincent Claveau - Xavier Tannier - Anne Vilnat -



Documentos relacionados