Proceedings of the 2nd workshop on Multi-source, Multilingual Information Extraction and SummarizationReportar como inadecuado

Proceedings of the 2nd workshop on Multi-source, Multilingual Information Extraction and Summarization - Descarga este documento en PDF. Documentación en PDF para descargar gratis. Disponible también para leer online.

1 LIPN - Laboratoire d-Informatique de Paris-Nord 2 Computer Science Lab. 3 Computer Science Dpt 4 University of Helsinki Helsinki

Abstract : Information extraction IE and text summarization TS are key technologies aiming at extracting relevant information from texts and presenting the information to the user in a condensed form. The ongoing information explosion makes IE and TS particularly critical for successful functioning within the information society. These technologies, however, face new challenges with the adoption of the Web 2.0 paradigm e.g., blogs, wikis due to their inherent multi-source nature. These technologies must no longer deal only with isolated texts or narratives, but with large-scale repositories or sources—possibly in several languages—containing a multiplicity of views, opinions, and commentaries on particular topics, entities and events. There is thus a need to adapt and-or develop new techniques to deal with these new phenomena. Recognising similar information across different sources and-or in different languages is of paramount importance in this multi-source, multi-lingual context. In information extraction, merging information from multiple sources can lead to increased accuracy, as compared to extraction from a single source. In text summarization, similar facts found across sources can inform sentence scoring algorithms. In question answering, the distribution of answers in similar contexts can inform answer-ranking components. Often, it is not the similarity of information that matters, but its complementary nature. In a multi-lingual context, information extraction and text summarization can provide solutions for crosslingual access: key pieces of information can be extracted from different texts in one or many languages, merged, and then conveyed in natural language in concise form. Applications need to be able to cope with the idiosyncratic nature of the new Web 2.0 media: mixed input, new jargon, ungrammatical and mixed-language input, emotional discourse, etc. In this context, synthesizing or inferring opinions from multiple sources is a new and exciting challenge for NLP. On another level, profiling of individuals who engage in the new social Web, and identifying whether a particular opinion is appropriate-relevant in a given context are important topics to be addressed. The objective of this second Multi-source Multilingual Information Extraction and Summarization MMIES workshop is to bring together researchers and practitioners in information-access technologies, to discuss recent approaches for dealing with multi-source and multi-lingual challenges.

Keywords : Information Extraction Automatic Summarization

Autor: Thierry Poibeau - Sivaji Bandyopadhyay - Horacio Saggion - Roman Yangarber -



Documentos relacionados