Open information extraction from real internet texts in spanish using constraints over part-of- speech sequences: problems of the method, their causes, and ways for improvement Reportar como inadecuado




Open information extraction from real internet texts in spanish using constraints over part-of- speech sequences: problems of the method, their causes, and ways for improvement - Descarga este documento en PDF. Documentación en PDF para descargar gratis. Disponible también para leer online.

Alexander Gelbukh ;Revista Signos 2016, 49 90

Autor: Alisa Zhila

Fuente: http://www.redalyc.org/


Introducción



Revista Signos ISSN: 0035-0451 revista.signos@ucv.cl Pontificia Universidad Católica de Valparaíso Chile Zhila, Alisa; Gelbukh, Alexander Open Information Extraction from real Internet texts in Spanish using constraints over part -of- speech sequences: Problems of the method, their causes, and ways for improvement Revista Signos, vol.
49, núm.
90, marzo, 2016, pp.
119-142 Pontificia Universidad Católica de Valparaíso Valparaíso, Chile Available in: http:--www.redalyc.org-articulo.oa?id=157044553006 How to cite Complete issue More information about this article Journals homepage in redalyc.org Scientific Information System Network of Scientific Journals from Latin America, the Caribbean, Spain and Portugal Non-profit academic project, developed under the open access initiative R evista Signos.
Estudios de Lingüística ISSN 0718-0934 © 2016 PUCV, Chile • DOI: 10.4067-S0718-09342016000100006 • 49(90) 119-142 Open Information Extraction from real Internet texts in Spanish using constraints over part-ofspeech sequences: Problems of the method, their causes, and ways for improvement Extracción abierta de información a partir de textos de Internet en español utilizando reglas sobre categorías de palabras en secuencias: Problemas del método, sus causas y posibles mejoras Alisa Zhila Alexander Gelbukh Centro de investigación en Computación I nstituto Politécnico Nacional M éxico alisa.zhila@gmail.com Centro de investigación en Computación I nstituto Politécnico Nacional M éxico gelbukh@gelbukh.com Recibido: 17-III-2013 -Aceptado: 16-VI-2015 Abstract Usually we do not know the domain of an arbitrary text from the Internet, or the semantics of the relations it conveys.
While humans identify such information easily, for a computer this task is far from straightforward.
The task of detecting relations of arbitrary semantic type in texts is known as Open Information Extraction (Open IE).
The approach to this task based on heuristic constrai...





Documentos relacionados