Information retrieval system using multiwords expressions mwe as descriptors Reportar como inadecuado




Information retrieval system using multiwords expressions mwe as descriptors - Descarga este documento en PDF. Documentación en PDF para descargar gratis. Disponible también para leer online.

Renato Rocha Souza ;JISTEM: Journal of Information Systems and Technology Management 2012, 9 2

Autor: Edson Marchetti da Silva

Fuente: http://www.redalyc.org/


Introducción



JISTEM: Journal of Information Systems and Technology Management E-ISSN: 1807-1775 tecsi@usp.br Universidade de São Paulo Brasil Marchetti da Silva, Edson; Rocha Souza, Renato INFORMATION RETRIEVAL SYSTEM USING MULTIWORDS EXPRESSIONS (MWE) AS DESCRIPTORS JISTEM: Journal of Information Systems and Technology Management, vol.
9, núm.
2, mayo-agosto, 2012, pp.
213-234 Universidade de São Paulo São Paulo, Brasil Available in: http:--www.redalyc.org-articulo.oa?id=203223859003 How to cite Complete issue More information about this article Journals homepage in redalyc.org Scientific Information System Network of Scientific Journals from Latin America, the Caribbean, Spain and Portugal Non-profit academic project, developed under the open access initiative JISTEM - Journal of Information Systems and Technology Management Revista de Gestão da Tecnologia e Sistemas de Informação Vol.
9, No.
2, May-Aug.
2012, pp.213-234 ISSN online: 1807-1775 DOI: 10.4301-S1807-17752012000200002 INFORMATION RETRIEVAL SYSTEM USING MULTIWORDS EXPRESSIONS (MWE) AS DESCRIPTORS Edson Marchetti da Silva Federal University of Minas Gerais, MG, Brazil Renato Rocha Souza Getúlio Vargas Foundation - FGV, RJ, Brazil _______________________________________________________________ ABSTRACT This paper aims to propose an alternative method for retrieving documents using Multiwords Expressions (MWE) extracted from a document base to be used as descriptors in search of an Information Retrieval System (IRS).
In this sense, unlike methods that consider the text as a set of words, bag of words, we propose a method that takes into account the characteristics of the physical structure of the document in the extraction process of MWE.
From this set of terms comparing pre-processed using an exhaustive algorithmic technique proposed by the authors with the results obtained for thirteen different measures of association statistics generated by the software Ngram Statistics Package (NSP).
To perform th...





Documentos relacionados