Efficient and Privacy-Preserving k-Means Clustering for Big Data MiningReportar como inadecuado

Efficient and Privacy-Preserving k-Means Clustering for Big Data Mining - Descarga este documento en PDF. Documentación en PDF para descargar gratis. Disponible también para leer online.

1 Laboratoire de Méthodes de Conception de Systèmes LMCS 2 CERIST - Centre de recherche sur l-Information Scientifique et Technique

Abstract : —Recent advances in sensing and storing technologies have led to big data age where a huge amount of data are distributed across sites to be stored and analysed. Indeed, cluster analysis is one of the data mining tasks that aims to discover patterns and knowledge through different algorithmic techniques such as k-means. Nevertheless, running k-means over distributed big data stores has given rise to serious privacy issues. Accordingly, many proposed works attempted to tackle this concern using cryptographic protocols. However, these cryptographic solutions introduced performance degradation issues in analysis tasks which does not meet big data properties. In this work we propose a novel privacy-preserving k-means algorithm based on a simple yet secure and efficient multi-party additive scheme that is cryptography-free. We designed our solution for horizontally partitioned data. Moreover, we demonstrate that our scheme resists against adversaries passive model.

Keywords : efficiency horizontally partitioned data k-means clustering privacy

Autor: Zakaria Gheid - Yacine Challal -

Fuente: https://hal.archives-ouvertes.fr/


Documentos relacionados