On Achieving Efficient Data Transfer for Graph Processing in Geo-Distributed DatacentersReportar como inadecuado

On Achieving Efficient Data Transfer for Graph Processing in Geo-Distributed Datacenters - Descarga este documento en PDF. Documentación en PDF para descargar gratis. Disponible también para leer online.

1 ASCOLA - Aspect and Composition Languages Inria Rennes – Bretagne Atlantique , LS2N - Laboratoire des Sciences du Numérique de Nantes 2 NUS - School of computing Singapore

Abstract : Graph partitioning is important for optimizing the performance and communication cost of large graph processing jobs. Recently, many graph applications such as social networks store their data on geo-distributed datacenters DCs to provide services worldwide with low latency. This raises new challenges to existing graph partitioning methods, due to the costly Wide Area Network WAN usage and the multi-levels of network heterogeneities in geo-distributed DCs. In this paper, we propose a geo-aware graph partitioning method named G-Cut, which aims at minimizing the inter-DC data transfer time of graph processing jobs in geo-distributed DCs while satisfying the WAN usage budget. G-Cut adopts two novel optimization phases which address the two challenges in WAN usage and network heterogeneities separately. G-Cut can be also applied to partition dynamic graphs thanks to its lightweight runtime overhead. We evaluate the effectiveness and efficiency of G-Cut using real-world graphs with both real geo-distributed DCs and simulations. Evaluation results show that G-Cut can reduce the inter-DC data transfer time by up to 58% and reduce the WAN usage by up to 70% compared to state-of-the-art graph partitioning methods with a low runtime overhead.

Keywords : Graph partitioning Heterogeneous network Geo-distributed datacenters

Autor: Amelie Zhou - Shadi Ibrahim - Bingsheng He -

Fuente: https://hal.archives-ouvertes.fr/


Documentos relacionados