Minimizing Inter-Server Communications by Exploiting Self-Similarity in Online Social Networks
Autor: | Hanhua Chen, Hai Jin, Shaoliang Wu |
---|---|
Rok vydání: | 2016 |
Předmět: |
020203 distributed computing
Computer science business.industry Distributed computing 020206 networking & telecommunications 02 engineering and technology Load balancing (computing) Partition (database) Computational Theory and Mathematics Hardware and Architecture Server Signal Processing Scalability 0202 electrical engineering electronic engineering information engineering The Internet Data center business Inter-server Computer network |
Zdroj: | IEEE Transactions on Parallel and Distributed Systems. 27:1116-1130 |
ISSN: | 1045-9219 |
DOI: | 10.1109/tpds.2015.2427155 |
Popis: | Efficiently operating on relevant data for users in large-scale online social network (OSN) systems is a challenging problem. Storage systems used by popular OSNs often rely on key-value stores, where randomly partitioning the data of users among servers across the data centers is the defacto standard. Although by using DHTs, the random partition scheme is highly scalable for hosting a large number of users, it leads to costly inter-server communications across data centers due to the complexity of interconnection and interaction between OSN users. In this paper, we explore how to reduce the inter-server communications by retaining the simple and robust nature of OSNs. We propose a data placement solution atop OSN systems to divide users among servers according to the interaction-locality-based structure. Our approach exploits a simple, yet powerful principle of OSN interactions, self-similarity, which reveals that the inter-server communication cost is minimized under such intrinsic structure. Our algorithm avoids a significant amount of inter-server traffic as well as achieves load balance among servers across the data centers. We demonstrate the existence of self-similarity in large-scale Facebook traces including 10 million Facebook users and 24 million interaction events. We conduct comprehensive trace-driven simulations to evaluate this design. Results show that our scheme significantly reduces the traffic and latency of OSN systems comparing to existing schemes. |
Databáze: | OpenAIRE |
Externí odkaz: |