VITAL: Structured and clustered super-peer network for similarity search
Autor: | Sahar M. Ghanem, Mohamed A. Ismail, Samia G. Omar |
---|---|
Rok vydání: | 2014 |
Předmět: | |
Zdroj: | Peer-to-Peer Networking and Applications. 8:965-991 |
ISSN: | 1936-6450 1936-6442 |
DOI: | 10.1007/s12083-014-0304-0 |
Popis: | VITAL is a novel P2P indexing structure that provides on top of exact search a similarity search of multidimensional vectors. It is designed to scale to millions of peers and billions of shared documents and to adapt to high network dynamics. To exploit peer heterogeneity, VITAL is a super-peer (SP) network where every peer is an SP candidate and a simple election protocol is run to select SPs. On the other hand, every SP locally monitors its “vital” signs of memory, processing, and bandwidth and initiates the SP election protocol based on its capacity and load limits. In addition, the SP overlay is structured as CAN distributed hash table to guarantee both the correctness and responsiveness of the query protocol. A novel data replication model is introduced, where every peer clusters its shared documents to local clusters (LCs) and each LC summary is published at few SPs to achieve content-based clustering and firework query propagation. Every peer establishes TCP connections with many SPs that maintain its LC summaries. VITAL has no central component and does not require global knowledge, however it requires identifying a set of global cluster (GC) centriods to be disjointly managed by the elected SPs. In addition, CAN zone overloading is seamlessly applied to relief overwhelmed SPs and it provided an extra layer of physical proximity clustering. The scalability analysis shows that peer index requires less than 3 % of extra storage and a query (on average) can be satisfied by visiting 1.6 % of the number of established TCP connections. |
Databáze: | OpenAIRE |
Externí odkaz: |