SMCQL: Secure Querying for Federated Databases
Autor: | Gregory Elliott, Satyender Goel, Abel N. Kho, Jennie Rogers, Craig Eggen, Johes Bater |
---|---|
Rok vydání: | 2016 |
Předmět: |
FOS: Computer and information sciences
SQL Information retrieval Computer Science - Cryptography and Security Computer science Federated databases General Engineering InformationSystems_DATABASEMANAGEMENT Context (language use) Databases (cs.DB) 0102 computer and information sciences 02 engineering and technology 01 natural sciences Information sensitivity Federated database Computer Science - Databases 010201 computation theory & mathematics 020204 information systems Scalability 0202 electrical engineering electronic engineering information engineering Secure multi-party computation Tuple computer Cryptography and Security (cs.CR) computer.programming_language |
DOI: | 10.48550/arxiv.1606.06808 |
Popis: | People and machines are collecting data at an unprecedented rate. Despite this newfound abundance of data, progress has been slow in sharing it for open science, business, and other data-intensive endeavors. Many such efforts are stymied by privacy concerns and regulatory compliance issues. For example, many hospitals are interested in pooling their medical records for research, but none may disclose arbitrary patient records to researchers or other healthcare providers. In this context we propose the Private Data Network (PDN), a federated database for querying over the collective data of mutually distrustful parties. In a PDN, each member database does not reveal its tuples to its peers nor to the query writer. Instead, the user submits a query to an honest broker that plans and coordinates its execution over multiple private databases using secure multiparty computation (SMC). Here, each database's query execution is oblivious , and its program counters and memory traces are agnostic to the inputs of others. We introduce a framework for executing PDN queries named smcql . This system translates SQL statements into SMC primitives to compute query results over the union of its source databases without revealing sensitive information about individual tuples to peer data providers or the honest broker. Only the honest broker and the querier receive the results of a PDN query. For fast, secure query evaluation, we explore a heuristics-driven optimizer that minimizes the PDN's use of secure computation and partitions its query evaluation into scalable slices. |
Databáze: | OpenAIRE |
Externí odkaz: |