Tree Awareness for Relational DBMS Kernels

Autor: Grust, Torsten, van Keulen, Maurice, Blanken, Henk, Grabs, Torsten, Schek, Hans-Jörg, Schenkel, Ralf, Weikum, Gerhard
Přispěvatelé: Databases (Former)
Jazyk: angličtina
Rok vydání: 2003
Předmět:
Zdroj: Intelligent Search on XML Data: Applications, Languages, Models, Implementations, and Benchmarks, 231-245
STARTPAGE=231;ENDPAGE=245;TITLE=Intelligent Search on XML Data
Lecture Notes in Computer Science ISBN: 9783540407683
Intelligent Search on XML Data
ISSN: 0302-9743
DOI: 10.1007/978-3-540-45194-5_16
Popis: Relational database management systems (RDBMSs) derive much of their efficiency from the versatility of their core data structure: tables of tuples. Such tables are simple enough to allow for an efficient representation on all levels of the memory hierarchy, yet sufficiently generic to host a wide range of data types. If one can devise mappings from a data type Tau to tables and from operations on Tau to relational queries, an RDBMS may be a premier implementation alternative. Temporal intervals, complex nested objects, and spatial data are sample instances for such types Tau . The key to efficiency of the relational approach is that the RDBMS is made aware of the specific properties of Tau . Typically, such awareness can be implemented in the form of index structures (e.g., R-trees [7] efficiently encode the inclusion and overlap of spatial objects) or query operators (e.g., the multi-predicate merge join [11] exploits knowledge about containment of nested intervals). This chapter applies this principle to the tree data type with the goal to turn RDBMSs into efficient XML and XPath processors [1]. The database system is supplied with a relational [8] XML document encoding, the XPath accelerator [5]. Encoded documents (1) are represented in relational tables, (2) can be efficiently indexed using index structures native to the RDBMS, namely B-trees, and (3) XPath queries may be mapped to SQL queries over these tables. The resulting purely relational XPath processor is efficient [5] and complete (supports all 13 XPath axes). We will show that an enhanced level of tree awareness, however, can lead to a query speed-up by an order of magnitude. Tree awareness is injected into the database kernel in terms of the staircase join operator, which is tuned to exploit the knowledge that the RDBMS operates over tables encoding treeshaped data. This is a local change to the database kernel: standard B-trees suffice to support the evaluation of staircase join and the query optimizer may treat staircase join much like other native join operators.
Databáze: OpenAIRE