Practical Skew Handling in Parallel Joins 论文

1992Minds at UW (University of Wisconsin)引用 263

Advanced Database Systems and QueriesData Management and AlgorithmsAlgorithms and Data Compression

企业软件 Algorithms and Data Compression Data Management and Algorithms Advanced Database Systems and Queries

作者

摘要

We present an approach to dealing with skew in parallel joins in database systems. Our approach is easily implementable within current parallel DBMS, and performs well on skewed data without degrading the performance of the system on non-skewed data. The main idea is to use multiple algorithms, each specialized for a different degree of skew, and to use a small sample of the relations being joined to determine which algorithm is appropriate. We developed, implemented, and experimented with four new skew-handling parallel join algorithms; one, which we call virtual processor range partitioning, was the clear winner in high skew cases, while traditional hybrid hash join was the clear winner in lower skew or no skew cases. We present experimental results from an implementation of all four algorithms on the Gamma parallel database machine. To our knowledge, these are the first reported skew-handling numbers from an actual implementation. 1 Introduction Multiprocessor database system techn...

作者查看全部 (4)

S. Seshadri

Donovan A. Schneider

Jeffrey F. Naughton

David J. DeWitt

Practical Skew Handling in Parallel Joins 论文

摘要

作者查看全部 (4)

相关技术查看全部 (2)

相关事件

相关文章