Universal classes of hash functions (Extended Abstract) 论文

1977引用 348
Algorithms and Data CompressionAdvanced Image and Video Retrieval TechniquesOptimization and Search Problems

详细信息

发表日期
1977-01-01
发表年份
1977

关键词

Algorithms and Data CompressionAdvanced Image and Video Retrieval TechniquesOptimization and Search Problems

摘要

This paper gives an input independent average linear time algorithm for storage and retrieval on keys. The algorithm makes a random choice of hash function from a suitable class of hash functions. Given any sequence of inputs the expected time (averaging over all functions in the class) to store and retrieve elements is linear in the length of the sequence. The number of references to the data base required by the algorithm for any input is extremely close to the theoretical minimum for any possible hash function with randomly distributed inputs. We present three suitable classes of hash functions which also may be evaluated rapidly. The ability to analyze the cost of storage and retrieval without worrying about the distribution of the input allows as corollaries improvements on the bounds of several algorithms.