Efficient similarity search and classification via rank aggregation 论文
2003引用 363
Data Management and AlgorithmsAdvanced Image and Video Retrieval TechniquesBayesian Modeling and Causal Inference
摘要
We propose a novel approach to performing efficient similarity search and classification in high dimensional data. In this framework, the database elements are vectors in a Euclidean space. Given a query vector in the same space, the goal is to find elements of the database that are similar to the query. In our approach, a small number of independent "voters" rank the database elements based on similarity to the query. These rankings are then combined by a highly efficient aggregation algorithm. Our methodology leads both to techniques for computing approximate nearest neighbors and to a conceptually rich alternative to nearest neighbors.