Stabilizing the Recall in Similarity Search

Publikace nespadá pod Filozofickou fakultu, ale pod Fakultu informatiky. Oficiální stránka publikace je na webu muni.cz.

Název česky Stabilizace kvality v podobnost
Autoři

KYSELÁK Martin NOVÁK David ZEZULA Pavel

Druh Článek ve sborníku
Konference Fourth International Conference on Similarity Search and Applications, SISAP 2011
Fakulta / Pracoviště MU

Fakulta informatiky

Citace
Doi http://dx.doi.org/10.1145/1995412.1995422
Obor Informatika
Klíčová slova locality-sensitive hashing; metric space; similarity search; recall; stability;
Popis The recent techniques for approximate similarity search focus on optimizing answer precision/recall and they typically improve the average of these measures over a set of sample queries. However, according to our observation, the recall for particular indexes and queries can fluctuate considerably. In order to stabilize the recall, we propose a query-evaluation model that exploits several variants of the search index. This approach is applicable to a signicant subset of current approximate methods with a focus on techniques based purely on metric postulates. Applying this approach to the M-Index structure, we perform extensive measurements on large datasets and we show that this approach has a positive impact on the recall stability and it suppresses the most unsatisfactory cases. Further, the results indicate that the proposed approach can also increase the general average recall for given overall search costs.
Související projekty: