In this article, an R implementation of locality sensitive hashing will be used for fast approximate nearest neighbor search in images. The idea of document retrieval using LSH appears as one assignment in the Coursera Course Machine Learning Clustering and Retrieval. As kd-tree based implementation of ANN search does not scale well with high dimensional data (such as text data), LSH provides an alternative implementation.

Image Search with LSH

Data File Format

Data File Format

Data File Format

Data File Format

Data File Format

Data File Format

Data File Format

Data File Format