Large Scale Face Recognition with NMSLIB

Non-Metris Space Library or shortly NMSLIB is an efficient similarity search package. We have mentioned similarity search solutions of tech giants: Spotify Annoy and Facebook Faiss. However, this package was developed by just a few PhD students. Amazon adopted nmslib in Elasticsearch recently. Product and service recommendations, image, document and video search are some use cases for similarity search. We are going to adapt nmslib similarity search library into a face recognition task in this post.

Webinar

There are billions images indexed by Google but reverse image search returns responses just in seconds. This has nothing to do with the hardware it has. You can either continue to read this tutorial or watch the following video. They both cover large scale facial recognition with NMSLIB.

🙋‍♂️ You may consider to enroll my top-rated machine learning course on Udemy

As Ara Guler declared if the best camera had taken the best photograph then the one would be the best novelist who has the best typewriter.

BTW, strongly recommend to watch the following video about the math behind approximate nearest neighbor and its python implementation from scratch

Face recognition pipeline

We should remember the common stages of a modern face recognition pipeline. These are detection, alignment, representation and verification. We feed a facial image to a CNN model and it returns a representation. Verification step finds the distance between representations of a face pair and verify them as same person if the distance is less than a threshold.

If the number of face pairs a huge, e.g. millions, response times might be problem in a face recognition pipeline. This post aims to solve this trouble with an approximation method.

Installation

Its python distribution is stable and available on PyPI. I run my tests with its 2.0.6 version in this study.

#!pip install nmslib==2.0.6
import nmslib

Data set

There are several facial images in the DeepFace package unit test folder.

#Clone images from this repo: https://github.com/serengil/deepface/tree/master/tests/dataset
files = []
for r, d, f in os.walk(&amp;quot;deepface/tests/dataset/&amp;quot;):
    for file in f:
        if (&#039;.jpg&#039; in file):
            exact_path = r + file
            files.append(exact_path)

Deepface framework for python wraps several state-of-the-art face recognition models: VGG-Face, Google FaceNet, OpenFace, Facebook DeepFace, DeepID, Dlib and ArcFace.

I will build a Dlib ResNet face recognition model.

FaceNet, VGG-Face, Dlib and ArcFace overperform among others. Here, you can watch how to determine the best model.

Facial representations

Represent facial images with Dlib ResNet model.


embeddings = []
for index in tqdm(range(0, len(files)), desc=&#039;Finding embeddings&#039;):
    img_path = files[index]
    embedding = DeepFace.represent(img_path = img_path, model_name = &amp;quot;Dlib&amp;quot;)[0][&amp;quot;embedding&amp;quot;]
    embeddings.append(embedding)

Synthetic data generation

Unit test folder stores a few facial images. I will create a synthetic data to make the problem more complex.

for i in tqdm(range(len(embeddings), 1000000), desc=&#039;Finding embeddings&#039;):
    key = &#039;deepface/tests/dataset/synthetic_%d&#039; % (i)
    vector = [random.gauss(-0.35, 0.48) for z in range(embedding_size)]

    embeddings.append(vector)
    files.append(key)

Finally, convert embeddings list to numpy array because nmslib expects numpy type as input.

embeddings = np.array(embeddings)

Embeddings object is (1M, 128) shaped numpy array. In other words, it stores 1M 128 dimensional vectors.

Target image

We are going to search the following target image in 1M data set.

We will detect and align face of the target image firstly. Secondly, represent facial image as 128 dimensional vector with Dlib ResNet model. Thirdly, make it 2D matrix as nmslib expects.

target_representation = DeepFace.represent(img_path = &amp;quot;target.jpg&amp;quot;, model_name = &amp;quot;Dlib&amp;quot;)[0][&amp;quot;embedding&amp;quot;]

#(128,) to (1, 128)
target_representation = np.expand_dims(target_representation, axis = 0)

Initialization

We can initialize the nmslib index based on the similarity metric: euclidean distance and cosine similarity.

metric = &#039;euclidean&#039;
if metric == &#039;euclidean&#039;:
    index = nmslib.init(space=&#039;l2&#039;)
elif metric == &#039;cosine&#039;:
    index = nmslib.init(space=&#039;cosinesimil&#039;)

Initialization lasts 0.00010442733764648438 seconds.

Adding vectors to index

We then add the vector representations of the faces to the index. Here, embeddings object is (1M, 128) shaped numpy array. In other words, it stores 1M 128 dimensional vector.

index.addDataPointBatch(embeddings)

Adding data points lasts 0.805 seconds.

Building the index

We should build the index when data points added. I will be able to search any vector in this built index. Notice that building is done once.

index_time_params = {&#039;M&#039;: 15, &#039;indexThreadQty&#039;: 4, &#039;efConstruction&#039;: 100}
index.createIndex(index_time_params)

Building lasts 263.9547507762909 seconds. This is very huge against Annoy and Faiss.

Searching in the index

I can find the most similar vectors in the built index fast.

neighbors, distances = index.knnQueryBatch(target_representation, k = 3, num_threads = 4)[0]

Searching lasts 0.0034360885620117188 seconds

Notice that I will run knn query batch command when I search an entity in my data set. But I don’t have to build index in each call. Building can be planned weekly or monthly.

Brute force

Brute force method lasts 7.18 seconds. On the other hand, adding data points lasts 0.805 seconds, creating index lasts 263 seconds, knn query lasts 0.003 seconds. Notice that I can store the built index in production and just call knn query stage. So, nmslib is 2300 times faster than the brute force method over 1M vectors.

distances = []
for embedding in embeddings:
    distance = findEuclideanDistance(target_representation, embedding)
    distances.append(distance)

idx = np.argmin(distances)
print(&amp;quot;Nearest one: &amp;quot;, idx)

Save and restoration the index

As I mentioned, index building is costly operation but you don’t have to re-build it always. That’s why, you should save the built index and restore it when you need.

#save
index.saveIndex(&#039;index.bin&#039;, save_data=False)

#restore
restored_index = nmslib.init(space=&#039;l2&#039;)
restored_index.loadIndex(&#039;index.bin&#039;)
restored_index.knnQueryBatch(target_representation, k = 3, num_threads = 4)

Scalability

Spotify Annoy, Facebook Faiss and NMSLIB are amazing libraries enabling us to search on very large scale data set fast. But they are very core libraries and scalability of those libraries on production pipelines might be problematic. Herein, Elasticsearch wraps NMSLIB to perform approximate nearest neighbor algorithm but it comes with highly scalability feature by default. In this way, we can run a-nn algorithm on many clusters easily.

Face recognition with deepface

Face recognition can be handled within deepface. It handles all common stages of a face recognition pipeline: detect, align, represent and verify. Also, it applies face verification several times in the background. All you need is to call find function and it returns a pandas data frame.

#!pip install deepface
from deepface import DeepFace
 
models = [&#039;VGG-Face&#039;, &#039;Facenet&#039;, &#039;OpenFace&#039;, &#039;DeepFace&#039;, &#039;DeepID&#039;, &#039;Dlib&#039;]
 
df = DeepFace.find(img_path = &amp;quot;img.jpg&amp;quot;, db_path = &amp;quot;C:/my_db&amp;quot;, model_name = models[0])
print(df.head())

It wraps state-of-the-art VGG-Face, Google FaceNet, OpenFace, Facebook DeepFace, DeepID and Dlib face recognition models.

Here, you can watch how face verification is handled in deepface.

Face recognition requires to apply face verification several times.

The Best Single Model

DeepFace has many cutting-edge models in its portfolio. Find out the best configuration for facial recognition model, detector, similarity metric and alignment mode.

DeepFace API

DeepFace offers a web service for face verification, facial attribute analysis and vector embedding generation through its API. You can watch a tutorial on using the DeepFace API here:

Additionally, DeepFace can be run with Docker to access its API. Learn how in this video:

The other ann packages

Spotify Annoy is not the unique approximate nearest neighbor implementation of the open source community. If you like this post, I strongly recommend you to read this tutorial:

Picking up the right tool for the right job is important. If your use case requires to creating and building index often, then search time is the key performance. On the other hand, if your use case requires to re-build index often, then adding embeddings and index building times are important. The following table demonstrates the times I spent for those a-nn packages on 1M vectors.

Map Reduce Technology

Approximate nearest neighbor algorithm reduces the time complexity dramatically but it does not guarantee to find the closest ones always. If you have million level data, big data systems and map reduce technology might match your satisfactions if your concern is not to discard important ones. Herein, mongoDB, Cassandra, Redis and Hadoop are the most popular solutions.

Tech Stack Recommendations

Face recognition is mainly based on representing facial images as vectors. Herein, storing the vector representations is a key factor for building robust facial recognition systems. I summarize the tech stack recommendations in the following video.

Conclusion

So, we’ve mentioned nmslib to perform knn algorithm and similarity search on a large scale data set. It seems that nmslib is much slower than Spotify Annoy and Facebook Faiss. Still it is faster than Faiss but slower than Annoy in searching step.

I pushed the source code of this study into the GitHub. You can support this study if you star⭐️ the repo.

Support this blog financially if you do like!

3 Comments

Sam says:

October 26, 2020 at 4:57 pm

Cool stuff. The author of the Annoy library did some of his own ANN library benchmarking and he found nmslib to have the highest performance in his benchmarks, but he was looking at recall vs. queries per second so his benchmark was aimed towards analyzing the libraries for their abilities to return the correct neighbors in a given amount of time:

https://erikbern.com/2018/06/17/new-approximate-nearest-neighbor-benchmarks.html

Log in to Reply
1. MOHAMMAD BILAL says:
  
  February 19, 2022 at 3:17 pm
  
  Please guide me which one i use faiss elasticsearch spotify annoy
  
  Log in to Reply
  1. Sefik Serengil says:
    
    March 8, 2022 at 7:55 pm
    
    my favorite is elasticsearch
    
    Log in to Reply

Large Scale Face Recognition with NMSLIB

Webinar

Face recognition pipeline

Installation

Data set

Facial representations

Synthetic data generation

Target image

Initialization

Adding vectors to index

Building the index

Searching in the index

Brute force

Save and restoration the index

Scalability

Face recognition with deepface

DeepFace API

The other ann packages

Map Reduce Technology

Tech Stack Recommendations

Conclusion

Related

3 Comments

Leave a Reply Cancel reply

Webinar

Face recognition pipeline

Installation

Data set

Facial representations

Synthetic data generation

Target image

Initialization

Adding vectors to index

Building the index

Searching in the index

Brute force

Save and restoration the index

Scalability

Face recognition with deepface

DeepFace API

The other ann packages

Map Reduce Technology

Tech Stack Recommendations

Conclusion

Related

3 Comments

Leave a Reply Cancel reply

Discover more from Sefik Ilkin Serengil