Add InceptionResnetV1 model. by zacharydenton · Pull Request #430 · elixir-nx/bumblebee

zacharydenton · 2025-12-18T18:55:16Z

This is a port of https://github.com/timesler/facenet-pytorch/blob/master/models/inception_resnet_v1.py.

I'm using this to generate facenet512 embeddings for face recognition:

{:ok, facenet_model_info} =
  Bumblebee.load_model({:hf, "zacharydenton/facenet-vggface2"}, module: Bumblebee.Vision.InceptionResnetV1))

{:ok, facenet_featurizer} =
  Bumblebee.load_featurizer({:hf, "zacharydenton/facenet-vggface2"})

facenet_serving =
  Bumblebee.Vision.image_embedding(facenet_model_info, facenet_featurizer,
    embedding_processor: :l2_norm,
    defn_options: [compiler: EXLA],
  )

# use face detection (e.g. yunet from evision) to get x/y/w/h
face_img = extract_face!(img, x, y, w, h)
{:ok, facenet512} = Nx.Serving.batched_run(FacenetServing, face_img)

I've tested this implementation and compared to the embeddings generated by the facenet-pytorch library. The embeddings match the pytorch implementation with 0.9995 average cosine similarity (with latest Axon - see elixir-nx/axon#622 for details).

n.b. I want to add unit tests but I didn't find any suitable weights in https://huggingface.co/hf-internal-testing - open to advice!

zacharydenton · 2025-12-18T18:55:47Z

lib/bumblebee/vision/inception_resnet_v1.ex

+
+    %{
+      pooled_state: pooled_state,
+      hidden_states: Axon.container({input, pooled_state})


I'm not sure what to do with hidden_state and hidden_states

hidden_states is a tuple (effectively a list) of intermediate states between various blocks (see Bumblebee.Vision.ResNet for example). By default we actually prune it and only return if the user configures output_hidden_states: true. We conventionally do it for most models to mirror hf/transformers, but is not a requirement, and it should be fine to not return it.

zacharydenton · 2025-12-18T19:14:39Z

This model isn't part of the transformers library so there's no valid architecture for config.json on huggingface. Thus I'm not sure if it makes sense to include this in Bumblebee or if it should be in a separate library?

jonatanklosko · 2025-12-22T13:25:34Z

This model isn't part of the transformers library so there's no valid architecture for config.json on huggingface. Thus I'm not sure if it makes sense to include this in Bumblebee or if it should be in a separate library?

Hey @zacharydenton! Yeah, so far we've only been adding models that are available in hf/transformers, since the main idea is to be able to search on HF Hub and load pretrained models from there in the standardized format.

Given that the model implementation is pretty self-contained, a separate library sounds like a valid approach to me.

It is also worth looking around if there are some newer models in hf/transformers for this specific task.

Add InceptionResnetV1 model.

d12912a

zacharydenton commented Dec 18, 2025

View reviewed changes

mix format

eaed6d1

zacharydenton closed this Jan 7, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add InceptionResnetV1 model.#430

Add InceptionResnetV1 model.#430
zacharydenton wants to merge 2 commits intoelixir-nx:mainfrom
zacharydenton:inception

zacharydenton commented Dec 18, 2025 •

edited

Loading

Uh oh!

zacharydenton Dec 18, 2025

Uh oh!

jonatanklosko Dec 22, 2025

Uh oh!

zacharydenton commented Dec 18, 2025

Uh oh!

jonatanklosko commented Dec 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

zacharydenton commented Dec 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zacharydenton Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

jonatanklosko Dec 22, 2025

Choose a reason for hiding this comment

Uh oh!

zacharydenton commented Dec 18, 2025

Uh oh!

jonatanklosko commented Dec 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

zacharydenton commented Dec 18, 2025 •

edited

Loading