Speed up attribute lookup by preciz · Pull Request #651 · philss/floki

preciz · 2025-12-18T09:33:40Z

Improvements:
exact attribute => ~5% faster, ~50% lower memory usage
attribute present => ~10% faster, ~40% lower memory usage
attribute includes => ~100% faster, ~60% lower memory usage

This change utilizes more built in functions instead of Enum and uses String.contains? to check for match before performing String.split.

  read_file = fn name ->
    __ENV__.file
    |> Path.dirname()
    |> Path.join(name)
    |> File.read!()
    |> Floki.parse_document!()
  end

  inputs = %{
    "big" => read_file.("big.html")
  }

  Benchee.run(
    %{
      "exact attribute" => fn doc -> Floki.find(doc, "[class='noprint']") end,
      "attribute present" => fn doc -> Floki.find(doc, "[title]") end,
      "attribute includes" => fn doc -> Floki.find(doc, "[class~='wikitable']") end
    },
    time: 10,
    inputs: inputs,
    memory_time: 2
  )

philss · 2025-12-19T14:33:20Z

lib/floki/selector/attribute_selector.ex

-  defp get_value(attr_name, attributes) do
-    Enum.find_value(attributes, "", fn
+  defp get_value(attr_name, attributes) when is_list(attributes) do
+    case List.keyfind(attributes, attr_name, 0) do


How much of the improvements came from this change?

If is not that much, I would prefer to keep the Enum.find_value/3 just because it's easier to read and maintain.

I just ran the benchmarks back and forth changing this part only and it's faster in all cases, the biggest win in speedup is that it's ~11% faster in the "exact_attribute" case but most significantly it halves the memory usage in the "exact_attribute" and in the "attribute_includes" cases.

I believe this library is used heavily by a lot of companies where every speedup has a huge effect on throughput.
That is the case for our company.

Speed up attribute lookup

68c0e13

philss reviewed Dec 19, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Speed up attribute lookup#651

Speed up attribute lookup#651
preciz wants to merge 1 commit intophilss:mainfrom
preciz:speed_up_attribute_lookup_2

preciz commented Dec 18, 2025

Uh oh!

philss Dec 19, 2025

Uh oh!

preciz Dec 19, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

preciz commented Dec 18, 2025

Uh oh!

philss Dec 19, 2025

Choose a reason for hiding this comment

Uh oh!

preciz Dec 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

preciz Dec 19, 2025 •

edited

Loading