filter_log_to_metrics: Use optimized memory allocations #11414

cosmo0920 · 2026-01-30T07:45:21Z

Currently, filter_log_to_metrics frequently allocates heap memory.
This causes memory fragmentation and take a longer time to allocate memory which corresponds to running period.
Instead, we need to optimize this kind of heap memory allocations and suppress CPU stale for waiting I/O operations for memory.

Enter [N/A] in the box, if an item is not applicable to your change.

Testing
Before we can approve your change; please submit the following in a comment:

Example configuration file for the change
Debug log output from testing the change

Attached Valgrind output that shows no leaks or memory corruption was found

If this is a change to packaging of containers or native binaries then please confirm it works for all targets.

Run local packaging test showing all targets (including any new ones) build.
Set ok-package-test label to test for all targets (requires maintainer to do).

Documentation

Documentation required for this feature

Backporting

Backport to latest stable release.

Fluent Bit is licensed under Apache 2.0, by submitting this pull request I understand that this code will be released under the terms of that license.

Summary by CodeRabbit

New Features
- Increased maximum label capacity from 32 to 128.
- Introduced pre-allocated runtime label structures and pre-created accessors for faster, allocation-free filtering.
- Added emitter aliasing when an explicit emitter name is not provided.
Bug Fixes
- Reworked label setup and validation with overflow/mismatch checks.
- Fixed resource cleanup to free runtime label buffers and accessors, preventing memory leaks.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

Signed-off-by: Hiroshi Hatake <hiroshi@chronosphere.io>

…llocations Signed-off-by: Hiroshi Hatake <hiroshi@chronosphere.io>

Signed-off-by: Hiroshi Hatake <hiroshi@chronosphere.io>

coderabbitai · 2026-01-30T07:45:49Z

📝 Walkthrough

Walkthrough

Refactors log_to_metrics to pre-allocate runtime label structures: introduces label counting and preparation helpers, replaces per-call dynamic label construction with pre-created record accessors and contiguous buffers, updates Kubernetes label handling, and consolidates emitter aliasing and cleanup with stronger validation.

Changes

Cohort / File(s)	Summary
Header Structure Updates `plugins/filter_log_to_metrics/log_to_metrics.h`	Added `<stddef.h>`, raised `MAX_LABEL_COUNT` from 32 to 128, and extended `struct log_to_metrics_ctx` with `label_ras`, `label_values_buf`, and `label_values` for pre-allocated runtime label storage.
Label runtime helpers & init/refactor `plugins/filter_log_to_metrics/log_to_metrics.c`	Added `count_labels()` and `prepare_label_runtime()`; converted `set_labels()` to a two-pass flow; replaced per-call label allocation with pre-allocated buffers and `ctx->label_ras` accessors; created/destroyed record accessors at init/destroy; added overflow/mismatch checks and strengthened validation.
Filter path & emitter wiring `plugins/filter_log_to_metrics/log_to_metrics.c`	Updated `cb_log_to_metrics_init()` and `cb_log_to_metrics_filter()` to use `ctx->label_counter`, `ctx->label_keys`, `ctx->label_accessors`, `ctx->label_ras`, and `ctx->label_values` instead of building labels per-call; introduced emitter aliasing derived from filter name and adjusted emitter validation/configuration.
Removed legacy logic `plugins/filter_log_to_metrics/log_to_metrics.c`	Removed legacy `fill_labels` logic and associated per-call label construction code paths; adjusted teardown to free new runtime buffers and accessors.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

Possibly related PRs

log_to_metrics: fix improper handling of record accessor context #10920 — related change moving RA creation/destruction to init and introducing persistent record accessor(s) in log_to_metrics.

Suggested labels

backport to v4.0.x, backport to v4.1.x

Suggested reviewers

edsiper
koleini
fujimotos

Poem

🐰 I hopped through labels, one by one—
Counting, packing, then they’re done.
Buffers ready, accessors born,
No more per-call chaos at dawn,
A tidy harvest — hop, hop, hooray! 🥕🐇

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 27.27% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately reflects the main objective: optimizing memory allocations in the filter_log_to_metrics plugin through pre-allocation strategies and reduced runtime fragmentation.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch cosmo0920-use-optimized-memory-allocations-on-log_to_metrics

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

🤖 Fix all issues with AI agents

In `@plugins/filter_log_to_metrics/log_to_metrics.c`:
- Around line 810-821: The code assigns the result of flb_sds_printf directly to
emitter_alias_tmp which can return NULL and cause the original SDS to leak;
instead, call flb_sds_printf into a temporary pointer (e.g., tmp), check if tmp
is NULL, and if so call flb_sds_destroy(emitter_alias_tmp), flb_errno(),
log_to_metrics_destroy(ctx) and return -1; on success assign emitter_alias_tmp =
tmp. This uses the existing symbols emitter_alias_tmp, flb_sds_create_size,
flb_sds_printf, flb_sds_destroy and preserves current error handling via
log_to_metrics_destroy(ctx).

plugins/filter_log_to_metrics/log_to_metrics.c

Signed-off-by: Hiroshi Hatake <hiroshi@chronosphere.io>

coderabbitai

Actionable comments posted: 0

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (2)

plugins/filter_log_to_metrics/log_to_metrics.c (2)
360-475: ⚠️ Potential issue | 🟡 Minor

Tighten the bounds guard before indexing label arrays.
If the computed total and fill pass ever diverge (e.g., config mutation or unexpected properties), the current > check can still allow one out-of-bounds write before the final mismatch check. Use >= to fail fast before indexing.
🛠️ Proposed fix
-        if (counter > ctx->label_counter) {
+        if (counter >= ctx->label_counter) {
             flb_plg_error(ctx->ins, "internal label counter overflow");
             return -1;
         }
956-1079: ⚠️ Potential issue | 🟠 Major

Confirm thread-safety vulnerability in shared ctx->label_values buffer.

The label_values buffer is allocated once per filter instance and reused across all concurrent invocations from multiple input sources. Since Fluent Bit has input worker threads that independently process chunks and invoke filters (via flb_filter_do), multiple workers can simultaneously call cb_log_to_metrics_filter with the same ctx. The vulnerable window is between writing to label_values (lines 982–999) and passing it to cmt_counter_inc, cmt_gauge_set, or cmt_histogram_observe (lines 1009, 1030, 1051). A concurrent writer can corrupt label values mid-operation.

Locking exists on chunks and tasks but not on filter instances or their context. To fix: either allocate label_values per-invocation (stack or local scope), use thread-local storage, add a mutex around the vulnerable window, or buffer label values before the cmt call completes.

cosmo0920 added 5 commits January 30, 2026 15:01

filter_log_to_metrics: Use pre-allocated slots of memory for labels

c0af3ef

Signed-off-by: Hiroshi Hatake <hiroshi@chronosphere.io>

filter_log_to_metrics: Migrate two-paths for constructing RA and labels

603fd6e

Signed-off-by: Hiroshi Hatake <hiroshi@chronosphere.io>

filter_log_to_metrics: Use temporary variables to avoid overwriting a…

7ff9b75

…llocations Signed-off-by: Hiroshi Hatake <hiroshi@chronosphere.io>

filter_log_to_metrics: Fix the bondary condition of overflowing

8822092

Signed-off-by: Hiroshi Hatake <hiroshi@chronosphere.io>

filter_log_to_metrics: Loosen the limit a count of label

b9f17cb

Signed-off-by: Hiroshi Hatake <hiroshi@chronosphere.io>

cosmo0920 requested a review from edsiper as a code owner January 30, 2026 07:45

github-actions bot added the docs-required label Jan 30, 2026

cosmo0920 temporarily deployed to pr January 30, 2026 07:45 — with GitHub Actions Inactive

coderabbitai bot reviewed Jan 30, 2026

View reviewed changes

plugins/filter_log_to_metrics/log_to_metrics.c Show resolved Hide resolved

cosmo0920 temporarily deployed to pr January 30, 2026 08:04 — with GitHub Actions Inactive

cosmo0920 force-pushed the cosmo0920-use-optimized-memory-allocations-on-log_to_metrics branch from 13732cf to b9f17cb Compare January 30, 2026 08:45

cosmo0920 temporarily deployed to pr January 30, 2026 08:47 — with GitHub Actions Inactive

filter_log_to_metrics: Plug a leak on exception

40a5ffa

Signed-off-by: Hiroshi Hatake <hiroshi@chronosphere.io>

cosmo0920 temporarily deployed to pr January 30, 2026 08:51 — with GitHub Actions Inactive

coderabbitai bot reviewed Jan 30, 2026

View reviewed changes

cosmo0920 temporarily deployed to pr January 30, 2026 09:10 — with GitHub Actions Inactive

cosmo0920 added this to the Fluent Bit v5.0 milestone Jan 30, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

filter_log_to_metrics: Use optimized memory allocations #11414

filter_log_to_metrics: Use optimized memory allocations #11414

cosmo0920 commented Jan 30, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Jan 30, 2026 •

edited

Loading

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested labels

Suggested reviewers

Poem

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

filter_log_to_metrics: Use optimized memory allocations #11414

Are you sure you want to change the base?

filter_log_to_metrics: Use optimized memory allocations #11414

Conversation

cosmo0920 commented Jan 30, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Jan 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested labels

Suggested reviewers

Poem

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

cosmo0920 commented Jan 30, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Jan 30, 2026 •

edited

Loading