Base64 decoding depth assessment #4744

dxa4481 · 2026-02-13T01:27:34Z

Description:

This PR introduces an iterative decoding pipeline, allowing decoders (e.g., Base64, UTF-16) to chain their outputs. Previously, decoders ran independently on the original chunk, missing secrets hidden behind layered encoding (e.g., base64 within UTF-16, or double-base64-encoded values).

The scannerWorker now re-runs decoders on any new output, up to a configurable --max-decode-depth (default 5). This enables detection of secrets like GCP service accounts and private keys found within base64-encoded Docker auth configs, or Artifactory tokens within base64. The pipeline includes an early exit, ensuring negligible performance overhead for higher depths when no new decoded data is produced (typically <5% overhead compared to depth 1).

Checklist:

Tests passing (make test-community)?
Lint passing (make lint this requires golangci-lint)?

Decoders (base64, UTF-16, escaped unicode) now chain iteratively: each decoder's output is fed back through all decoders until no new transformations occur or --max-decode-depth is reached (default: 5). This finds secrets hidden inside layered encodings, e.g. a base64 Docker auth blob containing a GCP private key, or a UTF-16 file with base64-encoded credentials. At depth=1 behavior is identical to the previous implementation. Extra depths exit early when no new data is produced, so the cost is <5% wall time on a large repo scan. Co-authored-by: Dylan Ayrey <dxa4481@rit.edu>

cursor · 2026-02-13T01:27:35Z

Cursor Agent can help with this pull request. Just @cursor in comments and I'll start working on changes in this branch.
_{Learn more about Cursor Agents}

CLAassistant · 2026-02-13T01:27:43Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Base64 decoding depth assessment #4744

Base64 decoding depth assessment #4744

dxa4481 commented Feb 13, 2026

Uh oh!

cursor bot commented Feb 13, 2026

Uh oh!

CLAassistant commented Feb 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Base64 decoding depth assessment #4744

Are you sure you want to change the base?

Base64 decoding depth assessment #4744

Conversation

dxa4481 commented Feb 13, 2026

Description:

Checklist:

Uh oh!

cursor bot commented Feb 13, 2026

Uh oh!

CLAassistant commented Feb 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants