Potential fix for code scanning alert no. 11: Incomplete multi-character sanitization by EthanThePhoenix38 · Pull Request #12 · ThePhoenixAgency/AI-Pulse

EthanThePhoenix38 · 2026-01-08T19:59:48Z

Potential fix for https://github.com/ThePhoenixAgency/AI-Pulse/security/code-scanning/11

In general, the fix is to stop using an ad‑hoc regex to “remove HTML tags” and instead use a well-tested, HTML‑aware sanitization/parsing approach. For this code, the best low-impact change is to replace the regex-based .replace(/<[^>]*>/g, '') with a call to a dedicated HTML-sanitizing/stripping library that reliably removes or escapes unsafe content, including <script> and other dangerous constructs.

Concretely, in src/aggregator.js:

Add an import for a popular, well-tested sanitizer such as sanitize-html.
Define a small helper, e.g. sanitizeText, that:
- Uses sanitizeHtml with a configuration that strips all tags and attributes, returning only safe text.
- Ensures that multi-character sequences that could re-form tags (e.g. in malformed HTML) are fully neutralized.
Update line 68 to use this helper on article.contentSnippet.
Update line 71 similarly for article.title, replacing the inlined regex with the helper before slicing to 200 chars.

This keeps the existing functionality (getting a plain-text-ish title/summary) but implements it in a robust way that avoids incomplete multi-character sanitization.

You will need:

An import at the top of src/aggregator.js for sanitize-html.
A sanitizeText function defined near the other helpers.
Replacement of the two regex uses on lines 68 and 71 with calls to sanitizeText.

Suggested fixes powered by Copilot Autofix. Review carefully before merging.

…ter sanitization Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: f29c13b85f

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-01-08T20:02:59Z

package.json

+    "rss-parser": "^3.13.0",
+    "sanitize-html": "^2.17.0"


Update lockfile for new sanitize-html dependency

This commit adds sanitize-html to package.json but does not update package-lock.json. In any environment that installs dependencies via the lockfile (e.g., npm ci in CI/CD or deployments), sanitize-html will not be installed and the new require('sanitize-html') in src/aggregator.js will throw MODULE_NOT_FOUND at runtime. Please regenerate the lockfile so it includes the new dependency.

Useful? React with 👍 / 👎.

Copilot

Pull request overview

This PR addresses a code scanning alert regarding incomplete multi-character sanitization by replacing regex-based HTML tag removal with a dedicated HTML sanitization library. The fix targets a security vulnerability where the regex pattern /<[^>]*>/g could be bypassed by malformed HTML.

Key changes:

Added sanitize-html library as a new dependency
Implemented a sanitizeText helper function that uses sanitize-html with strict configuration
Replaced regex-based HTML stripping in article.contentSnippet and article.title processing

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
package.json	Added `sanitize-html` version 2.17.0 as a new dependency
src/aggregator.js	Added import for sanitize-html, implemented sanitizeText helper function, and replaced regex-based tag removal with sanitizeText calls

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-01-08T20:04:32Z

package.json

+    "rss-parser": "^3.13.0",
+    "sanitize-html": "^2.17.0"


The project already includes dompurify and isomorphic-dompurify as dependencies. Adding sanitize-html creates redundancy with three different HTML sanitization libraries. Consider using one of the existing libraries (particularly isomorphic-dompurify which works in both browser and Node.js environments) instead of introducing a third sanitization library. This would reduce bundle size, dependency complexity, and potential security surface.

Suggested change

"rss-parser": "^3.13.0",

"sanitize-html": "^2.17.0"

"rss-parser": "^3.13.0"

Copilot · 2026-01-08T20:04:32Z

src/aggregator.js

 const Parser = require('rss-parser');
 const axios = require('axios');
 const { Octokit } = require('@octokit/rest');
+const sanitizeHtml = require('sanitize-html');


Consider using the existing isomorphic-dompurify library instead of adding sanitize-html. The project already has DOMPurify available, which can strip HTML tags effectively using DOMPurify.sanitize(input, {ALLOWED_TAGS: []}). This would maintain consistency with the sanitization approach used in the HTML files (portfolio.html, reader.html, stats.html) and reduce the number of dependencies.

Potential fix for code scanning alert no. 11: Incomplete multi-charac…

f29c13b

…ter sanitization Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com>

EthanThePhoenix38 marked this pull request as ready for review January 8, 2026 20:01

Copilot AI review requested due to automatic review settings January 8, 2026 20:01

Copilot started reviewing on behalf of EthanThePhoenix38 January 8, 2026 20:01 View session

chatgpt-codex-connector bot reviewed Jan 8, 2026

View reviewed changes

EthanThePhoenix38 merged commit 576ed70 into main Jan 8, 2026
8 of 10 checks passed

EthanThePhoenix38 deleted the alert-autofix-11 branch January 8, 2026 20:03

Copilot AI reviewed Jan 8, 2026

View reviewed changes

Copilot AI mentioned this pull request Jan 8, 2026

Potential fix for code scanning alert no. 12: Incomplete multi-character sanitization #11

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Potential fix for code scanning alert no. 11: Incomplete multi-character sanitization#12

Potential fix for code scanning alert no. 11: Incomplete multi-character sanitization#12
EthanThePhoenix38 merged 1 commit intomainfrom
alert-autofix-11

EthanThePhoenix38 commented Jan 8, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Jan 8, 2026

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Jan 8, 2026

Uh oh!

Copilot AI Jan 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	"rss-parser": "^3.13.0",
	"sanitize-html": "^2.17.0"
	"rss-parser": "^3.13.0"

Uh oh!

Conversation

EthanThePhoenix38 commented Jan 8, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Jan 8, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Jan 8, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jan 8, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants