Skip to content

fix: stripping logic in Update check_fingerprints.py#14

Open
moshderte wants to merge 1 commit intosentient-agi:mainfrom
moshderte:patch-1
Open

fix: stripping logic in Update check_fingerprints.py#14
moshderte wants to merge 1 commit intosentient-agi:mainfrom
moshderte:patch-1

Conversation

@moshderte
Copy link

This PR fixes a bug where bos_token was intended to be stripped from the signature_tokenized sequences, but the processed list was not reassigned back — effectively leaving the original tokens unchanged.

The code correctly creates new_signature_tokenized without the bos_token, but then mistakenly keeps using the original signature_tokenized.

This fix ensures that the model's output is compared against the expected signature without the BOS token, as originally intended.

It should improve the accuracy of the backdoor detection evaluation, especially in cases where BOS token presence causes mismatch.

This PR fixes a bug where bos_token was intended to be stripped from the signature_tokenized sequences, but the processed list was not reassigned back — effectively leaving the original tokens unchanged.

The code correctly creates new_signature_tokenized without the bos_token, but then mistakenly keeps using the original signature_tokenized.

This fix ensures that the model's output is compared against the expected signature without the BOS token, as originally intended. 

It should improve the accuracy of the backdoor detection evaluation, especially in cases where BOS token presence causes mismatch.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant