Skip to content

New Tokenizers#4

Merged
cpetersen merged 9 commits intomainfrom
more-strategy-research
Sep 29, 2025
Merged

New Tokenizers#4
cpetersen merged 9 commits intomainfrom
more-strategy-research

Conversation

@cpetersen
Copy link
Member

@cpetersen cpetersen commented Sep 28, 2025

Added the following tokenizers:

  • char_group
  • letter
  • lowercase
  • ngram

Added validation to the edge_ngram tokenizer

@cpetersen cpetersen mentioned this pull request Sep 28, 2025
@cpetersen cpetersen merged commit 24bfe83 into main Sep 29, 2025
1 check passed
@cpetersen cpetersen deleted the more-strategy-research branch September 29, 2025 13:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant