Tokenizer profiles #1
Labels
No Label
bug
duplicate
enhancement
help wanted
invalid
question
wontfix
No Milestone
No project
No Assignees
1 Participants
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: kj7rrv/micronlp#1
Loading…
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
The tokenizer should give a dictionary, dataclass, or similar object indicating the configuration used. Something dictionary-like would be good to allow
micronlp.tokenizer.tokenize(text, **profile)
.