Treat numbers as words #12
Labels
No Label
api-break
bug
duplicate
enhancement
help wanted
invalid
model-break
question
wait-for-break
wontfix
No Milestone
No project
No Assignees
1 Participants
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: kj7rrv/gptc#12
Loading…
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Numbers should be treated as words by the tokenizer. It would be great to include commas and periods within numbers as well, but this might not be feasible.
Input:
Testing 123,456.789.Test
Ideal output:
['testing', '123,456.789', 'test']
Acceptable output:
['testing', '123', '456', '789', 'test']
Numbers should be treated as wordsto Treat numbers as words