Update project and README

This commit is contained in:
Samuel Sloniker 2022-07-05 18:46:29 -07:00
parent 3206d344be
commit 9ed9b82bf7
2 changed files with 17 additions and 1 deletions

View File

@ -1,3 +1,19 @@
# gptc_scratch
GPTC, ported to Scratch
gptc_scratch uses the same algorithm as GPTC, in a Scratch project. However, it
only supports two categories, and they are numbered (1 and 2) rather than
named.
Models are formatted differently than in the Python version of GPTC. A model
consists of two files, each containing the text for a category. gptc_scratch
does not have a tokenizer, so each line must consist of one lowercase word with
no other characters. To load the model, right-click "Category 1 Words" in the
project, click "import," then choose one of the files. Do the same thing again
with "Category 2 Words" and the other file.
The text to classify must be stored in the same format as the model files.
Follow the same procedure that you used for loading the model files, but with
"Words in text to classify" and the file containing the text to classify. Then,
click "Compile." If the button changes to a lighter color, wait for it to
change back. Next, click "classify." This should update the "Cat1" and "Cat2"
variables with the confidence valued for the given text and categories.

Binary file not shown.