attempting to create a more readable evaluation to anime tagger ai systems
.gitignore | ||
config.example.json | ||
LICENSE | ||
main.py | ||
README.md | ||
requirements.txt |
tagger-showdown
attempting to create a more readable evaluation to anime tagger ai systems
idea: take some recent images from danbooru, also include your own
then run x tagger systems against each other
score formula:
(len(tags in ground_truth) - len(tags not in ground_truth)) / len(ground_truth)
then average for all posts
python3 -m venv env
env/bin/pip install -Ur ./requirements.txt
env/bin/python3 ./main.py