atarashi
atarashi copied to clipboard
Make evaluation.py more informative
The evaluation script should
- Allow to print a comparison table with all the algorithms supported by atarashi. You can find examples of comparison tables in #95 and #65.
- Allow to print a confusion matrix so that we can easily do error analysis to make decisions on how to improve current agents or implement new ones.