jokebot-2.0 icon indicating copy to clipboard operation
jokebot-2.0 copied to clipboard

upload test results

Open rgstephens opened this issue 4 years ago • 10 comments

rgstephens avatar Jun 19 '21 01:06 rgstephens

Commit: 3b3eb20d4f721806413077f957d52c63aec2e489 Data: default

Configuration Intent Classification Micro F1 Entity Recognition Micro F1 Response Selection Micro F1 Story Recognition Micro F1
config.yml 0.8996 (0.00) 0.8615 (0.00) 0.8864 (0.00) 0.5000 (0.00)

Intent Cross-Validation Results

class support f1-score confused_with
macro avg 249 0.920 N/A
weighted avg 249 0.900 N/A
faq 88 0.911 affirm(3), stop(3)
time_range 24 0.880 affirm(1), time_from(1)
affirm 21 0.818 stop(2), faq(1)
stop 19 0.762 faq(2), affirm(1)
f1_score 13 0.960 stop(1)
deny 8 0.800 stop(1), faq(1)
version 8 1.000 N/A
inform 8 1.000 N/A
debug 6 1.000 N/A
feedback 6 0.923 N/A
breaking_quote 6 1.000 N/A
time_from 6 0.600 time_range(3)
survey 6 1.000 N/A
creed_quote 5 1.000 N/A
ron_quote 5 0.833 N/A
chuck_quote 5 1.000 N/A
trump_quote 5 1.000 N/A
kanye_quote 5 1.000 N/A
inspiring_quote 5 1.000 N/A

Entity Cross-Validation Results

entity support f1-score precision recall
micro avg 33 0.8615 0.8750 0.8485
macro avg 33 0.7818 0.8167 0.7667
weighted avg 33 0.8617 0.8889 0.8485
email_addr 17 1.0000 1.0000 1.0000
feedback 6 0.8000 1.0000 0.6667
survey 5 0.7273 0.6667 0.8000
debug 5 0.6000 0.6000 0.6000

github-actions[bot] avatar Jun 19 '21 01:06 github-actions[bot]

Commit: 441972fd69f713af64a002c46a64a95915c47908 Data: default

Configuration Intent Classification Micro F1 Entity Recognition Micro F1 Response Selection Micro F1 Story Recognition Micro F1
config.yml 0.8715 (0.00) 0.8571 (0.00) 0.8977 (0.00) 0.5000 (0.00)

Intent Cross-Validation Results

class support f1-score confused_with
macro avg 249 0.8939 N/A
weighted avg 249 0.8737 N/A
faq 88 0.8941 stop(5), affirm(3)
time_range 24 0.8571 time_from(3)
affirm 21 0.7805 faq(3), stop(2)
stop 19 0.6818 faq(3), affirm(1)
f1_score 13 0.9167 stop(1), debug(1)
version 8 1.0000 N/A
deny 8 0.8571 stop(2)
inform 8 1.0000 N/A
breaking_quote 6 1.0000 N/A
feedback 6 0.9231 N/A
time_from 6 0.3333 time_range(4)
survey 6 1.0000 N/A
debug 6 0.9231 N/A
creed_quote 5 0.9091 N/A
ron_quote 5 1.0000 N/A
inspiring_quote 5 1.0000 N/A
kanye_quote 5 1.0000 N/A
trump_quote 5 1.0000 N/A
chuck_quote 5 0.9091 N/A

Entity Cross-Validation Results

entity support f1-score precision recall
micro avg 33 0.8571 0.900 0.8182
macro avg 33 0.7667 0.837 0.7167
weighted avg 33 0.8525 0.902 0.8182
email_addr 17 1.0000 1.000 1.0000
feedback 6 0.8000 1.000 0.6667
debug 5 0.6000 0.600 0.6000
survey 5 0.6667 0.750 0.6000

github-actions[bot] avatar Jun 19 '21 02:06 github-actions[bot]

Commit: b731bc3ec65ba81279362ad91d457648720f568c Data: default

Configuration Intent Classification Micro F1 Entity Recognition Micro F1 Response Selection Micro F1 Story Recognition Micro F1
config.yml 0.8795 (0.00) 0.8308 (0.00) 0.8864 (0.00) 0.5000 (0.00)

Intent Cross-Validation Results

class support f1-score confused_with
macro avg 249 0.8897 N/A
weighted avg 249 0.8833 N/A
faq 88 0.9341 stop(4), affirm(3)
time_range 24 0.8571 time_from(3)
affirm 21 0.8095 stop(3), survey(1)
stop 19 0.6512 affirm(1), faq(1)
f1_score 13 0.8696 debug(2), stop(1)
inform 8 1.0000 N/A
version 8 1.0000 N/A
deny 8 0.8000 stop(2)
breaking_quote 6 0.9231 N/A
time_from 6 0.4615 time_range(3)
survey 6 0.9231 N/A
feedback 6 1.0000 N/A
debug 6 0.8571 N/A
creed_quote 5 1.0000 N/A
inspiring_quote 5 1.0000 N/A
ron_quote 5 0.9091 N/A
trump_quote 5 1.0000 N/A
kanye_quote 5 1.0000 N/A
chuck_quote 5 0.9091 N/A

Entity Cross-Validation Results

entity support f1-score precision recall
micro avg 33 0.8308 0.844 0.8182
macro avg 33 0.7364 0.775 0.7167
weighted avg 33 0.8342 0.864 0.8182
email_addr 17 1.0000 1.000 1.0000
feedback 6 0.8000 1.000 0.6667
debug 5 0.5455 0.500 0.6000
survey 5 0.6000 0.600 0.6000

github-actions[bot] avatar Jun 19 '21 03:06 github-actions[bot]

Commit: feef035d8db38d03ca6ed38dd903734c98c17807 Data: default

Configuration Intent Classification Micro F1 Entity Recognition Micro F1 Response Selection Micro F1 Story Recognition Micro F1
config.yml 0.8916 (0.00) 0.7937 (0.00) 0.9205 (0.00) 0.5000 (0.00)

Intent Cross-Validation Results

class support f1-score confused_with
macro avg 249 0.9139 N/A
weighted avg 249 0.8919 N/A
faq 88 0.9249 affirm(3), stop(3)
time_range 24 0.9167 time_from(2)
affirm 21 0.7179 feedback(2), stop(2)
stop 19 0.6829 faq(3), affirm(1)
f1_score 13 0.8889 stop(1)
inform 8 1.0000 N/A
version 8 1.0000 N/A
deny 8 0.8000 stop(2)
time_from 6 0.6667 time_range(2)
breaking_quote 6 1.0000 N/A
survey 6 1.0000 N/A
feedback 6 0.8571 N/A
debug 6 1.0000 N/A
creed_quote 5 1.0000 N/A
ron_quote 5 0.9091 N/A
trump_quote 5 1.0000 N/A
kanye_quote 5 1.0000 N/A
inspiring_quote 5 1.0000 N/A
chuck_quote 5 1.0000 N/A

Entity Cross-Validation Results

entity support f1-score precision recall
micro avg 33 0.7937 0.8333 0.7576
macro avg 33 0.6489 0.7083 0.6167
weighted avg 33 0.7811 0.8232 0.7576
email_addr 17 1.0000 1.0000 1.0000
feedback 6 0.8000 1.0000 0.6667
survey 5 0.2500 0.3333 0.2000
debug 5 0.5455 0.5000 0.6000

github-actions[bot] avatar Jun 19 '21 03:06 github-actions[bot]

Commit: 678316c8e2109dbc672eb2a40beef25516ad6280 Data: default

Configuration Intent Classification Micro F1 Entity Recognition Micro F1 Response Selection Micro F1 Story Recognition Micro F1
config.yml 0.8835 (0.00) 0.8615 (0.00) 0.9318 (0.00) 0.5000 (0.00)

Intent Cross-Validation Results

class support f1-score confused_with
macro avg 249 0.9048365857833256 N/A
weighted avg 249 0.8826947162339407 N/A
faq 88 0.8953488372093024 affirm(5), stop(2)
time_range 24 0.8571428571428572 time_from(2), faq(1)
affirm 21 0.7441860465116279 faq(3), stop(2)
stop 19 0.7804878048780488 faq(2), affirm(1)
f1_score 13 0.9600000000000001 stop(1)
inform 8 1.0000000000000000 N/A
deny 8 0.8571428571428571 stop(1), faq(1)
version 8 0.9411764705882353 N/A
debug 6 1.0000000000000000 N/A
time_from 6 0.4000000000000000 time_range(4)
breaking_quote 6 1.0000000000000000 N/A
feedback 6 0.9230769230769230 N/A
survey 6 1.0000000000000000 N/A
chuck_quote 5 1.0000000000000000 N/A
ron_quote 5 0.8333333333333333 N/A
creed_quote 5 1.0000000000000000 N/A
kanye_quote 5 1.0000000000000000 N/A
trump_quote 5 1.0000000000000000 N/A
inspiring_quote 5 1.0000000000000000 N/A

Entity Cross-Validation Results

entity support f1-score precision recall
micro avg 33 0.8615384615384615 0.8750000000000000 0.8484848484848485
macro avg 33 0.7818181818181817 0.8166666666666667 0.7666666666666666
weighted avg 33 0.8617079889807163 0.8888888888888888 0.8484848484848485
email_addr 17 1.0000000000000000 1.0000000000000000 1.0000000000000000
feedback 6 0.8000000000000000 1.0000000000000000 0.6666666666666666
debug 5 0.6000000000000000 0.6000000000000000 0.6000000000000000
survey 5 0.7272727272727272 0.6666666666666666 0.8000000000000000

github-actions[bot] avatar Jul 22 '21 20:07 github-actions[bot]

Commit: 1309403f23bc650ab4c2c71c50856b5dffff59df Data: default

Configuration Intent Classification Micro F1 Entity Recognition Micro F1 Response Selection Micro F1 Story Recognition Micro F1
config.yml 0.8876 (0.00) 0.8529 (0.00) 0.9205 (0.00) 0.5000 (0.00)

Intent Cross-Validation Results

class support f1-score confused_with
macro avg 249 0.9059092785636486 N/A
weighted avg 249 0.8863837046242278 N/A
faq 88 0.9080459770114941 stop(4), affirm(2)
time_range 24 0.8979591836734694 time_from(2)
affirm 21 0.8000000000000000 stop(2), faq(2)
stop 19 0.6829268292682926 faq(3), affirm(1)
f1_score 13 0.9230769230769231 stop(1)
version 8 0.9411764705882353 N/A
inform 8 1.0000000000000000 N/A
deny 8 0.7500000000000000 stop(1), faq(1)
survey 6 1.0000000000000000 N/A
breaking_quote 6 1.0000000000000000 N/A
time_from 6 0.4000000000000000 time_range(3), faq(1)
feedback 6 1.0000000000000000 N/A
debug 6 1.0000000000000000 N/A
inspiring_quote 5 1.0000000000000000 N/A
kanye_quote 5 1.0000000000000000 N/A
trump_quote 5 0.9090909090909091 N/A
chuck_quote 5 1.0000000000000000 N/A
creed_quote 5 1.0000000000000000 N/A
ron_quote 5 1.0000000000000000 N/A

Entity Cross-Validation Results

entity support f1-score precision recall
micro avg 33 0.8529411764705883 0.8285714285714286 0.8787878787878788
macro avg 33 0.7786713286713287 0.7812500000000000 0.8166666666666667
weighted avg 33 0.8598008052553507 0.8674242424242424 0.8787878787878788
email_addr 17 1.0000000000000000 1.0000000000000000 1.0000000000000000
feedback 6 0.8000000000000000 1.0000000000000000 0.6666666666666666
survey 5 0.5454545454545454 0.5000000000000000 0.6000000000000000
debug 5 0.7692307692307693 0.6250000000000000 1.0000000000000000

github-actions[bot] avatar Jul 22 '21 21:07 github-actions[bot]

Commit: bebaf0152bca324d5b1b982d0e562968b570cf4e Data: default

Configuration Intent Classification Micro F1 Entity Recognition Micro F1 Response Selection Micro F1 Story Recognition Micro F1
config.yml 0.8996 (0.00) 0.8571 (0.00) 0.8977 (0.00) 0.5000 (0.00)

Intent Cross-Validation Results

class support f1-score confused_with
macro avg 249 0.9163354061428515 N/A
weighted avg 249 0.8999093491482560 N/A
faq 88 0.9371428571428572 affirm(3), stop(3)
time_range 24 0.8571428571428572 time_from(3)
affirm 21 0.8095238095238095 stop(3), faq(1)
stop 19 0.6829268292682926 faq(3), affirm(1)
f1_score 13 0.9600000000000001 stop(1)
inform 8 1.0000000000000000 N/A
deny 8 0.7999999999999999 stop(1), faq(1)
version 8 1.0000000000000000 N/A
feedback 6 1.0000000000000000 N/A
survey 6 1.0000000000000000 N/A
time_from 6 0.3636363636363636 time_range(4)
debug 6 1.0000000000000000 N/A
breaking_quote 6 1.0000000000000000 N/A
inspiring_quote 5 1.0000000000000000 N/A
ron_quote 5 1.0000000000000000 N/A
trump_quote 5 1.0000000000000000 N/A
chuck_quote 5 1.0000000000000000 N/A
kanye_quote 5 1.0000000000000000 N/A
creed_quote 5 1.0000000000000000 N/A

Entity Cross-Validation Results

entity support f1-score precision recall
micro avg 33 0.8571428571428572 0.9000000000000000 0.8181818181818182
macro avg 33 0.7666666666666666 0.8375000000000000 0.7166666666666667
weighted avg 33 0.8525252525252525 0.9015151515151515 0.8181818181818182
email_addr 17 1.0000000000000000 1.0000000000000000 1.0000000000000000
feedback 6 0.8000000000000000 1.0000000000000000 0.6666666666666666
survey 5 0.6666666666666665 0.7500000000000000 0.6000000000000000
debug 5 0.6000000000000000 0.6000000000000000 0.6000000000000000

github-actions[bot] avatar Jul 22 '21 21:07 github-actions[bot]

Commit: 9003528e2773525a45e1f21c96980f0737aa96b4 Data: default

Configuration Intent Classification Micro F1 Entity Recognition Micro F1 Response Selection Micro F1 Story Recognition Micro F1
config.yml 0.8755 (0.00) 0.8308 (0.00) 0.8977 (0.00) 0.5000 (0.00)

Intent Cross-Validation Results

class support f1-score confused_with
macro avg 249 0.8827030711470072 N/A
weighted avg 249 0.8779528893200972 N/A
faq 88 0.9285714285714285 stop(4), affirm(2)
time_range 24 0.8260869565217391 time_from(4), survey(1)
affirm 21 0.8000000000000000 stop(3), feedback(1)
stop 19 0.7272727272727272 affirm(1), deny(1)
f1_score 13 0.8333333333333333 debug(2), stop(1)
deny 8 0.7500000000000000 stop(1), faq(1)
inform 8 1.0000000000000000 N/A
version 8 1.0000000000000000 N/A
survey 6 0.9230769230769230 N/A
time_from 6 0.4615384615384615 time_range(3)
feedback 6 0.9230769230769230 N/A
breaking_quote 6 0.9230769230769230 N/A
debug 6 0.8571428571428571 N/A
inspiring_quote 5 1.0000000000000000 N/A
kanye_quote 5 1.0000000000000000 N/A
chuck_quote 5 1.0000000000000000 N/A
trump_quote 5 1.0000000000000000 N/A
ron_quote 5 0.9090909090909091 N/A
creed_quote 5 0.9090909090909091 N/A

Entity Cross-Validation Results

entity support f1-score precision recall
micro avg 33 0.8307692307692308 0.8437500000000000 0.8181818181818182
macro avg 33 0.7363636363636364 0.7750000000000000 0.7166666666666667
weighted avg 33 0.8341597796143251 0.8636363636363636 0.8181818181818182
email_addr 17 1.0000000000000000 1.0000000000000000 1.0000000000000000
feedback 6 0.8000000000000000 1.0000000000000000 0.6666666666666666
debug 5 0.5454545454545454 0.5000000000000000 0.6000000000000000
survey 5 0.6000000000000000 0.6000000000000000 0.6000000000000000

github-actions[bot] avatar Jul 26 '21 04:07 github-actions[bot]

Commit: dfc0d7f31ff7ce44e1dfe206d315d0ccf14cd473 Data: default

Configuration Intent Classification Micro F1 Entity Recognition Micro F1 Response Selection Micro F1 Story Recognition Micro F1
config.yml 0.8835 (0.00) 0.8000 (0.00) 0.9091 (0.00) 0.5000 (0.00)

Intent Cross-Validation Results

class support f1-score confused_with
macro avg 249 0.8924406570245675 N/A
weighted avg 249 0.8857622171165227 N/A
faq 88 0.9349112426035502 affirm(3), stop(3)
time_range 24 0.8163265306122450 time_from(4)
affirm 21 0.8372093023255814 stop(3)
stop 19 0.6666666666666666 deny(2), faq(2)
f1_score 13 0.9600000000000001 stop(1)
version 8 1.0000000000000000 N/A
deny 8 0.6666666666666666 stop(2), breaking_quote(1)
inform 8 1.0000000000000000 N/A
time_from 6 0.3333333333333333 time_range(4)
debug 6 1.0000000000000000 N/A
feedback 6 1.0000000000000000 N/A
breaking_quote 6 0.9230769230769230 N/A
survey 6 1.0000000000000000 N/A
ron_quote 5 0.9090909090909091 N/A
creed_quote 5 0.9090909090909091 N/A
kanye_quote 5 1.0000000000000000 N/A
trump_quote 5 1.0000000000000000 N/A
chuck_quote 5 1.0000000000000000 N/A
inspiring_quote 5 1.0000000000000000 N/A

Entity Cross-Validation Results

entity support f1-score precision recall
micro avg 33 0.8000000000000000 0.81250000000000000 0.7878787878787878
macro avg 33 0.6861111111111111 0.73214285714285720 0.6666666666666666
weighted avg 33 0.8037037037037037 0.83766233766233770 0.7878787878787878
email_addr 17 1.0000000000000000 1.00000000000000000 1.0000000000000000
feedback 6 0.8000000000000000 1.00000000000000000 0.6666666666666666
debug 5 0.5000000000000000 0.42857142857142855 0.6000000000000000
survey 5 0.4444444444444445 0.50000000000000000 0.4000000000000000

github-actions[bot] avatar Jul 26 '21 05:07 github-actions[bot]

Commit: 319426bc43b6632b568c6fcfb777860472b0fe61 Data: default

Configuration Intent Classification Micro F1 Entity Recognition Micro F1 Response Selection Micro F1 Story Recognition Micro F1
config.yml 0.8795 (0.00) 0.8615 (0.00) 0.8864 (0.00) 0.5000 (0.00)

Intent Cross-Validation Results

class support f1-score confused_with
macro avg 249 0.8784366005656136 N/A
weighted avg 249 0.8784381266870133 N/A
faq 88 0.9239766081871345 stop(3), deny(2)
time_range 24 0.8571428571428572 time_from(2), affirm(1)
affirm 21 0.8292682926829269 faq(2), survey(1)
stop 19 0.7500000000000001 version(1), deny(1)
f1_score 13 0.8695652173913044 debug(2), stop(1)
inform 8 1.0000000000000000 N/A
deny 8 0.6250000000000000 breaking_quote(1), stop(1)
version 8 0.8888888888888890 N/A
debug 6 0.8571428571428571 N/A
breaking_quote 6 0.8571428571428571 N/A
feedback 6 1.0000000000000000 N/A
time_from 6 0.4000000000000000 time_range(4)
survey 6 0.9230769230769230 N/A
trump_quote 5 1.0000000000000000 N/A
chuck_quote 5 1.0000000000000000 N/A
creed_quote 5 1.0000000000000000 N/A
inspiring_quote 5 1.0000000000000000 N/A
kanye_quote 5 1.0000000000000000 N/A
ron_quote 5 0.9090909090909091 N/A

Entity Cross-Validation Results

entity support f1-score precision recall
micro avg 33 0.8615384615384615 0.8750000000000000 0.8484848484848485
macro avg 33 0.7833333333333333 0.8303571428571428 0.7666666666666666
weighted avg 33 0.8626262626262626 0.8971861471861472 0.8484848484848485
email_addr 17 1.0000000000000000 1.0000000000000000 1.0000000000000000
feedback 6 0.8000000000000000 1.0000000000000000 0.6666666666666666
debug 5 0.6666666666666665 0.7500000000000000 0.6000000000000000
survey 5 0.6666666666666666 0.5714285714285714 0.8000000000000000

github-actions[bot] avatar Jul 30 '21 04:07 github-actions[bot]