corpus_process_script
corpus_process_script copied to clipboard
Unable to retrieve strokes - 'NoneType' object has no attribute 'contents'
Hi!
Thanks for the helpful tool. I tried to use it with giga_small.txt you provided. But only the first few characters look correct. Below are the output I got: (stroke) Qihuis-MacBook-Pro:extract_zh_char_stoke qihuixu$ python extract_zh_char_stoke.py --input ./Data/giga_small.txt --output fun Extract Chinese Character Stoke Information fun.TempFound handling with the 1000 line, all -1 lines. Handle Finished read dict. handling with the 11 line, all 1824 lines.From handian: word 人 url https://www.zdic.net/search/ 'NoneType' object has no attribute 'contents' All Finished.
The output file looks like this:
中 丨フ一丨
国 丨フ一一丨一丶一
庆 丶一ノ一ノ丶
假 ノ丨フ一丨一一フ一フ丶
期 一丨丨一一一ノ丶ノフ一一
香 ノ一丨ノ丶丨フ一一
江 丶丶一一丨一
将 丶一丨ノフ丶一丨丶
涌 丶丶一フ丶丨フ一一丨
入 ノ丶
~
~
~
~
~
~
~
~
~
~
~
~
~
Appreciate if you could take a look and let me know how to fix it!