bert
bert copied to clipboard
Processing book corpus
Hi team et al,
I'd like to know how to process bookcorpus to pre-training. I am confusing to process this data. Should I treat 1 book as a document including all sentences or 1 chapter as a document?
Thanks.
Same question!