novel-first-lines-dataset
novel-first-lines-dataset copied to clipboard
Removed 2500ish duplicates, fixed numerous syntactical errors
Still plenty of duplicates, often due to differences in spelling, typos. Also a great number of lines that are more than one sentence long.