wikiextractor
wikiextractor copied to clipboard
'{{snd}}' should resolve to '–' or '-'
In leaving a comment, https://github.com/attardi/wikiextractor/issues/130#issuecomment-890390800 I noticed a bug wrt the following line:
According to:
- Wikipedia rendered:
(11 August 1848 – 27 June 1934) was an - Wikipedia source:
(11 August 1848 – 27 June 1934) was an - dump XML:
(11 August 1848{{snd}}27 June 1934) was an - wikiextractor output (erroneous):
(11 August 184827 June 1934) was an