parser icon indicating copy to clipboard operation
parser copied to clipboard

Lemonde.fr headings and "Décryptages" are missing in parsed content

Open kit-cat opened this issue 5 years ago • 0 comments

  • Platform: Darwin Camille 18.6.0 Darwin Kernel Version 18.6.0: Thu Apr 25 23:16:27 PDT 2019; root:xnu-4903.261.4~2/RELEASE_X86_64 x86_64
  • Mercury Parser Version: 2.2.0

Expected Behavior

Expect all headings and "Décryptages" within Le Monde articles to be present within parsed content.

Current Behavior

Headings and "Décryptages" missing from the parsed content.

Steps to Reproduce

Use the following Le Monde article: https://www.lemonde.fr/pixels/article/2019/11/08/e-sport-tout-comprendre-a-league-of-legends-dont-les-mondiaux-vont-se-conclure-a-guichets-fermes-a-paris_6018436_4408996.html Feed it to the parser Look for "Décryptages" and "Qu’est-ce que l’e-sport ?" - this text will not be found in the parsed article.

kit-cat avatar Nov 08 '19 13:11 kit-cat