web-auto-extractor icon indicating copy to clipboard operation
web-auto-extractor copied to clipboard

Parsing microdata strips spaces

Open hkdobrev opened this issue 8 years ago • 2 comments

Given the following HTML:

<div itemscope itemtype="http://schema.org/Product"><h1 itemprop="name"><span>Foo</span> Bar</h1></div>

I would expect the library to extract a Product with the name of Foo Bar, but it extracts FooBar omitting the space.

Do you think this would be an easy fix?

hkdobrev avatar Oct 25 '17 00:10 hkdobrev

@Vasanth-Indix @addnab Do you think the above is a valid expectation? Do you think you'd be able to address it or point me in the right direction? Thanks!

hkdobrev avatar Oct 30 '17 09:10 hkdobrev

Yes @hkdobrev. It's a valid expectation. We will look into it.

Vasanth-Indix avatar Oct 30 '17 09:10 Vasanth-Indix