Felipe Hertzer

Results 21 comments of Felipe Hertzer

I started the development of JSON-LD parse, the results are good, it will solve half of the author problems that we have.

Hey @adbar I'm having problem with a few publications like huffpost where it is not extracting the metadata correctly. But, if I change the line bellow to `tree = fromstring(htmlobject.encode('utf8'),...

I think it's a good idea to create a function for extracting pdf content, I have several sites that return PDF's to me. Do you have an idea how to...

Sounds good for me, it would be good to run some tests to predict what we need to have on the roadmap

I mean, the number of hits for each brand. I want to know how many hits each group has. group_key brand 1 - 10hits group_key brand 2 - 30hits group_key...

Hey @adbar, I have a similar problem, but with the site [Stuff](https://urlis.net/qqgfdyra), it is only getting half of the content, because they are using the class 'stuff-article', which is very...

@adbar I tested the ```ends-with``` and LXML seems to do not support it, do you want me to include the ```contains(@class, "-article")```?