html-similarity icon indicating copy to clipboard operation
html-similarity copied to clipboard

Fix one error regarding xml code inside html

Open catarinaacsilva opened this issue 6 years ago • 4 comments

Dear matiskay,

I am using your library to compare http and https Portuguese websites. Some of the websites still use XML inside HTML. I added one condition to your code to support lxml.html.HtmlProcessingInstruction (the XML tags were as the previously mentioned class). Can you add this fix to your code?

Thanks, Catarina

catarinaacsilva avatar Jun 01 '19 14:06 catarinaacsilva

Hi @catarinaacsilva, sorry I miss this. I will test this out later today. Thanks

matiskay avatar Jun 10 '19 15:06 matiskay

Hi @catarinaacsilva thanks for the effort on adding feature. I'm working on a new version of the package and I will include fix into it. stay tune.

matiskay avatar Jun 12 '19 04:06 matiskay

@matiskay Just out of curiousity, is this sitll an active project? (Also, thank you for your great work)

ninoseki avatar Apr 26 '21 08:04 ninoseki

@matiskay Thanks for your work. I am also using this package and was wondering if the project is still active.

ivantha avatar Aug 09 '21 04:08 ivantha