html2text icon indicating copy to clipboard operation
html2text copied to clipboard

Problem with <font> tags --> not displaying markdown syntax

Open kcmoffitt opened this issue 12 years ago • 1 comments

Hi, first time poster here. I apologize in advance for not following issue-submission protocol that may exist.

I am working on converting corporate annual reports (default format html, yet no standardized form of html) to text with markdown syntax. HTML2Text works perfectly for and tags, but not for <font...FontWeight: Bold> type tags. In these instances, the text is displayed with no markdown tags. I am a novice Python programmer and I cannot overcome this issue on my own.

This research is very important as it will expose certain companies that were either negligent or incompetent in the years before and surrounding the recent financial meltdown. Any help will be greatly appreciated.

Here is some sample html that exhibits the problem I described above...

https://docs.google.com/document/d/1PUSJWCfnddFCMzb_qiIg7dQYxwyBJpsh-T_cR55oa-A/edit?usp=sharing

kcmoffitt avatar Apr 09 '13 15:04 kcmoffitt

@ordinaryProfessor this is not a good method of sharing HTML (I am afraid Google Docs do some conversion about it). Do you want to say that your example is http://mcepl.fedorapeople.org/tmp/SampleHTML.html ?

mcepl avatar Nov 07 '13 13:11 mcepl