html2text icon indicating copy to clipboard operation
html2text copied to clipboard

The html2text command doesn't grab it from the document, just the headers

Open avar opened this issue 9 years ago • 6 comments

See also the related issue #106.

The command will only use the feedparser if the fetched html is retrieved over HTTP(S), but doesn't pick up documents that have a <meta http-equiv header.

avar avatar Jan 12 '16 12:01 avar

@avar could you give me an example link?

theSage21 avatar Feb 27 '16 20:02 theSage21

@avar I'm not sure I understand this completely. Could you please elaborate or point me to resources which might help?

theSage21 avatar May 27 '16 13:05 theSage21

I filed these related to this mu bug I filed which has an example file. I.e. a HTML file with an encoding="..." in the HTML itself.

avar avatar May 27 '16 13:05 avar

Ah, so when the encoding is present in the file itself it is not grabbed? Am I getting this right?

theSage21 avatar May 27 '16 13:05 theSage21

Yes.

On Fri, May 27, 2016 at 3:15 PM, arjoonn sharma [email protected] wrote:

Ah, so when the encoding is present in the file itself it is not grabbed? Am I getting this right?

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/Alir3z4/html2text/issues/107#issuecomment-222143124, or mute the thread https://github.com/notifications/unsubscribe/AACw9baTIuM-h8mNEvNiu8gKQB9RkYbiks5qFu5dgaJpZM4HDHT1 .

avar avatar May 27 '16 13:05 avar

I'm in over my head. @Alir3z4 can you take this one?

theSage21 avatar May 27 '16 15:05 theSage21