email-outlook-message-perl icon indicating copy to clipboard operation
email-outlook-message-perl copied to clipboard

Handle RTF-encapsulated HTML

Open kosta-beller opened this issue 8 years ago • 5 comments

The attached archive contains .msg files with HTML bodies encoded in base64. The library fails to convert them correctly to .eml messages.

msg.zip

kosta-beller avatar Jan 05 '17 12:01 kosta-beller

@granite-zero I'm pretty sure your issue is different: The problem with @kobeller's messages is that the HTML bodies are actually RTF, and the problem does occur on Linux. Can you open a new ticket please?

mvz avatar Mar 21 '17 14:03 mvz

I also have the same issue on linux when converting the html (as it sees RTF - and then add's the html part as rtf attachment). Is there a way to get round this? Perhaps being able to extract the html from the rtf part?

genericcx avatar Apr 13 '18 15:04 genericcx

These bodies are RTF-encapsulated HTML (who came up with that?), and there's a process for converting it back to HTML.

mvz avatar Aug 31 '20 06:08 mvz

@mvz I know this edge case is weird, but it arrived to me in as real complaint from the customer. So, should I follow the guide you linked or do you intend to implement it in the library? Also, any tips to deterministically identify those messages and distinguish them, for example, from plain RTF?

kosta-beller avatar Aug 31 '20 15:08 kosta-beller

@kobeller I'm considering implementing it in the library, but I'm still investigating whether there's a suitable RTF reader library in Perl that I can use as a base.

mvz avatar Sep 01 '20 07:09 mvz