java-libpst icon indicating copy to clipboard operation
java-libpst copied to clipboard

Method to return body text in case of html email

Open rawadmarji opened this issue 3 years ago • 7 comments

getBodyPrefix is not returning the full body

rawadmarji avatar Dec 21 '22 12:12 rawadmarji

Method to return body text in case of html email #99

rawadmarji avatar Dec 21 '22 12:12 rawadmarji

getBody() returns the plaintext body, if available. Otherwise you can use getBodyHTML with JSoup (https://jsoup.org/) to convert HTML to plaintext.

mooijtech avatar Dec 21 '22 12:12 mooijtech

just a final question when a new release will be delivered

rawadmarji avatar Dec 22 '22 07:12 rawadmarji

@rawadmarji A new release containing what functionality?

mooijtech avatar Dec 22 '22 08:12 mooijtech

The fix of seek method when using byte array in order to not download file and save it on local disk

rawadmarji avatar Dec 22 '22 08:12 rawadmarji

What is the issue you are running into specifically?

If you want to read PST files without fully downloading to disk beforehand you will find massive network performance overhead, for example 10 seconds compared to 5 minutes.

The trick is to run multiple "S3 Byte-Range Fetches" in parallel (https://docs.aws.amazon.com/whitepapers/latest/s3-optimizing-performance-best-practices/use-byte-range-fetches.html).

As far as I know java-libpst is not designed with this use case in mind. It's possible though.

The fastest way to solve your issue would be to download locally. Even better, if you can move your code to where the file is located the whole network issue would be solved.

Worth mentioning is mounting an S3 bucket using FUSE locally but I have not tried it personally so don't know how it performs (see https://github.com/s3fs-fuse/s3fs-fuse).

mooijtech avatar Dec 22 '22 08:12 mooijtech

but the issue is currently fixed in develop branch

rawadmarji avatar Dec 22 '22 09:12 rawadmarji