PearsonEbookDownloader
PearsonEbookDownloader copied to clipboard
Cannot download ETEXT2_PXE-type books
This issue concerns the inability to download ETEXT2_PXE-type ebooks, the newer epub-based format from Pearson. Although it is possible to download the ePub itself without too much effort, the formatting is messed up and needs lots of fixing. This will hopefully be fixed in the future, but due to a lack of books to test with, it may take a while.
Is it possible that you can just make the new epub version downloadable, I just want to see if it's still usable even with the formatting issues. When it comes to reading I can just open it up on my phone app, but when I need to do the questions on paper it's annoying to look at my phone it would be more convenient if I can use it offline on my laptop.
It appears that they are using xhtml files and contains assets like images and css so it would be a pain in the a.. to properly even convert them to epub
According to my observations, they need two cookies to fetch:
URL: https://etext.pearson.com/eps/pearson-reader/api/item/7cd0d006-bce6-4d38-9ffb-9881840d9bc4/1/file/taylor_simon_dickey_cbcac-9e_v5_SPI_062217/OPS/xhtml/ch04_pg0004.xhtml
Cookie: _ga=GA1.(1 number).(10 numbers).(10 numbers); etext-cdn-token=(redacted)
I currently have access to a textbook in this format and a bit of experience with Python, what can I do to help?
Realistically speaking, this format of ebook would require an entirely new downloader to be made to support it. @scheng123 has made the right observations here, there's unfortunately more that's needed than just a few parameters from a URL too, cookies are also necessary, so if you're up to making a downloader for this type of book, you'd realistically be on your own.