archiveweb.page When recording via extension: Text extraction from LinkedIn site contains a lot of tags which are not text

When recording via extension: Text extraction from LinkedIn site contains a lot of tags which are not text

Open tsemachh opened this issue 1 year ago • 0 comments

Example: https://www.linkedin.com/feed/update/urn:li:activity:7084078817010491393/ Or Profile: https://www.linkedin.com/in/philippejosephcohen/ In the WACZ the text in pages.json contains many markups until we get to see text

Jul 12 '23 13:07 tsemachh

archiveweb.page archiveweb.page copied to clipboard

When recording via extension: Text extraction from LinkedIn site contains a lot of tags which are not text

archiveweb.page
archiveweb.page copied to clipboard