archiveweb.page
archiveweb.page copied to clipboard
When recording via extension: Text extraction from LinkedIn site contains a lot of tags which are not text
Example: https://www.linkedin.com/feed/update/urn:li:activity:7084078817010491393/ Or Profile: https://www.linkedin.com/in/philippejosephcohen/ In the WACZ the text in pages.json contains many markups until we get to see text