parser
parser copied to clipboard
📜 Extract meaningful content from the chaos of a web page
That's pretty much it. Currently only choice is between serif/sans-serif, other than that, I can't find a way to chance the font-family.
# ##Mismatched domains Canonical version http://marshsyl.com/ AMP version https:/z/mercury.postlight.com/amp?url=http://www.marshsyl.com/ - **Platform**: - **Mercury Parser Version**: - **Node Version (if a Node bug)**: - **Browser Version (if a browser bug)**: ##...
The build with Rollup was complaining about circular file dependencies. This commit fixes those.
Issue: Incomplete extraction Link [Climate & Security Special Report](https://foreignpolicy.com/2020/04/22/climate-change-security-poverty-food-water-fragile-states-peacegames/) Desktop: Chrome app Mercury version: 4.3.1.0 Browser: Brave | Version 1.25.70 Chromium: 91.0.4472.77 (Official Build) (64-bit)
I don't get what I don't understand here: https://github.com/postlight/mercury-parser/blob/master/src/extractors/custom/README.md#using-transforms Mercury proceeds to return everything as if there's no transforms. My code: ```js accreditations: { selectors: ['#accreditations'], allowMultiple: true, clean: [...
Hi. I used Mercury Reader as a Chrome extension in the past. (Really great!) I removed it for a while b/c I wasn't reading as much at the time. I...
## Platform ## - Windows 10 64bit: - Mercury Reader Chrome Extension 4.3.1.0 - I don't know the version but I get the same result through the mercury-parser-api - Version...
- **Platform**: Darwin 20.4.0 Darwin Kernel Version 20.4.0: Thu Apr 22 21:46:41 PDT 2021; root:xnu-7195.101.2~1/RELEASE_ARM64_T8101 x86_64 - **Mercury Parser Version**: 2.2.0 ## Expected Behavior Should return the correct word count....
How to `clone` the video portion of the HTML page in order to extract and keep it intact? For example: From this url : https://abcnews.go.com/Politics/arizona-gov-doug-ducey-signs-law-purge-voters/story?id=77606533&cid=clicksource_4380645_1_heads_hero_live_hero_image I would like to keep...