article-extractor icon indicating copy to clipboard operation
article-extractor copied to clipboard

To extract main article from given URL with Node.js

Results 12 article-extractor issues
Sort by recently updated
recently updated
newest added

This is the error I get while using the extract function. "Error: ENOENT: no such file or directory, open 'S:\tmp\hermes\staging\hermes\cmake\intlDebug\arm64-v8a\lib\InternalBytecode\InternalBytecode.js" Initially, my app wouldn't even open as it couldn't find...

after `npm start`, it show: `WARNING in ./node_modules/tldts-core/dist/es6/index.js [1] Module Warning (from ./node_modules/source-map-loader/dist/cjs.js): [1] Failed to parse source map from 'D:\projectName\node_modules\tldts-core\index.ts' file: Error: ENOENT: no such file or directory, open...

`Document#toString()` will return the html with `linkedom` but not browser ![图片](https://user-images.githubusercontent.com/19922286/176678132-c8401976-ae37-4020-b626-4c8373a0bc72.png) - [x] Waiting for https://github.com/WebReflection/linkedom/issues/146

Having ability to pass proxy and custom headers to extract function would be great.

Pages to test on: * https://www.pinterest.ca/variamsingh87/ * https://www.pinterest.com.au/seriako/ Code: ```js import { extract } from '@extractus/article-extractor' const input = 'https://www.pinterest.ca/variamsingh87/' await extract(input) ``` Error: ``` TypeError: Cannot read properties of...

hello ! `curl http://localhost:3100/?url=https://www.lonelyplanet.fr/article/croatie-croisiere-entre-dubrovnik-mljet-et-korcula` ``` {"error":1,"message":"Expected ',' or '}' after property value in JSON at position 543 (line 23 column 7)","data":null,"meta":{"service":"article-parser","lang":"javascript","server":"express","platform":"node"}} ``` The same happen with deno. thanks !

hello ! i opened #368 for the exact reverse reason last week, this time the deno example works but not node :upside_down_face: `https://reporterre.net/Arnaud-Begin-l-inspecteur-nucleaire-qu-EDF-veut-faire-taire` getting ``` { "error": 1, "message": "Request...

The Open Graph protocol states that if there are multiple tags, the first one should be preferred in cases of conflicts. See here: https://ogp.me/#array But `article-extractor` prefers the last tag....

Currently the npm specifiers limit deployment options. It would be nice to have a full Deno release.

Hello, I have a typescript application and I use your tool. In my Typescript file, I import your library dynamically with the `import { extract } from “@extractus/article-extractor”;` But once...