Vladimir Prelovac

Results 20 comments of Vladimir Prelovac

I guess it was a network issue. What is the purpose of a local install if it still needs to connect to an external server?

Trafilatura 1.2.2 - Python 3.8.10 I get empty result when called from command line (and exception when called from API) > trafilatura -u "https://www.thekitchn.com/pan-fried-chicken-thighs-recipe-23394671" > trafilatura -u "http://thenextweb.com/news/web-inventor-tim-berners-lee-screw-web3-my-decentralized-internet-doesnt-need-blockchain" > Python...

Perhaps a resonable quick 'fix' would be adding a proper error message for download fail. A more ambitious one would be to provide a paramater for alternate URL (for example...

FYI I installed pycurl and it still fails for all the above URLs. I am currently retrying by using a proxy when trafilatura raises an exception and this seems to...

> @vprelovac as for the alternative URL source it exists on the command-line with the argument `--archived`: > https://trafilatura.readthedocs.io/en/latest/usage-cli.html#internet-archive This is exactly what we need :) To clarify, when you...

> As of now this parameter cannot be enabled for `fetch_url()`, I will think about it. "Do or do not, there is no try" :)

1. I agree it is complex. However you already decided to use regex approach for english, and my point is that the regex I provided is higher quality and faster...

@atcbosselut Are you going to release pre-trained weights for Newsroom? Would make evaluating the model much easier! Thanks!

It seems to be a change in Safari. More info here: https://ecosia.zendesk.com/hc/en-us/articles/360023798274-What-is-the-new-Ecosia-Mac-App-Extension-for-Safari- New API does not allow to forward searches directly from address bar. It will effectively kill this extension...

Yes, and it doesn't work. Colab had huge community of ML users and I suggest that you include it as part of testing.