David

Results 21 comments of David

dbt + DLT sounds like very powerful combination. I don't know how feasible this is, but I ❤️ the idea.

The html used for extracting the market_value_history has changed significantly on a recent update in Transfermarkt, and the existing logic for extracting the data does no longer work https://github.com/dcaribou/transfermarkt-scraper/blob/3e4ccb8488df1d843d36c8a8cd5d8bea949ae2d8/tfmkt/spiders/players.py#L120 The...

Ah, and it's an open API 🙌 This is super helpful, even the response format is the same as the scraped `market_value_history` object.

@tvqt - Thanks for the suggestion. I haven't really tried to scrape non EU and lower level leagues, but I'd assume that's possible without changing the scraper by providing appropriate...

Hey @visheugene. There's not an explicit limit on the number of competitions scraped by the `competitions` crawler. However, this crawler does scrape the first page from competitions list in the...

> Hey @dcaribou, > > Would you be able to help me with modifying the scraper so it recurses through the rest of the pages in the competitions list? I'm...

Hi @ScottishWolverine, Can you explain what you have tried and the problem you've found?

I think what you want is to "follow" the link. I'd suggest that you checkout [the docs for following links in scrapy](https://docs.scrapy.org/en/latest/intro/tutorial.html?highlight=follow#following-links) website.

Hey @wsciekly-kredens. Thanks for raising the issue. The reason why Marquinhos is not in the players asset is that Norwich City is now a club from the second league. It...

Most of these references have been removed, however frictionless is still being used to abstract the datapackage management. Is this actually worth it?