transfermarkt-scraper
transfermarkt-scraper copied to clipboard
Collects data from Transfermarkt website
We can add the contract expiry date along with the player_valuation file, which will give another dimension of problem statement to be answered to the dataset. I propose this as...
Add a small section in the README to guide potential contributors on how to extend the functionality on existing scrapers. Things to have into account * How to create local...
There are a few general league attributes in the league index page that would be rather easy to parse and expose in the 'leagues' crawler. This ticket is to parse...
Support recursive backfilling of entity parents. For example ```console > scrapy crawl leagues ... Exception: Backfilling is not yet supported, please provide a 'parents' file ```
In the scenario of a missing lineup information for some of the teams, the logic assigning lineups and substitutes for home/away clubs does not work. This causes some players to...
See https://github.com/dcaribou/transfermarkt-datasets/issues/244
Currently the competitions scraper only chooses countries from the first page of a confederation (25 of them) and then scrapes the competitions from those countries. I've tried to extend the...
I've been joining the data on the FIFA 23 player database, and only about half of the players in the FIFA database are present in the scraped Transfermarkt data. The...
Parent issue → https://github.com/dcaribou/transfermarkt-datasets/issues/215 The attribute `market_value_history` is coming as null in the latest runs, likely due to an upstream change on Transfermarkt side. ```console $ scrapy crawl players -a...
Detected in https://github.com/dcaribou/transfermarkt-datasets/issues/290