soweego
soweego copied to clipboard
Link Wikidata items to large catalogs
updates: - [github.com/pycqa/isort: 5.10.1 → 5.13.2](https://github.com/pycqa/isort/compare/5.10.1...5.13.2) - https://github.com/myint/autoflake → https://github.com/PyCQA/autoflake - [github.com/PyCQA/autoflake: v1.4 → v2.3.1](https://github.com/PyCQA/autoflake/compare/v1.4...v2.3.1)
- [ ] filter rotten URLs - [ ] percent-decode IDs - [ ] double-check - [x] Discogs musician - [x] Discogs band - ~[ ] generate Discogs musical work~...
- [x] blocked by #19 - [ ] run `sync ids` - [ ] upload test edits - [ ] submit a request for permission at https://www.wikidata.org/wiki/Wikidata:Requests_for_permissions/Bot **Note:** we reverted...
- [ ] Make a class for the cache so it's also easier to switch to a new backend if needed. See https://github.com/Wikidata/soweego/pull/419#discussion_r691277529 - [ ] don't call public validation...
[This](https://w.wiki/46iF) SPARQL query returns all the regexes to validate a given ID. - [ ] Make a template out of the query, the first pattern should be a variable -...
Following a conversation with the MusicBrainz team, a _"relationship"_ set as _ended_ in the MusicBrainz database dump stands for a known broken URL: implement this check at import time.
High priority: - [x] Replace Docker with Conda - [x] bump Python to the latest version - [x] bump as many dependencies as possible to their latest version - [...
- [ ] Blocked by #433 - [ ] MusicBrainz musician - [ ] IMDb actor - [ ] IMDb director - [ ] IMDb producer - [ ] IMDb...
- [ ] 9f30cd9213d337e136ae5191b0030748e4dcc88d dramatically decreases results - [ ] dates with `00-00` break the bot - [ ] use hardcoded QIDs for gender (looking up `female` returns `Q43445` female...
Offending command: ``` python -m soweego -l soweego DEBUG importer check_urls musicbrainz ``` Stack trace: ``` 2021-08-16 08:32:54,436 [ERROR] base._finalize_fairy #702 - Exception during reset or similar Traceback (most recent...