yuenhern (yh)

Results 3 issues of yuenhern (yh)

**Describe the bug** The library was unable to decode byte into character. **Affected dataset(s)** - `msmarco-passage/dev/small` **To Reproduce** Steps to reproduce the behavior: 1. Make sure `collectionandqueries.tar.gz` has already been...

bug

**Describe the bug** In `msmarco_passage.py` line 199-204, the `dev/small` dataset was: ``` subsets['dev/small'] = Dataset( collection, TsvQueries(Cache(TarExtract(dlc['collectionandqueries'], 'queries.dev.small.tsv'), base_path/'dev/small/queries.tsv'), namespace='msmarco', lang='en'), TrecQrels(Cache(TarExtract(dlc['collectionandqueries'], 'qrels.dev.small.tsv'), base_path/'dev/small/qrels'), QRELS_DEFS), TrecScoredDocs(Cache(ExtractQidPid(TarExtract(dlc['dev/scoreddocs'], 'top1000.dev')), base_path/'dev/ms.run')), ) ```...

bug

Created `.github/workflows/unit_tests.yml` to run tests on GitHub Actions as mentioned in Issue #16 . Unfortunately the workflow did not complete as there are tests that failed. See below: ``` Run...