Dirk Groeneveld

Results 200 comments of Dirk Groeneveld

Using the slow tokenizer is a good workaround. The fact that the documentation does one thing and the code does another is still a bug though. I doubt that people...

Let's say I have a dataset where the two fields I have for every instance are "question" and "context". I never want to truncate the question. If I truncate the...

> Figure out the culprit pair, and exclude it from the batch With a large enough dataset (or many datasets in my case) this is not possible. For one thing,...

A specific solution would be a way to say "truncate first, then second" or "second, then first". As you noted, it means you have to give it something like a...

I did another run like this. This time only one of them failed. The results table points me to this dataset: https://beaker.org/ds/01GC7PCX5M9B357GX6YJFY7C5R/details. This is clearly incomplete. Its presence will prevent...

Ah! I can search experiments by name, which reveals this error message: ``` 2022-09-05T21:18:13.535899802Z {"name": "root", "msg": "[step trained_model_arc_challenge_bert-base-uncased_2147483647] Uncaught exception", "args": {"py/tuple": []}, "levelname": "ERROR", "levelno": 40, "pathname": "/opt/conda/lib/python3.9/site-packages/tango/common/logging.py",...

I will leave all this up like it is in case it helps with debugging.

The job finishes with (in this case) one failure and a few dependent steps not run. The problems with how this goes down are these: 1. It shouldn't have failed...

Glad you got unblocked. Before you did the version number thing, did you remove the entire directory? Or did you just remove all files in it?