lenicolas
lenicolas
Good afternoon, Same issue over here, please note that we cannot pull the code directly, we have strict security procedures and have to install from a downloaded zip file. Also,...
Good morning, Found something in the code : might come from the polib library which runs in updates_locales.py and fetches the messages according to local environment. Either it did'nt install...
Good afternoon, We investigated futher with a Python-fluent colleague ;-) : the update_locales.py only translates locales, so is out of the issue. polib is ijstalled and activated, out of the...
> Hello, Would like to insert a line between lines 8 and 9 to support gated-gelu activation ` "gated-gelu": common_spec.Activation.GELU,` Also, gated-gelu does not feature in the "transformers.py" script. Might...
I tried updating CT2 to 4.2.1 and pulling a training. It breaks upon validation step with the errors below. At first, I thought it was the "None" values set in...
> Tried pretty much any version of CT4.x and onmt3.5x with compatible torch/cuda. Also tried different data to check this out, and different population method. Do you have any idea?...
I think I've found the bug... it happens while calculating validation metrics at first val step. At this point, train will repeat call the scoring utils which manipulates "example" strings....
OK; I'll follow eole to see when we can switch. Too much focus on LLMs in OpenNMT-py lately. As of the transform, well there's no real need for it at...
I tried to remove it earlier this morning, then launched a test and went on an errand, you are right, it's something else. I'll remove the filtertoolong all the way,...
isn't the risk to abort training on some empty token bigger? I've implemented it, but i keep my fingers crossed.... Then, you might want to rebase your PR on version...