Uri Alon
Uri Alon
Hi @YsylviaUC , Thank you for your interest in our work and for reporting this! I just fixed that, and set `input_ids` to be the labels when labels are not...
Great! It's important **not** to use `--predict_with_generate` when saving the datastore, because we want the datastore to be saved according to the training labels, not to the random/generated labels. Let...
Hi @YsylviaUC , Good catch! This is a flag that we used to run some experiments on randomly deciding on whether or not to perform a kNN search, similarly to...
Hi @YsylviaUC ! Thank you for your interest in our work. I just pushed a commit that sets `dstore_size` automatically for you, according to your training set size, if you...
It depends on the number of CPU cores. If you can use more, the code will use them. I think a few hours.
Hi @ngohlong , Thank you for your interest in our work! It says that the GPU is out of memory. Which kind of GPU are you using? By the way,...
The `OOM when allocating tensor` means that the GPU is Out Of Memory. The model itself is very memory hungry, because it has huge vocabularies, and thus huge embedding matrices....
Hi @snowlovehang , Thank you for your interest in our work! The preprocessing formats of code2vec and code2seq are similar, but not identical, so just renaming `*.c2s` to `*.c2v` will...
Hi @sleekmike , I'm sure that Vincent will respond soon, as far as I remember the models were trained on 8 Nvidia RTX 8000 GPUs on a single machine. But...
Thanks a lot @NinedayWang ! Hopefully Vincent will be able to take a look soon. @VHellendoorn what are the next needed steps? do we need to push code to `huggingface`,...