Results 177 comments of Uri Alon

Hi @YsylviaUC , Thank you for your interest in our work and for reporting this! I just fixed that, and set `input_ids` to be the labels when labels are not...

Great! It's important **not** to use `--predict_with_generate` when saving the datastore, because we want the datastore to be saved according to the training labels, not to the random/generated labels. Let...

Hi @YsylviaUC , Good catch! This is a flag that we used to run some experiments on randomly deciding on whether or not to perform a kNN search, similarly to...

Hi @YsylviaUC ! Thank you for your interest in our work. I just pushed a commit that sets `dstore_size` automatically for you, according to your training set size, if you...

It depends on the number of CPU cores. If you can use more, the code will use them. I think a few hours.

Hi @ngohlong , Thank you for your interest in our work! It says that the GPU is out of memory. Which kind of GPU are you using? By the way,...

The `OOM when allocating tensor` means that the GPU is Out Of Memory. The model itself is very memory hungry, because it has huge vocabularies, and thus huge embedding matrices....

Hi @snowlovehang , Thank you for your interest in our work! The preprocessing formats of code2vec and code2seq are similar, but not identical, so just renaming `*.c2s` to `*.c2v` will...

Hi @sleekmike , I'm sure that Vincent will respond soon, as far as I remember the models were trained on 8 Nvidia RTX 8000 GPUs on a single machine. But...

Thanks a lot @NinedayWang ! Hopefully Vincent will be able to take a look soon. @VHellendoorn what are the next needed steps? do we need to push code to `huggingface`,...