yzhang-github-pub

Results 15 issues of yzhang-github-pub

I cloned https://github.com/aws-samples/amazon-sagemaker-bert-pytorch.git in SageMaker, and ran jupyter notebook without any modification, and got error as below: "UnexpectedStatusException: Error for Training job pytorch-training-2020-10-27-16-28-37-955: Failed. Reason: AlgorithmError: ExecuteUserScriptError: Command "/opt/conda/bin/python train_deploy.py...

type: question
PyTorch

I used the default xavier method to init params and the model converged as loss dropped from ~4 to

Dear Author, I tried Xtransformer for machine translation task, and got val loss==0.0 at the very first epoch. I don't know where I did it wrong. Please advice. Here is...

I am trying to use Xtransformer for language translation. In the original transformer paper, target input to decoder is masked such that attentions are only to current and past tokens,...

Dear Author, Thanks for your excellent work! I want to try your implementation for language translation related task. I have two questions and I'd appreciate your help very much: 1....

Aladdin, I watched your excellent youtube videos on transformer implementation with pytorch. When I tried to test in google colab, I got NotImplementedError in https://github.com/aladdinpersson/Machine-Learning-Collection/blob/master/ML/Pytorch/more_advanced/Seq2Seq_attention/seq2seq_attention.py line 243: for batch_idx, batch...

Dear Author, I am fine-tuning your pretrained RITA with a protein family data, using run_clm.py script @ huggingface. I tried this format where seq1 & seq2 are protein sequences 1...

Dear Author, Your tool 'isONcorrect' can be very useful to many users. I wonder if you can provide a dockerfile, or better yet, create a docker image with all dependencies...

Instead of generate the best (top 1) sequence from autoregressive generate function, it is useful to generate >1 best sequences. For example, in language translation, human users can have choices...

Tested minimap 2.22 & 2.24 with the options modified from -x splice: -k15 -w5 --splice -g2k -G200k -A1 -B2 -O2,32 -E1,0 -b0 -C9 -z200 -ub --junc-bonus=9 --cap-sw-mem=0 --splice-flank=no I got:...

bug