ALBERT-Pytorch issues

what is the corpus format in train.txt for electra pre-training ??

1

if i wanna use my own textual data to pre-train a electra from scatch, what is the format of the text? Only sentence segmentation or even more ?? Please help.

marcusau

how to train on custom dataset ?

hey there, sadly i dont clearly understand how the wiki.train.tokens and vocab.txt file is produced. followingly could you please show me the steps on how to reproduce the same train.tokens...

StephennFernandes

How to select mask_alpha and mask_beta parameters values in n-grams mask by experience？

what should be my mask_alpha and mask_beta values if my seqence lenght is about 10-20?

wa008

WARNING:root:NAN or Inf found in input tensor ERROR

i am trying to train albert model using my own dataset i don't know why got the error like this please let me know, when you figure out thank you

youngyoung1021

I'm running classify on the MRPC dataset. In trainer.train trainer.train(get_loss,model_file,True), it allows only three arguments not 4 so I cant use the pretrain file. Also it runs out of memory,...

csharma

Add 80% mask, 10% random in n-gram MLM

In ALBERT(Lan at el), There is not detail about 80% mask ![image](https://user-images.githubusercontent.com/10525011/66101701-ba2b0400-e5ea-11e9-9375-5aeca1a3173e.png) But, from n-gram masking (Joshi et al., 2019), they said about 80/10/10 > As in BERT, we also...

graykode

ALBERT-Pytorch
ALBERT-Pytorch copied to clipboard

Metadata

what is the corpus format in train.txt for electra pre-training ??

how to train on custom dataset ?

Is there a bug in there?

How to select mask_alpha and mask_beta parameters values in n-grams mask by experience？

WARNING:root:NAN or Inf found in input tensor ERROR

out of memory error

Add 80% mask, 10% random in n-gram MLM

← Metadata

Owner

Metadata

ALBERT-Pytorch ALBERT-Pytorch copied to clipboard

Metadata

what is the corpus format in train.txt for electra pre-training ??

how to train on custom dataset ?

Is there a bug in there?

How to select mask_alpha and mask_beta parameters values in n-grams mask by experience？

WARNING:root:NAN or Inf found in input tensor ERROR

out of memory error

Add 80% mask, 10% random in n-gram MLM

← Metadata

Owner

Metadata

ALBERT-Pytorch
ALBERT-Pytorch copied to clipboard