RetroMAE issues

Loss drop to 0 after 4 epochs

I have logged the parameters of the training into wandb (you can see the [link](https://api.wandb.ai/links/nguyenducnhan-work/j586u538) below). ![image](https://github.com/staoxiao/RetroMAE/assets/95571916/41df1593-8032-45c8-8126-ff9ca8d7e0e8) This is my config: ```python pretrain.run --output_dir output_merge_data \ --report_to wandb \ --data_dir...

Iambestfeed

get_data.sh error, url expired

1

--2024-03-07 12:47:02-- https://msmarco.blob.core.windows.net/msmarcoranking/qidpidtriples.train.full.2.tsv.gz Resolving msmarco.blob.core.windows.net (msmarco.blob.core.windows.net)... 20.150.34.4 Connecting to msmarco.blob.core.windows.net (msmarco.blob.core.windows.net)|20.150.34.4|:443... connected. HTTP request sent, awaiting response... 404 The specified resource does not exist. 2024-03-07 12:47:03 ERROR 404: The specified...

littlewwwhite

When do you plan to release code for RetroMAE v2?

1

kennyLSN

quesions about training data format ！

1

Can you provide some examples of data formats for training pretrain, reranker, and retriever models? I have no concept of this. Thanks!

adogwangwang

Pretraining accuracy for retromae v1

2

Great job! Hello , i wonder if you can tell me the training mlm accuracy of encoder and decoder. Im training my retromae model now.

wwx13

Some questions about the checkpoint

3

Hello, I tried to use your checkpoint to finetune the RetroMAE_MSMARCO model, but the result is lower than the number in your paper(e.g. the MRR@10 is 0.393 in the paper,...

mingheiT

What version of cuda is suitable, i tried the 11.7 but failure.

1

littlewwwhite

Question about enhanced decoding

9

Hi staoxiao, I wanted to ask more about how the enhanced decoding works - it looks like it generates 256 random possible attention masks, and then picks randomly from that...

mtybadger

Code for RetroMAE v2: Duplex Masked Auto-Encoder For Pre-Training Retrieval-Oriented Language Models

1

Hello, thank you for your work and provided code! When do you plan to release code for RetroMAE v2?

arahusky

Some questions about pretrain data

2

Traceback (most recent call last): File "E:\RetroMAE-master\RetroMAE-master\examples\pretrain\preprocess.py", line 158, in wiki = create_wiki_data(args.tokenizer_name, args.max_seq_length, args.short_seq_prob) File "E:\RetroMAE-master\RetroMAE-master\examples\pretrain\preprocess.py", line 62, in create_wiki_data tokenizer = AutoTokenizer.from_pretrained("F:\bert-base-uncased") File "C:\Users\HZY\AppData\Local\Programs\Python\Python39\lib\site-packages\transformers\models\auto\tokenization_auto.py", line 463, in from_pretrained...

Victoriaheiheihei

RetroMAE
RetroMAE copied to clipboard

Metadata

Loss drop to 0 after 4 epochs

get_data.sh error, url expired

When do you plan to release code for RetroMAE v2?

quesions about training data format ！

Pretraining accuracy for retromae v1

Some questions about the checkpoint

What version of cuda is suitable, i tried the 11.7 but failure.

Question about enhanced decoding

Code for RetroMAE v2: Duplex Masked Auto-Encoder For Pre-Training Retrieval-Oriented Language Models

Some questions about pretrain data

← Metadata

Owner

Metadata

RetroMAE RetroMAE copied to clipboard

Metadata

← Metadata

Owner

Metadata

RetroMAE
RetroMAE copied to clipboard