turkish-bert How to get all hidden layers' output of pre-trained BERTurk model in HuggingFace Transformers library?

How to get all hidden layers' output of pre-trained BERTurk model in HuggingFace Transformers library?

Open katirasole opened this issue 3 years ago • 2 comments

Hi Stefan, I have a problem to get the all hidden layer's output of BERTurk. I tried as follows:

model = AutoModel.from_pretrained("dbmdz/bert-base-turkish-uncased")

Convert inputs (length 20) to PyTorch tensors

tokens_tensor = torch.tensor([indexed_tokens]) segments_tensors = torch.tensor([segments_ids])

model.eval()

with torch.no_grad(): outputs = model(tokens_tensor, segments_tensors)

the outputs contain two tensors.

print (outputs[0]) print (len(outputs[0][0])) # have 20 array, each is belongs to each token of sentence print (outputs[0][0][0]) # for each token outputs[0][0][i] this is for [CLS] print (len(outputs[0][0][0])) #768 embedding size

I am not sure outputs[0] is the final hidden state or not too.

and outputs[1] is as following: print (outputs[1][0])
print (len(outputs[1][0])) # have 768 entries

Also I tried as what is described in https://huggingface.co/transformers/model_doc/bert.html#tfbertmodel but I got an error when I define output_hidden_states = True.

Aug 26 '20 14:08 katirasole

Maybe I can help you on this issue. Here is my sample code to use all hidden layers of each Transformer layers' output. You should define output_hidden_states attribute in the config for AutoModels.


config=AutoConfig.from_pretrained("dbmdz/bert-base-turkish-128k-cased",**output_hidden_states=True**)
model=AutoModel.from_pretrained("dbmdz/bert-base-turkish-128k-cased",**config=config**)
with torch.no_grad():    
    all_hidden_states = model(inputs,attention_mask=masks)[2]
    final_hidden_states = model(inputs,attention_mask=masks)[0]

The all_hidden_states is a tuple with length 13 (1 for embedding layer and 12 for Transformer layers). For example, CLS output of 10th layer can be found with all_hidden_states[-3][:,0,:].

I hope it helps.

Sep 28 '20 15:09 ozcangundes

Thank you so much @ozcangundes , I will try it and let you know if it works for me or not.

Sep 28 '20 17:09 katirasole

turkish-bert turkish-bert copied to clipboard

How to get all hidden layers' output of pre-trained BERTurk model in HuggingFace Transformers library?

turkish-bert
turkish-bert copied to clipboard