evo Limiting attention radius and extracting embeddings

Limiting attention radius and extracting embeddings

Open george-henderson opened this issue 9 months ago • 0 comments

Hello,

Is it possible to alter the model's attention radius, such that the model only applies attention within a certain window in the input?

A second question: Can you instruct on how I might extract the embeddings from the model? I am using the model out of the box, such that the final output is a tensor of dimension num_batches x input length x num_tokens, but I’d like to access the internal latent space representation of my text as well.

Thank you!

May 15 '24 17:05 george-henderson

evo evo copied to clipboard

Limiting attention radius and extracting embeddings

evo
evo copied to clipboard