Sherif Gad

Results 3 comments of Sherif Gad

To see the visual environment, use the `--display` option with `train.py` but this is not recommended during training. What you should do is after finishing training, load the saved policy...

Try to use '.weight' instead of .word_embeddings as per hugging face latest implementation: `class TFBertEmbeddings(tf.keras.layers.Layer): """Construct the embeddings from word, position and token_type embeddings.""" def __init__(self, config: BertConfig, **kwargs): super().__init__(**kwargs)...

@mohdil23 as said above, you shall treat Mamba as similar to an LSTM or GRU cell. To perform classification on top of it, you could add a classification head (e.g.,...