Oscar
Oscar copied to clipboard
Question about tensor size difference in modeling_bert.py
Hi :) Thank you for such a great works.
I have a question about tensor size difference error in modeling_bert.py code.

I want to know
- What is difference between history_state and hidden_state
- what can I do to solve the error(as seen above capture) that is occurred.