Seq2seq-Chatbot-for-Keras limit = l[0][0] - IndexError: index 0 is out of bounds for axis 0 with size 0

limit = l[0][0] - IndexError: index 0 is out of bounds for axis 0 with size 0

Open stajilov opened this issue 6 years ago • 5 comments

What can be the root cause for this?

for m in range(Epochs):

    # Loop over training batches due to memory constraints:
    for n in range(0,round_exem,step):

        q2 = q[n:n+step]
        print(q2)
        s = q2.shape
        print(s)
        count = 0
        for i, sent in enumerate(a[n:n+step]):
            print("Sentence")
            print(sent)
            l = np.where(sent==3)  #  the position of the symbol EOS
            limit = l[0][0]
            count += limit + 1

  File "train_bot.py", line 188, in <module>
    limit = l[0][0]
IndexError: index 0 is out of bounds for axis 0 with size 0

I don't see any 3 for some reason

Sentence
[   1   31    5  640    8 2475    9    8  339    4    2    0    0    0
    0    0    0    0    0    0    0    0    0    0    0    0    0    0
    0    0    0    0    0    0    0    0    0    0    0    0    0    0
    0    0    0    0    0    0    0    0]

Apr 17 '18 09:04 stajilov

Hi, If you changed the dictionary (e.g. if you generated another dictionary), you have to look up the indexes of the special tokens "BOS" and "EOS" in your new dictionary and change them in the code, because it will not be 3 anymore. I should use a variable for this. :-) Fortunately, there is a comment like "the position of the symbol EOS" after every line containing a reference to these special tokens, which make it ease to change it.

Apr 17 '18 11:04 oswaldoludwig

@oswaldoludwig thanks, so I just look into the dictionary and find BOS and EOS tokens? thanks

Apr 18 '18 11:04 stajilov

Yes, the tokens of your new dictionary will be sorted by their frequency in the training dataset, i.e. the EOS (end of sentence) token had index 3 because it was the third most frequent token in my original training dataset. The easiest approach may be to move them to the same position as the original dictionary, this may solve your problem as well.

Apr 18 '18 11:04 oswaldoludwig

@oswaldoludwig thanks,but which corpus did you use to create the dictionary? asking because it looks different than simple_dialogue corpus

Apr 19 '18 07:04 stajilov

Yes, I created this dictionary when I was using the Cornell Movie Dialogs Corpus, before collecting data from English courses to compose my own dataset. So, I ended up keeping the dictionary I already had.

Apr 19 '18 09:04 oswaldoludwig

Seq2seq-Chatbot-for-Keras Seq2seq-Chatbot-for-Keras copied to clipboard

limit = l[0][0] - IndexError: index 0 is out of bounds for axis 0 with size 0

Seq2seq-Chatbot-for-Keras
Seq2seq-Chatbot-for-Keras copied to clipboard