superzeroT
superzeroT
> Can you check which one is 200d, and which one is 100d in `hiddens * sent_probs`? This could help clearify this issue. hiddens is 200d,sent_probs is 100d.
data:image/s3,"s3://crabby-images/bd7aa/bd7aa6e44c4dfe23f070561f69027ae5e654bd57" alt="图片1"
data:image/s3,"s3://crabby-images/eb03d/eb03db4aba0cf22d1b0068bc6faa1a973802ca5c" alt="图片2"
> So it seems fine? > > sent_probs can be regarded as the attention score of the hidden states for each log event in the log sequence. The multiplication between...
Hi @LeonYang95 ,I haven't solved the problem yet.I tried to unify the dimensions but it didn't work.Don't worry about the note I added.Since your code is running successfully,I guess it...
Hi @LeonYang95 ,Can I see the shape of your sent_probs and hiddens values. data:image/s3,"s3://crabby-images/05de0/05de0d8eec3ad4421a580366b522b36e6379aab3" alt="屏幕截图 2023-07-05 222313"
> 我在训练的时候出现了如下错误,不知道如何纠正。 > > 回溯(最近一次调用最后): 文件“/Users/heyingying/PycharmProjects/PLELog/approaches/PLELog.py”,第255行, 损失= plelog.forward(tinst.inputs,tinst.targets) 文件“/Users/heyingying /PycharmProjects/PLELog/approaches/PLELog.py”,第 56 行,前向 tag_logits = self.model(inputs) 文件“/Users/heyingying/opt/anaconda3/envs/pytorch/lib/python3.10/site-packages/ torch/nn/modules/module.py”,第 1194 行,在 _call_impl returnforward_call(*input, **kwargs) 文件“/Users/heyingying/PycharmProjects/PLELog/models/gru.py”,第 71 行,在forward 中 表示=hiddens*sent_probs 运行时错误:张量 a (200)...