BiLSTM-LAN icon indicating copy to clipboard operation
BiLSTM-LAN copied to clipboard

why is very slow about the model on CPU platform?

Open wshzd opened this issue 4 years ago • 7 comments

l run the model in windows10 with CPU, but it will spend 4 hours every epoch, that is, 100 epoches need 400 hour in order to run the whole model. it claims it is faster than biLSTM+CRF, actually,it is not. ok, l run the BERT+biLSTM+CRF on same envirment(windows10 with CPU), it only costs 10 hours, however, it's accuracy is 0.92 Please can you tell me that is why?

wshzd avatar Nov 28 '19 06:11 wshzd

Hi, I am also trying to use the tool. But I am wondering where the dataset is. I saw you run this model. What dataset did you use? Could you please share with me?

chiyuzhang94 avatar Dec 03 '19 00:12 chiyuzhang94

the format of the dataset is normally two columns,one is char, another is corresponding label as follows: 人 o 民 B-pro

wshzd avatar Dec 05 '19 01:12 wshzd

@wshzd Thanks for your reply. Are the two elements split by white space or anything else?

chiyuzhang94 avatar Dec 05 '19 02:12 chiyuzhang94

anything

wshzd avatar Dec 06 '19 06:12 wshzd

I run this model with the WSJ dataset on GPU platform,it seems that each epoch will take 50 minutes:(

wudaoyunqi avatar Jan 21 '20 10:01 wudaoyunqi

I run this model with the WSJ dataset on GPU platform,it seems that each epoch will take 50 minutes:(

Could you kindly share your log with me via email?

Nealcly avatar Jan 21 '20 15:01 Nealcly

I run this model with the WSJ dataset on GPU platform,it seems that each epoch will take 50 minutes:(

Could you kindly share your log with me via email?

Sorry for the late reply, I went through the log and found a parameter setting error. Then I ran the model on a high-performance GPU server, and it was fast. Thanks for your kindly reply.

wudaoyunqi avatar Feb 04 '20 15:02 wudaoyunqi