MlWoo

Results 4 issues of MlWoo

Many friends seem very to be interested in multi-gpus support when training the model. Maybe it is necessary to merge the branch into the master one.

``` sess_list1 = self.apply_ + self.sd_asn sess_list2 = [self.train_step, self.cross_entropy, self.accuracy] result1 = self.sess.run(sess_list1, feed_dict = feed_dict) result2 = self.sess.run(sess_list2, feed_dict = feed_dict) ``` As above, the code will execute...

A new model with a heavy module with a light module could be viewd as a t5 model. so the tp of encoder is more than that of the decoder....