nlp4whp comments

Results 25 comments of


                                            nlp4whp

model 第282行

> model.py 第282行,为什么assert len(final_dists)==1，fina_dists应该是解码序列吧，怎么可能等于1？等于1的话，不相当于每次摘要只有一个字？？？ decode阶段是单步执行的，每次只取序列中的一个词，只有执行了这一步才能获得当前state

In decoder mode, the generated summary came out only 1 character?

> I followed the instructions to train my models and test, And in decode mode, I set max_dec_step = 1 for running the decoder one step at a time, However...

BERT feature extraction for tensorflow serving

> @apurvaasf I found an easiest way to export original bert model to SavedModel. > > ```python > # load the checkpoint from bert > # create an estimator which...

Model detail: Why does "attention_heads" exist in "modeling.py"?

That's weird. attention_heads is always 1 also, attention_heads should always be 1 if len(attention_heads) > 1, after concat the shape of `attention_output` will be different then this `attention_output` cannot pass...

运行Example中的ls_bert报错，TypeError: infer(): incompatible function arguments

> 改成和2.1.3版本一致就可以了，少传入attn_mask这个参数 > > def infer(self, inputs, attn_mask): **last_hidden_states = self.ls_bert.infer(inputs, _attn_mask_)** last_hidden_states = torch.Tensor(last_hidden_states).float() pooled_output = self.pooler(last_hidden_states.to("cuda:0")) logits = self.classifier(pooled_output) return logits > > 但是lightseq性能还不huggingface未加速版本，GPU是 1080Ti > > ====================END...

6层albert模型的发布问题

> 发现新发布的small版本的config文件有些问题，与发布的模型参数不一致，在调用的时候出错，希望作者查看一下。ValueError: Shape of variable bert/embeddings/LayerNorm/beta:0 ((384,)) doesn't match with shape of tensor bert/embeddings/LayerNorm/beta ([128]) from checkpoint reader 是的我也遇到这个问题，zip里`albert_config_small_google.json`并不是合适的参数文件，请问您解决了么 @KunWangR

nlp4whp

model 第282行

In decoder mode, the generated summary came out only 1 character?

BERT feature extraction for tensorflow serving

Model detail: Why does "attention_heads" exist in "modeling.py"?

运行Example中的ls_bert报错，TypeError: infer(): incompatible function arguments

6层albert模型的发布问题

6层albert模型的发布问题

How does the model handle the OOV problem?

Multilingual or Chinese version plan?

显存不足