MerlinWang comments

Results 15 comments of


                                            MerlinWang

trafficstars

关于在您的模型上继续预训练

辛苦问下具体预训练方法，我用自己的数据预训练，但是获取不到相关的records，搞不懂为啥create_pretrain_data.py里面有这个split(',') ![image](https://user-images.githubusercontent.com/26263128/88659388-8e43b680-d107-11ea-876a-0c4d9b8325a9.png)

[Maintenance] re-test the end-to-end performance

Hello, Just re-test the BERTNLU-RuleDST-GDPL-TemplatedNLG policy. ![image](https://user-images.githubusercontent.com/26263128/95296301-1df5a600-08ab-11eb-975e-ad8fe5c204b3.png)

[Maintenance] RL policy training

I am afraid that the evaluation results of PPO maybe not correct. There are two levels of evaluation. The first one is policy/evaluate.py which is action-level, it follows the instructions...

为什么要tagset+2

这个不对吧，tag.TXT里面已经存在start 和 eos，不需要再次进行 + 2 操作

GAIL uses AIRL reward function

I am also confused, what should I do if I just wanna GAIL loss? just reward = - (1 - s).log()

Why the output looks very bad

I found that my output is mostly "\ \", do not know why.

UnicodeError when run example: python -m unittest discover tests/

# packages in environment # # Name Version Build Channel _libgcc_mutex 0.1 main backports-weakref 1.0rc1 pypi_0 pypi bilm 0.1.post5 pypi_0 pypi bleach 1.5.0 pypi_0 pypi ca-certificates 2019.10.16 0 certifi 2018.8.24...

Feature representation of the dataset already computed

Hi, did u already get the dataset? I will be very grateful if u can send me a copy of it.

Add CMB to your paper

Hi, thank you for your incredible work. Here is our new EMNLP 2023 paper about LLM evaluation for in-depth dialogue questions. Feel free to add it to your survey!! Cue-CoT:...