kevinpro
kevinpro
it's obvious that this project is no longer update (since last update is 2020.6)
> > it's obvious that this project is no longer update (since last update is 2020.6) > > Maybe it's time to fork and create a **better**-better-onetab extension... 😉 hhhhhhhhhh...
Greatly Appreciate discussion above. Is there anyone retrain and finetuing the model to get result on CNNDM or other dataset? Will that help to get a more precision Fact-Evalutaion? If...
I notice that somepaper use FactCC as a metric If FactCC remains problem, then the result maybe not reliable to be a metric mentioned in Paper
> @Ricardokevins, you can take a look at the following two comprehensive surveys on actuality metrics. What's disturbing is that they have very different conclusions. If you're writing a paper,...
> Here is the dataset: > > http://kinloch.inf.ed.ac.uk/public/XSUM-EMNLP18-Summary-Data-Original.tar.gz > > Please use train, development and test ids from github to split into subsets. Let me know if you have any...
it's obvious that anthor didn`t fix the bug lol
Any update?
I encouter the same problem. Even with deepspeed and FSDP. It feels like when model save its weights and stuck.
> @Ricardokevins I hypothesise that it's a flash attention issue. It works fine with deepspeed only (for my case) and fsdp only (for @pacman100 ) This issue may be quite...