ankit201 comments

Results 8 comments of


                                            ankit201

Not able to load encoder

> Could you please share the steps before this line? > > It seems like you are trying a different model in some manner, this is indicated by the difference...

pretrained models

newsela

What is considered the best simplification result?

@ojotoxy What is `test.en` and `test.sen` used? Did you use the pre-trained model, in that case, what was the torch version used?

Not able to complete training

@roeeaharoni `PID Killed` , where PID->process id is the error message

Get individual prediction

@roeeaharoni Please solve the issue mentioned by @008karan.

Support for mosaicml/mpt-30b-instruct model

> It's supported on the "best effort basis". > > I started some work to actually support it, but it means rewriting flash attention (the cuda version) with added bias,...

Support for mosaicml/mpt-30b-instruct model

> > on implementing dynamic batching for this as it only supports 1 concurrent request for now on AutoModel. > > This won't require work once we have flash attention....

Support for mosaicml/mpt-30b-instruct model

> Here is the non flash version (as a temporary measure since modifying the kernel is taking more time than I anticipated: #514 > > This should enable sharding at...