I can't fit the pipeline for RoBerta For Sequence Classification

Open kareemgamalmahmoud opened this issue 2 years ago • 0 comments

Description

When i try to run (HuggingFace in Spark NLP - RoBertaForSequenceClassification) notebook with colab i always face issues with the last fitting cell with pipline

Error 1 : NameError: name 'Pipeline' is not defined And after importing the pipline ( from pyspark.ml import Pipeline )

Then Error 2 :

IllegalArgumentException: requirement failed: Wrong or missing inputCols annotators in REGEX_TOKENIZER_cfae21e0e52f.

Current inputCols: doc ument. Dataset's columns: (column_name=text,is_nlp_annotator=false) (column_name=document,is_nlp_annotator=true,type=document). Make sure such annotators exist in your pipeline, with the right output names and that they have following annotator types: document

I don't know hot to solve !!!

Aug 15 '22 03:08 kareemgamalmahmoud

spark-nlp-workshop spark-nlp-workshop copied to clipboard

I can't fit the pipeline for RoBerta For Sequence Classification

Description

spark-nlp-workshop
spark-nlp-workshop copied to clipboard