Styleformer
Styleformer copied to clipboard
A Neural Language Style Transfer framework to transfer natural language text smoothly between fine-grained language styles like formal/casual, active/passive, and many more. Created by Prithiviraj Dam...
I am using this model to do inferencing on 1 million data point using `A100 GPU's ` with `4 GPU`. I am launching a `inference.py` code using `Googles vertex-ai Container.`...
OSError: prithivida/parrot_adequacy_on_BART is not a local folder and is not a valid model identifier listed on 'https://huggingface.co/models' I'm using the latest version and seeing the following issue. I was wondering...
Can you provide your training code? I want to experiment with different styles.
Love the library! Was hoping to do sentiment transfer but I see that has not yet been integrated. Any pointers towards off the shelf models that can do that?
In the codes, an adequacy model is applied to score and rank the generated sentences. (1) `def _formal_to_casual()`: https://github.com/PrithivirajDamodaran/Styleformer/blob/main/styleformer/styleformer.py#L69-L95 (2) `def _casual_to_formal()`: https://github.com/PrithivirajDamodaran/Styleformer/blob/main/styleformer/styleformer.py#L97-L123 May I know why it is not...
Add `torch_dtype=torch.float32` because "[RuntimeError: "addmm_impl_cpu_" not implemented for 'Half'](https://stackoverflow.com/questions/73530569/pytorch-matmul-runtimeerror-addmm-impl-cpu-not-implemented-for-half)" ```python self.ctf_model = AutoModelForSeq2SeqLM.from_pretrained(ctf_model_tag, use_auth_token=False, torch_dtype=torch.float32) self.ftc_model = AutoModelForSeq2SeqLM.from_pretrained(ftc_model_tag, use_auth_token=False, torch_dtype=torch.float32) self.atp_model = AutoModelForSeq2SeqLM.from_pretrained(atp_model_tag, use_auth_token=False, torch_dtype=torch.float32) self.pta_model = AutoModelForSeq2SeqLM.from_pretrained(pta_model_tag, use_auth_token=False, torch_dtype=torch.float32)...
If possible could you please share the training script with and the dataset, I would like to finetune this on a flan-t5-base by using specific tokens for specific task, this...
it keeps throing the same error(diffrent request id) OSError: There was a specific connection error when trying to load prithivida/informal_to_formal_styletransfer: (Request ID: K9-6-Ks5uMEai7cOcQ3gC)