SimCLS icon indicating copy to clipboard operation
SimCLS copied to clipboard

Preprocess on custom dataset

Open Sushant581 opened this issue 2 months ago • 0 comments

How can we use preprocessing on our dataset? The Readme does not explain preprocessing step or how to create/Understand following statement in README

"src_dir should contain the following files (using test split as an example): test.source test.source.tokenized test.target test.target.tokenized test.out test.out.tokenized"

so that we can run SimCLS on our dataset

Sushant581 avatar Dec 05 '24 18:12 Sushant581