SimCSE icon indicating copy to clipboard operation
SimCSE copied to clipboard

[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821

Results 14 SimCSE issues
Sort by recently updated
recently updated
newest added

I used the latest release of the version, and use the `run_sup_example.sh` to train my datasets. But when I used SimCSE to load the model I got this hint: Use...

How can I tell if the model is converging or not, and is the value of the loss function indicative? I trained on my own dataset and the loss of...

I'm trying to train SimCSE in a virtual environment (w/ python 3.7.0) I tried to run the download_wiki.sh file, but I got an error message saying "/Users/davidsun/Downloads/SimCSE/data/download_wiki.sh: line 1: wget:...

I am trying to run SimCSE evaluation on a virtual environment w/ python 3.8.6 & the following packages: transformers==4.2.1 scipy==1.5.4 datasets==1.2.1 pandas==1.1.5 scikit-learn==0.24.0 prettytable==2.1.0 gradio torch setuptools==49.3.0 I tried running...

I'm doing search task and the pre-training model I'm using is RoBerta Base. I would like to join SimCSE on this basis, how to use SimCSE on my own data...

So, encode method in tool.py may generate diff results after each run. add model.eval() may solve the problem. also adding a vector saving and reloading function may by helpful

The command is exactly as the following example, only the model_name_or_path is changed ```shell python evaluation.py \ --model_name_or_path princeton-nlp/sup-simcse-bert-base-uncased \ --pooler cls \ --task_set sts \ --mode test ``` The...

报错:setting an array element with a sequence. The requested array has an inhomogeneous shape after 1 dimensions. The detected shape was (750,) + inhomogeneous part.

'OurTrainingArguments' object has no attribute 'distributed_state' File "/public/duzx/project/Path2Vec/SimCSE/train.py", line 548, in main trainer = CLTrainer( File "/public/duzx/project/Path2Vec/SimCSE/train.py", line 600, in main() AttributeError: 'OurTrainingArguments' object has no attribute 'distributed_state'