CaLM icon indicating copy to clipboard operation
CaLM copied to clipboard

FileNotFoundError

Open geng007 opened this issue 11 months ago • 1 comments

Dear Developer Team,

I am writing to seek your assistance regarding an issue I encountered while attempting to run the code associated with your paper titled "Codon language embeddings provide strong signals for use in protein engineering". When running the 'training.py' file, I encountered the following traceback error:

Traceback (most recent call last):
  File "training.py", line 141, in <module>
    ckpt_path='production-run/latest-56000.ckpt')
  ...
  FileNotFoundError: [Errno 2] No such file or directory: 'training_data.fasta'

It seems that the 'training_data.fasta' file is not found, leading to this error. I would greatly appreciate it if you could provide some guidance on how to address this issue.

Thank you very much for your time and consideration. I look forward to your valuable guidance.

Sincerely, Geng Hu

geng007 avatar Mar 09 '24 03:03 geng007

if you red the paper, you can find this

Data availability We have made available, in our website, the training set (http://opig.stats. ox.ac.uk/data/downloads/training_data.tar.gz), the heldout set (http:// opig.stats.ox.ac.uk/data/downloads/heldout.tar.gz) and the weights of the trained model (http://opig.stats.ox.ac.uk/data/downloads/ calm_weights.pkl). All datasets used for validation of the models presented in this article are available at https://github.com/oxpig/CaLM.

you need to download the data by hand, the data is not included in this github

ZJL0111 avatar Apr 02 '24 07:04 ZJL0111