CaLM
CaLM copied to clipboard
FileNotFoundError
Dear Developer Team,
I am writing to seek your assistance regarding an issue I encountered while attempting to run the code associated with your paper titled "Codon language embeddings provide strong signals for use in protein engineering". When running the 'training.py' file, I encountered the following traceback error:
Traceback (most recent call last):
File "training.py", line 141, in <module>
ckpt_path='production-run/latest-56000.ckpt')
...
FileNotFoundError: [Errno 2] No such file or directory: 'training_data.fasta'
It seems that the 'training_data.fasta' file is not found, leading to this error. I would greatly appreciate it if you could provide some guidance on how to address this issue.
Thank you very much for your time and consideration. I look forward to your valuable guidance.
Sincerely, Geng Hu
if you red the paper, you can find this
Data availability We have made available, in our website, the training set (http://opig.stats. ox.ac.uk/data/downloads/training_data.tar.gz), the heldout set (http:// opig.stats.ox.ac.uk/data/downloads/heldout.tar.gz) and the weights of the trained model (http://opig.stats.ox.ac.uk/data/downloads/ calm_weights.pkl). All datasets used for validation of the models presented in this article are available at https://github.com/oxpig/CaLM.
you need to download the data by hand, the data is not included in this github