SeqVec icon indicating copy to clipboard operation
SeqVec copied to clipboard

model resources cannot be downloaded

Open Jmpax404 opened this issue 1 year ago • 11 comments

It seems that model files on rostlab.org cannot be accessed recently by all links below:

https://rostlab.org/~deepppi/seqvec.zip https://rostlab.org/~deepppi/seqvec_checkpoint.tar.gz http://rostlab.org/~deepppi/embedding_repo/embedding_models/seqvec/options.json http://rostlab.org/~deepppi/embedding_repo/embedding_models/seqvec/weights.hdf5

Thus, both automatic and manual downloads are ineffective. Could you fix it or provide another available download links? : )

Jmpax404 avatar Aug 14 '24 07:08 Jmpax404

sorry for the delayed response; will try to look into this but the source of the problem is the ongoing problem with our internal FTP server. I am trying to recover the files but can you in the meantime maybe try this approach for installing SeqVec? : https://github.com/Rostlab/SeqVec/issues/26#issuecomment-2267991933

mheinzinger avatar Aug 28 '24 15:08 mheinzinger

Thank you for your concern about this issue and thank you for maintaining this project. I have no problem on installing SeqVec python package, but when using SeqVec to get embeddings, it will download pre-trained SeqVec model files on rostlab.org server (the code is below).

Luckily, I found these files from my co-worker. But I still suggest storing files as backups on other free and stable cloud drives, such as Google Drive.

def get_elmo_model(model_dir: Path, cpu: bool) -> ElmoEmbedder:
    weights_path = model_dir / "weights.hdf5"
    options_path = model_dir / "options.json"

    # if no pre-trained model is available, yet --> download it
    if not (weights_path.exists() and options_path.exists()):
        logger.info(
            "No existing model found. Start downloading pre-trained SeqVec (~360MB)..."
        )

        Path.mkdir(model_dir, exist_ok=True)
        repo_link = "http://rostlab.org/~deepppi/embedding_repo/embedding_models/seqvec"
        options_link = repo_link + "/options.json"
        weights_link = repo_link + "/weights.hdf5"
        urllib.request.urlretrieve(options_link, str(options_path))
        urllib.request.urlretrieve(weights_link, str(weights_path))

Jmpax404 avatar Sep 04 '24 13:09 Jmpax404

Oh well, I am so sorry, I misread your issue. We keep experiencing issues with our FTP server and the problem is so bad that I can not even access those weights at the moment by myself ... Lesson learned: always put those things on Zenodo or sth alike. In case you feel like sharing the weights, I can upload them somewhere else. Otherwise, I do so once our server is back up again.

mheinzinger avatar Sep 04 '24 14:09 mheinzinger

I uploaded those weights onto a temporary file sharing platform. They will be removed after 7 days. The url is here, https://filebin.net/l553d2tiek3r8wzq

You can check these files MD5 values.

file name MD5
SeqVec.zip f9664ab720a7d7cd5ea48a7d8b0574e2
options.json 05637bed5b38e68ee17e107648a5f597
weights.hdf5 5f9d3f5fcac5e6bfadc88aebf147ac02

Jmpax404 avatar Sep 05 '24 05:09 Jmpax404

I uploaded those weights onto a temporary file sharing platform. They will be removed after 7 days. The url is here, https://filebin.net/l553d2tiek3r8wzq

You can check these files MD5 values.

file name MD5 SeqVec.zip f9664ab720a7d7cd5ea48a7d8b0574e2 options.json 05637bed5b38e68ee17e107648a5f597 weights.hdf5 5f9d3f5fcac5e6bfadc88aebf147ac02

I have occured the same problem. Would you be kind to share these weights? Many thanks!

fuyw-aisw avatar Sep 21 '24 07:09 fuyw-aisw

@fuyw-aisw, I uploaded them again, will be kept for 7 days.

https://filebin.net/dsofhelf2o1a2fyp

Jmpax404 avatar Oct 14 '24 14:10 Jmpax404

@fuyw-aisw, I uploaded them again, will be kept for 7 days.

https://filebin.net/dsofhelf2o1a2fyp

I'm sorry, but the link you shared has expired again. While running the project, I encountered an SSL connection error. Could you please reshare the materials? Thank you.

sugarlib avatar Oct 23 '24 08:10 sugarlib

@sugarlib I apologize for my delayed response. The files are sourced from @Jmpax404, and you can download them at the following link: https://filebin.net/ja9f1z6xbza53482. I hope you find this helpful!

fuyw-aisw avatar Oct 26 '24 14:10 fuyw-aisw

@sugarlib I apologize for my delayed response. The files are sourced from @Jmpax404, and you can download them at the following link: https://filebin.net/ja9f1z6xbza53482. I hope you find this helpful!

I would like to express my sincere gratitude for sharing the files. The files are sourced from @Jmpax404 and can be downloaded from the following links: https://drive.google.com/file/d/1MxcjvsbyMUTxBs688revSUAVGD-jhd2N/view?usp=sharing

https://pan.baidu.com/s/1Ol17nWNbigNnNGA87ZA1BQ?pwd=8888 (Password: 8888)

sugarlib avatar Oct 28 '24 08:10 sugarlib

That's a great job! Thank you for persisting the model files.

Jmpax404 avatar Nov 05 '24 14:11 Jmpax404

@sugarlib I apologize for my delayed response. The files are sourced from @Jmpax404, and you can download them at the following link: https://filebin.net/ja9f1z6xbza53482. I hope you find this helpful!

I would like to express my sincere gratitude for sharing the files. The files are sourced from @Jmpax404 and can be downloaded from the following links: https://drive.google.com/file/d/1MxcjvsbyMUTxBs688revSUAVGD-jhd2N/view?usp=sharing

https://pan.baidu.com/s/1Ol17nWNbigNnNGA87ZA1BQ?pwd=8888 (Password: 8888)

Thank you for persisting these files. helped me a lot as well!

internationalJoke avatar May 18 '25 19:05 internationalJoke