fusioncatcher icon indicating copy to clipboard operation
fusioncatcher copied to clipboard

Getting genome indices for specific ENSEMBL version

Open zhuchcn opened this issue 2 years ago • 3 comments

We are re-analyzing an cohort of samples done several years ago, and we would like to use the same ENSEMBL/GENCODE version that was used previously. Is there any way to build the references with a specific ENSEMBL version? The version we are interested is ENSMBLE 100. I went across the documentation but did not see an easy solution. Although the '--ftp-ensembl-path' seems related?

zhuchcn avatar Sep 29 '21 18:09 zhuchcn

Hi @zhuchcn

Sorry, the easy way would be to find someone which has those built files corresponding to some older version of FusionCatcher. It is not easy feature to backtrak Ensembl version in FusionCatcher because FusionCatcher integrates other 20+ databases. That means that very likely a parser that works for Ensembl release 100 does not work for Ensembl release 104 due to changes in Ensembl SQL database schema. The amount of work for doing that is big and unfortunatel here are not enough resources to support several Ensembl releases simultaneously.

ndaniel avatar Oct 08 '21 06:10 ndaniel

Thanks for the explanation! Just curious, if I use the fusioncatcher-build.py and specify the --ftp-ensembl-path to 'pub/release-100', is it not going to work? I haven't gotten it to work yet because our cluster don't have FTP access, so I'm currently finding a way to run it.

zhuchcn avatar Oct 09 '21 19:10 zhuchcn

@zhuchen I tried to use --ftp-ensembl-path=/pub/release-105 (I used gencode 39) and set --web=http://dec2021.archive.ensembl.org/info/data/ftp/index.html and it worked.

duaghk avatar Jun 13 '22 04:06 duaghk