pySCENIC icon indicating copy to clipboard operation
pySCENIC copied to clipboard

[BUG] Human transcription factors list

Open malosreet opened this issue 1 year ago • 2 comments

The transcription factor (TF) gene symbols in the recommended list to be used with pySCENIC (https://resources.aertslab.org/cistarget/tf_lists/allTFs_hg38.txt) include some genes that are assessed to not be TFs based on Lambert et al., 2018 (https://www.cell.com/cms/10.1016/j.cell.2018.01.029/attachment/ede37821-fd6f-41b7-9a0e-9d5410855ae6/mmc2.xlsx).

Some of the gene symbols on the "allTFs_hg38.txt" list don't seem to have motifs in the corresponding motifs database either (https://resources.aertslab.org/cistarget/motif2tf/motifs-v10nr_clust-nr.hgnc-m0.001-o0.0.tbl). For example - NCOR1, UBB, PRNP, TAF7, etc.

I was referring to the instructions here (https://pyscenic.readthedocs.io/en/latest/installation.html#auxiliary-datasets) to obtain the files required to run pySCENIC.

I assumed the "allTFs_hg38.txt" list would be based on Lambert et al., 2018, looking at this issue (https://github.com/aertslab/pySCENIC/issues/377) and this notebook (https://github.com/aertslab/pySCENIC/blob/master/notebooks/pySCENIC%20-%20List%20of%20Transcription%20Factors.ipynb).

Any clarification you can provide is appreciated.

Thanks!

malosreet avatar Feb 06 '23 14:02 malosreet