avocado
avocado copied to clipboard
fix-variable-names
Hey @jmschrei thanks for sharing your code. It's interesting to look through.
A few things:
-
I noticed in the download bigWig util that
chromosome_lengthsvariable doesn't exist. Is it supposed to bechrom_lengthsinstead? (By the way what is the purpose of decimating a vector and then assigning it back to a custom chrom_length? I'm assuming any passed in chrom_legnth will be much longer than the decimated vector.) -
It looks like
bigWigvariable should bebigwiginstead. -
When I try to import this function I get following error:
(avo) kiene:avocado keenan$ python
Python 3.6.7 | packaged by conda-forge | (default, Feb 28 2019, 02:16:08)
>>> from avocado.utils import download_bigWig
Using TensorFlow backend.
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "/Users/keenan/Desktop/projects/github/avocado/avocado/utils.py", line 16, in <module>
chroms = range(1, 23) + ['X']
TypeError: unsupported operand type(s) for +: 'range' and 'list'
This must by a Python 2 idiom, but based on the print statements it seems like Avocado should be run with Python 3? Changing code to list(range(1, 23)) + ['X'] seems to solve this.
In any case not sure if it's useful to you but these changes appear to work for me:
(avo3) kiene:avocado keenan$ python
Python 3.6.7 | packaged by conda-forge | (default, Feb 28 2019, 02:16:08)
[GCC 4.2.1 Compatible Clang 4.0.1 (tags/RELEASE_401/final)] on darwin
Type "help", "copyright", "credits" or "license" for more information.
>>> from avocado.utils import download_bigWig
Using TensorFlow backend.
>>> data = download_bigWig('https://www.encodeproject.org/files/ENCFF289XSX/@@download/ENCFF289XSX.bigWig')
wget -nv -q https://www.encodeproject.org/files/ENCFF289XSX/@@download/ENCFF289XSX.bigWig -P ./
2019-03-12 21:23:55 URL:https://encode-files.s3.amazonaws.com/2019/02/13/9a778a35-5a1a-4b1d-9207-bd2f09af2d07/ENCFF289XSX.bigWig?response-content-disposition=attachment%3B%20filename%3DENCFF289XSX.bigWig&Expires=1552580563&Signature=sD3BmCkh1%2FcYrfPAQC2xulzFyyU%3D&AWSAccessKeyId=ASIATGZNGCNX6HEOCSVV&x-amz-security-token=FQoGZXIvYXdzEBUaDLP8pWCUg42a5OTyKyK3A23C%2B%2B0XcTYGOyH96liuoZiqQVmx280j8Vyc4YsPlBJGprrjtuaJ9HvNWjIN3JpPYoaPJOFVa6XpBTyfhzur8ZNGIwORUQc8o4LszZM8E55YyDxTmXholVFgzowmJfsRBvn6IYJuI5cZojEBXI7g7QuqAapmOkSUfLPPPsRTnkQFnPfpukSEupVDJ4NkT2CxgJn%2B5UzH20DYSGZ65F3GYF1%2BNHC5SJMrkdY0Pwj7MLbbKpYRnoF1%2B9eA7iiMzf%2BZhJPuIAXjIfv0jpDelRF2OIljzasGJo5QM2bd2MBDN54yt%2BKEmonVrPzqbSzACUlbYOiAs4WWsT7EMLiWF1kGV6FHFk5IJZDItH2PZEUqULx1inu%2FzTHk%2B6oO0q7CqYnjQg6DmR70KkT7VO9dqGvjWJBP7OdTTnWhbTIVt5fJGNx5CEnvypPjelylsbDzwtGyewmYh3v4H53ZcdK1s3d7FW5I61cNHsEhXdD%2BD%2BKq6EqdhecyceJ5cgfVmoscQNSP83y%2BYePmfcnY6av2XDgF3Mp9bxp9OZrpsTtJSvHGQoF6HmCC7tDB6G73XVxe3qCT6bFv1ZnqJ6Ao6uuh5AU%3D [577542827/577542827] -> "./ENCFF289XSX.bigWig" [1]
bigWigToBedGraph ./ENCFF289XSX.bigWig ENCFF289XSX.chr1.bedgraph -chrom=chr1
bedgraph_to_dense(ENCFF289XSX.chr1.bedgraph)
100%|█████████████████████████████████████████████████████████████████████████████████████████| 8502512/8502512 [13:53<00:00, 10201.60it/s]
decimate_vector
rm ENCFF289XSX.chr1.bedgraph
rm ./ENCFF289XSX.bigWig
>>> len(data[0])
9957851
Thanks!
Good catch. Thanks for submitting a PR. I threw this up for the imputation challenge, but didn't end up debugging it. I'm going to go through and add a lot more code to wrap Avocado, so hopefully I can fix this as a part of that.