Edoardo Giacopuzzi

Results 21 comments of Edoardo Giacopuzzi

Great, thanks! I didn't realize that. This will do the trick. Now I can iterate and then apply del to unwanted INFO keys.

Hi, I'm testing glow on a local Spark stand-alone implementation (especially we are interested in the GWAS pipeline) and everything else worked fine so far. Essentially, I initialize a SparkSession...

Hi! Apologise for the late reply... In the end I've built a container with all Spark and Python dependencies and it works now! Thanks!

Hi, our main interest is using GLOW to run regenie GWAS algorithm at scale using Spark implementation provided in the GloWGR pipeline. So I've made a container based on the...

This is the Dockerfile I'm using right now. Feel free to improve and/or re-distribute this as long as my contribution is properly acknowledged. [Dockerfile_glow.zip](https://github.com/projectglow/glow/files/8303976/Dockerfile_glow.zip)

Thanks Brent. I've updated the nextflow pipeline to write somalier output to $TMPDIR and then move files back to the working dir. This fixed the issue. A little annoying, but...

Yes, I was suspecting something like that. With larger and larger datasets being generated I'm wondering if we can get an advantage by a new implementation nof the VCF file...

Supplementary reads are not yet implemented in the last release (0.7.1) right? I'm testing it to see how strobealign perform in our current WGS pipeline and I see no supplementary...

Thanks! This clarifies the conflicting options. Additional question: in your experience which is the maximum number of VCF that can be merged by survivor? I will possibly like to scale...

I'm trying now to merge the big dataset. Few question / suggestions: 1. How long can it take to merge a ~50k sample dataset without splitting by chrom? My medium...