Edoardo Giacopuzzi comments

Results 21 comments of


Edoardo Giacopuzzi

Select subset of INFO tags

Great, thanks! I didn't realize that. This will do the trick. Now I can iterate and then apply del to unwanted INFO keys.

Installing and running glow outside of databricks

Hi, I'm testing glow on a local Spark stand-alone implementation (especially we are interested in the GWAS pipeline) and everything else worked fine so far. Essentially, I initialize a SparkSession...

Installing and running glow outside of databricks

Hi! Apologise for the late reply... In the end I've built a container with all Spark and Python dependencies and it works now! Thanks!

Installing and running glow outside of databricks

Hi, our main interest is using GLOW to run regenie GWAS algorithm at scale using Spark implementation provided in the GloWGR pipeline. So I've made a container based on the...

Installing and running glow outside of databricks

This is the Dockerfile I'm using right now. Feel free to improve and/or re-distribute this as long as my contribution is properly acknowledged. [Dockerfile_glow.zip](https://github.com/projectglow/glow/files/8303976/Dockerfile_glow.zip)

somalier ancestry and relate freeze on writing output on BeeGFS filesystem

Thanks Brent. I've updated the nextflow pipeline to write somalier output to $TMPDIR and then move files back to the working dir. This fixed the issue. A little annoying, but...

Echtvar slower on VCF with many samples

Yes, I was suspecting something like that. With larger and larger datasets being generated I'm wondering if we can get an advantage by a new implementation nof the VCF file...

Supplementary reads

Supplementary reads are not yet implemented in the last release (0.7.1) right? I'm testing it to see how strobealign perform in our current WGS pipeline and I see no supplementary...

details of merge options

Thanks! This clarifies the conflicting options. Additional question: in your experience which is the maximum number of VCF that can be merged by survivor? I will possibly like to scale...

details of merge options

I'm trying now to merge the big dataset. Few question / suggestions: 1. How long can it take to merge a ~50k sample dataset without splitting by chrom? My medium...