Nalini Ganapati issues

Results 21 issues of


                                            Nalini Ganapati

Move to GenomicsDB 1.4.4

Allow for multiple inputs(bed, etc.) to be ingested into GenomicsDB

Currently, only vcf files can be ingested. Refactor code to have an api to ingest from multiple sources. This api should be made callable from Java/Python/R bindings as well.

plink support has introduced mpi dependency in test_gen.cc

The C unit tests are meant to test the basic functionality exposed by `/src/main/cpp` and `libtiledbgenomicsdb.so`, so should not have any mpi dependencies. However, `gt_mpi_gather` can be used for testing...

Support MacOS/Linux arm64 architecture

gatk forum users have started requesting support for MacOS and Linux arm64 architectures - see f[orum post](https://gatk.broadinstitute.org/hc/en-us/community/posts/5462468688539-Is-ARM64-Linux-MacOS-architecture-officially-supported-?page=1#community_comment_5972596856091).

roadmap

Annotation Service needs to be extended to support multiple alternate alleles

This is something to keep in mind when we start using this functionality. _Originally posted by @jPleyte in https://github.com/GenomicsDB/GenomicsDB/pull/186#discussion_r794222104_

Add ID field and handle duplicate names in INFO/FMT/FILTER fields correctly in vcf2genomicsdb_init

Explore using zstd as an optional alternative to using intel zlib from GenomicsDB

Looks like zstd has really good decompress performance and we should at least investigate this.

GenomicsDB api returns a null list when querying for variant calls with no rows specified

The default is `all rows` when no rows are specified, so GenomicsDB api `query_variants` and `query_variant_calls` should consider all rows. Thanks @jacmarjorie for pointing this issue.

Extend Github Actions to run with cloud emulators - azurite, minio, etc.

This could be part of a separate workflow. We could just run run_spark_hdfs.py for pull requests.

Improve (mainly java) logging in GenomicsDB

This is a placeholder to - Hold issues we are seeing with logging currently - Collect ideas on how to improve logging.