goci
goci copied to clipboard
Summary statistics file formatter
- Plink2.0 has an option to output files in the gwas-ssf standard format
- Not everyone is using plink to generate their files for submission to the GWAS Catalog.
- For users not using plink, how do they generate their gwas-ssf files?
- What can we do to make it easier for users to generate gwas-ssf?
- Software to convert existing tabular data to gwas-ssf
- What type of user - CLI or GUI?
- Acceptable input formats?
- Limitations? Resource considerations
- Is this just a typical ETL problem?
- Existing generic tools:
- petl python package and cli
- sumstats formatter python package and cli
- assess users' (curators) requirements
- make more user friendly e.g. interactive cli
- sequence of operations like petl exec chaining idea (handled by the above)
- deprecate sum-stats-formatter and merge into: https://github.com/EBISPOT/gwas-sumstats-tools