bigstatsr
bigstatsr copied to clipboard
control MAF filtering?
Hi, when running a GWAS with big_spLinReg
I get a message saying that some number of "variables with low/no variation have been removed". I guess this means that some variants have been filtered based on some MAF threshold? Is it possible to control which threshold is applied?
Anni
You must compute the MAFs (e.g. with snp_MAF()
), and then provide those that pass some threshold with ind.col =
.
Ok, thanks! I will use that.
How the variables are filtered when user does not provide any indices for ind.col
? Is there some default threshold for MAF?
There is no MAF threshold since this function is not specific to genetic data; it can be used for any matrices.
IIRC, the function checks for standard deviations lower than 1e-6 and remove those. This would correspond to a very small MAF threshold. Basically it checks that it is different from 0.