lassosum icon indicating copy to clipboard operation
lassosum copied to clipboard

pseudovalidate() with no LD

Open euffelmann opened this issue 2 years ago • 1 comments

Does pseudovalidate() also provide a valid estimate of corr(PRS, Phenotype) when there is no LD? That is, I'm simulating SNP data with no LD, then run a GWAS of a disease phenotype using these SNPs. Using the resulting summary stats, and a test.bfile that also has no LD, will pseudovalidate provide a correct estimate?

euffelmann avatar Oct 27 '22 11:10 euffelmann

Not sure why you want to use pseudovalidate rather than simply correlating the generated phenotype with PRS. In general, pseudovalidation is deprecated in favour of splitvalidation. See the README.md for more details.

tshmak avatar Oct 28 '22 03:10 tshmak

Sorry, above I meant I use ref.bfile without LD. So I would like to use lassosum to get an estimate of R2 from the summary statistics and and a reference sample alone (no test data). As I understand, for this I can use pseudovalidate. splitvalidation requires a test.bfile.

euffelmann avatar Nov 02 '22 12:11 euffelmann

In general, pseudovalidation is not very robust. It's based on a heuristic and its performance on real life datasets is unknown. Hence, I don't recommend it in general. I also don't know what's the impact of LD (or the lack thereof) has on it.

tshmak avatar Nov 03 '22 05:11 tshmak

That's also the feeling I got from my simulations. Thank you!

euffelmann avatar Nov 03 '22 08:11 euffelmann