EPACTS icon indicating copy to clipboard operation
EPACTS copied to clipboard

Emmax test

Open LivUllmann opened this issue 4 years ago • 10 comments

Hello!

I am running emmax gene-based CMC test on the sample 47,000 participants. The input vcf file includes 1824 variants. The number of individuals is the same kinship, vcf and ped file. However, the analysis is taking more than 2 days. So far no file with association results has been generated, only Makefile, .phe, .ind, .grp , .cov and .reml files and a large .eigr.R file. I am using epacts version UKBB.chr1.22.emmaxCMC.AC.eigR Is there any reason why the analysis is taking so long? Thank you for your help!

LivUllmann avatar Apr 07 '21 20:04 LivUllmann

I am using epacts version 3.3.2

LivUllmann avatar Apr 07 '21 21:04 LivUllmann

Is the timestamp on the eigr file recent? Can you check that you are not low on RAM and using swap?

jonathonl avatar Apr 08 '21 18:04 jonathonl

the eigr was created on 07-04 18:19. RAM: MemTotal: 196533944 kB MemFree: 36321224 kB MemAvailable: 159619784 kB

LivUllmann avatar Apr 08 '21 19:04 LivUllmann

Sorry, I was actually looking for the modification timestamp. I'm trying assess whether the reml step is still running and making progress. Is there anything informative in the stderr output?

jonathonl avatar Apr 08 '21 19:04 jonathonl

I have actually cancelled the jog for that file, because it was taking too long. I ran the analysis on chromosome 17, and got and error: NOTICE - Reading eigenvectors NOTICE - Allocating a size 18446744071655423224 bytes

terminate called after throwing an instance of 'std::bad_alloc' what(): std::bad_alloc make: *** [chr17.emmaxCMC.AC.no.cov.0.epacts] Aborted

LivUllmann avatar Apr 08 '21 19:04 LivUllmann

Hmm... that is way too much memory for 47,000 individuals. Seems like a bug.

Can you try the latest pre-release https://github.com/statgen/EPACTS/releases or the latest from the develop branch?

jonathonl avatar Apr 08 '21 19:04 jonathonl

I don't think EMMAX works for >20K samples..

Hyun.

Hyun Min Kang, Ph.D. Associate Professor of Biostatistics University of Michigan, Ann Arbor Email : @.***

On Thu, Apr 8, 2021 at 3:41 PM Jonathon LeFaive @.***> wrote:

Hmm... that is way too much memory for 47,000 individuals. Seems like a bug.

Can you try the latest pre-release https://github.com/statgen/EPACTS/releases or the latest from the develop branch?

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/statgen/EPACTS/issues/30#issuecomment-816109193, or unsubscribe https://github.com/notifications/unsubscribe-auth/ABPY5OLRHKGACZNLR5EMSNDTHYBGPANCNFSM42RO5TZQ .

hyunminkang avatar Apr 08 '21 19:04 hyunminkang

Thank you! I will try 3.4.2 release

LivUllmann avatar Apr 08 '21 19:04 LivUllmann

@LivUllmann just curious if 3.4.2 release was able to solve your problem? I am working with a vcf file with 61000 samples and 41000 variants. its been 22 days emmaxCMC is running but not output yet. it is consuming only 1 cpu to 100% and 170 gb memory.

smhaider avatar Apr 27 '22 13:04 smhaider

I don't think EMMAX works for >20K samples.. Hyun. ----------------------------------------------------- Hyun Min Kang, Ph.D. Associate Professor of Biostatistics University of Michigan, Ann Arbor Email : @.*** On Thu, Apr 8, 2021 at 3:41 PM Jonathon LeFaive @.***> wrote: Hmm... that is way too much memory for 47,000 individuals. Seems like a bug. Can you try the latest pre-release https://github.com/statgen/EPACTS/releases or the latest from the develop branch? — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#30 (comment)>, or unsubscribe https://github.com/notifications/unsubscribe-auth/ABPY5OLRHKGACZNLR5EMSNDTHYBGPANCNFSM42RO5TZQ .

Dear Prof Dr. Kang, just wondering if there is a way to handle 61000 samples in EPACTS?

smhaider avatar Apr 29 '22 11:04 smhaider