Han Altae-Tran

Results 10 comments of Han Altae-Tran

Linclust seemed to fail still with 3.6B sequences. Is the max 2^31-1?

There should be enough RAM, since the usage never tops 40% or so. It is a 2TB memory machine with 160 cores. Usually I get something like this, which happens...

the whole log overwhelms my system, but I captured most of the information that occurs before this bug: ``` clusterer:/mnt/cluster/filt_80_2020-04-19/combined12$ time /custom_install/installations/mmseqs-nonmpi/MMseqs2/build/bin/mmseqs linclust inDB linClu90DB tmp --min-seq-id 0.90 --kme r-per-seq...

I saw there was a potential fix regarding the database size limit on the github. Was anyone able to take a look at this issue? I'm excited to try MMseqs2...

I get this error whenever I try to run dispyscheduler.py as a background process using terminal (& process). I suspect it might be interpreting the & strangely, so I just...

Definitely looking for the same

Could it be because the HHsuite databases may only use the first 100 sequences (or some other number) in the alignment?

Also seeing similar bug during build: [INFO] Including org.apache.hadoop:hadoop-mapreduce-client-jobclient:jar:2.7.1 in the shaded jar. [INFO] Including com.uber.m3:tally-core:jar:0.2.1 in the shaded jar. [INFO] Including org.javassist:javassist:jar:3.18.1-GA in the shaded jar. [INFO] Including org.apache.hadoop:hadoop-yarn-server-common:jar:2.7.1...

For anyone wondering, I was able to switch the maven-shade-plugin to version 3.3 in the pom file and this removed all remaining errors. However, I am not sure if the...