immunarch
immunarch copied to clipboard
Issues with repDiversity
I've been using repDiversity and noticed several issues, which I'm putting all here together for simplicity:
documentation issues
- .max.q default is actually 6, not 5:
.max.q | The max hill number to calculate (default: 5).
- "true diversity" is just the "hill q", is that correct? That is, using hill's formula, if we set the parameter
.q=3
then "true diversity" is equal to the value showed on the hill curve at x=3. If this is correct, it should be better stated in the documentation, possibly.
bugs
- when using
.method="d50"
, inside the plot I actually see:
D51 diversity index D51 diversity index
Number of clonotypes occupying the 51% of repertoires
The same happens if I manually try to specify .perc=50
.
- with
.method="raref"
, the "Sample size" stops too early, I'm not sure why. For example, the samples in my repertoire have 200-300 clonotypes each, summing up to thousands of clones (UMIs, in my case). When runningraref
, the x-axis goes only as far as 30 to 40 and estimate diversity is in the 1-2 range. What is the logic behind this?
Ah, regarding raref
I got it.. it's the normalisation step. To be honest it's not really clear to me what kind of normalisation is being performed and why. Can this be expanded a bit in the documentation? Does the .laplace
parameter matter for rarefaction curves?
Hi Fabio, noted, thank you!