scnym icon indicating copy to clipboard operation
scnym copied to clipboard

Add edge-case testing for "jackpot" cells

Open jacobkimmel opened this issue 3 years ago • 0 comments

Some cell profiles contain a large majority of reads mapping to a single gene (e.g. the Rn45s locus for low quality cells). These cells are usually filtered out during quality control, but in the event they persist in a dataset passed to scnym, they can lead to unstable training dynamics and failures to converge.

We should add simple quality control checks in .api.scnym_api that search for these cells in user provided datasets and throw a warning if they appear.

jacobkimmel avatar Jul 31 '21 21:07 jacobkimmel