Mike Lin

Results 116 issues of Mike Lin

Minor updates to the embedding pipelines in Nov-2025: - update to transcriptformer 0.6.0 - `census_contrib`: - update dependencies - adjust `roundHalfToEven` typing for numpy 2 - tune TileDB consolidation settings

Our regular procedure for generating Census scVI embeddings trains the scVI model with an up-to-date HVG h5ad, but doesn't revisit the [hyperparameters](https://github.com/chanzuckerberg/cellxgene-census/blob/main/tools/models/scvi/scvi-config.yaml) set a couple of years ago, since which...

discovery

Follow-on to #1440 The builder has two different source files named `census_summary.py`. One computes the `census["census_info"]["summary"]` table stored in the tiledbsoma artifacts, and the other computes slightly different statistics shown...

tech

These two procedures seem very similar, maybe one can use the other or we can factor out some shared core.

tech debt

Configure instructions for this repository as documented in [Best practices for Copilot coding agent in your repository](https://gh.io/copilot-coding-agent-tips).