CuratedAtlasQueryR icon indicating copy to clipboard operation
CuratedAtlasQueryR copied to clipboard

How to import dataset from collection?

Open YXC33 opened this issue 1 year ago • 7 comments

Hi! I met a problem when I tried to import a dataset from a collection. Normally, when the dataset is not from a collection, I can get the dataset using these codes: curated_seurat_object <- metadata |> dplyr::filter( dataset_id == '37b21763-7f0f-41ae-9001-60bad6e2841d' ) |> get_seurat()

However, if I change the dataset_id to a dataset from collections, I can't get the dataset. For example: curated_seurat_object <- metadata |> dplyr::filter( dataset_id == '9fcb0b73-c734-40a5-be9c-ace7eea401c9' ) |> get_seurat()

And the web page of this dataset is: https://cellxgene.cziscience.com/e/9fcb0b73-c734-40a5-be9c-ace7eea401c9.cxg/

'9fcb0b73-c734-40a5-be9c-ace7eea401c9' %in% unique(dplyr::pull(metadata, "dataset_id")) [1] FALSE Since this code returned FALSE, I know it's not because of the size.

Could you please tell me how to get the dataset from collections?

YXC33 avatar Aug 19 '24 08:08 YXC33

have you tried the column collection_id?

stemangiola avatar Aug 19 '24 11:08 stemangiola

have you tried the column collection_id?

Thanks for the suggestion. However, I have tried the collection_id, and used this code: '71f4bccf-53d4-4c12-9e80-e73bfb89e398' %in% unique(dplyr::pull(metadata, "collection_id")) [1] FALSE This the collection page, https://cellxgene.cziscience.com/collections/71f4bccf-53d4-4c12-9e80-e73bfb89e398

And I checked there are only 76 collections in the collection_id column.

YXC33 avatar Aug 19 '24 11:08 YXC33

@myushen ?

stemangiola avatar Aug 19 '24 11:08 stemangiola

Hi @YXC33, it seems that dataset_id 9fcb0b73-c734-40a5-be9c-ace7eea401c9 does not exist in our metadata, thus it won't be able to generate Seurat.

This could happen if the dataset was published recently and our API haven't incorporated it.

myushen avatar Aug 20 '24 00:08 myushen

We will have the updated CELLxGENE in one week or so

stemangiola avatar Aug 20 '24 13:08 stemangiola

May I ask what's the frequency of updating? Also, for the datasets, we only have 329 unique datasets, but in Cellxgene, there are 1486 datasets.

YXC33 avatar Aug 30 '24 09:08 YXC33

Hello, we are completing the most recent update. Probably 2 weeks from publication.

stemangiola avatar Aug 31 '24 12:08 stemangiola