massonix / hcatonsildata Goto Github PK
View Code? Open in Web Editor NEWProvide programmatic access to the tonsil cell atlas datasets
License: Other
Provide programmatic access to the tonsil cell atlas datasets
License: Other
Hey guys,
Congratulation on such amazing work,
I am just having some issues converting the sce to Seurat object
allseurat <- as.Seurat(All)
Error in UseMethod(generic = "as.sparse", object = x) :
no applicable method for 'as.sparse' applied to an object of class "c('DelayedMatrix', 'DelayedArray', 'DelayedUnaryIsoOp', 'DelayedUnaryOp', 'DelayedOp', 'Array', 'RectangularData')"
Would appreciate if you could help with that :D
Thanks a lot
Hi there,
Congrats on a great work, and thanks for developing this package.
I am attempting to pull TCR-seq data for T cells that also have the GEX/CITE-seq Seurat objects.
QUESTION 1. I saw the output of cellranger vdj
deposited at https://zenodo.org/record/6678331#.Y_iuOZPP00R within the CITE-seq folder. Would you mind clarifying what the subfolders (shown below, e.g., BCLLATLAST_XX, ifZOgenn_TpMNTvBa, mLuLpVxi_v0fLyotc, etc.) refer to? They are probably sequencing runs and gem ID.
QUESTION 2. I noticed you used scirpy
for the immune receptor repertoire analysis, and I could pull TCR from the vdj_t folders "barcode" that matched the cell barcodes of the meta.data of the T cell Seurat objects. But, would you mind letting me know whether you might already have the processed sc-VDJ data for T cells as an R/Python object - the ones that have the matched Seurat GEX/CITE-seq objects for T cells?
QUESTION 3. It looks like only the CITE-seq-ed cells have TCR data, but only Seurat object with ADT imputed was deposited. The cell barcodes are not the same between Seurat GEX object and TCR vdj, so I could not match the cells between these 2 datasets. Do you have CITE-seq Seurat object for CD8 T cells?
Thank you again very much for your help!
Hi, could you please inform where can I find the Visium data in your paper? Thanks~
Hi there,
Thanks again for all of your help in leveraging the great dataset you generated.
I subsetted the whole tonsil cite-seq Seurat object to obtain 2 cite-seq objects (one object for B cells expressing CD38 and one object for B cells not expressing CD38).
I followed the codes you kindly deposited to perform WNN analysis of RNA+ADT for each of these 2 objects, including harmony
to integrate by "gem_ID" within each object within each object.
I now hope to merge these 2 B cell objects and cluster them using RNA+ADT.
Would you minding providing some opinion whether I should simply use merge()
or what is the recommended workflow?
Thank you very much!
Hi there,
Sorry for another Q.
I downloaded the Seurat objects from Zenodo because the HCATonsilData did not work (issue #9).
However, I encountered another error when attempting to subset cite-seq Seurat object by protein expression. It only worked for protein names that do not have a parenthesis. Yet, VlnPlot
, AverageExpression
, rownames
works fine for these genes. Asked also in satijalab/seurat#6989
Would you mind advising how to proceed?
Thank you for your help.
Are raw counts available for the scRNA-seq data? From the commands history in the Seurat object, seems like the data in the "RNA" slot under assays is normalized.
library(HCATonsilData)
HCATonsilData(assayType="RNA")
#> snapshotDate(): 2021-10-19
#> Error in .local(x, i, j = j, ...): 'i' must be length 1
Created on 2022-10-23 by the reprex package (v2.0.1)
sessioninfo::session_info()
#> ─ Session info ───────────────────────────────────────────────────────────────
#> setting value
#> version R version 4.1.0 (2021-05-18)
#> os Ubuntu 20.04.5 LTS
#> system x86_64, linux-gnu
#> ui X11
#> language en_GB:en
#> collate en_GB.UTF-8
#> ctype en_GB.UTF-8
#> date 2022-10-23
#> pandoc 2.18 @ /usr/lib/rstudio/bin/quarto/bin/tools/ (via rmarkdown)
#>
#> ─ Packages ───────────────────────────────────────────────────────────────────
#> package * version date (UTC) lib source
#> AnnotationDbi 1.56.2 2021-11-09 [1] Bioconductor
#> AnnotationHub 3.2.2 2022-03-01 [1] Bioconductor
#> assertthat 0.2.1 2019-03-21 [1] CRAN (R 4.1.0)
#> Biobase 2.54.0 2021-10-26 [1] Bioconductor
#> BiocFileCache 2.2.1 2022-01-23 [1] Bioconductor
#> BiocGenerics 0.40.0 2021-10-26 [1] Bioconductor
#> BiocManager 1.30.18 2022-05-18 [1] CRAN (R 4.1.0)
#> BiocVersion 3.14.0 2021-05-19 [1] Bioconductor
#> Biostrings 2.62.0 2021-10-26 [1] Bioconductor
#> bit 4.0.4 2020-08-04 [1] CRAN (R 4.1.0)
#> bit64 4.0.5 2020-08-30 [1] CRAN (R 4.1.0)
#> bitops 1.0-7 2021-04-24 [1] CRAN (R 4.1.0)
#> blob 1.2.3 2022-04-10 [1] CRAN (R 4.1.0)
#> cachem 1.0.6 2021-08-19 [1] CRAN (R 4.1.0)
#> cli 3.4.1 2022-09-23 [1] CRAN (R 4.1.0)
#> crayon 1.5.2 2022-09-29 [1] CRAN (R 4.1.0)
#> curl 4.3.3 2022-10-06 [1] CRAN (R 4.1.0)
#> DBI 1.1.3 2022-06-18 [1] CRAN (R 4.1.0)
#> dbplyr 2.2.1 2022-06-27 [1] CRAN (R 4.1.0)
#> DelayedArray 0.20.0 2021-10-26 [1] Bioconductor
#> digest 0.6.30 2022-10-18 [1] CRAN (R 4.1.0)
#> dplyr 1.0.10 2022-09-01 [1] CRAN (R 4.1.0)
#> ellipsis 0.3.2 2021-04-29 [1] CRAN (R 4.1.0)
#> evaluate 0.16 2022-08-09 [1] CRAN (R 4.1.0)
#> ExperimentHub 2.2.1 2022-01-23 [1] Bioconductor
#> fansi 1.0.3 2022-03-24 [1] CRAN (R 4.1.0)
#> fastmap 1.1.0 2021-01-25 [1] CRAN (R 4.1.0)
#> filelock 1.0.2 2018-10-05 [1] CRAN (R 4.1.0)
#> fs 1.5.2 2021-12-08 [1] CRAN (R 4.1.0)
#> generics 0.1.3 2022-07-05 [1] CRAN (R 4.1.0)
#> GenomeInfoDb 1.30.1 2022-01-30 [1] Bioconductor
#> GenomeInfoDbData 1.2.7 2022-03-14 [1] Bioconductor
#> GenomicRanges 1.46.1 2021-11-18 [1] Bioconductor
#> glue 1.6.2 2022-02-24 [1] CRAN (R 4.1.0)
#> HCATonsilData * 0.0.0.9000 2022-10-23 [1] Github (massonix/HCATonsilData@59187d4)
#> HDF5Array 1.22.1 2021-11-14 [1] Bioconductor
#> highr 0.9 2021-04-16 [1] CRAN (R 4.1.0)
#> htmltools 0.5.3 2022-07-18 [1] CRAN (R 4.1.0)
#> httpuv 1.6.6 2022-09-08 [1] CRAN (R 4.1.0)
#> httr 1.4.4 2022-08-17 [1] CRAN (R 4.1.0)
#> interactiveDisplayBase 1.32.0 2021-10-26 [1] Bioconductor
#> IRanges 2.28.0 2021-10-26 [1] Bioconductor
#> KEGGREST 1.34.0 2021-10-26 [1] Bioconductor
#> knitr 1.39 2022-04-26 [1] CRAN (R 4.1.0)
#> later 1.3.0 2021-08-18 [1] CRAN (R 4.1.0)
#> lattice 0.20-45 2021-09-22 [1] CRAN (R 4.1.0)
#> lifecycle 1.0.3 2022-10-07 [1] CRAN (R 4.1.0)
#> magrittr 2.0.3 2022-03-30 [1] CRAN (R 4.1.0)
#> Matrix 1.4-1 2022-03-23 [1] CRAN (R 4.1.0)
#> MatrixGenerics 1.6.0 2021-10-26 [1] Bioconductor
#> matrixStats 0.62.0 2022-04-19 [1] CRAN (R 4.1.0)
#> memoise 2.0.1 2021-11-26 [1] CRAN (R 4.1.0)
#> mime 0.12 2021-09-28 [1] CRAN (R 4.1.0)
#> pillar 1.8.1 2022-08-19 [1] CRAN (R 4.1.0)
#> pkgconfig 2.0.3 2019-09-22 [1] CRAN (R 4.1.0)
#> png 0.1-7 2013-12-03 [1] CRAN (R 4.1.0)
#> promises 1.2.0.1 2021-02-11 [1] CRAN (R 4.1.0)
#> purrr 0.3.5 2022-10-06 [1] CRAN (R 4.1.0)
#> R.cache 0.16.0 2022-07-21 [1] CRAN (R 4.1.0)
#> R.methodsS3 1.8.2 2022-06-13 [1] CRAN (R 4.1.0)
#> R.oo 1.25.0 2022-06-12 [1] CRAN (R 4.1.0)
#> R.utils 2.12.0 2022-06-28 [1] CRAN (R 4.1.0)
#> R6 2.5.1 2021-08-19 [1] CRAN (R 4.1.0)
#> rappdirs 0.3.3 2021-01-31 [1] CRAN (R 4.1.0)
#> Rcpp 1.0.9 2022-07-08 [1] CRAN (R 4.1.0)
#> RCurl 1.98-1.9 2022-10-03 [1] CRAN (R 4.1.0)
#> reprex 2.0.1 2021-08-05 [1] CRAN (R 4.1.0)
#> rhdf5 2.38.1 2022-03-10 [1] Bioconductor
#> rhdf5filters 1.6.0 2021-10-26 [1] Bioconductor
#> Rhdf5lib 1.16.0 2021-10-26 [1] Bioconductor
#> rlang 1.0.6 2022-09-24 [1] CRAN (R 4.1.0)
#> rmarkdown 2.14 2022-04-25 [1] CRAN (R 4.1.0)
#> RSQLite 2.2.18 2022-10-04 [1] CRAN (R 4.1.0)
#> rstudioapi 0.13 2020-11-12 [1] CRAN (R 4.1.0)
#> S4Vectors 0.32.4 2022-03-24 [1] Bioconductor
#> sessioninfo 1.2.2 2021-12-06 [1] CRAN (R 4.1.0)
#> shiny 1.7.2 2022-07-19 [1] CRAN (R 4.1.0)
#> SingleCellExperiment 1.16.0 2021-10-26 [1] Bioconductor
#> stringi 1.7.8 2022-07-11 [1] CRAN (R 4.1.0)
#> stringr 1.4.0 2019-02-10 [1] CRAN (R 4.1.0)
#> styler 1.7.0 2022-03-13 [1] CRAN (R 4.1.0)
#> SummarizedExperiment 1.25.2 2021-11-08 [1] Github (Bioconductor/SummarizedExperiment@439eeff)
#> tibble 3.1.8 2022-07-22 [1] CRAN (R 4.1.0)
#> tidyselect 1.2.0 2022-10-10 [1] CRAN (R 4.1.0)
#> utf8 1.2.2 2021-07-24 [1] CRAN (R 4.1.0)
#> vctrs 0.5.0 2022-10-22 [1] CRAN (R 4.1.0)
#> withr 2.5.0 2022-03-03 [1] CRAN (R 4.1.0)
#> xfun 0.32 2022-08-10 [1] CRAN (R 4.1.0)
#> xtable 1.8-4 2019-04-21 [1] CRAN (R 4.1.0)
#> XVector 0.34.0 2021-10-26 [1] Bioconductor
#> yaml 2.3.6 2022-10-18 [1] CRAN (R 4.1.0)
#> zlibbioc 1.40.0 2021-10-26 [1] Bioconductor
#>
#> [1] /home/user/miniconda3/envs/r-4.1/lib/R/library
#>
#> ──────────────────────────────────────────────────────────────────────────────
Dear Ramon and colleagues,
Great work and thank you for making this data available. I am trying to access the data via level 4 (R) using the following command:
devtools::install_github("massonix/HCATonsilData", build_vignettes = TRUE)
However I get the following error message:
**Quitting from lines 117-119 (HCATonsilData.Rmd)
Error: processing vignette 'HCATonsilData.Rmd' failed with diagnostics:
HDF5. File accessibility. Unable to open file.
--- failed re-building 'HCATonsilData.Rmd'
SUMMARY: processing the following file failed:
'HCATonsilData.Rmd'
Error: Vignette re-building failed.
Execution halted**
Apologies if I am doing something wrong - can you please help?
Many thanks
BW
Amit
Hi there,
Thanks for the tool and the dataset.
I downloaded the Seurat object for CD8 T cells (CD8_T_seurat_obj) from https://zenodo.org/record/6340174#.Y_zdmbTP3ao.
The object shows the ADT
assay is available, but when access the protein expression levels for any of the proteins, the expression was 0 across cells.
Would you mind providing the cite-seq
Seurat object for CD8 T cells?
And, why is the protein expression in the available Seurat object = 0?
Thank you.
Hi Ramon,
Thanks for the great package and dataset - I'm getting an error trying to install:
devtools::install_github("massonix/HCATonsilData", build_vignettes = TRUE) Downloading GitHub repo massonix/HCATonsilData@HEAD ✔ checking for file ‘/private/var/folders/yh/ls2hbq6j7039g1b2ldmrkvbh0000gp/T/Rtmp5lxC9o/remotes9c5c5c953635/massonix-HCATonsilData-59187d4/DESCRIPTION’ (411ms) ─ preparing ‘HCATonsilData’: ✔ checking DESCRIPTION meta-information ... ─ installing the package to build vignettes ----------------------------------- ─ installing *source* package ‘HCATonsilData’ ... ** using staged installation ** R Error in parse(outFile) : /private/var/folders/yh/ls2hbq6j7039g1b2ldmrkvbh0000gp/T/RtmpG6nFZf/Rbuild9c7a4c45bdde/HCATonsilData/R/HCATonsilData.R:52:33: unexpected input 51: "pca", "harmony", "umap") 52: filePaths <- sapply(suffixes, \ ^ ERROR: unable to collate and parse R files for package ‘HCATonsilData’ ─ removing ‘/private/var/folders/yh/ls2hbq6j7039g1b2ldmrkvbh0000gp/T/RtmpG6nFZf/Rinst9c7a76b460f0/HCATonsilData’ ----------------------------------- ERROR: package installation failed Error: Failed to install 'HCATonsilData' from GitHub: System command 'R' failed, exit status: 1, stdout + stderr (last 10 lines): E> ** R E> Error in parse(outFile) : E> /private/var/folders/yh/ls2hbq6j7039g1b2ldmrkvbh0000gp/T/RtmpG6nFZf/Rbuild9c7a4c45bdde/HCATonsilData/R/HCATonsilData.R:52:33: unexpected input E> 51: "pca", "harmony", "umap") E> 52: filePaths <- sapply(suffixes, \ E> ^ E> ERROR: unable to collate and parse R files for package ‘HCATonsilData’ E> * removing ‘/private/var/folders/yh/ls2hbq6j7039g1b2ldmrkvbh0000gp/T/RtmpG6nFZf/Rinst9c7a76b460f0/HCATonsilData’ E> ----------------------------------- E> ERROR: package installation failed
Here's my sessionInfo()
`sessionInfo()
R version 4.0.3 (2020-10-10)
Platform: x86_64-apple-darwin19.6.0 (64-bit)
Running under: macOS Big Sur 10.16
Matrix products: default
BLAS: /usr/local/Cellar/r/4.0.3/lib/R/lib/libRblas.dylib
LAPACK: /usr/local/Cellar/r/4.0.3/lib/R/lib/libRlapack.dylib
locale:
[1] en_GB.UTF-8/en_GB.UTF-8/en_GB.UTF-8/C/en_GB.UTF-8/en_GB.UTF-8
attached base packages:
[1] stats graphics grDevices utils datasets methods base
loaded via a namespace (and not attached):
[1] ps_1.5.0 prettyunits_1.1.1 withr_2.5.0 rprojroot_2.0.2
[5] crayon_1.5.1 R6_2.5.1 lifecycle_1.0.1 magrittr_2.0.3
[9] rlang_1.0.3 cachem_1.0.6 cli_3.3.0 curl_4.3.2
[13] remotes_2.4.2 fs_1.5.2 callr_3.7.0 ellipsis_0.3.2
[17] devtools_2.4.3 tools_4.0.3 glue_1.6.2 purrr_0.3.4
[21] pkgload_1.3.0 fastmap_1.1.0 compiler_4.0.3 processx_3.5.2
[25] pkgbuild_1.2.0 sessioninfo_1.2.2 memoise_2.0.1 usethis_2.1.6`
Any ideas?
Thanks
Dan
Installing GCBC with processedCounts =TRUE gives the following error:
gcbc <- HCATonsilData(assayType = "RNA", cellType = "GCBC", processedCounts = TRUE)
snapshotDate(): 2022-04-26
see ?HCATonsilData and browseVignettes('HCATonsilData') for documentation
loading from cache
see ?HCATonsilData and browseVignettes('HCATonsilData') for documentation
loading from cache
see ?HCATonsilData and browseVignettes('HCATonsilData') for documentation
loading from cache
see ?HCATonsilData and browseVignettes('HCATonsilData') for documentation
loading from cache
see ?HCATonsilData and browseVignettes('HCATonsilData') for documentation
loading from cache
see ?HCATonsilData and browseVignettes('HCATonsilData') for documentation
loading from cache
see ?HCATonsilData and browseVignettes('HCATonsilData') for documentation
loading from cache
Error in H5Fopen(filepath, flags = "H5F_ACC_RDONLY") :
HDF5. File accessibility. Unable to open file.
Which doesn't happen with processedCounts = FALSE. Need to look in depth for future versions
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.