morris-lab / biddyetalworkflow Goto Github PK

View Code? Open in Web Editor NEW

27.0 27.0 11.0 14.97 MB

This repository contains our CellTag workflow, as deployed in our 2018 Biddy et al., Nature paper.

Awk 3.85% R 96.15%

human-cell-atlas

biddyetalworkflow's People

Contributors

Stargazers

Watchers

Forkers

asimazia leighnd hidekkobonba vynaveen jaymgrayson wli90 qsbsc sivanola akmazad theresa-chen kaveh8

biddyetalworkflow's Issues

custom celltag reference for alignment

Hi!
Thanks for the great workflow! For the custom reference used in the cellranger count function, do you add the entire celltag plasmid sequence or only a small portion containing the celltag barcode and perhaps the GFP sequence?

Thanks!

rows which exceeds .Machine$integer.max Error generating cell barcode x cell tag matrix

Hello!

I've created the parsed celltag list for both v1 and v3 libraries. When I try to run matrix.count.celltags.R, I get the following error for the v1 CellTag library, but not the V3:

Error in CJ(1:178363, 1:35378) : 
  Cross product of elements provided to CJ() would result in 6310126214 rows which exceeds .Machine$integer.max == 2147483647
Calls: dcast -> dcast.data.table -> do.call -> CJ

The v1 parsed file is much larger than the v3, and given the error, I suspect it has to do with the size? I have two samples, one with ~18,000 cells and one with ~6,000 cells and they both throw this error for the V1 library.

Have you seen this error with larger datasets?

In case it is relevant, I also get this warning about data.table:

Warning message:
package ‘data.table’ was built under R version 3.5.2

I'm using R 3.5.1.

Thanks!

Sincerely,
Lauren

why "CellTagV1_2" was selected in drawSubnet

Hi,

I noticed that in the drawSubnet function, you've chosen to use tag = "CellTagV1_2". Was there a specific reason for this choice? Additionally, does the figure in the paper also use "CellTagV1_2"?

drawSubnet(tag = "CellTagV1_2", overlay = "Cluster", linkList = linkList, Nodes = Nodes )

From what I understand, "CellTagV1_1" is the largest and most diverse clone. I'm curious as to why "CellTagV1_2" was selected instead.

Thank you.

Error in dcast.data.table(data = celltagCounts, formula = Cell.BC ~ Cell.Tag, : Can not cast an empty data.table

I followed the protocol step by step and got blocked by this error, Can somebody help me overcome it ?

Is it necessary to use the cellTag mapped with other chromosome?

Hi
I found when the parsing the output of bam, you mentioned you also output the other gene info

With the CellTag reads extracted we use a custom gawk script to parse the file and retain only the information we need. This scripts identifies and extracts the CellTag, Cell Barcode, and UMI sequences associated with each CellTag read. Furthermore, the read ID, read sequence, and any genes the read aligned to are extracted as well. This allows us to accurately quantify each CellTag and associate the CellTags with the correct cells. The output of this script is a tab delimited file with the following collumns: Read.ID, Read.Seq, Cell.BC, UMI, Cell.Tag, Gene. We will use this file to quantify and filter the CellTag data. Note that the regular expression identifying CellTagMEF has two bases added to the beginning. This is a "stricter" CellTag motif which helps filter out some erroneous CellTag reads.

But in my thinking, the only useful reads for celltag is in the CellTag.UTR or GFP chromosome and not other chromosome.
(Please correct me if I misunderstand something).
So is it necessary to keep these reads from other chromosome for downstream analysising ?

Best wishes
Guandong Shang

Running the CellTag workflow with just V1 dataset

Hello,

I have a single-cell RNAseq dataset run with the CellTag-V1 library. I am running the Workflow and am able to go through it successfully till the Clone Calling step. However, I run into an error at the Lineage and Network Visualization step for which the input Celltag data should be Nx3 matrix. Since I only have results for V1, how should I proceed at this step?

morris-lab / biddyetalworkflow Goto Github PK

biddyetalworkflow's People

Contributors

Stargazers

Watchers

Forkers

biddyetalworkflow's Issues

custom celltag reference for alignment

rows which exceeds .Machine$integer.max Error generating cell barcode x cell tag matrix

why "CellTagV1_2" was selected in drawSubnet

Error in dcast.data.table(data = celltagCounts, formula = Cell.BC ~ Cell.Tag, : Can not cast an empty data.table

Is it necessary to use the cellTag mapped with other chromosome?

Running the CellTag workflow with just V1 dataset

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent