Dear Dr. Gonçalves, I really like your Crispy paper, and I am trying

The input of Crispy are pandas dataframes for <code c

Crispy copy number input file and CRISPR screen input file format question about crispy HOT 3 CLOSED

emanuelgoncalves commented on June 22, 2024

Crispy copy number input file and CRISPR screen input file format question

from crispy.

Comments (3)

EmanuelGoncalves commented on June 22, 2024

This was an incomplete merge, thanks for flagging. I've updated the example and included the missing files.

Update your installation pip install -U cy

In the example, I've also added a few more details on the file requirements and you can consult the pandas dataframes for a running example.

from crispy.

EmanuelGoncalves commented on June 22, 2024

The input of Crispy are pandas dataframes for copy_number and library and pandas series for sgrna_fc

You have an example on how the data should be structured.

from crispy.

EmanuelGoncalves commented on June 22, 2024

You should have all that in the example in the initial page of the project github, have you tried to run it? Below for your convenience:

import crispy as cy
import matplotlib.pyplot as plt
from crispy.CRISPRData import ReadCounts, Library

"""
Import sample data
"""
rawcounts, copynumber = cy.Utils.get_example_data()

"""
Import CRISPR-Cas9 library

Important:
      Library has to have the following columns: "Chr", "Start", "End", "Approved_Symbol"
      Library and segments have to have consistent "Chr" formating: "Chr1" or "chr1" or "1"
      Gurantee that "Start" and "End" columns are int
"""
lib = Library.load_library("Yusa_v1.1.csv.gz")

lib = lib.rename(
    columns=dict(start="Start", end="End", chr="Chr", Gene="Approved_Symbol")
).dropna(subset=["Chr", "Start", "End"])

lib["Chr"] = "chr" + lib["Chr"]

lib["Start"] = lib["Start"].astype(int)
lib["End"] = lib["End"].astype(int)

"""
Calculate fold-change
"""
plasmids = ["ERS717283"]
rawcounts = ReadCounts(rawcounts).remove_low_counts(plasmids)
sgrna_fc = rawcounts.norm_rpm().foldchange(plasmids)

"""
Correct CRISPR-Cas9 sgRNA fold changes
"""
crispy = cy.Crispy(
    sgrna_fc=sgrna_fc.mean(1), copy_number=copynumber, library=lib.loc[sgrna_fc.index]
)

# Fold-changes and correction integrated funciton.
# Output is a modified/expanded BED formated data-frame with sgRNA and segments information
#   n_sgrna: represents the minimum number of sgRNAs required per segment to consider in the fit.
#            Recomended default values range between 4-10.
bed_df = crispy.correct(n_sgrna=10)
print(bed_df.head())

# Gaussian Process Regression is stored
crispy.gpr.plot(x_feature="ratio", y_feature="fold_change")
plt.show()

from crispy.

Crispy copy number input file and CRISPR screen input file format question about crispy HOT 3 CLOSED

Comments (3)

Related Issues (4)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent