Git Product home page Git Product logo

dasper's Introduction

dasper

Lifecycle: maturing R build status Codecov test coverage BioC status DOI

The aim of dasper is to detect aberrant splicing events from RNA-seq data. By comparing patient RNA-seq data to a set of controls, dasper will score each splicing event in the patient representing the degree to which that splicing event looks abnormal. For a detailed guide on the usage of dasper, check out the vignette here.

Installation instructions

Get the latest stable R release from CRAN. Then install dasper from Bioconductor using the following code:

if (!requireNamespace("BiocManager", quietly = TRUE)) {
    install.packages("BiocManager")
}

BiocManager::install("dasper")

And the development version from GitHub with:

BiocManager::install("dzhang32/dasper")

Citation

Below is the citation output from using citation('dasper') in R. Please run this yourself to check for any updates on how to cite dasper.

print(citation("dasper"), bibtex = TRUE)
#> 
#> dzhang32 (2021). _Detecting abberant splicing events from
#> RNA-sequencing data_. doi: 10.18129/B9.bioc.dasper (URL:
#> https://doi.org/10.18129/B9.bioc.dasper),
#> https://github.com/dzhang32/dasper - R package version 1.3.8, <URL:
#> http://www.bioconductor.org/packages/dasper>.
#> 
#> A BibTeX entry for LaTeX users is
#> 
#>   @Manual{,
#>     title = {Detecting abberant splicing events from RNA-sequencing data},
#>     author = {{dzhang32}},
#>     year = {2021},
#>     url = {http://www.bioconductor.org/packages/dasper},
#>     note = {https://github.com/dzhang32/dasper - R package version 1.3.8},
#>     doi = {10.18129/B9.bioc.dasper},
#>   }
#> 
#> dzhang32 (2020). "Detecting abberant splicing events from
#> RNA-sequencing data." _bioRxiv_. doi: 10.1101/TODO (URL:
#> https://doi.org/10.1101/TODO), <URL:
#> https://www.biorxiv.org/content/10.1101/TODO>.
#> 
#> A BibTeX entry for LaTeX users is
#> 
#>   @Article{,
#>     title = {Detecting abberant splicing events from RNA-sequencing data},
#>     author = {{dzhang32}},
#>     year = {2020},
#>     journal = {bioRxiv},
#>     doi = {10.1101/TODO},
#>     url = {https://www.biorxiv.org/content/10.1101/TODO},
#>   }

Please note that the dasper was only made possible thanks to many other R and bioinformatics software authors, which are cited either in the vignettes and/or the paper(s) describing this package.

Code of Conduct

Please note that the dasper project is released with a Contributor Code of Conduct. By contributing to this project, you agree to abide by its terms.

Development tools

For more details, check the dev directory.

In particular, I am very grateful to Leo for his time and advice throughout the development of dasper. The transition of dasper Bioconductor-friendly package was made possible thanks to his biocthis package.

dasper's People

Contributors

aleighbrown avatar dzhang32 avatar hpages avatar lcolladotor avatar nturaga avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

dasper's Issues

[BUG] ImportError: /lib64/libstdc++.so.6: version `GLIBCXX_3.4.29' not found

When running outlier_detect, I encountered this error

Error: BiocParallel errors
1 remote errors, element index: 1
1 unevaluated and other errors
first remote error:
Error in py_module_import(module, convert = convert): ImportError: /lib64/libstdc++.so.6: version `GLIBCXX_3.4.29' not found (required by /home/chualec/.cache/R/basilisk/1.10.2/dasper/1.7.0/env_sklearn/lib/python3.8/site-packages/scipy/spatial/_ckdtree.cpython-38-x86_64-linux-gnu.so)

I tried a few things to solve it, including updating the conda environment and installing gcc, but it didn't help.

[BUG] megadepth fails with paths containing spaces

Hello
Thanks for developing dasper.
I have an issue with coverage_norm function, where it fails with megadepth.
Essentially, the quotes around the path of the input bigwig file are stripped, so space characters in that path are now interpreted as breaks in the command, and it fails.

Although I submit my list of bigwig files using properly formed paths (with quotes), these quotes are stripped from the megadepth command that is run.

There are a few space characters in the path. My working directory is on OneDrive with a path I cannot change. I have no issues with running other R commands at that location, as a matter of fact the said bigwig files are made with GenomicAlignemnts::coverage and saved there with rtracklayer::export.bw.

I should also mention that I took the megadepth command echoed in the error message, I added the quotes that had been stripped, and I ran that in a command prompt window and it ran fine, yielding a temp file with name ending in _coverage.annotation.tsv

So my questions are:
-what would be the fix to avoid the stripping of the quotes? Where is the line of code that forms this command?
-I don't mind issuing the megadepth commands separately, but how would I go about picking up from there afterwards?

Thanks in advance for your help.

Alex

Cannot update repo

Hi David,

I tried fixing lcolladotor/derfinder#42 and updating the GHA workflow to use biocthis 1.3.6 but I ran into:

$ git remote -v
origin	[email protected]:dzhang32/dasper.git (fetch)
origin	[email protected]:dzhang32/dasper.git (push)
13:15 dasper $ git push origin
ERROR: Permission to dzhang32/dasper.git denied to lcolladotor.
fatal: Could not read from remote repository.

Please make sure you have the correct access rights
and the repository exists.

Best,
Leo

Plotting with mean coverage

Hi,

Thanks for the awesome tool!

I'm trying to plot coverage and coverage plotting functions supports a single case sample at a time. Can you please suggest on how I could plot mean coverage from multiple bigwigs?

Error during junction loading

Hey there,
I'm trying to load my SJ.out.tab files produced using DRAGEN RNA Pipeline. The file produced by this pipeline is pre-filtered, using these rules declared on Illumina support:

  1. SJ is a noncanonical motif and is only supported by < 3 unique mappings.
  2. SJ of length > 50000 and is only supported by < 2 unique mappings.
  3. SJ of length > 100000 and is only supported by < 3 unique mappings.
  4. SJ of length > 200000 and is only supported by < 4 unique mappings.
  5. SJ is a noncanonical motif and the maximum spliced alignment overhang is < 30.
  6. SJ is a canonical motif and the maximum spliced alignment overhang is < 12.

However, as I try to load my files, here comes the error:

Error in .junction_merge(junctions_all, junctions) : No strands should be left as NA after processing

Do you know how to deal with it?

Best,

DB

Issue with coverage processing

Hello,
I tried to run the coverage_norm function. However, I get the following error:

[1] "2023-08-04 10:50:08 - Obtaining exonic and intronic regions to load coverage from..."
[1] "2023-08-04 10:50:08 - Obtaining regions to use to normalise coverage..."
[1] "2023-08-04 10:50:08 - Importing gtf/gff3 as a TxDb..."
Import genomic features from the file as a GRanges object ... OK
Prepare the 'metadata' data frame ... OK
Make the TxDb object ... OK
1429 genes were dropped because they have exons located on both strands of the same
reference sequence or on more than one reference sequence, so cannot be represented by a
single genomic range.
Use 'single.strand.genes.only=FALSE' to get all the genes in a GRangesList object, or use
suppressMessages() to suppress this message.
Error in .Call2("C_solve_user_SEW0", start, end, width, PACKAGE = "IRanges") :
In range 240: at least two out of 'start', 'end', and 'width', must
be supplied.

Could you please help me with this? Thank you!
Best,
Salwa.

Can't install in R4.0.2

Hello,
I tried the following to install dasper but got errors:

if (!requireNamespace("BiocManager", quietly = TRUE)) {
    install.packages("BiocManager")
}

BiocManager::install("dasper")

which gave me this error:

Warning message:
package ‘dasper’ is not available (for R version 4.0.2) 

Then I tried

BiocManager::install("dzhang32/dasper")

and got:

ERROR: dependency ‘megadepth’ is not available for package ‘dasper’
* removing ‘/Library/Frameworks/R.framework/Versions/4.0/Resources/library/dasper’
Error: Failed to install 'dasper' from GitHub:
  (converted from warning) installation of package ‘/var/folders/6b/fk6yw91n0gn9r5sr99yc88x5_r9zrk/T//Rtmp2WW3yy/file1165c7f39b482/dasper_1.1.6.tar.gz’ had non-zero exit status

I tried to install megadepth but got error that it is not available for R 4.0.2 BiocManager::install("megadepth")

Warning message:
package ‘megadepth’ is not available (for R version 4.0.2)

Not sure what to try next. Thank you

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.