Git Product home page Git Product logo

bioinformatics-cafe's Introduction

License

Unsorted scripts for bioinformatics

Miscellanea of scripts for bioinformatics. Many of these scripts do something very specific and they don't deserve their own repository. The quality of these programs can vary a lot, some are reasonably well designed and tested others are just stubs that I started and I haven't finished.

To download a given script or file without cloning or downloading the entire repository use:

wget https://github.com/dariober/bioinformatics-cafe/blob/master/<PATH>/<TO>/<MY.SCRIPT>?raw=true -O <MY.SCRIPT>

where <PATH>/<TO>/<MY.SCRIPT> are just placeholders. For example, download fastaRegexFinder.py and make it executable:

wget https://github.com/dariober/bioinformatics-cafe/blob/master/fastaRegexFinder/fastaRegexFinder.py?raw=true -O fastaRegexFinder.py
chmod a+x fastaRegexFinder.py

bioinformatics-cafe's People

Contributors

dariober avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

bioinformatics-cafe's Issues

xacct.py sort by numeric field

xacct.py -v
xacct.py 0.3.0

Sort by MaxRSS gives:

xacct.py -s MaxRSS
Traceback (most recent call last):
  File "/home/db291g/bin/xacct.py", line 341, in <module>
    sacct_mrg= sorted(sacct_mrg, key=lambda k: k[s], reverse= rev)
TypeError: '<' not supported between instances of 'str' and 'int'

Probably due to a mix of numbers and strings in the sort field:

xacct.py
64377  WW00295.chr14.pindelChrom  node25    9027    48000   8          COMPLETED  Mon 25 15:44:36  02:29:10  
64413  WW00300.chr4.pindelChrom   node25    .       48000   8          RUNNING    Mon 25 15:44:36  00:07:03  
64561  WW00284.chr9.pindelChrom   .         .       48000   0          PENDING    Mon 25 15:44:39  00:00:00  
64339  WW00275.chr4.pindelChrom   node07    10753   48000   8          COMPLETED  Mon 25 15:50:45  05:52:27  
64414  WW00300.chr20.pindelChrom  node07    .       48000   8          RUNNING    Mon 25 15:50:45  00:00:54  
64562  WW00307.chr5.pindelChrom   .         .       48000   0          PENDING    Mon 25 15:51:11  00:00:00  

Minor documentation issue for catInterleaveFastq.sh

Hi,

Thank you for providing your useful tools.

There is a minor issue with the documentation (output from -h) in the script catInterleaveFastq.sh:

The usage instructions says

USAGE
catInterleaveFastq.sh -1 R1.L001.fq.gz R1.L002.fq.gz ... \\
                      -2 R2.L001.fq.gz R2.L002.fq.gz ...
First and second arguments are comma separated lists of first-in-pair and 
second-in-pair fastq file(s), respectively.

Clearly, the first and second arguments are not comma separated lists, but white-space separated lists -- just as correctly given in the example.

Cheers
Johan

fastaRegexFinder

Hi, Perhaps it's not an issue but my lack of understanding of the syntax, but I am not able to perform fuzzy regex seatch with your script. The syntax I am using is
-r "(ATGC){e<=1}"

It is not reporting any hits, whereas both exact and fuzzy matches exist in my input. Exact match without {e<=1} works just fine. Thank you.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.