volkamerlab / ratar Goto Github PK
View Code? Open in Web Editor NEWRead-across the targetome
Home Page: http://gepris.dfg.de/gepris/projekt/391684253?language=en
License: MIT License
Read-across the targetome
Home Page: http://gepris.dfg.de/gepris/projekt/391684253?language=en
License: MIT License
Molecules contain atoms belonging not to standard amino acids:
For calculations including z-scales, all atoms that do not belong to a standard amino acids are removed.
What about other encoding methods such as pdbqt?
Here is a list of the packages that are used in the PR refactoring #1 , queried using https://github.com/volkamerlab/ratar/search?p=1&q=import
biopandas
pandas
numpy
seaborn
pymol
Testing or env scripts:
pytest
yaml
Used but not necessary to put in conda env, since already in python (see here):
How robust are reference points in binding site?
Binding site definition/size varies between datasets:
What size is needed for good performance of encoding method?
Can we compare performance on these datasets with each other?
First three moments of distribution:
Hi @AndreaVolkamer,
I think the paths that you mentioned in your email are only at places where the code is using input or output data paths.
The inputs are not part of this repo but must be on your lab's harddrive. Note:
Examples that I found:
Line 43 in 197d3f4
Line 7 in 197d3f4
The folder structures on the harddrive should be untouched, hence updating the path prefix /home/dominique/Documents
to wherever you copy the data should be enough. Happy to help here if you send me a directory tree once you found the data on the harddrive.
One big note: We used to work in the lab until 2019 (I think) still with pickle files which is not helpful when you do not have fixed versions of a package. So you likely won't be able to load any of the pickle files anymore. As we stopped development on ratar
shortly after, we did not update that output file format.
Write class method from_path
, analogous to from_molecule
. Current problem: files can contain multiple molecules, thus from_path
would return a list of molecule objects instead of a molecule object as in case of from_molecule
.
Differing behaviour here will not work well downstream, right? Check this.
Updates needed for this code base:
See PR #14 for details.
similarity
modulesimilarity
module - refactoring needed?pymol
dependency (not on conda-forge
; currently installed from tpeulen
)flatten-dict
dependency (only pip
-installable)from_path
class method to all ratar.encoding
classes: Write class method from_path
, analogous to from_molecule
. Current problem: files can contain multiple molecules, thus from_path
would return a list of molecule objects instead of a molecule object as in the case of from_molecule
.logging.conf
file to fine-grain our logging. Include back into the package if of interest.ratar
environment - enable conda
packagingThe installation instructions in the README only work for <3.12.
I recommend the following two steps:
python=3.8
after downloading the package and follow the instructions.Update setup.py Python version from >=3.8 to >=3.12
Update versioneer.py so that it is compatible with Python 3.12
@AndreaVolkamer can you please ping your student here?
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.