Comments (3)
Hi,
I want to make an update on the signalling script. I sorted the signalling arguments but they took very long to proceed so I have skipped this script because I have limited runtime. I just ran the signal chunks straight to model 1 and combine the result afterwards. I wonder does it impact the downstream performance?
Second, I have an issue with the model 2 dimension issue in m5C predictions. My input file is 122G and I broke it down into the first 50000 lines for processing and it is still not working. I have checked the format and there are 3 columns: The transcript and the cytosine sites, a series of probability and the replicate tag. I double-checked there are no extra columns or data that is not aligned properly. The model ran smoothly for the first few minutes, but it stopped in the middle and reported the dimension issue:
"W tensorflow/core/framework/op_kernel.cc:1828] OP_REQUIRES failed at conv_ops_fused_impl.h:761 : INVALID_ARGUMENT: convolution input must be 4-dimensional: [1,32,99]"
I have tried adding a fourth column with '1's to fulfill the requirement, and removing the replicate tag column as the test data only has 2 columns, but the wrong format error will come up. Can you try to resolve it, please?
Dom
from cheui.
Hi,
Sorry about the run time issue. We are working on the faster preprocessing version and will have the update soon.
I think its ok to pass chunks to model1 and combine output at the end as you mentioned. Sort them before passing to model2.
i think the issue you are getting regarding the dimension is because of TensorFlow version. What version are you using ?
Either please install the recommended version TF 2.4.1
Or
please try replacing the below line in the CHEUI_predict_model2.py code
lr_probs = model.predict(prob_vectors)
to:
lr_probs = model.predict(prob_vectors.reshape(prob_vectors.shape[0], 99, 1))
It should be line number 199 and 220 in CHEUI_predict_model2.py.
Please let us know if it helps.
Thanks,
Akanksha
from cheui.
Hi Akanksha,
Thank you so much for your reply.
That is absolutely fine and totally understandable that it takes time to develop the new preprocessing script and I will finger-cross it for the CHEUI team.
That is great! I just want to make sure the result is not disrupted even if I am not following the combined script. Regarding model 2, I added the reshaped part for the Ir_probs. I think it is working properly and does generate much longer results now.
Bw,
Dominic
from cheui.
Related Issues (20)
- about reference transcriptome file and explanation of cheui output HOT 2
- multi-threading in running model 1 HOT 1
- Regarding output of the m6A probability and stoichiometry command HOT 3
- CHEUI model 2: prediction of stoichiometry and modification probability at transcriptomic sites : Executed with Error HOT 5
- Column header _position_ in site_level_prediction HOT 6
- Regarding CHEUI tool HOT 6
- CHEUI preprocess C++ compiling issue
- Low m5C modification site number overlapped with m5c-atlas database HOT 1
- SQK-RNA004 HOT 2
- Regarding prediction of stoichiometry
- Do CHEUI models work on Bacterial RNA modification detection? HOT 1
- Understanding of the read-level prediction file output HOT 1
- pickle data was truncated HOT 4
- Do CHEUI models work on SARS-CoV-2 RNA modification detection? HOT 1
- 2 Questions about model 2 and python/c++ preprocess HOT 1
- Compatibility with the new direct RNA seq kit HOT 2
- CHEUI_preprocess_m5C.py give error HOT 17
- Problem with preprocessm6A HOT 1
- stoichiometry and probability columns in the solo output HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from cheui.