Comments (2)
The following pipeline currently fails with an unhelpful error, which commenting out discard_per_amino_acid_embeddings: true
fixes:
global:
sequences_file: fasta.fa
prefix: simple
seqvec_embeddings:
type: embed
protocol: seqvec
reduce: True
#discard_per_amino_acid_embeddings: True
weights_file: /mnt/nfs/models/seqvec/weights_file
options_file: /mnt/nfs/models/seqvec/options_file
annotations_from_seqvec:
type: extract
protocol: seqvec_from_publication
depends_on: seqvec_embeddings
Output:
80%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████▊ | 4/5 [00:03<00:00, 1.24it/s]
Traceback (most recent call last):
File "/home/kschuetze/bio_embeddings/.venv/bin/bio_embeddings", line 33, in <module>
sys.exit(load_entry_point('bio-embeddings', 'console_scripts', 'bio_embeddings')())
File "/home/kschuetze/bio_embeddings/bio_embeddings/utilities/cli.py", line 24, in main
parse_config_file_and_execute_run(arguments.config_path[0], overwrite=arguments.overwrite)
File "/home/kschuetze/bio_embeddings/bio_embeddings/utilities/pipeline.py", line 202, in parse_config_file_and_execute_run
execute_pipeline_from_config(config, **kwargs)
File "/home/kschuetze/bio_embeddings/bio_embeddings/utilities/pipeline.py", line 169, in execute_pipeline_from_config
stage_output_parameters = stage_runnable(**stage_parameters)
File "/home/kschuetze/bio_embeddings/bio_embeddings/extract/pipeline.py", line 158, in run
return PROTOCOLS[kwargs["protocol"]](**kwargs)
File "/home/kschuetze/bio_embeddings/bio_embeddings/extract/pipeline.py", line 31, in seqvec_from_publication
return predict_annotations_using_basic_models("seqvec_from_publication", **kwargs)
File "/home/kschuetze/bio_embeddings/bio_embeddings/extract/pipeline.py", line 48, in predict_annotations_using_basic_models
check_required(kwargs, ['embeddings_file', 'mapping_file', 'remapped_sequences_file'])
File "/home/kschuetze/bio_embeddings/bio_embeddings/utilities/helpers.py", line 32, in check_required
", ".join(missing), params
bio_embeddings.utilities.exceptions.MissingParameterError: Missing required parameters: embeddings_file
Given: {'sequences_file': 'simple/sequences_file.fasta', 'prefix': 'simple', 'simple_remapping': False, 'mapping_file': 'simple/mapping_file.csv', 'remapped_sequences_file': 'simple/remapped_sequences_file.fasta', 'type': 'extract', 'protocol': 'seqvec_from_publication', 'reduce': True, 'discard_per_amino_acid_embeddings': True, 'weights_file': '/mnt/nfs/models/seqvec/weights_file', 'options_file': '/mnt/nfs/models/seqvec/options_file', 'stage_name': 'annotations_from_seqvec', 'start_time': '2020-09-05 22:24:30.492982+02:00', 'max_amino_acids': 15000, 'reduced_embeddings_file': 'simple/seqvec_embeddings/reduced_embeddings_file.h5', 'end_time': '2020-09-05 22:24:30.490852+02:00', 'elapsed_time': '0:00:26.489258', 'depends_on': 'seqvec_embeddings'}
from bio_embeddings.
Documenting this extensively is hard.
I tried to address parts of this textually via the updated examples README in PR 42 on GitLab.
The solution to this is to read the first few lines of each protocol (in the code). There, neccessary files (and parameters) are check_required
. This is also why that's one of the first lines: fail soon and fix problems before wasting someone's day.
P.S.: also documented in the parameters_blueprint.yml
file!
from bio_embeddings.
Related Issues (20)
- It may be better to let the user define whether the same redundant is allowed or not HOT 1
- AssertionError: Failed in nopython mode pipeline (step: native lowering) Storing i64 to ptr of i32 ('dim'). FE type int32
- I want to know the configuration of the people who run successfully HOT 2
- CUDA out of memory. HOT 2
- Add support for ESM-2 and ESMFold HOT 3
- Update jax-unirep dependency version
- Protocol prottrans_t5_xl_u50: URLError: <urlopen error [Errno 113] No route to host> HOT 1
- OSError: Unable to open file (truncated file: eof = 63504384, sblock->base_addr = 0, stored_eof = 374434776)
- Custom embeddings
- Docker containers shutting down within a few seconds of starting
- Can not install bio-embedding in wsl HOT 2
- Can not install bioembedings on ubuntu? Please help HOT 1
- Hard times trying to run the bindEmbed21 example.
- Error during first step - greenlet size changed, may indicate binary incompatibility. HOT 1
- 3D Protein Embeddings
- AttributeError: 'dict' object has no attribute '__NUMPY_SETUP__'
- Cant install bio_embeddings in colab HOT 2
- Tensor size issue
- Protocol prottrans_t5_xl_u50: PermissionError: [Errno 13] Permission denied: 'C:\\Users\\user\\AppData\\Local\\Temp\\tmpk7e1m0jg' HOT 1
- from where comes the models in "bio_embeddings/utilities /defaults.yml", where is docs, parameters, dataset ?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from bio_embeddings.