Comments (4)
PDB code 2lwk
can be viewed at https://files.rcsb.org/view/2LWK.pdb and contains:
HEADER RNA 01-AUG-12 2LWK
TITLE SOLUTION STRUCTURE OF SMALL MOLECULE-INFLUENZA RNA COMPLEX
COMPND MOL_ID: 1;
COMPND 2 MOLECULE: RNA (32-MER);
COMPND 3 CHAIN: A;
COMPND 4 ENGINEERED: YES
...
DBREF 2LWK A 1 32 PDB 2LWK 2LWK 1 32
SEQRES 1 A 32 G A G U A G A A A C A A G
SEQRES 2 A 32 G C U U C G G C C U G C U
SEQRES 3 A 32 U U U G C U
HET 0EC A 101 40
...
Confirming behaviour:
$ wget https://files.rcsb.org/view/2LWK.pdb
$ python -c "import sys; from Bio import SeqIO; SeqIO.convert('2LWK.pdb','pdb-seqres', sys.stdout, 'fasta')"
>2LWK:A PDB:2LWK 2LWK
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
The SEQRES
lines are not residues, but nucleotides. As per the COMPND
lines, this is an RNA chain, not protein.
This is as documented, although perhaps an error or warning would help for non-protein chains:
- pdb-seqres - Reads a Protein Data Bank (PDB) file to determine the
complete protein sequence as it appears in the header (no dependencies).
Closing as a duplicate of #3125.
from biopython.
Thank you for your clarification.
Indeed, I am seeking to retrieve the complete RNA sequence utilized in the experiment. Given that the structure consists of nucleotides and represents an RNA chain, I am interested in exploring if there is a method within Biopython or any alternative approach to obtain the complete RNA sequence from the PDB file.
Your guidance on how to accomplish this using Biopython or any other relevant tools would be greatly appreciated.
Thank you for your assistance.
from biopython.
Well you could parse the SEQRES lines from the header (via the PDB parser), either DIY code or you might consider extending the code here to handle DNA and RNA:
https://github.com/biopython/biopython/blob/biopython-183/Bio/SeqIO/PdbIO.py#L113
I think it should look at the COMPND
lines to tell if a chain is expected to be protein, RNA or RNA...
Or, you can look at the 3D structure but that would be more complicated.
from biopython.
Closing as a duplicate of #3125 (I evidently clicked the wrong button last time)
from biopython.
Related Issues (20)
- Inefficient indexing into `Bio.Align.substitution_matrices.Array` during `pairwise2` alignment HOT 3
- Bio.Align.Alignment doesn't contain chromosome data? HOT 1
- BinaryCIF Parser HOT 2
- Calculating HSE with missing atoms on Gly
- [feature] Phylo: miter settings when drawing trees HOT 1
- [bug] Phylo: tree and label will be very crowded when branch number is huge HOT 2
- [Help Request] Migrating from pairwise2 -> PairwiseAligner
- [feature] Phylo: get color from comment in nexus file
- Blank altlocs in duplicate residue ILE (' ', 105, ' ') HOT 1
- [Typo] Tutorial - Section 4.3.2.3 HOT 3
- default matrix HOT 2
- Multiple Genbank structured comment warnings
- `Blast.NCBIWWW.qblast` should return the `rid` of the submitted query to the caller HOT 2
- GC content for RNA sequences HOT 2
- PDBIOException re-raised ValueError with obfuscating message HOT 3
- output of Phylo.Consensus.strict_consensus depends on the order of trees in the input HOT 1
- How to get the gap/mismatch positions of alignment HOT 2
- The process for specifying Entrez.cache is unclear HOT 3
- BiopythonDeprecationWarning in Bio.pairwise2 HOT 4
- Plotting tracks with GenoDiagram HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from biopython.