Oh Dear,
I am experiencing some crashes and thanks to python configured during install using "configure --with-pydebug" and thanks to https://pypi.python.org/pypi/faulthandler I have much better stacktraces in gdb and on STDERR.
Looks the route took me again to legacy BLASTN and to old bug in blast. NCBI asnwred to me in the past they won't fix legacy blastn. So, we have to fix biopython blastn parser, and now it even seems expat/biopython is crashing.
It will take me a while to get through all of the output but unless the bug is elsewhere the below stacktrace shpuld be enough . Most likely this is the bug I saw already in biopython-1.59 but right now have biopython-1.62b (pre-release-beta) installed, see the line numbers below.
Fatal Python error: Segmentation fault
Current thread 0x00007f9316072700:
File "/usr/lib64/python2.7/site-packages/Bio/Blast/NCBIXML.py", line 106 in endElement
File "/mnt/1TB/var/tmp/portage/dev-lang/python-2.7.5-r2/work/Python-2.7.5/Modules/pyexpat.c", line 618 in EndElement
File "/usr/lib64/python2.7/site-packages/Bio/Blast/NCBIXML.py", line 654 in parse
File "blah.py", line 19469 in parse_blastn_XML_and_write_csv
...
(gdb) where
#0 0x00007f9315810acb in raise () from /lib64/libpthread.so.0
#1 0x00007f93149365f6 in faulthandler_fatal_error (signum=11) at faulthandler.c:321
#2
#3 0x00007f9315bc6e40 in visit_decref (op=<unknown at remote 0x46966a0>, data=0x0) at /mnt/1TB/var/tmp/portage/dev-lang/python-2.7.5-r2/work/Python-2.7.5/Modules/gcmodule.c:360
#4 0x00007f9315abc37c in list_traverse (o=0x6998150, visit=0x7f9315bc6e02 <visit_decref>, arg=0x0) at /mnt/1TB/var/tmp/portage/dev-lang/python-2.7.5-r2/work/Python-2.7.5/Objects/listobject.c:2362
#5 0x00007f9315bc6f32 in subtract_refs (containers=0x7f9315e789c0 <generations+96>) at /mnt/1TB/var/tmp/portage/dev-lang/python-2.7.5-r2/work/Python-2.7.5/Modules/gcmodule.c:385
#6 0x00007f9315bc7fb3 in collect (generation=2) at /mnt/1TB/var/tmp/portage/dev-lang/python-2.7.5-r2/work/Python-2.7.5/Modules/gcmodule.c:925
#7 0x00007f9315bc830c in collect_generations () at /mnt/1TB/var/tmp/portage/dev-lang/python-2.7.5-r2/work/Python-2.7.5/Modules/gcmodule.c:1050
#8 0x00007f9315bc8fc3 in _PyObject_GC_Malloc (basicsize=408) at /mnt/1TB/var/tmp/portage/dev-lang/python-2.7.5-r2/work/Python-2.7.5/Modules/gcmodule.c:1511
#9 0x00007f9315bc9064 in _PyObject_GC_NewVar (tp=0x7f9315e4f120 <PyFrame_Type>, nitems=1) at /mnt/1TB/var/tmp/portage/dev-lang/python-2.7.5-r2/work/Python-2.7.5/Modules/gcmodule.c:1531
#10 0x00007f9315aae3e3 in PyFrame_New (tstate=0x20000a0, code=0x77d9d50, globals=
{'xml': <module at remote 0x448b268>, 'BlastParser': <classobj at remote 0x4d549e0>, '__builtins__': {'bytearray': <type at remote 0x7f9315e43520>, 'IndexError': <type at remote 0x7f9315e49a00>, 'all': <built-in function all>, 'help': <_Helper at remote 0x22091b0>, 'vars': <built-in function vars>, 'SyntaxError': <type at remote 0x7f9315e49380>, 'unicode': <type at remote 0x7f9315e60be0>, 'UnicodeDecodeError': <type at remote 0x7f9315e4a320>, 'memoryview': <type at remote 0x7f9315e54640>, 'isinstance': <built-in function isinstance>, 'copyright': <_Printer(_Printer__data='Copyright (c) 2001-2013 Python Software Foundation.\nAll Rights Reserved.\n\nCopyright (c) 2000 BeOpen.com.\nAll Rights Reserved.\n\nCopyright (c) 1995-2001 Corporation for National Research Initiatives.\nAll Rights Reserved.\n\nCopyright (c) 1991-1995 Stichting Mathematisch Centrum, Amsterdam.\nAll Rights Reserved.', _Printer__lines=None, _Printer__name='copyright', _Printer__dirs=(), _Printer__files=(...)) at remote 0x21bdc30>, 'NameError'...(truncated), locals=
{'self': <BlastParser(_parser=<ExpatParser(_namespaces=0, _parser=None, _external_ges=0, _source=<InputSource(_InputSource__charfile=None, _InputSource__bytefile=None, _InputSource__public_id=None, _InputSource__system_id=None, _InputSource__encoding=None) at remote 0x6998600>, _bufsize=65516, _cont_handler=<...>, _dtd_handler=<DTDHandler() at remote 0x6998e70>, _entity_stack=[], _err_handler=<ErrorHandler() at remote 0x6998858>, _lex_handler_prop=None, _parsing=0, _ent_handler=<EntityResolver() at remote 0x6998948>, _interning=None) at remote 0x6998b28>, _mult_al=<MultipleAlignment(alignment=[]) at remote 0x77d0f40>, _debug=0, _hsp=<HSP(sbjct_end=None, sbjct='', bits=None, frame=(), query_end=None, score=None, gaps=(None, None), expect=None, query='', sbjct_start=None, positives=(None, None), align_length=None, num_alignments=None, identities=(None, None), query_start=None, strand=(None, None), match='') at remote 0x77d0ed0>, _descr=<Description(e=<float at remote 0x68fa3c0>, title=u'gnl|BL_ORD_ID|14 poly_A'...(truncated)) at /mnt/1TB/var/tmp/portage/dev-lang/python-2.7.5-r2/work/Python-2.7.5/Objects/frameobject.c:682
I suspect the bug happens because either of:
positives=(None, None)
identities=(None, None)
strand=(None, None)
I know these funny tuples were already reported for gaps and identities if I remember right .... so there may be more? :(
https://redmine.open-bio.org/issues/3363
https://redmine.open-bio.org/issues/3354
Why the expat crash on "Hsp_bit-score" (see rows #23 and #26 from gdb below)?
Nevertheless, I think biopython should sanitize its values if XML entry is crap. If you find why expat crashes than its only good. ;-)
#13 0x00007f9315baca72 in run_mod (mod=0x29b97b8, filename=0x7f9315c0fdd5 "",
globals={'xml': <module at remote 0x448b268>, 'BlastParser': <classobj at remote 0x4d549e0>, '__builtins__': {'bytearray': <type at remote 0x7f9315e43520>, 'IndexError': <type at remote 0x7f9315e49a00>, 'all': <built-in function all>, 'help': <_Helper at remote 0x22091b0>, 'vars': <built-in function vars>, 'SyntaxError': <type at remote 0x7f9315e49380>, 'unicode': <type at remote 0x7f9315e60be0>, 'UnicodeDecodeError': <type at remote 0x7f9315e4a320>, 'memoryview': <type at remote 0x7f9315e54640>, 'isinstance': <built-in function isinstance>, 'copyright': <_Printer(_Printer__data='Copyright (c) 2001-2013 Python Software Foundation.\nAll Rights Reserved.\n\nCopyright (c) 2000 BeOpen.com.\nAll Rights Reserved.\n\nCopyright (c) 1995-2001 Corporation for National Research Initiatives.\nAll Rights Reserved.\n\nCopyright (c) 1991-1995 Stichting Mathematisch Centrum, Amsterdam.\nAll Rights Reserved.', _Printer__lines=None, _Printer__name='copyright', _Printer__dirs=(), _Printer__files=(...)) at remote 0x21bdc30>, 'NameError'...(truncated),
locals={'self': <BlastParser(_parser=<ExpatParser(_namespaces=0, _parser=None, _external_ges=0, _source=<InputSource(_InputSource__charfile=None, _InputSource__bytefile=None, _InputSource__public_id=None, _InputSource__system_id=None, _InputSource__encoding=None) at remote 0x6998600>, _bufsize=65516, _cont_handler=<...>, _dtd_handler=<DTDHandler() at remote 0x6998e70>, _entity_stack=[], _err_handler=<ErrorHandler() at remote 0x6998858>, _lex_handler_prop=None, _parsing=0, _ent_handler=<EntityResolver() at remote 0x6998948>, _interning=None) at remote 0x6998b28>, _mult_al=<MultipleAlignment(alignment=[]) at remote 0x77d0f40>, _debug=0, _hsp=<HSP(sbjct_end=None, sbjct='', bits=None, frame=(), query_end=None, score=None, gaps=(None, None), expect=None, query='', sbjct_start=None, positives=(None, None), align_length=None, num_alignments=None, identities=(None, None), query_start=None, strand=(None, None), match='') at remote 0x77d0ed0>, _descr=<Description(e=<float at remote 0x68fa3c0>, title=u'gnl|BL_ORD_ID|14 poly_A'...(truncated), flags=0x7fff7c311d50, arena=0x4137e40)
at /mnt/1TB/var/tmp/portage/dev-lang/python-2.7.5-r2/work/Python-2.7.5/Python/pythonrun.c:1365
#14 0x00007f9315bac923 in PyRun_StringFlags (str=0x77d0fc4 "self._end_Hsp_bit_score()", start=258,
globals={'xml': <module at remote 0x448b268>, 'BlastParser': <classobj at remote 0x4d549e0>, '__builtins__': {'bytearray': <type at remote 0x7f9315e43520>, 'IndexError': <type at remote 0x7f9315e49a00>, 'all': <built-in function all>, 'help': <_Helper at remote 0x22091b0>, 'vars': <built-in function vars>, 'SyntaxError': <type at remote 0x7f9315e49380>, 'unicode': <type at remote 0x7f9315e60be0>, 'UnicodeDecodeError': <type at remote 0x7f9315e4a320>, 'memoryview': <type at remote 0x7f9315e54640>, 'isinstance': <built-in function isinstance>, 'copyright': <_Printer(_Printer__data='Copyright (c) 2001-2013 Python Software Foundation.\nAll Rights Reserved.\n\nCopyright (c) 2000 BeOpen.com.\nAll Rights Reserved.\n\nCopyright (c) 1995-2001 Corporation for National Research Initiatives.\nAll Rights Reserved.\n\nCopyright (c) 1991-1995 Stichting Mathematisch Centrum, Amsterdam.\nAll Rights Reserved.', _Printer__lines=None, _Printer__name='copyright', _Printer__dirs=(), _Printer__files=(...)) at remote 0x21bdc30>, 'NameError'...(truncated),
locals={'self': <BlastParser(_parser=<ExpatParser(_namespaces=0, _parser=None, _external_ges=0, _source=<InputSource(_InputSource__charfile=None, _InputSource__bytefile=None, _InputSource__public_id=None, _InputSource__system_id=None, _InputSource__encoding=None) at remote 0x6998600>, _bufsize=65516, _cont_handler=<...>, _dtd_handler=<DTDHandler() at remote 0x6998e70>, _entity_stack=[], _err_handler=<ErrorHandler() at remote 0x6998858>, _lex_handler_prop=None, _parsing=0, _ent_handler=<EntityResolver() at remote 0x6998948>, _interning=None) at remote 0x6998b28>, _mult_al=<MultipleAlignment(alignment=[]) at remote 0x77d0f40>, _debug=0, _hsp=<HSP(sbjct_end=None, sbjct='', bits=None, frame=(), query_end=None, score=None, gaps=(None, None), expect=None, query='', sbjct_start=None, positives=(None, None), align_length=None, num_alignments=None, identities=(None, None), query_start=None, strand=(None, None), match='') at remote 0x77d0ed0>, _descr=<Description(e=<float at remote 0x68fa3c0>, title=u'gnl|BL_ORD_ID|14 poly_A'...(truncated), flags=0x7fff7c311d50)
at /mnt/1TB/var/tmp/portage/dev-lang/python-2.7.5-r2/work/Python-2.7.5/Python/pythonrun.c:1328
#15 0x00007f9315b658b5 in builtin_eval (self=0x0, args=(u'self._end_Hsp_bit_score()',)) at /mnt/1TB/var/tmp/portage/dev-lang/python-2.7.5-r2/work/Python-2.7.5/Python/bltinmodule.c:695
#16 0x00007f9315ad5006 in PyCFunction_Call (func=, arg=(u'self._end_Hsp_bit_score()',), kw=0x0) at /mnt/1TB/var/tmp/portage/dev-lang/python-2.7.5-r2/work/Python-2.7.5/Objects/methodobject.c:81
#17 0x00007f9315b7b1d4 in call_function (pp_stack=0x7fff7c311f90, oparg=1) at /mnt/1TB/var/tmp/portage/dev-lang/python-2.7.5-r2/work/Python-2.7.5/Python/ceval.c:4021
#18 0x00007f9315b75cd8 in PyEval_EvalFrameEx (
f=Frame 0x3edea80, for file /usr/lib64/python2.7/site-packages/Bio/Blast/NCBIXML.py, line 106, in endElement (self=<BlastParser(_parser=<ExpatParser(_namespaces=0, _parser=None, _external_ges=0, _source=<InputSource(_InputSource__charfile=None, _InputSource__bytefile=None, _InputSource__public_id=None, _InputSource__system_id=None, _InputSource__encoding=None) at remote 0x6998600>, _bufsize=65516, _cont_handler=<...>, _dtd_handler=<DTDHandler() at remote 0x6998e70>, _entity_stack=[], _err_handler=<ErrorHandler() at remote 0x6998858>, _lex_handler_prop=None, _parsing=0, _ent_handler=<EntityResolver() at remote 0x6998948>, _interning=None) at remote 0x6998b28>, _mult_al=<MultipleAlignment(alignment=[]) at remote 0x77d0f40>, _debug=0, _hsp=<HSP(sbjct_end=None, sbjct='', bits=None, frame=(), query_end=None, score=None, gaps=(None, None), expect=None, query='', sbjct_start=None, positives=(None, None), align_length=None, num_alignments=None, identities=(None, None), query_start=None, strand=(None, None), match='') a...(truncated), throwflag=0)
at /mnt/1TB/var/tmp/portage/dev-lang/python-2.7.5-r2/work/Python-2.7.5/Python/ceval.c:2666
#19 0x00007f9315b7870e in PyEval_EvalCodeEx (co=0x50a4bf0,
globals={'xml': <module at remote 0x448b268>, 'BlastParser': <classobj at remote 0x4d549e0>, '__builtins__': {'bytearray': <type at remote 0x7f9315e43520>, 'IndexError': <type at remote 0x7f9315e49a00>, 'all': <built-in function all>, 'help': <_Helper at remote 0x22091b0>, 'vars': <built-in function vars>, 'SyntaxError': <type at remote 0x7f9315e49380>, 'unicode': <type at remote 0x7f9315e60be0>, 'UnicodeDecodeError': <type at remote 0x7f9315e4a320>, 'memoryview': <type at remote 0x7f9315e54640>, 'isinstance': <built-in function isinstance>, 'copyright': <_Printer(_Printer__data='Copyright (c) 2001-2013 Python Software Foundation.\nAll Rights Reserved.\n\nCopyright (c) 2000 BeOpen.com.\nAll Rights Reserved.\n\nCopyright (c) 1995-2001 Corporation for National Research Initiatives.\nAll Rights Reserved.\n\nCopyright (c) 1991-1995 Stichting Mathematisch Centrum, Amsterdam.\nAll Rights Reserved.', _Printer__lines=None, _Printer__name='copyright', _Printer__dirs=(), _Printer__files=(...)) at remote 0x21bdc30>, 'NameError'...(truncated), locals=0x0, args=0x4172718, argcount=2, kws=0x0, kwcount=0, defs=0x0, defcount=0,
closure=0x0) at /mnt/1TB/var/tmp/portage/dev-lang/python-2.7.5-r2/work/Python-2.7.5/Python/ceval.c:3253
#20 0x00007f9315ab0f2e in function_call (func=<function at remote 0x67ce8e8>,
arg=(<BlastParser(_parser=<ExpatParser(_namespaces=0, _parser=None, _external_ges=0, _source=<InputSource(_InputSource__charfile=None, _InputSource__bytefile=None, _InputSource__public_id=None, _InputSource__system_id=None, _InputSource__encoding=None) at remote 0x6998600>, _bufsize=65516, _cont_handler=<...>, _dtd_handler=<DTDHandler() at remote 0x6998e70>, _entity_stack=[], _err_handler=<ErrorHandler() at remote 0x6998858>, _lex_handler_prop=None, _parsing=0, _ent_handler=<EntityResolver() at remote 0x6998948>, _interning=None) at remote 0x6998b28>, _mult_al=<MultipleAlignment(alignment=[]) at remote 0x77d0f40>, _debug=0, _hsp=<HSP(sbjct_end=None, sbjct='', bits=None, frame=(), query_end=None, score=None, gaps=(None, None), expect=None, query='', sbjct_start=None, positives=(None, None), align_length=None, num_alignments=None, identities=(None, None), query_start=None, strand=(None, None), match='') at remote 0x77d0ed0>, _descr=<Description(e=<float at remote 0x68fa3c0>, title=u'gnl|BL_ORD_ID|14 poly_A', access...(truncated), kw=0x0)
at /mnt/1TB/var/tmp/portage/dev-lang/python-2.7.5-r2/work/Python-2.7.5/Objects/funcobject.c:526
#21 0x00007f9315a6f840 in PyObject_Call (func=<function at remote 0x67ce8e8>,
arg=(<BlastParser(_parser=<ExpatParser(_namespaces=0, _parser=None, _external_ges=0, _source=<InputSource(_InputSource__charfile=None, _InputSource__bytefile=None, _InputSource__public_id=None, _InputSource__system_id=None, _InputSource__encoding=None) at remote 0x6998600>, _bufsize=65516, _cont_handler=<...>, _dtd_handler=<DTDHandler() at remote 0x6998e70>, _entity_stack=[], _err_handler=<ErrorHandler() at remote 0x6998858>, _lex_handler_prop=None, _parsing=0, _ent_handler=<EntityResolver() at remote 0x6998948>, _interning=None) at remote 0x6998b28>, _mult_al=<MultipleAlignment(alignment=[]) at remote 0x77d0f40>, _debug=0, _hsp=<HSP(sbjct_end=None, sbjct='', bits=None, frame=(), query_end=None, score=None, gaps=(None, None), expect=None, query='', sbjct_start=None, positives=(None, None), align_length=None, num_alignments=None, identities=(None, None), query_start=None, strand=(None, None), match='') at remote 0x77d0ed0>, _descr=<Description(e=<float at remote 0x68fa3c0>, title=u'gnl|BL_ORD_ID|14 poly_A', access...(truncated), kw=0x0)
at /mnt/1TB/var/tmp/portage/dev-lang/python-2.7.5-r2/work/Python-2.7.5/Objects/abstract.c:2529
#22 0x00007f9315a8ba6f in instancemethod_call (func=<function at remote 0x67ce8e8>,
arg=(<BlastParser(_parser=<ExpatParser(_namespaces=0, _parser=None, _external_ges=0, _source=<InputSource(_InputSource__charfile=None, _InputSource__bytefile=None, _InputSource__public_id=None, _InputSource__system_id=None, _InputSource__encoding=None) at remote 0x6998600>, _bufsize=65516, _cont_handler=<...>, _dtd_handler=<DTDHandler() at remote 0x6998e70>, _entity_stack=[], _err_handler=<ErrorHandler() at remote 0x6998858>, _lex_handler_prop=None, _parsing=0, _ent_handler=<EntityResolver() at remote 0x6998948>, _interning=None) at remote 0x6998b28>, _mult_al=<MultipleAlignment(alignment=[]) at remote 0x77d0f40>, _debug=0, _hsp=<HSP(sbjct_end=None, sbjct='', bits=None, frame=(), query_end=None, score=None, gaps=(None, None), expect=None, query='', sbjct_start=None, positives=(None, None), align_length=None, num_alignments=None, identities=(None, None), query_start=None, strand=(None, None), match='') at remote 0x77d0ed0>, _descr=<Description(e=<float at remote 0x68fa3c0>, title=u'gnl|BL_ORD_ID|14 poly_A', access...(truncated), kw=0x0)
at /mnt/1TB/var/tmp/portage/dev-lang/python-2.7.5-r2/work/Python-2.7.5/Objects/classobject.c:2602
#23 0x00007f9315a6f840 in PyObject_Call (func=<instancemethod at remote 0x415f460>, arg=(u'Hsp_bit-score',), kw=0x0) at /mnt/1TB/var/tmp/portage/dev-lang/python-2.7.5-r2/work/Python-2.7.5/Objects/abstract.c:2529
#24 0x00007f9315b7a900 in PyEval_CallObjectWithKeywords (func=<instancemethod at remote 0x415f460>, arg=(u'Hsp_bit-score',), kw=0x0) at /mnt/1TB/var/tmp/portage/dev-lang/python-2.7.5-r2/work/Python-2.7.5/Python/ceval.c:3890
#25 0x00007f930c412705 in call_with_frame (c=0x67d5bf0, func=<instancemethod at remote 0x415f460>, args=(u'Hsp_bit-score',), self=0x50a2ba0) at /mnt/1TB/var/tmp/portage/dev-lang/python-2.7.5-r2/work/Python-2.7.5/Modules/pyexpat.c:355
#26 0x00007f930c4135bd in my_EndElementHandler (userData=0x50a2ba0, name=0x5367d60 "Hsp_bit-score") at /mnt/1TB/var/tmp/portage/dev-lang/python-2.7.5-r2/work/Python-2.7.5/Modules/pyexpat.c:616
#27 0x00007f930c1ef2d2 in doContent () from /usr/lib64/libexpat.so.1
#28 0x00007f930c1f01b4 in contentProcessor () from /usr/lib64/libexpat.so.1
#29 0x00007f930c1eae2a in XML_ParseBuffer () from /usr/lib64/libexpat.so.1
#30 0x00007f930c416199 in xmlparse_Parse (self=0x50a2ba0,
args=('n>\n <Hsp_qseq>AAAAAAAAAACAAAAAAAAAANAAAAAAAAACAA</Hsp_qseq>\n <Hsp_hseq>AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA</Hsp_hseq>\n <Hsp_midline>|||||||||| |||||||||| ||||||||| ||</Hsp_midline>\n </Hsp>\n <Hsp>\n <Hsp_num>728</Hsp_num>\n <Hsp_bit-score>49.9773</Hsp_bit-score>\n <Hsp_score>54</Hsp_score>\n <Hsp_evalue>2.68758e-09</Hsp_evalue>\n <Hsp_query-from>97</Hsp_query-from>\n <Hsp_query-to>130</Hsp_query-to>\n <Hsp_hit-from>728</Hsp_hit-from>\n <Hsp_hit-to>761</Hsp_hit-to>\n <Hsp_query-frame>1</Hsp_query-frame>\n <Hsp_hit-frame>1</Hsp_hit-frame>\n <Hsp_identity>31</Hsp_identity>\n <Hsp_positive>31</Hsp_positive>\n <Hsp_align-len>34</Hsp_align-len>\n <Hsp_qseq>AAAAAAAAAACAAAAAAAAAANAAAAAAAAACAA</Hsp_qseq>\n <Hsp_hseq>AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA</Hsp_hseq',...(truncated)) at /mnt/1TB/var/tmp/portage/dev-lang/python-2.7.5-r2/work/Python-2.7.5/Modules/pyexpat.c:902
#31 0x00007f9315ad5006 in PyCFunction_Call (func=<built-in method Parse of pyexpat.xmlparser object at remote 0x50a2ba0>,
arg=('n>\n <Hsp_qseq>AAAAAAAAAACAAAAAAAAAANAAAAAAAAACAA</Hsp_qseq>\n <Hsp_hseq>AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA</Hsp_hseq>\n <Hsp_midline>|||||||||| |||||||||| ||||||||| ||</Hsp_midline>\n </Hsp>\n <Hsp>\n <Hsp_num>728</Hsp_num>\n <Hsp_bit-score>49.9773</Hsp_bit-score>\n <Hsp_score>54</Hsp_score>\n <Hsp_evalue>2.68758e-09</Hsp_evalue>\n <Hsp_query-from>97</Hsp_query-from>\n <Hsp_query-to>130</Hsp_query-to>\n <Hsp_hit-from>728</Hsp_hit-from>\n <Hsp_hit-to>761</Hsp_hit-to>\n <Hsp_query-frame>1</Hsp_query-frame>\n <Hsp_hit-frame>1</Hsp_hit-frame>\n <Hsp_identity>31</Hsp_identity>\n <Hsp_positive>31</Hsp_positive>\n <Hsp_align-len>34</Hsp_align-len>\n <Hsp_qseq>AAAAAAAAAACAAAAAAAAAANAAAAAAAAACAA</Hsp_qseq>\n <Hsp_hseq>AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA</Hsp_hseq',...(truncated), kw=0x0)
at /mnt/1TB/var/tmp/portage/dev-lang/python-2.7.5-r2/work/Python-2.7.5/Objects/methodobject.c:81
#32 0x00007f9315b7b1d4 in call_function (pp_stack=0x7fff7c312a70, oparg=2) at /mnt/1TB/var/tmp/portage/dev-lang/python-2.7.5-r2/work/Python-2.7.5/Python/ceval.c:4021
#33 0x00007f9315b75cd8 in PyEval_EvalFrameEx (
f=Frame 0x58dab40, for file /usr/lib64/python2.7/site-packages/Bio/Blast/NCBIXML.py, line 654, in parse (handle=<file at remote 0x533db80>, debug=0, expat=<module at remote 0x67d2130>, BLOCK=1024, MARGIN=10, XML_START='<?xml', text='n>\n <Hsp_qseq>AAAAAAAAAACAAAAAAAAAANAAAAAAAAACAA</Hsp_qseq>\n <Hsp_hseq>AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA</Hsp_hseq>\n <Hsp_midline>|||||||||| |||||||||| ||||||||| ||</Hsp_midline>\n </Hsp>\n <Hsp>\n <Hsp_num>728</Hsp_num>\n <Hsp_bit-score>49.9773</Hsp_bit-score>\n <Hsp_score>54</Hsp_score>\n <Hsp_evalue>2.68758e-09</Hsp_evalue>\n <Hsp_query-from>97</Hsp_query-from>\n <Hsp_query-to>130</Hsp_query-to>\n <Hsp_hit-from>728</Hsp_hit-from>\n <Hsp_hit-to>761</Hsp_hit-to>\n <Hsp_query-frame>1</Hsp_query-frame>\n <Hsp_hit-frame>1</Hsp_hit-frame>\n <Hsp_identity>31</Hsp_identity>\n ...(truncated), throwflag=0)
at /mnt/1TB/var/tmp/portage/dev-lang/python-2.7.5-r2/work/Python-2.7.5/Python/ceval.c:2666
It seems it crashed because there were TWO broken XML entries in the XML stream while and on the third (non-bogus) it crashed but deeply in it on <Hsp_num>728</Hsp_num> ...:
<Iteration>
<Iteration_iter-num>5195</Iteration_iter-num>
<Iteration_query-ID>lcl|5195_0</Iteration_query-ID>
<Iteration_query-def>EYI1BW404I60E4 length=245 xy=3653_1102 region=4 run=R_2007_11_06_15_29_46_</Iteration_query-def>
<Iteration_query-len>253</Iteration_query-len>
<Iteration_stat>
<Statistics>
<Statistics_db-num>30</Statistics_db-num>
<Statistics_db-len>20176</Statistics_db-len>
<Statistics_hsp-len>0</Statistics_hsp-len>
<Statistics_eff-space>0</Statistics_eff-space>
<Statistics_kappa>0.41</Statistics_kappa>
<Statistics_lambda>0.625</Statistics_lambda>
<Statistics_entropy>0.78</Statistics_entropy>
</Statistics>
</Iteration_stat>
<Iteration_message>No hits found</Iteration_message>
</Iteration>
<Iteration>
<Iteration_iter-num>5196</Iteration_iter-num>
<Iteration_query-ID>lcl|5196_0</Iteration_query-ID>
<Iteration_query-def>EYI1BW404I5AGB length=255 xy=3633_2713 region=4 run=R_2007_11_06_15_29_46_</Iteration_query-def>
<Iteration_query-len>259</Iteration_query-len>
<Iteration_stat>
<Statistics>
<Statistics_db-num>30</Statistics_db-num>
<Statistics_db-len>20176</Statistics_db-len>
<Statistics_hsp-len>0</Statistics_hsp-len>
<Statistics_eff-space>0</Statistics_eff-space>
<Statistics_kappa>0.41</Statistics_kappa>
<Statistics_lambda>0.625</Statistics_lambda>
<Statistics_entropy>0.78</Statistics_entropy>
</Statistics>
</Iteration_stat>
<Iteration_message>No hits found</Iteration_message>
</Iteration>
<Iteration>
<Iteration_iter-num>5197</Iteration_iter-num>
<Iteration_query-ID>lcl|5197_0</Iteration_query-ID>
<Iteration_query-def>EYI1BW404IB6HP length=88 xy=3302_0331 region=4 run=R_2007_11_06_15_29_46_</Iteration_query-def>
<Iteration_query-len>166</Iteration_query-len>
<Iteration_hits>
<Hit>
<Hit_num>1</Hit_num>
<Hit_id>gnl|BL_ORD_ID|14</Hit_id>
<Hit_def>poly_A</Hit_def>
<Hit_accession>14</Hit_accession>
<Hit_len>960</Hit_len>
<Hit_hsps>
<Hsp>
<Hsp_num>1</Hsp_num>
<Hsp_bit-score>49.9773</Hsp_bit-score>
<Hsp_score>54</Hsp_score>
<Hsp_evalue>2.68758e-09</Hsp_evalue>
<Hsp_query-from>97</Hsp_query-from>
<Hsp_query-to>130</Hsp_query-to>
<Hsp_hit-from>1</Hsp_hit-from>
<Hsp_hit-to>34</Hsp_hit-to>
<Hsp_query-frame>1</Hsp_query-frame>
<Hsp_hit-frame>1</Hsp_hit-frame>
<Hsp_identity>31</Hsp_identity>
<Hsp_positive>31</Hsp_positive>
<Hsp_align-len>34</Hsp_align-len>
<Hsp_qseq>AAAAAAAAAACAAAAAAAAAANAAAAAAAAACAA</Hsp_qseq>
<Hsp_hseq>AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA</Hsp_hseq>
<Hsp_midline>|||||||||| |||||||||| ||||||||| ||</Hsp_midline>
</Hsp>
plenty matches and finally
<Hsp>
<Hsp_num>728</Hsp_num>
<Hsp_bit-score>49.9773</Hsp_bit-score>
<Hsp_score>54</Hsp_score>
<Hsp_evalue>2.68758e-09</Hsp_evalue>
<Hsp_query-from>97</Hsp_query-from>
<Hsp_query-to>130</Hsp_query-to>
<Hsp_hit-from>728</Hsp_hit-from>
<Hsp_hit-to>761</Hsp_hit-to>
<Hsp_query-frame>1</Hsp_query-frame>
<Hsp_hit-frame>1</Hsp_hit-frame>
<Hsp_identity>31</Hsp_identity>
<Hsp_positive>31</Hsp_positive>
<Hsp_align-len>34</Hsp_align-len>
<Hsp_qseq>AAAAAAAAAACAAAAAAAAAANAAAAAAAAACAA</Hsp_qseq>
<Hsp_hseq>AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA</Hsp_hseq>
<Hsp_midline>|||||||||| |||||||||| ||||||||| ||</Hsp_midline>
</Hsp>
Do you want "bt full" output from gdb instead. ;-)))))) This is likely the longest bug report I ever wrote and 4 A.M.