我正在尝试使用BCBio GFF解析器解析一个GFF文件,我得到了以下错误。有人能帮我解决这个问题吗?
回溯(最近一次调用):
File "gff_parse.py", line 6, in <module>
for rec in GFF.parse(in_handle):
File "build/bdist.linux-x86_64/egg/BCBio/GFF/GFFParser.py", line 709, in parse
File "build/bdist.linux-x86_64/egg/BCBio/GFF/GFFParser.py", line 299, in parse_in_parts
File "build/bdist.linux-x86_64/egg/BCBio/GFF/GFFParser.py", line 320, in parse_simple
File "build/bdist.linux-x86_64/egg/BCBio/GFF/GFFParser.py", line 603, in _gff_process
File "build/bdist.linux-x86_64/egg/BCBio/GFF/GFFParser.py", line 634, in _lines_to_out_info
File "build/bdist.linux-x86_64/egg/BCBio/GFF/GFFParser.py", line 183, in _gff_line_map
ValueError: invalid literal for int() with base 10: 'New Start'下面是我的代码:
from BCBio import GFF
in_file = "infile.gff"
in_handle = open(in_file)
for rec in GFF.parse(in_handle):
print rec
in_handle.close()谢谢图利卡
发布于 2011-04-21 18:28:16
您是如何生成GFF文件的?它似乎至少包含一个无效的行。第四列应该包含一个特征的起始坐标的整数;错误信息指出它包含值'New Start‘。
GFF3 specification page有一些有效的GFF示例,online validator可以帮助调试像这样的格式化问题。
https://stackoverflow.com/questions/5742379
复制相似问题