首页
学习
活动
专区
圈层
工具
发布
社区首页 >问答首页 >如何将CoNLL格式转换为空间格式

如何将CoNLL格式转换为空间格式
EN

Stack Overflow用户
提问于 2021-07-26 05:04:56
回答 1查看 1.8K关注 0票数 3

我目前正在研究一个NER模型。我有一堆以CoNLL格式存储的数据,需要转换成Spacy格式。在CoNLL中,句子的每个单词旁边都有一个标签。在Spacy中,标记只显示在具有实际标记的单词中。如何从以下(CoNLL)格式转换

代码语言:javascript
复制
From    O
2001    B-DateTime
to  I-DateTime
2004    I-DateTime
,   O
I   O
was O
a   O
stagehand   O
for O
Hartford    B-Company
Stage   I-Company
Company O
.   O

到以下格式(Spacy)

代码语言:javascript
复制
TRAIN_DATA = [('what is the price of polo?', {'entities': [(21, 25, 'PrdName')]}), 
              ('what is the price of ball?', {'entities': [(21, 25, 'PrdName')]}), 
              ('what is the price of jegging?', {'entities': [(21, 28, 'PrdName')]}), 
              ('what is the price of t-shirt?', {'entities': [(21, 28, 'PrdName')]}), 
              ('what is the price of jeans?', {'entities': [(21, 26, 'PrdName')]}), 
              ('what is the price of bat?', {'entities': [(21, 24, 'PrdName')]}), 
              ('what is the price of shirt?', {'entities': [(21, 26, 'PrdName')]}), 
              ('what is the price of bag?', {'entities': [(21, 24, 'PrdName')]}), 
              ('what is the price of cup?', {'entities': [(21, 24, 'PrdName')]}), 
              ('what is the price of jug?', {'entities': [(21, 24, 'PrdName')]}), 
              ('what is the price of plate?', {'entities': [(21, 26, 'PrdName')]}), 
              ('what is the price of glass?', {'entities': [(21, 26, 'PrdName')]}),
              ('what is the price of watch?', {'entities': [(21, 26, 'PrdName')]})]
EN

回答 1

Stack Overflow用户

发布于 2021-07-27 05:07:13

只需使用空间变换

代码语言:javascript
复制
spacy convert input.conll -c conll ./output/

注意,默认情况下,这会生成一个二进制.spacy文件。JSON格式在v3中不受欢迎,对它没有多大帮助。

票数 0
EN
页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持
原文链接:

https://stackoverflow.com/questions/68524723

复制
相关文章

相似问题

领券
问题归档专栏文章快讯文章归档关键词归档开发者手册归档开发者手册 Section 归档