我正在努力学习TFIDF。但是我不能从文件中找到这些字。
代码:
docA = open("/home/user/Desktop/da/doca","r")
print(docA.read())
bowA = docA.split(" ")错误:
AttributeError
Traceback (most recent call last)
<ipython-input-32-06e07f9dd975> in <module>
----> 1 bowA = docA.split(" ")
AttributeError: '_io.TextIOWrapper' object has no attribute 'split'`
Can anyone help me solve this?发布于 2020-02-27 18:25:11
我假设你是这个意思:
docA = open("/home/user/Desktop/da/doca","r")
# print(docA.read())
bowA = docA.read().split(" ") # or just split() will do
docA.close()当您调用read()时,读游标读取整个文件,将读游标留在最后。因此再次调用read()将返回空字符串。因此,如果您想要打印内容,可以将内容赋给一个变量,打印它并根据需要使用它:
docA = open("/home/user/Desktop/da/doca","r")
data = docA.read()
print(data)
bowA = data.split()
docA.close()或者简单地说
with open("/home/user/Desktop/da/doca","r") as docA:
data = docA.read()
print(data)
bowA = data.split()发布于 2020-02-27 18:16:16
您希望使用返回的字符串而不是文件句柄:
docA = open("/home/user/Desktop/da/doca","r")
document_string = docA.read()
bowA = document_string.split()您可以只调用split,默认情况下它会拆分空格
https://stackoverflow.com/questions/60430667
复制相似问题