文章/答案/技术大牛

发布

社区首页 >问答首页 >无法从tabula或Camelot提取表

问无法从tabula或Camelot提取表
EN

Stack Overflow用户

提问于 2022-11-14 09:23:33

回答 1查看 31关注 0票数 0

尝试使用Tabula提取下表，但它返回的是空数据。对于其他类似的桌子，它也很好用。

也试过使用卡梅洛特，但效果不太好。对我如何提取这些有什么建议吗？

附加了我的密码

from tabula import read_pdf 
from tabulate import tabulate
from tabula import read_pdf
import pandas as pd
# from tabula.io import read_pdf

Page_No = 1
tables = read_pdf('/content/page1.pdf',pages=Page_No,multiple_tables=True)
df1 = pd.DataFrame(tables[0])
df1

import camelot

tables2=camelot.read_pdf('page1.pdf', flavor='lattice', pages='1')
tables2

python

dataframe

python-camelot

tabula-py

回答 1

Stack Overflow用户

回答已采纳

发布于 2022-11-14 13:53:46

在小报中加入风味=“guess=False”和“流”后，这个问题就得到了解决。

from tabula import read_pdf 
from tabulate import tabulate
from tabula import read_pdf
import pandas as pd
# from tabula.io import read_pdf

Page_No = 1
tables = read_pdf('/content/page1.pdf',pages=Page_No,guess=False,stream=True)
df1 = pd.DataFrame(tables[0])
df1

票数 0

页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持

原文链接：

https://stackoverflow.com/questions/74429395

复制

相似问题

问无法从tabula或Camelot提取表
EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问无法从tabula或Camelot提取表EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问无法从tabula或Camelot提取表
EN