尝试使用Tabula提取下表,但它返回的是空数据。对于其他类似的桌子,它也很好用。

也试过使用卡梅洛特,但效果不太好。对我如何提取这些有什么建议吗?
附加了我的密码
from tabula import read_pdf
from tabulate import tabulate
from tabula import read_pdf
import pandas as pd
# from tabula.io import read_pdf
Page_No = 1
tables = read_pdf('/content/page1.pdf',pages=Page_No,multiple_tables=True)
df1 = pd.DataFrame(tables[0])
df1import camelot
tables2=camelot.read_pdf('page1.pdf', flavor='lattice', pages='1')
tables2发布于 2022-11-14 13:53:46
在小报中加入风味=“guess=False”和“流”后,这个问题就得到了解决。
from tabula import read_pdf
from tabulate import tabulate
from tabula import read_pdf
import pandas as pd
# from tabula.io import read_pdf
Page_No = 1
tables = read_pdf('/content/page1.pdf',pages=Page_No,guess=False,stream=True)
df1 = pd.DataFrame(tables[0])
df1https://stackoverflow.com/questions/74429395
复制相似问题