我需要编写将图像转换为pdfs并将tchem合并为一个的脚本。
我尝试过使用img2pdf和PYPDF2,但是我遇到了错误。谁能看看,告诉我我做错了什么。
import img2pdf
import os
from PyPDF2 import PdfFileReader, PdfFileMerger, PdfFileWriter
merger = PdfFileMerger()
path = input()
for root,dir,files in os.walk(path):
for eachfile in files:
if "pdf" not in eachfile:
os.chdir(root)
PDFfile = img2pdf.convert((eachfile,), dpi=None, x=None, y=None)
merger.append(fileobj=PDFfile)
merger.write(open("out.pdf", "wb"))错误
Traceback (most recent call last):
File "C:/Users/ms/Desktop/Desktop/test.py", line 13, in <module>
merger.append(fileobj=PDFfile)
File "C:\Python34\lib\site-packages\PyPDF2\merger.py", line 203, in append
self.merge(len(self.pages), fileobj, bookmark, pages, import_bookmarks)
File "C:\Python34\lib\site-packages\PyPDF2\merger.py", line 133, in merge
pdfr = PdfFileReader(fileobj, strict=self.strict)
File "C:\Python34\lib\site-packages\PyPDF2\pdf.py", line 1065, in __init__
self.read(stream)
File "C:\Python34\lib\site-packages\PyPDF2\pdf.py", line 1660, in read
stream.seek(-1, 2)
AttributeError: 'bytes' object has no attribute 'seek'发布于 2015-12-17 12:26:35
img2pdf.convert返回对应pdf文件的字节(作为字符串?),而不是文件处理程序。如果您阅读help(merger.append),您将看到您需要传递一个文件处理程序或路径到该PDF文件。这里有一个可能的解决办法。也可能不生成所有的中间PDF文件。
import img2pdf
import os
from PyPDF2 import PdfFileReader, PdfFileMerger, PdfFileWriter
merger = PdfFileMerger()
path = "/tmp/images"
for root,dir,files in os.walk(path):
for eachfile in files:
if "pdf" not in eachfile:
os.chdir(root)
pdfbytes = img2pdf.convert((eachfile,), dpi=None, x=None, y=None)
pdfname = eachfile.split('.')[0]+".pdf"
f = open(pdfname, 'wb+')
f.write(pdfbytes)
merger.append(fileobj=f)
f.close()
f = open("out.pdf", "wb")
merger.write(f)
f.close()顺便说一句,使用常规工具(如转换、长春花、pdftk )也要简单得多。
https://stackoverflow.com/questions/34329968
复制相似问题