文章/答案/技术大牛

发布

社区首页 >问答首页 >如何使用python将.docx文件转换为html？

问如何使用python将.docx文件转换为html？
EN

Stack Overflow用户

提问于 2017-10-30 13:27:30

回答 3查看 10.5K关注 0票数 5

import mammoth

f = open("D:\filename.docx", 'rb')
document = mammoth.convert_to_html(f)

我无法获得一个.html文件，而我运行这段代码，请帮我得到它，当我转换成.html文件时，我没有得到图像插入到word文件到.html文件，你能帮我如何获得图像到.html从.docx？

python-2.7

回答 3

Stack Overflow用户

发布于 2017-10-30 13:52:18

试试这个：

import mammoth

f = open("path_to_file.docx", 'rb')
b = open('filename.html', 'wb')
document = mammoth.convert_to_html(f)
b.write(document.value.encode('utf8'))
f.close()
b.close()

票数 7

Stack Overflow用户

发布于 2017-10-30 13:34:47

我建议您尝试以下代码

    import mammoth
    with open("document.docx", "rb") as docx_file:
    result = mammoth.convert_to_html(docx_file)
    html = result.value

票数 2

Stack Overflow用户

发布于 2021-07-27 13:48:50

现在回答这个问题可能太晚了，但是如果有人仍然在寻找答案，那么下面的答案将会有所帮助，因为在转换为html之后，单词"tables/images/“应该保持不变。

import win32com.client as win32
# Open MS Word
word = win32.gencache.EnsureDispatch('Word.Application')

doc = word.Documents.Open("D:\filename.docx")
# change to a .html
txt_path = word_file.split('.')[0] + '.html'

# wdFormatFilteredHTML has value 10
# saves the doc as an html
doc.SaveAs(txt_path, 10)

doc.Close()
# noinspection PyBroadException
try:
    word.ActiveDocument()
except Exception:
    word.Quit()

票数 0

页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持

原文链接：

https://stackoverflow.com/questions/47008593

复制

相似问题

问如何使用python将.docx文件转换为html？
EN

回答 3

Stack Overflow用户

Stack Overflow用户

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问如何使用python将.docx文件转换为html？EN

回答 3

Stack Overflow用户

Stack Overflow用户

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问如何使用python将.docx文件转换为html？
EN