问Pdf2htmlEx: html包含图像，我怎么能用图形而不是图像作为输出？
EN

Stack Overflow用户

提问于 2018-10-25 15:20:48

回答 1查看 553关注 0票数 1

我已经尝试了文档中的每一个命令，我怎么能只得到文本部分作为输出，而不是所有的图像？

https://github.com/coolwanglu/pdf2htmlEX/wiki/Command-Line-Options。

pdf2htmlex

回答 1

Stack Overflow用户

发布于 2018-10-31 17:52:02

我不确定你想要达到什么目的，因为问题主题和细节似乎是矛盾的，但有一些选项可以将图形和文本分割成单独的文件：

--embed <string>
   --embed-css <0|1> (Default: 1)
   --embed-font <0|1> (Default: 1)
   --embed-image <0|1> (Default: 1)
   --embed-javascript <0|1> (Default: 1)
   --embed-outline <0|1> (Default: 1)
          Specify which elements should be embedded into the  output  HTML
          file.

          If  switched  off,  separated files will be generated along with
          the HTML file for the corresponding elements.

          --embed accepts a string as argument. Each letter of the  string
          must  be  one  of  `cCfFiIjJoO`, which corresponds to one of the
          --embed-*** switches. Lower case letters for 0  and  upper  case
          letters  for  1.  For  example,  `--embed  cFIJo` means to embed
          everything but CSS files and outlines.

   --split-pages <0|1> (Default: 0)
          If turned on, the content of each page is stored in a  separated
          file.

          This  switch is useful if you want pages to be loaded separately
          & dynamically -- a supporting server might be necessary.

          Also see --page-filename.

因此，如果使用--split-pages 1和--embed-image 0选项，则每个PDF页面都有一个HTML页面，其中不包括嵌入的图像。

如果这不是您想要的，请在您的问题中包含更多信息。

票数 0

页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持

原文链接：

https://stackoverflow.com/questions/52983549

复制

相似问题

问Pdf2htmlEx: html包含图像，我怎么能用图形而不是图像作为输出？
EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问Pdf2htmlEx: html包含图像，我怎么能用图形而不是图像作为输出？EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问Pdf2htmlEx: html包含图像，我怎么能用图形而不是图像作为输出？
EN