blocks|key|1444172|text|上面的问题只发生在像chrome和safari这样的webkit浏览器中，它们提供了连接的支持，而像firefox这样的浏览器却没有。|type|unstyled|depth|inlineStyleRanges|entityRanges|data|1444173|结扎是两个或多个字母作为单个字形连接在一起的组合。|blockquote|offset|length|1444174|​根由|style|BOLD|1444175|这个缺少字符的问题是由于这些现代浏览器提供的捆绑支持--让我来解释一下|1444176|1.该工具在转换时--它使用poppler将字符转换为字形以进行呈现--现在这些浏览器在遇到tt+ff+fi这样的字符时，会认为它们是捆绑的，并搜索对应于tt而不是t的字形。|1444177|2.由于它们没有相应的符号--它们只是跳过字符并呈现其他字符--因此，我们找出了丢失的字符。|1444178|可以用解决|1444179|禁用/关闭这些浏览器中的连接-将css嵌入生成内容中。|1444180|欲知更多详情，请参阅：|1444181|通过CSS防止Safari+(小牛/iOS7+7)结扎|unordered-list-item|1444182|如果我错了，请纠正我。|1444183|entityMap|0|LINK|mutability|MUTABLE|url|https://en.wikipedia.org/wiki/Typographic_ligature|1|https://stackoverflow.com/questions/19591746/prevent-ligatures-in-safari-mavericks-ios7-via-css^0|0|0|2|0|0|0|3|0|0|0|0|0|3|0|0|0|0|R|1|0|0^^$0|@$1|2|3|4|5|6|7|1B|8|@]|9|@]|A|$]]|$1|B|3|C|5|D|7|1C|8|@]|9|@$E|1D|F|1E|1|1F]]|A|$]]|$1|G|3|H|5|6|7|1G|8|@$E|1H|F|1I|I|J]]|9|@]|A|$]]|$1|K|3|L|5|6|7|1J|8|@]|9|@]|A|$]]|$1|M|3|N|5|6|7|1K|8|@]|9|@]|A|$]]|$1|O|3|P|5|6|7|1L|8|@]|9|@]|A|$]]|$1|Q|3|R|5|6|7|1M|8|@$E|1N|F|1O|I|J]]|9|@]|A|$]]|$1|S|3|T|5|6|7|1P|8|@]|9|@]|A|$]]|$1|U|3|V|5|6|7|1Q|8|@]|9|@]|A|$]]|$1|W|3|X|5|Y|7|1R|8|@]|9|@$E|1S|F|1T|1|1U]]|A|$]]|$1|Z|3|10|5|6|7|1V|8|@]|9|@]|A|$]]|$1|11|3|-4|5|6|7|1W|8|@]|9|@]|A|$]]]|12|$13|$5|14|15|16|A|$17|18]]|19|$5|14|15|16|A|$17|1A]]]]

The above issue occurs only in - webkit web browsers like chrome and safari - which provides support for ligatures - whereas browser like firefox does not.

<blockquote>
 A <a href="https://en.wikipedia.org/wiki/Typographic_ligature" rel="nofollow noreferrer">ligature</a> is a combination of two or more letters joined as a single
 glyph 
</blockquote>

​Root cause

This issue with missing characters is due to ligature support provided by these modern browsers - let me explain how

1.The tool while converting - it converts characters to glyphs using poppler for rendering - now these browser when they come across characters like tt tf ti ff fi consider them to be ligature and searches for glyphs corresponding to tt and not t t

2.Since they do not have their corresponding glyphs - they just skip the characters and renders the rest - so, we fount the characters missing

Could be solved by

Disabling/ Turning-off the ligature in these browsers - embedding the css in the generating content

For more details please refer:

<ul>
<li><a href="https://stackoverflow.com/questions/19591746/prevent-ligatures-in-safari-mavericks-ios7-via-css">Prevent ligatures in Safari (Mavericks/iOS7) via CSS</a></li>
</ul>

Please correct me if I am wrong.

FONT ISSUES WITH PDF TO HTML CONVERSION

<ol>
<li>All "ti","fi","tt" characters are missing</li>
</ol>

<a href="http://i.stack.imgur.com/8EGmm.png" rel="nofollow">SAMPLE SCREENSHOT</a>

<ol start="2">
<li>Font overlapping issue</li>
</ol>

<a href="http://i.stack.imgur.com/NTHuM.png" rel="nofollow">SAMPLE SCREENSHOT</a>

<ul>
<li>NOTE: I don't get this issue with firefox. Getting the above issues in chrome in safari browser</li>
</ul>

I AM USING

<ul>
<li>Using the 0.13.6 version of pdf2htmlEX </li>
<li>Using the following command to convert pdf to html</li>
</ul>

<blockquote>
 pdf2htmlEX --split-pages 1 --zoom 3 --fit-width 920 --correct-text-visibility 1 --dest-dir $1 $2 2>&amp;1
</blockquote>

TRIED

Using --fallback 1 option solves all my above problems. But 

<ol>
<li>The fallback option reduces the clarity of document.</li>
<li>Table in the page disappears rather replaced with empty space.</li>
</ol>

DOUBTS

<blockquote>
 <ol>
 <li>Could you please explain a bit more on fallback?</li>
 <li>I have tried the above one (using fallback). Please suggest me if you prefer a different approach to solve the above problem with fonts.</li>
 </ol>
</blockquote>

Getting the above issues with chrome and safari whereas, in Firefox it is working fine.

Font misalignment during pdf to html conversion using pdf2htmlEx tool

翻译质量差，导致语言生硬或混乱。

没有提供实际的解决方法或示例。

解答不清晰，无法理解或解决问题。

页面排版不美观，阅读体验差。

文章

问答

视频

教程

学习中心

腾讯云实验室

直播

竞赛

腾讯云代码分析专区

腾讯iOA零信任安全管理系统专区

腾讯云架构师技术同盟交流圈

腾讯云数据库专区

腾讯云智能顾问专区

腾讯云原生专区

腾讯混元专区

腾讯云TCE专区

腾讯云Lighthouse专区

腾讯云HAI专区

腾讯云Edgeone专区

腾讯云存储专区

腾讯云智能专区

腾讯轻联专区 

腾讯云开发专区

TAPD专区

腾讯轻量云游戏服专区

腾讯云最具价值专家

腾讯云架构师技术同盟

腾讯云创作之星

腾讯云开发者先锋

腾讯云AI代码助手

云原生构建

TAPD 敏捷项目管理

Cloud Studio

SDK中心

API中心

命令行工具

功能1上新10个字符

功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符。

功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符。

功能5描述100个字符功能5描述100个字符功能5描述100个字符功能5描述100个字符功能5描述100个字符功能5描述100个字符

功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符

功能4上新

文章&问答评论现已支持表情

全新交互，全新视觉，新增快捷键、悬浮工具栏、高亮块等功能并同时优化现有功能，全面提升创作效率和体验

社区富文本编辑器全新改版！诚邀体验～ 

精选全网热门MCP server，让你的AI更好用 🚀

💥开发者 MCP广场重磅上线！

涵盖代码开发、场景应用、自动测试全流程，助你从零构建专属AI助手

一站式MCP教程库，解锁AI应用新玩法

聚焦“写作效率、视觉美观与运行性能”三方面进行全面升级，为您提供更高效、稳定的创作环境

社区富文本&Markdown编辑器全新改版上线，欢迎大家体验!

诚挚邀请您参与本次调研，分享您的真实使用感受与建议。您的反馈至关重要，感谢您的支持与参与！

社区新版编辑器体验调研

字体问题与PDF到的转换所有"ti“、"fi”、"tt“字符都不见了。字体重叠问题注意:我不理解firefox的这个问题。在safari浏览器中获取铬中的上述问题I正在使用使用0.13.6版本的pdf2htmlEX使用以下命令将pdf转换为htmlpdf2htmlEX -拆分-第1页-缩放3-合适宽度920 -正确-文本-可见性1-最-迪尔1美元2美元2>&1尝试过使用--fallback 1选项

问使用pdf2htmlEx工具将pdf转换为html的字体不对齐
EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问使用pdf2htmlEx工具将pdf转换为html的字体不对齐EN