文章/答案/技术大牛

发布

社区首页 >问答首页 >在JPedal中的某一点突出显示word

问在JPedal中的某一点突出显示word
EN

Stack Overflow用户

提问于 2012-09-26 14:23:41

回答 1查看 1.4K关注 0票数 2

我想实现一个功能，允许用户双击以突出显示使用JPedal库的PDF文档中的单词。如果我可以获得单词的边界矩形并查看MouseEvent位置是否在其中，这将是一件很简单的事情；下面的代码片段演示了如何突出显示一个区域：

private void highlightText() {
    Rectangle highlightRectangle = new Rectangle(firstPoint.x, firstPoint.y,
            secondPoint.x - firstPoint.x, secondPoint.y - firstPoint.y);
    pdfDecoder.getTextLines().addHighlights(new Rectangle[]{highlightRectangle}, false, currentPage);
    pdfDecoder.repaint();
}

然而，我只能在文档中找到纯文本提取示例。

java

pdf

jpedal

回答 1

Stack Overflow用户

回答已采纳

发布于 2012-09-27 16:06:34

在看了Mark的例子之后，我设法让它工作起来。有几个怪癖，所以我将解释它是如何工作的，以防它对其他人有帮助。关键方法是extractTextAsWordlist，当给定要从中提取的区域时，它将返回{word1, w1_x1, w1_y1, w1_x2, w1_y2, word2, w2_x1, ...}形式的List<String>。下面列出了分步说明。

首先，您需要将MouseEvent的组件/屏幕坐标转换为PDF页面坐标，并更正缩放：

/**
 * Transforms Component coordinates to page coordinates, correcting for 
 * scaling and panning.
 * 
 * @param x Component x-coordinate
 * @param y Component y-coordinate
 * @return Point on the PDF page
 */
private Point getPageCoordinates(int x, int y) {
    float scaling = pdfDecoder.getScaling();
    int x_offset = ((pdfDecoder.getWidth() - pdfDecoder.getPDFWidth()) / 2); 
    int y_offset = pdfDecoder.getPDFHeight();
    int correctedX = (int)((x - x_offset + viewportOffset.x) / scaling);
    int correctedY = (int)((y_offset - (y + viewportOffset.y))  / scaling);
    return new Point(correctedX, correctedY);
}

接下来，创建一个框来扫描文本。我选择将其设置为页面宽度，垂直方向设置为+/- 20个页面单位(这是一个相当随意的数字)，以MouseEvent为中心

/**
 * Scans for all the words located with in a box the width of the page and 
 * 40 points high, centered at the supplied point.
 * 
 * @param p Point to centre the scan box around
 * @return  A List of words within the scan box
 * @throws PdfException
 */
private List<String> scanForWords(Point p) throws PdfException {
    List<String> result = Collections.emptyList();
    if (pdfDecoder.getlastPageDecoded() > 0) {
        PdfGroupingAlgorithms currentGrouping = pdfDecoder.getGroupingObject();
        PdfPageData currentPageData = pdfDecoder.getPdfPageData();
        int x1 = currentPageData.getMediaBoxX(currentPage);
        int x2 = currentPageData.getMediaBoxWidth(currentPage) + x1;
        int y1 = p.y + 20;
        int y2 = p.y - 20;
        result = currentGrouping.extractTextAsWordlist(x1, y1, x2, y2, currentPage, true, "");
    }
    return result;
}

然后，我将其解析为一系列Rectangle：

/**
 * Parse a String sequence of:
 *   {word1, w1_x1, w1_y1, w1_x2, w1_y2, word2, w2_x1, ...}
 *   
 * Into a sequence of Rectangles.
 * 
 * @param wordList Word list sequence to parse
 * @return A List of Rectangles
 */
private List<Rectangle> parseWordBounds(List<String> wordList) {
    List<Rectangle> wordBounds = new LinkedList<Rectangle>();
    Iterator<String> wordListIterator = wordList.iterator();
    while(wordListIterator.hasNext()) {
        // sequences are: {word, x1, y1, x2, y2}  
        wordListIterator.next(); // skip the word
        int x1 = (int) Float.parseFloat(wordListIterator.next());
        int y1 = (int) Float.parseFloat(wordListIterator.next());
        int x2 = (int) Float.parseFloat(wordListIterator.next());
        int y2 = (int) Float.parseFloat(wordListIterator.next());
        wordBounds.add(new Rectangle(x1, y2, x2 - x1, y1 - y2)); // in page, not screen coordinates
    }
    return wordBounds;
}

然后确定MouseEvent属于哪个Rectangle：

/**
 * Finds the bounding Rectangle of a word located at a Point.
 * 
 * @param p Point to find word bounds
 * @param wordBounds List of word boundaries to search
 * @return A Rectangle that bounds a word and contains a point, or null if 
 *         there is no word located at the point
 */
private Rectangle findWordBoundsAtPoint(Point p, List<Rectangle> wordBounds) {
    Rectangle result = null;
    for (Rectangle wordBound : wordBounds) {
        if (wordBound.contains(p)) {
            result = wordBound;
            break;
        }
    }
    return result;
}

由于某些原因，仅仅将此矩形传递给突出显示方法并不起作用。在进行了一些修补之后，我发现将Rectangle每一端缩小一个点就解决了问题：

/**
 * Contracts a Rectangle to enable it to be highlighted.
 * 
 * @return A contracted Highlight Rectangle
 */
private Rectangle contractHighlight(Rectangle highlight){
    int x = highlight.x + 1;
    int y = highlight.y + 1;
    int width = highlight.width -2;
    int height = highlight.height - 2;
    return new Rectangle(x, y, width, height);
}

然后我把它传递给这个方法来增加亮点：

/**
 * Highlights text on the document
 */
private void highlightText(Rectangle highlightRectangle) {
    pdfDecoder.getTextLines().addHighlights(new Rectangle[]{highlightRectangle}, false, currentPage);
    pdfDecoder.repaint();
}

最后，上面的所有调用都被打包到这个方便的方法中：

/**
 * Highlights the word at the given point.
 * 
 * @param p Point where word is located
 */
private void highlightWordAtPoint(Point p) {
    try {
        Rectangle wordBounds = findWordBoundsAtPoint(p, parseWordBounds(scanForWords(p)));
        if (wordBounds != null) {
            highlightText(contractHighlight(wordBounds));
        }
    } catch (PdfException e) {
        // TODO Auto-generated catch block
        e.printStackTrace();
    }
}

票数 0

页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持

原文链接：

https://stackoverflow.com/questions/12595703

复制

相似问题

问在JPedal中的某一点突出显示word
EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问在JPedal中的某一点突出显示wordEN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问在JPedal中的某一点突出显示word
EN