谁能告诉我如何在windows和Netbeans上使用boilerpipe?如果你能给我一些开始的java代码,我将不胜感激。
发布于 2012-04-09 20:20:37
试着看看他们的Wiki和QuickStart。下面是示例代码...
public static void main(final String[] args) throws Exception {
URL url;
url = new URL("http://www.example.com/some-location/index.html");
// NOTE We ignore HTTP-based character encoding in this demo...
final InputStream urlStream = url.openStream();
final InputSource is = new InputSource(urlStream);
final BoilerpipeSAXInput in = new BoilerpipeSAXInput(is);
final TextDocument doc = in.getTextDocument();
urlStream.close();
// You have the choice between different Extractors
// System.out.println(DefaultExtractor.INSTANCE.getText(doc));
System.out.println(ArticleExtractor.INSTANCE.getText(doc));
}https://stackoverflow.com/questions/10072902
复制相似问题