首页
学习
活动
专区
圈层
工具
发布
社区首页 >问答首页 >从Scrubyt过渡到Nokogiri-写入XML或Hash?

从Scrubyt过渡到Nokogiri-写入XML或Hash?
EN

Stack Overflow用户
提问于 2009-08-19 06:22:32
回答 4查看 1.1K关注 0票数 1

我正在尝试将这一小段代码从scrubyt转换到nokogiri,并且在尝试将结果写入散列或xml时被卡住了。在scrubyt中,它看起来如下所示:

代码语言:javascript
复制
require 'rubygems'
  require 'scrubyt'

  result_data = Scrubyt::Extractor.define do
    fetch "http://www.amazon.com/gp/offer-listing/0061673730"
    results "//div[@class='resultsset']" do
       item "//tbody/tr" do
          condition "//div[@class = 'Condition']"
          price "//span[@class = 'price']"
          shipping "//span[@class = 'price_shipping']"
       end
   end
  end
  @description = result_data.to_xml
  return @description
end

使用nokogiri我可以解析出我想要的信息,但是似乎没有一种快速的方法来返回散列或xml文档中的项。这是我在nokogiri的所有东西。

代码语言:javascript
复制
require 'rubygems'
require 'nokogiri'
require 'open-uri'
doc = Nokogiri::HTML(open('http://www.amazon.com/gp/offer-listing/0061673730'))

doc.css('div.condition, span.price, span.price_shipping ').each do |item|
puts item.content
end 

如何将项目信息返回到xml或哈希?

EN

回答 4

Stack Overflow用户

回答已采纳

发布于 2009-08-19 08:26:23

您可以使用Builder来构建XML。

代码语言:javascript
复制
builder = Nokogiri::XML::Builder.new do |xml|
  xml.root {
    xml.items {
      doc.css('div.condition, span.price, span.price_shipping').each do |o|
        xml.item_content = o
      end
    }
  }
end

puts builder.to_xml
票数 4
EN

Stack Overflow用户

发布于 2009-08-19 23:43:44

弄清楚了..。

代码语言:javascript
复制
require 'rubygems'
require 'nokogiri'
require 'open-uri'

doc = Nokogiri::HTML(open('http://www.amazon.com/gp/offer-listing/0061673730'))
builder = Nokogiri::XML::Builder.new do |xml|
xml.root {
doc.xpath('//tbody[@class="result"]').each do |res|
    xml.result {
    res.css('span.price').each do |p|
      xml.price = p.content
    end
    res.css('span.price_shipping').each do |s|
      xml.ship = s.content
    end
    }
    end
}
end
puts builder.to_xml

结果:

代码语言:javascript
复制
<?xml version="1.0"?>
<root>
  <result>
    <price=>$6.09</price=>
    <ship=>+ $3.99</ship=>
  </result>
  <result>
    <price=>$6.48</price=>
    <ship=>+ $3.99</ship=>
  </result>
  <result>
    <price=>$7.12</price=>
    <ship=>+ $3.99</ship=>
  </result>
  <result>
    <price=>$7.31</price=>
    <ship=>+ $3.99</ship=>
  </result>
  <result>
    <price=>$7.52</price=>
    <ship=>+ $3.99</ship=>
  </result>
  <result>
    <price=>$7.52</price=>
    <ship=>+ $3.99</ship=>
  </result>
  <result>
    <price=>$11.53</price=>
  </result>
  <result>
    <price=>$7.56</price=>
    <ship=>+ $3.99</ship=>
  </result>
  <result>
    <price=>$7.61</price=>
    <ship=>+ $3.99</ship=>
  </result>
  <result>
    <price=>$7.61</price=>
    <ship=>+ $3.99</ship=>
  </result>
  <result>
    <price=>$7.95</price=>
    <ship=>+ $3.99</ship=>
  </result>
  <result>
    <price=>$7.95</price=>
    <ship=>+ $3.99</ship=>
  </result>
  <result>
    <price=>$8.59</price=>
    <ship=>+ $3.99</ship=>
  </result>
  <result>
    <price=>$8.99</price=>
    <ship=>+ $3.99</ship=>
  </result>
  <result>
    <price=>$10.05</price=>
    <ship=>+ $3.99</ship=>
  </result>
  <result>
    <price=>$10.32</price=>
    <ship=>+ $3.99</ship=>
  </result>
  <result>
    <price=>$10.32</price=>
    <ship=>+ $3.99</ship=>
  </result>
  <result>
    <price=>$10.55</price=>
    <ship=>+ $3.99</ship=>
  </result>
  <result>
    <price=>$10.56</price=>
    <ship=>+ $3.99</ship=>
  </result>
  <result>
    <price=>$11.42</price=>
    <ship=>+ $3.99</ship=>
  </result>
  <result>
    <price=>$11.59</price=>
    <ship=>+ $3.99</ship=>
  </result>
  <result>
    <price=>$11.90</price=>
    <ship=>+ $3.99</ship=>
  </result>
  <result>
    <price=>$11.95</price=>
    <ship=>+ $3.99</ship=>
  </result>
  <result>
    <price=>$12.07</price=>
    <ship=>+ $3.99</ship=>
  </result>
  <result>
    <price=>$12.35</price=>
    <ship=>+ $3.99</ship=>
  </result>
</root>
票数 1
EN

Stack Overflow用户

发布于 2009-08-19 22:15:05

谢谢!这正是我需要的。不过,我在正确循环时遇到了问题。

代码语言:javascript
复制
require 'rubygems'
require 'nokogiri'
require 'open-uri'
doc = Nokogiri::HTML(open('http://www.amazon.com/gp/offer-listing/0061673730'))
builder = Nokogiri::XML::Builder.new do |xml|
  xml.root {
    xml.item {
      doc.css('span.price').each do |o|
        xml.price = o
      doc.css('span.price_shipping').each do |o|

      end
      end
    }
  }
end

puts builder.to_xml

这将返回以下内容:

代码语言:javascript
复制
<?xml version="1.0"?>
<root>
  <item>
    <price=>&lt;span class="price"&gt;$6.09&lt;/span&gt;</price=>
    <price=>&lt;span class="price"&gt;$6.48&lt;/span&gt;</price=>
    <price=>&lt;span class="price"&gt;$11.95&lt;/span&gt;</price=>
    <ship=>&lt;span class="price_shipping"&gt;+ $3.99&lt;/span&gt;</ship=>
    <ship=>&lt;span class="price_shipping"&gt;+ $3.99&lt;/span&gt;</ship=>
    <ship=>&lt;span class="price_shipping"&gt;+ $3.99&lt;/span&gt;</ship=>

  </item>
</root>

我如何重写我的代码来返回类似下面这样的内容:

代码语言:javascript
复制
<?xml version="1.0"?>
<root>
  <item>
    <price=>&lt;span class="price"&gt;$6.09&lt;/span&gt;</price=>
    <ship=>&lt;span class="price_shipping"&gt;+ $3.99&lt;/span&gt;</ship=>
  </item>
  <item>
    <price=>&lt;span class="price"&gt;$6.48&lt;/span&gt;</price=>
    <ship=>&lt;span class="price_shipping"&gt;+ $3.99&lt;/span&gt;</ship=>
  </item>
  <item>
    <price=>&lt;span class="price"&gt;$11.95&lt;/span&gt;</price=>
    <ship=>&lt;span class="price_shipping"&gt;+ $3.99&lt;/span&gt;</ship=>
  </item>  
</root>
票数 0
EN
页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持
原文链接:

https://stackoverflow.com/questions/1298080

复制
相关文章

相似问题

领券
问题归档专栏文章快讯文章归档关键词归档开发者手册归档开发者手册 Section 归档