我正在尝试从<select>和<option>元素中获取一些元素。
这是我的代码:
for(int td = 1; td <= 1; td++)
{
using (WebClient wc = new WebClient())
{
string pagina = wc.DownloadString("http://www.serebii.net/attackdex-xy/");
HtmlDocument doc = new HtmlDocument();
doc.LoadHtml(pagina);
string attacco;
//var prova = doc.DocumentNode.SelectSingleNode("/html/body/table[2]/tr[2]/td[2]/div[2]/table/tr/td[1]/form/select/option");
foreach (HtmlNode node in doc.DocumentNode.SelectNodes(String.Format("/html/body/table[2]/tr[2]/td[2]/font/div[2]/table/tr/td[{0}]/select/option", td)))
{
attacco = node.NextSibling.InnerText;
if(attacco != "AttackDex: A - G\n" && attacco != "AttackDex: H - R\n" && attacco != "AttackDex: S - Z\n")
{
var url = string.Format("http://www.serebii.net/attackdex-xy/{0}.shtml", attacco.ToLower().Replace(" ", ""));
string attackPage = wc.DownloadString(url);
HtmlDocument doc2 = new HtmlDocument();
doc2.LoadHtml(attackPage);
var category = doc.DocumentNode.SelectSingleNode("/html/body/table[2]/tr[2]/td[2]");
}
}
}
}这是html页面的代码:
<select name="SelectURL" onchange="document.location.href=document.nav.SelectURL.options[document.nav.SelectURL.selectedIndex].value" style="color:#383838; font-size: 8pt; background:#CEBC77" size="1">
<option>AttackDex: A - G
</option><option value="/attackdex-xy/absorb.shtml">Absorb</option>
<option value="/attackdex-xy/acid.shtml">Acid</option>
<option value="/attackdex-xy/acidarmor.shtml">Acid Armor</option>
<option value="/attackdex-xy/acidspray.shtml">Acid Spray</option>
<option value="/attackdex-xy/acrobatics.shtml">Acrobatics</option>
<option value="/attackdex-xy/acupressure.shtml">Acupressure</option>
<option value="/attackdex-xy/aerialace.shtml">Aerial Ace</option>
<option value="/attackdex-xy/aeroblast.shtml">Aeroblast</option>
[...]当我运行程序时,node的值(在foreach块中)是空的。
发布于 2015-05-12 05:58:20
我选择了文档中的所有选项,然后只过滤出那些拥有祖先select标签且名称为'SelectURL‘的选项,从而使其正常工作。
doc.DocumentNode.SelectNodes("//option[ancestor::select[@name='SelectURL']]")https://stackoverflow.com/questions/30177924
复制相似问题