我有一个使用cURL从站点获取页面的脚本,我试图将其逐行剪切,然后提取所需内容并删除该行。问题是,有时它会给我一个项目的正确位置,有时它不会取决于哪一行,它交替。
我正在尝试使用其中的一个页面:http://www.gametracker.com/search/
im使用的代码是:
while (strpos($contents, '<tr>'))
{
$start = strpos($contents, '<tr>');
$end = strpos($contents, '</tr>', $start) + 5;
$rstring = substr($contents, $start, $end - $start);
$contents = str_replace($rstring, '', $contents);
//nameee
$start = strpos($rstring, '[clantag]');
$end = strpos($rstring, '</a>', $start);
$name= trim(substr($rstring, $start, $end - $start));
echo $name . '<br/>';
//players
$search = '<td class="c05 item_bgcolor_even">';
$start = (strpos($rstring, $search) + strlen($search));
$end = strpos($rstring, "</td>", $start);
$players= trim(substr($rstring, $start, ($end - $start)));
echo $players . '<br />';
$map;
$curPlayers;
$maxPlayers;
$ip;
}它正确地获得了服务器名称,但是当涉及到播放器时,一行我得到了它们,另一行我得到了lor_odd"> 1230。
发布于 2011-06-19 00:23:38
有了DOM parser,事情就简单多了
libxml_use_internal_errors(true);
$dom = new DOMDocument;
$dom->loadHTMLFile('http://www.gametracker.com/search/');
$xpath = new DOMXPath($dom);
// Get the rows from the search list
$rows = $xpath->query('//table[@class="table_lst table_lst_srs"]/tr');
for ($i=1; $i<$rows->length-1; $i++) {
$row = $rows->item($i);
// Get the columns for a row
$cols = $row->getElementsByTagName('td');
// Get the server name (3rd column)
echo 'Name: '.trim($cols->item(2)->textContent).PHP_EOL;
// Get the player count (5th column)
echo 'Players: '.trim($cols->item(4)->textContent).PHP_EOL.PHP_EOL;
}输出:
Name: NoobonicPlague.com | 24/7 RP, FastDL, Custom Mods
Players: 55/65
Name: RUS Coop-20 Semantic #1 (2.0.7.0)
Players: 20/20
Name: TAW.net -(Dallas)
Players: 32/32
Name: Dedicated
Players: 0/12
...发布于 2011-06-19 00:10:23
您一直在为这种处理编写“错误”类型的代码。它很难理解和维护,更不用说调试了。
如果您愿意放弃手动的超文本标记语言解析,转而使用XPath查询,那么您的工作将变得容易得多:
$doc = new DOMDocument();
$doc->loadHTML($contents); // $contents = the HTML that curl returns
$xpath = new DOMXPath($doc);
// You can now use as many XPath queries as you like; they look like this:
$nodes = $xpath->query(''); // you own query expression here
foreach ($nodes as $node) {
$v = $node->nodeValue; // now do something with $v
}https://stackoverflow.com/questions/6397205
复制相似问题