我不能用xpath在特定的网站上打印//*@class=“足球主内容”,谢谢!
# -*- coding:UTF-8 -*-
from pyvirtualdisplay import Display
import sys
from bs4 import BeautifulSoup
from selenium import webdriver
display = Display(visible=0, size=(1024, 768))
display.start()
driver = webdriver.Firefox()
driver.get("url")
soup = BeautifulSoup(driver.page_source.encode('utf-8'),'html.parser')
lines = soup.find_elements_by_xpath('//*[@class="footballmaincontent"]/tr')
print lines
driver.close()
display.stop()发布于 2018-06-05 09:25:15
尝试下面的代码以获得所需的输出:
lines = ['; '.join([j.text for j in i.find_elements_by_xpath('./td/div | ./td/span/span | ./td/span[not(span)]') if j.text]) for i in driver.find_elements_by_xpath('//*[@class="footballmaincontent"]//tr[position()>2]') if i.text]
for line in lines:
print linehttps://stackoverflow.com/questions/50696073
复制相似问题