运行我的脚本已经意识到需要存储或检索For是一个常见的变量,以后将用于其他目的。如果有人知道如何将结果print(link.get('href'))存储在一个公共变量中。
代码:
import requests
from bs4 import BeautifulSoup
Url='https://br.answers.search.yahoo.com/search?p=porque+escravizam+os+indios&guccounter=1&guce_referrer=aHR0cHM6Ly9ici5hbnN3ZXJzLnlhaG9vLmNvbS8&guce_referrer_sig=AQAAAJ667A0mUDEjLVx6pfks8w7FSsMqUwTnuRTXpDTkZoxPO9jjP-olDl37KqLbL0t-AG7H-7xs-rP4ddCp9NhDEAxBvSvSqmbx0L9EW2pLBKHJ1gyislcYuWHaHCijW1VunXB2Ih5o-47LSJRerSSxgWMosYe_rm6xWKQ3X2aJ_qPu'
r=requests.get(Url)
soup=BeautifulSoup(r.content,'html.parser')
for link in soup.select('h3 > a'):
print(' \033[7;31;10mLINK \033[m')
***print(link.get('href'))***发布于 2019-12-01 15:29:28
不确定“它必须在print命令中处于其原始状态”是什么意思。通常,它是一个常规属性,因此您可以随意访问它:
import requests
from bs4 import BeautifulSoup
Url='https://br.answers.search.yahoo.com/search?p=porque+escravizam+os+indios&guccounter=1&guce_referrer=aHR0cHM6Ly9ici5hbnN3ZXJzLnlhaG9vLmNvbS8&guce_referrer_sig=AQAAAJ667A0mUDEjLVx6pfks8w7FSsMqUwTnuRTXpDTkZoxPO9jjP-olDl37KqLbL0t-AG7H-7xs-rP4ddCp9NhDEAxBvSvSqmbx0L9EW2pLBKHJ1gyislcYuWHaHCijW1VunXB2Ih5o-47LSJRerSSxgWMosYe_rm6xWKQ3X2aJ_qPu'
r=requests.get(Url)
all_links = []
soup=BeautifulSoup(r.content,'html.parser')
for link in soup.select('h3 > a'):
print(' \033[7;31;10mLINK \033[m')
print(link.get('href'))
all_links.append(link.get('href'))https://stackoverflow.com/questions/59126797
复制相似问题