我从这里尝试了这个教程:https://www.youtube.com/watch?v=XQgXKtPSzUI&list=WL&index=93
这就是我试图摘录的一篇文章的脚本:

from urllib.request import urlopen as uReq
from bs4 import BeautifulSoup as soup
my_url = 'https://steemit.com/test/@bitcoinfree/test-4'
uClient = uReq(my_url)
page_html = uClient.read()
uClient.close()
page_soup = soup(page_html,'html.parser')
print(page_soup.prettify("utf-8"))目前,该代码输出的是胡言乱语。
我不知道如何获得纯html源代码。我做错了什么?
发布于 2017-08-02 21:15:35
明白了。
import requests
from bs4 import BeautifulSoup
url = 'https://steemit.com/test/@bitcoinfree/test-4'
r = requests.get(url)
soup = BeautifulSoup(r.content, "html.parser")
print(soup.prettify())https://stackoverflow.com/questions/45461385
复制相似问题