我需要为大学下载一些股票数据,并获得日期的NaN值。有人能帮忙吗?
header = {
"User-Agent": "Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/50.0.2661.75 Safari/537.36",
"X-Requested-With": "XMLHttpRequest"
}
def house_stock_trading():
url = 'https://www.quiverquant.com/sources/housetrading'
r = requests.get(url, headers=header)
df = pd.read_html(r.text)[0]
df.to_excel("data/house_stock_trading.xlsx" , index = False)表看起来不错,但我仍然可以得到日期的NaN值。有什么想法吗?
发布于 2022-02-02 18:13:25
尝试在flavor="html5lib"方法中指定.read_html()属性:
import requests
import pandas as pd
header = {
"User-Agent": "Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/50.0.2661.75 Safari/537.36",
"X-Requested-With": "XMLHttpRequest",
}
def house_stock_trading():
url = "https://www.quiverquant.com/sources/housetrading"
r = requests.get(url, headers=header)
df = pd.read_html(
r.text,
flavor="html5lib",
)[0]
return df
print(house_stock_trading().head())指纹:
Stock * Date Disclosed Rep. Purchase / Sale Amount District
0 SQ 2/1/2022 Donald Sternoff Beyer Jr. Sale $1,001-$15,000 VA08
1 PEP 2/1/2022 Donald Sternoff Beyer Jr. Purchase $1,001-$15,000 VA08
2 PEP 2/1/2022 Donald Sternoff Beyer Jr. Purchase $1,001-$15,000 VA08
3 CYRX 2/1/2022 Donald Sternoff Beyer Jr. Sale $1,001-$15,000 VA08
4 BBH 2/1/2022 Donald Sternoff Beyer Jr. Sale $1,001-$15,000 VA08https://stackoverflow.com/questions/70960338
复制相似问题