通过使用scikit学习中的K-方法,我将我的数据聚集到了3个集群中。我想要绘制一个线条图,它将有多个颜色,这取决于它所在的集群。当集群中没有数据时,我该如何弥补空白呢?
kmeans = KMeans(n_clusters=3)
kmeans.fit(nocutoff_diffdf_windowed2d)
y_kmeans = kmeans.predict(nocutoff_diffdf_windowed2d)
plt.scatter(nocutoff_diffdf_windowed2d[:, 0], nocutoff_diffdf_windowed2d[:, 1], c=y_kmeans)
centers = kmeans.cluster_centers_
plt.scatter(centers[:, 0], centers[:, 1], c='black', s=200, alpha=0.5);标绘线图
fig = go.Figure()
for i in range(0, 3):
df = X_nocutoff_diffdf_windowed[y_kmeans == i]
fig.add_trace(go.Scatter(
x=df.index,
y=df['total'],
))
# fig = px.line(X_nocutoff_diffdf_windowed, x=X_nocutoff_diffdf_windowed.index, y="total", title='linegraph of total over time', color=y_kmeans)
fig.show()


发布于 2022-04-16 03:46:47
基于这个链接:https://plotly.com/python/line-charts/#connect-data-gaps
如果您的NaN值不应该是群集值,默认情况下,它将不会连接间隙。我建议重采样().asfreq()来重采样看起来是日期时间索引(?)并插入NaNs,以确保缺口没有连接。
https://stackoverflow.com/questions/71890352
复制相似问题