blocks|key|1517627|text|问得好！在这些时间序列上使用R%5En+(欧几里德、曼哈顿或一般minkowski)的任何标准距离都无法达到您想要的结果，因为这些度量与R%5En坐标的排列无关(而时间是严格有序的，这是您想要捕捉的现象)。|type|unstyled|depth|inlineStyleRanges|entityRanges|data|1517628|一个简单的技巧，可以做您要求的是使用累积版本的时间序列(随着时间的推移和值)，然后应用一个标准的度量。使用曼哈顿的度量，你会得到两个时间序列之间的距离，它们的累积版本之间的面积。|offset|length|style|BOLD|1517629|entityMap^0|0|I|9|0^^$0|@$1|2|3|4|5|6|7|J|8|@]|9|@]|A|$]]|$1|B|3|C|5|6|7|K|8|@$D|L|E|M|F|G]]|9|@]|A|$]]|$1|H|3|-4|5|6|7|N|8|@]|9|@]|A|$]]]|I|$]]

nice question! using any standard distance of R^n (euclidean, manhattan or generically minkowski) over those time series cannot achieve the result you want, since those metrics are independent of the permutations of the coordinate of R^n (while time is strictly ordered and it is the phenomenon you want to capture).

A simple trick, that can do what you ask is using the cumulated version of the time series (sum values over time as time increases) and then apply a standard metric. Using the Manhattan metric, you would get as a distance between two time series the area between their cumulated versions.

blocks|key|246238|text|另一种方法是利用DTW算法计算两个时态序列之间的相似性。完全公开；我为此编写了一个称为trendypy的Python包，您可以通过pip+(pip+install+trendypy)下载。这里是关于如何使用包的演示。你只是在计算不同组合的总最小距离来设置集群中心。|type|unstyled|depth|inlineStyleRanges|offset|length|style|CODE|entityRanges|data|246239|entityMap|0|LINK|mutability|MUTABLE|url|https://en.wikipedia.org/wiki/Dynamic_time_warping|1|http://www.doganaskan.com/trendypy/source/seeinaction.html^0|17|8|1Y|K|8|3|0|2M|2|1|0^^$0|@$1|2|3|4|5|6|7|P|8|@$9|Q|A|R|B|C]|$9|S|A|T|B|C]]|D|@$9|U|A|V|1|W]|$9|X|A|Y|1|Z]]|E|$]]|$1|F|3|-4|5|6|7|10|8|@]|D|@]|E|$]]]|G|$H|$5|I|J|K|E|$L|M]]|N|$5|I|J|K|E|$L|O]]]]

Another approach would be by utilizing <a href="https://en.wikipedia.org/wiki/Dynamic_time_warping" rel="nofollow noreferrer">DTW</a> which is an algorithm to compute the similarity between two temporal sequences. Full disclosure; I coded a Python package for this purpose called <code>trendypy</code>, you can download via pip (<code>pip install trendypy</code>). <a href="http://www.doganaskan.com/trendypy/source/seeinaction.html" rel="nofollow noreferrer">Here</a> is a demo on how to utilize the package. You're just just basically computing the total min distance for different combinations to set the cluster centers.

blocks|key|1517595|text|如果使用标准的皮尔森相关系数？，那么您可以将新的点分配给系数最高的集群。|type|unstyled|depth|inlineStyleRanges|entityRanges|offset|length|data|1517596|correlation+=+scipy.stats.pearsonr(<new+time+series>,+<centroid>)|style|CODE|1517597|entityMap|0|LINK|mutability|MUTABLE|url|https://docs.scipy.org/doc/scipy-0.14.0/reference/generated/scipy.stats.pearsonr.html^0|7|8|0|0|0|1T|0^^$0|@$1|2|3|4|5|6|7|P|8|@]|9|@$A|Q|B|R|1|S]]|C|$]]|$1|D|3|E|5|6|7|T|8|@$A|U|B|V|F|G]]|9|@]|C|$]]|$1|H|3|-4|5|6|7|W|8|@]|9|@]|C|$]]]|I|$J|$5|K|L|M|C|$N|O]]]]

what about using standard <a href="https://docs.scipy.org/doc/scipy-0.14.0/reference/generated/scipy.stats.pearsonr.html" rel="nofollow noreferrer">Pearson correlation coefficient?</a> then you can assign the new point to the cluster with the highest coefficient.

<code>correlation = scipy.stats.pearsonr(&lt;new time series&gt;, &lt;centroid&gt;)</code>

blocks|key|1517813|text|Pietro+P的答案只是将卷积应用于您的时间序列的一个特例。|type|unstyled|depth|inlineStyleRanges|entityRanges|data|1517814|如果我给内核：|1517815|[1,1,...,1,1,1,0,0,0,0,...0,0]|code-block|syntax|javascript|1517816|我会得到一个累积的系列。|1517817|增加一个卷积是有效的，因为你给每个数据点关于它的邻居的信息-现在它是顺序依赖的。|1517818|尝试用番石榴卷积或其他内核可能会很有趣。|1517819|entityMap^0|0|0|0|0|0|0^^$0|@$1|2|3|4|5|6|7|Q|8|@]|9|@]|A|$]]|$1|B|3|C|5|6|7|R|8|@]|9|@]|A|$]]|$1|D|3|E|5|F|7|S|8|@]|9|@]|A|$G|H]]|$1|I|3|J|5|6|7|T|8|@]|9|@]|A|$]]|$1|K|3|L|5|6|7|U|8|@]|9|@]|A|$]]|$1|M|3|N|5|6|7|V|8|@]|9|@]|A|$]]|$1|O|3|-4|5|6|7|W|8|@]|9|@]|A|$]]]|P|$]]

Pietro P's answer is just a special case of applying a convolution to your time series.
If I gave the kernel:
<pre><code>[1,1,...,1,1,1,0,0,0,0,...0,0]
</code></pre>
I would get a cumulative series .
Adding a convolution works because you're giving each data point information about it's neighbours - it's now order dependent.
It might be interesting to try with a guassian convolution or other kernels.

In order to clusterize a set of time series I'm looking for a smart distance metric. 
I've tried some well known metric but no one fits to my case.

ex: Let's assume that my cluster algorithm extracts this three centroids [s1, s2, s3]:
<a href="https://i.stack.imgur.com/07OJY.png" rel="noreferrer"><img src="https://i.stack.imgur.com/07OJY.png" alt="enter image description here"></a>

I want to put this new example [sx] in the most similar cluster:

<a href="https://i.stack.imgur.com/owugh.png" rel="noreferrer"><img src="https://i.stack.imgur.com/owugh.png" alt="enter image description here"></a>

The most similar centroids is the second one, so I need to find a distance function d that gives me <code>d(sx, s2) &lt; d(sx, s1)</code> and <code>d(sx, s2) &lt; d(sx, s3)</code> 

edit

Here the results with metrics [cosine, euclidean, minkowski, dynamic type warping]
<img src="https://i.stack.imgur.com/OR6so.png" alt="enter image description here">]<a href="https://i.stack.imgur.com/OR6so.png" rel="noreferrer">3</a>

edit 2

User Pietro P suggested to apply the distances on the cumulated version of the time series
The solution works, here the plots and the metrics:
<a href="https://i.stack.imgur.com/ivhyt.png" rel="noreferrer"><img src="https://i.stack.imgur.com/ivhyt.png" alt="enter image description here"></a>

Time series distance metric

翻译质量差，导致语言生硬或混乱。

没有提供实际的解决方法或示例。

解答不清晰，无法理解或解决问题。

页面排版不美观，阅读体验差。

文章

问答

视频

教程

学习中心

腾讯云实验室

直播

竞赛

腾讯云代码分析专区

腾讯iOA零信任安全管理系统专区

腾讯云架构师技术同盟交流圈

腾讯云数据库专区

腾讯云智能顾问专区

腾讯云原生专区

腾讯混元专区

腾讯云TCE专区

腾讯云Lighthouse专区

腾讯云HAI专区

腾讯云Edgeone专区

腾讯云存储专区

腾讯云智能专区

腾讯轻联专区 

腾讯云开发专区

TAPD专区

腾讯轻量云游戏服专区

腾讯云最具价值专家

腾讯云架构师技术同盟

腾讯云创作之星

腾讯云开发者先锋

腾讯云AI代码助手

云原生构建

TAPD 敏捷项目管理

Cloud Studio

SDK中心

API中心

命令行工具

功能1上新10个字符

功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符。

功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符。

功能5描述100个字符功能5描述100个字符功能5描述100个字符功能5描述100个字符功能5描述100个字符功能5描述100个字符

功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符

功能4上新

文章&问答评论现已支持表情

全新交互，全新视觉，新增快捷键、悬浮工具栏、高亮块等功能并同时优化现有功能，全面提升创作效率和体验

社区富文本编辑器全新改版！诚邀体验～ 

精选全网热门MCP server，让你的AI更好用 🚀

💥开发者 MCP广场重磅上线！

涵盖代码开发、场景应用、自动测试全流程，助你从零构建专属AI助手

一站式MCP教程库，解锁AI应用新玩法

聚焦“写作效率、视觉美观与运行性能”三方面进行全面升级，为您提供更高效、稳定的创作环境

社区富文本&Markdown编辑器全新改版上线，欢迎大家体验!

诚挚邀请您参与本次调研，分享您的真实使用感受与建议。您的反馈至关重要，感谢您的支持与参与！

社区新版编辑器体验调研

为了聚类一组时间序列，我正在寻找一种智能的距离度量。我试过一些众所周知的指标，但没有人适合我的情况。让我们假设我的集群算法提取了这三个质心s1、s2、s3：​我想把这个新示例sx放在最相似的集群中：​​最相似的质心是第二个，所以我需要找到一个距离函数d，给我d(sx, s2) < d(sx, s1)和d(sx, s2) < d(sx, s3)。编辑这里的结果与度量余弦，欧几里德，明考斯基，动态类型

问时间序列距离度量
EN

回答 4

Stack Overflow用户

Stack Overflow用户

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问时间序列距离度量EN

回答 4

Stack Overflow用户

Stack Overflow用户

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问时间序列距离度量
EN