blocks|key|994271|text|我认为分层聚类是一个很好的选择。看看这里，聚类算法|type|unstyled|depth|inlineStyleRanges|entityRanges|offset|length|data|994272|entityMap|0|LINK|mutability|MUTABLE|url|http://home.deib.polimi.it/matteucc/Clustering/tutorial_html/^0|L|4|0|0^^$0|@$1|2|3|4|5|6|7|L|8|@]|9|@$A|M|B|N|1|O]]|C|$]]|$1|D|3|-4|5|6|7|P|8|@]|9|@]|C|$]]]|E|$F|$5|G|H|I|C|$J|K]]]]

I think hierarchical clustering is a good choice. Have a look here <a href="http://home.deib.polimi.it/matteucc/Clustering/tutorial_html/" rel="nofollow">Clustering Algorithms</a>

blocks|key|103427|text|比较简单的聚类方法是通过kmeans算法进行聚类。如果您的所有属性都是数字属性，那么这是最简单的集群方法。即使它们不是，你也必须找到一种距离度量来衡量毛毛虫属性或名词属性，但是kmeans仍然是一个不错的选择。Kmeans是一种分区聚类算法.在这种情况下，我不会使用分层聚类。但这也取决于你想做什么。您需要评估是否要在集群中找到集群，或者它们必须完全分开，而不是相互包含在一起。|type|unstyled|depth|inlineStyleRanges|entityRanges|data|103428|保重。|103429|entityMap^0|0|0^^$0|@$1|2|3|4|5|6|7|F|8|@]|9|@]|A|$]]|$1|B|3|C|5|6|7|G|8|@]|9|@]|A|$]]|$1|D|3|-4|5|6|7|H|8|@]|9|@]|A|$]]]|E|$]]

The more simple way to do clustering is by kmeans algorithm. If all of your attributes are numerical, then this is the easiest way of doing the clustering. Even if they are not, you would have to find a distance measure for caterogical or nominal attributes, but still kmeans is a good choice. Kmeans is a partitional clustering algorithm... i wouldn't use hierarchical clustering for this case. But that also depends on what you want to do. you need to evaluate if you want to find clusters within clusters or they all have to be totally apart from each other and not included on each other. 

Take care.

blocks|key|2036516|text|1)首先，尝试使用k-方法。如果这能满足你的要求，那就是了。播放不同数量的集群(由参数k控制)。k-means有许多实现，如果您有良好的编程技能，您可以实现自己的版本。|type|unstyled|depth|inlineStyleRanges|entityRanges|data|2036517|如果数据看起来像圆形/球形的话，K-的意思通常是很好的。这意味着数据中存在一些高斯性(数据来自于高斯分布)。|2036518|2)如果k手段不能满足你的期望，那么是时候多读书多想了。然后我建议阅读一份好的调查报告。最常见的技术是用几种编程语言和数据挖掘框架实现的，其中许多技术可以免费下载和使用。|offset|length|2036519|3)如果应用先进的聚类技术还不够，那么就应该设计一种新的聚类技术。然后你可以自己思考，也可以与机器学习专家联系。|2036520|entityMap|0|LINK|mutability|MUTABLE|url|http://dl.acm.org/citation.cfm?id=331504^0|0|0|Z|8|0|0|0^^$0|@$1|2|3|4|5|6|7|R|8|@]|9|@]|A|$]]|$1|B|3|C|5|6|7|S|8|@]|9|@]|A|$]]|$1|D|3|E|5|6|7|T|8|@]|9|@$F|U|G|V|1|W]]|A|$]]|$1|H|3|I|5|6|7|X|8|@]|9|@]|A|$]]|$1|J|3|-4|5|6|7|Y|8|@]|9|@]|A|$]]]|K|$L|$5|M|N|O|A|$P|Q]]]]

1) First, try with k-means. If that fulfills your demand that's it. Play with different number of clusters (controlled by parameter k). There are a number of implementations of k-means and you can implement your own version if you have good programming skills.

K-means generally works well if data looks like a circular/spherical shape. This means that there is some Gaussianity in the data (data comes from a Gaussian distribution).

2) if k-means doesn't fulfill your expectations, it is time to read and think more. Then I suggest reading <a href="http://dl.acm.org/citation.cfm?id=331504" rel="nofollow noreferrer">a good survey paper</a>. the most common techniques are implemented in several programming languages and data mining frameworks, many of them are free to download and use.

3) if applying state-of-the-art clustering techniques is not enough, it is time to design a new technique. Then you can think by yourself or associate with a machine learning expert.

blocks|key|103477|text|由于大多数数据是连续的，并且合理地假定能源消耗和发电是正态分布的，所以我会使用统计方法进行聚类。|type|unstyled|depth|inlineStyleRanges|entityRanges|data|103478|例如：|103479|高斯混合模型|unordered-list-item|offset|length|103480|贝叶斯层次聚类|103481|这些方法相对于基于度量的聚类算法(例如k均值)的优点是，我们可以利用我们正在处理的平均值这一事实，并对计算这些平均值的分布进行假设。|style|BOLD|103482|entityMap|0|LINK|mutability|MUTABLE|url|http://scikit-learn.org/stable/modules/mixture.html|1|http://www2.stat.duke.edu/~kheller/bhcnew.pdf^0|0|0|0|6|0|0|0|7|1|0|15|3|0^^$0|@$1|2|3|4|5|6|7|Y|8|@]|9|@]|A|$]]|$1|B|3|C|5|6|7|Z|8|@]|9|@]|A|$]]|$1|D|3|E|5|F|7|10|8|@]|9|@$G|11|H|12|1|13]]|A|$]]|$1|I|3|J|5|F|7|14|8|@]|9|@$G|15|H|16|1|17]]|A|$]]|$1|K|3|L|5|6|7|18|8|@$G|19|H|1A|M|N]]|9|@]|A|$]]|$1|O|3|-4|5|6|7|1B|8|@]|9|@]|A|$]]]|P|$Q|$5|R|S|T|A|$U|V]]|W|$5|R|S|T|A|$U|X]]]]

Since most of your data is continuous, and it reasonable to assume that energy consumption and generation are normally distributed, I would use statistical methods for clustering.

Such as:

<ul>
<li><a href="http://scikit-learn.org/stable/modules/mixture.html" rel="nofollow noreferrer">Gaussian Mixture Models</a></li>
<li><a href="http://www2.stat.duke.edu/~kheller/bhcnew.pdf" rel="nofollow noreferrer">Bayesian Hierarchical Clustering</a> </li>
</ul>

The advantage of these methods over metric-based clustering algorithms (e.g. k-means) is that we can take advantage of the fact that we are dealing with averages, and we can make assumptions on the distributions from which those average were calculated.

I have a data set which consists of data points having attributes like:

<ul>
<li>average daily consumption of energy</li>
<li>average daily generation of energy</li>
<li>type of energy source</li>
<li>average daily energy fed in to grid</li>
<li>daily energy tariff</li>
</ul>

I am new to clustering techniques.

So my question is which clustering algorithm will be best for such kind of data to form clusters ?

Clustering Algorithm for average energy measurements

翻译质量差，导致语言生硬或混乱。

没有提供实际的解决方法或示例。

解答不清晰，无法理解或解决问题。

页面排版不美观，阅读体验差。

文章

问答

视频

教程

学习中心

腾讯云实验室

直播

竞赛

腾讯云代码分析专区

腾讯iOA零信任安全管理系统专区

腾讯云架构师技术同盟交流圈

腾讯云数据库专区

腾讯云智能顾问专区

腾讯云原生专区

腾讯混元专区

腾讯云TCE专区

腾讯云Lighthouse专区

腾讯云HAI专区

腾讯云Edgeone专区

腾讯云存储专区

腾讯云智能专区

腾讯轻联专区 

腾讯云开发专区

TAPD专区

腾讯轻量云游戏服专区

腾讯云最具价值专家

腾讯云架构师技术同盟

腾讯云创作之星

腾讯云开发者先锋

腾讯云AI代码助手

云原生构建

TAPD 敏捷项目管理

Cloud Studio

SDK中心

API中心

命令行工具

功能1上新10个字符

功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符。

功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符。

功能5描述100个字符功能5描述100个字符功能5描述100个字符功能5描述100个字符功能5描述100个字符功能5描述100个字符

功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符

功能4上新

文章&问答评论现已支持表情

全新交互，全新视觉，新增快捷键、悬浮工具栏、高亮块等功能并同时优化现有功能，全面提升创作效率和体验

社区富文本编辑器全新改版！诚邀体验～ 

精选全网热门MCP server，让你的AI更好用 🚀

💥开发者 MCP广场重磅上线！

涵盖代码开发、场景应用、自动测试全流程，助你从零构建专属AI助手

一站式MCP教程库，解锁AI应用新玩法

聚焦“写作效率、视觉美观与运行性能”三方面进行全面升级，为您提供更高效、稳定的创作环境

社区富文本&Markdown编辑器全新改版上线，欢迎大家体验!

诚挚邀请您参与本次调研，分享您的真实使用感受与建议。您的反馈至关重要，感谢您的支持与参与！

社区新版编辑器体验调研

我有一个数据集，它由具有如下属性的数据点组成：平均日能耗平均日发电量能源类型给电网的平均日能量日能源电价我对聚类技术很陌生。那么，我的问题是，哪种聚类算法最适合于这样的数据形成集群？

问平均能量测量的聚类算法
EN

回答 4

Stack Overflow用户

Stack Overflow用户

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问平均能量测量的聚类算法EN

回答 4

Stack Overflow用户

Stack Overflow用户

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问平均能量测量的聚类算法
EN