blocks|key|1852973|text|实际上，您可以将度量标准提供给任何收件人:实现自己的MetricsSink并配置hadoop来使用它。|type|unstyled|depth|inlineStyleRanges|entityRanges|offset|length|data|1852974|或者，您可以使用已经与Hadoop发行版捆绑在一起的MetricsSink，如GraphiteSink，并在Graphite中获取度量标准。|1852975|注意，在作业完成之前，某些计数器是不可用的(成功与否)。|1852976|另外，选项2也有可能使HistoryServer陷入麻烦(当您使用大量映射器进行投票以获得一份工作时，它可能是OOM)。|1852977|entityMap|0|LINK|mutability|MUTABLE|url|https://hadoop.apache.org/docs/r2.7.0/api/org/apache/hadoop/metrics2/MetricsSink.html|1|https://hadoop.apache.org/docs/r2.7.0/api/org/apache/hadoop/metrics2/sink/GraphiteSink.html^0|Q|B|0|0|13|C|1|0|0|0^^$0|@$1|2|3|4|5|6|7|T|8|@]|9|@$A|U|B|V|1|W]]|C|$]]|$1|D|3|E|5|6|7|X|8|@]|9|@$A|Y|B|Z|1|10]]|C|$]]|$1|F|3|G|5|6|7|11|8|@]|9|@]|C|$]]|$1|H|3|I|5|6|7|12|8|@]|9|@]|C|$]]|$1|J|3|-4|5|6|7|13|8|@]|9|@]|C|$]]]|K|$L|$5|M|N|O|C|$P|Q]]|R|$5|M|N|O|C|$P|S]]]]

You can feed your metrics to whatever recipient actually: implement your own <a href="https://hadoop.apache.org/docs/r2.7.0/api/org/apache/hadoop/metrics2/MetricsSink.html" rel="nofollow">MetricsSink</a> and configure hadoop to use it.

Or you can use a MetricsSink already bundled with Hadoop distro, like <a href="https://hadoop.apache.org/docs/r2.7.0/api/org/apache/hadoop/metrics2/sink/GraphiteSink.html" rel="nofollow">GraphiteSink</a> and get your metrics in Graphite.

Note that some counters are not available until the job has finished (successfully or not).

Also, option 2 is also a risk to get HistoryServer into trouble (when you poll for a job with a jillion of mappers, it might OOM).

I'm looking for a way of gathering all the counters and metrics of individual hadoop jobs in an event-driven way to store all this data within elasticsearch for later troubleshooting and analysis.

Currently I found few methods which could have seemed to fit the requirements:

<ol>
<li>Using metric exporters, especially, <a href="https://github.com/apache/hadoop/blob/release-2.7.1/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/main/java/org/apache/hadoop/yarn/server/nodemanager/containermanager/monitor/ContainerMetrics.java" rel="nofollow">ContainerMetrics</a> that allows to obtain per-container memory and cpu usage and <a href="https://github.com/apache/hadoop/blob/release-2.7.1/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/metrics/MRAppMetrics.java" rel="nofollow">MRAppMetrics</a>, but that one aggregates all the metrics for all the jobs.</li>
<li>Polling MR History Server with its <a href="https://hadoop.apache.org/docs/r2.7.1/hadoop-mapreduce-client/hadoop-mapreduce-client-hs/HistoryServerRest.html" rel="nofollow">REST API</a> that is pretty straightforward, but requires a lot of HTTP calls to gather all the counters for jobs, tasks and their attempts.</li>
<li>Plugging an additional custom <a href="https://github.com/apache/hadoop/blob/release-2.7.1/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-common/src/main/java/org/apache/hadoop/yarn/event/EventHandler.java" rel="nofollow">EventHandler</a> into <a href="https://github.com/apache/hadoop/blob/release-2.7.1/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/MRAppMaster.java" rel="nofollow">MRAppMaster</a>'s event <a href="https://github.com/apache/hadoop/blob/release-2.7.1/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/MRAppMaster.java#L194" rel="nofollow">dispatcher</a>, but <a href="https://github.com/apache/hadoop/blob/release-2.7.1/hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/MRAppMaster.java" rel="nofollow">MRAppMaster</a> does not have corresponding mechanisms to register custom event handlers.</li>
<li>Using black magic of javaagents (java instrumentation api), bytecode modifications and aop-like functionality to intercept all the executions of <a href="https://github.com/apache/hadoop/blob/release-2.7.1/hadoop-yarn-project/hadoop-yarn/hadoop-yarn-common/src/main/java/org/apache/hadoop/yarn/event/EventHandler.java#L34" rel="nofollow">EventHandler#handle(T)</a> method. That way should be able to solve all the requirements, but needs additional configuration of MR-jobs, javaagent development and registration and generally seems to be pretty complex.</li>
</ol>

So, I'd like to ask whether there are any more simple ways to collect metrics and counters of individual hadoop jobs?

Gathering counters and metrics of individual hadoop jobs

翻译质量差，导致语言生硬或混乱。

没有提供实际的解决方法或示例。

解答不清晰，无法理解或解决问题。

页面排版不美观，阅读体验差。

文章

问答

视频

教程

学习中心

腾讯云实验室

直播

竞赛

腾讯云代码分析专区

腾讯iOA零信任安全管理系统专区

腾讯云架构师技术同盟交流圈

腾讯云数据库专区

腾讯云智能顾问专区

腾讯云原生专区

腾讯混元专区

腾讯云TCE专区

腾讯云Lighthouse专区

腾讯云HAI专区

腾讯云Edgeone专区

腾讯云存储专区

腾讯云智能专区

腾讯轻联专区 

腾讯云开发专区

TAPD专区

腾讯轻量云游戏服专区

EdgeOne AI 安全实战专区

腾讯云最具价值专家

腾讯云架构师技术同盟

腾讯云创作之星

腾讯云开发者先锋

腾讯云AI代码助手

云原生构建

TAPD 敏捷项目管理

Cloud Studio

SDK中心

API中心

命令行工具

功能1上新10个字符

功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符。

功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符。

功能5描述100个字符功能5描述100个字符功能5描述100个字符功能5描述100个字符功能5描述100个字符功能5描述100个字符

功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符

功能4上新

文章&问答评论现已支持表情

全新交互，全新视觉，新增快捷键、悬浮工具栏、高亮块等功能并同时优化现有功能，全面提升创作效率和体验

社区富文本编辑器全新改版！诚邀体验～ 

精选全网热门MCP server，让你的AI更好用 🚀

💥开发者 MCP广场重磅上线！

涵盖代码开发、场景应用、自动测试全流程，助你从零构建专属AI助手

一站式MCP教程库，解锁AI应用新玩法

聚焦“写作效率、视觉美观与运行性能”三方面进行全面升级，为您提供更高效、稳定的创作环境

社区富文本&Markdown编辑器全新改版上线，欢迎大家体验!

诚挚邀请您参与本次调研，分享您的真实使用感受与建议。您的反馈至关重要，感谢您的支持与参与！

社区新版编辑器体验调研

我正在寻找一种以事件驱动的方式收集单个hadoop作业的所有计数器和度量的方法，以便在elasticsearch中存储所有这些数据，以便以后进行故障排除和分析。目前，我发现了一些似乎符合要求的方法：使用度量导出器，特别是允许获取每个容器内存和cpu使用量以及的，但它可以聚合所有作业的所有指标。使用历史服务器的进行轮询非常简单，但需要大量的HTTP调用来收集作业、任务及其尝试的所有计数器。将额外的自

问收集单个hadoop作业的计数器和度量
EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问收集单个hadoop作业的计数器和度量EN