blocks|key|12632|text|这在很大程度上取决于领域知识。一般的方法是|type|unstyled|depth|inlineStyleRanges|entityRanges|data|12633|每个电路c上最坏的或平均一致性的乘积，分别是(1+%2B+m)\text{max}(\sigma_c)或(1+%2B+m)\text{avg}(\sigma_c)对该电路的零值的乘积，或|ordered-list-item|offset|length|style|CODE|12634|每个车手d的最差或平均一致性的乘数，分别为(1+%2B+m)\text{max}(\sigma_d)或(1+%2B+m)\text{avg}(\sigma_d)，用于未完成的比赛，或|12635|驱动程序平均值和电路平均一致性(即(1+%2B+m)[\text{avg}(\sigma_d)+%2B+\text{avg}(\sigma_c)]/2+)的乘积，用于电路c中未完成的驱动器d竞赛或其他一些组合。|12636|无论选择哪种方法，系数m的选择都会影响最终的排名，也可以确定。|12637|主观上，从专家的角度来看排名，并选择一个更有意义的，或|12638|通过尝试一系列的值，比如m+\in+\{-0.2,+-0.1,+0,+0.1,+0.2,+..,+0.5\}，平均每个驱动程序的一致性、\sigma_d或排名，R_d，d。这种方法的一个优点是，当驱动程序的秩对不同的m值具有较低的方差时，它意味着驾驶员的秩对m的选择不敏感，即它不那么有争议，并且当秩随着m的不同选择而变化很大时，平均秩就更具争议性。|entityMap|0|INLINETEX|mutability|IMMUTABLE|teX|c|1|(1+%2B+m)\text{max}(\sigma_c)|2|(1+%2B+m)\text{avg}(\sigma_c)|3|d|4|(1+%2B+m)\text{max}(\sigma_d)|5|(1+%2B+m)\text{avg}(\sigma_d)|6|(1+%2B+m)[\text{avg}(\sigma_d)+%2B+\text{avg}(\sigma_c)]/2|7|8|9|m|10|m+\in+\{-0.2,+-0.1,+0,+0.1,+0.2,+..,+0.5\}|11|\sigma_d|12|R_d|13|14|15|16^0|0|4|1|M|R|1E|R|4|1|0|M|R|1|1E|R|2|0|4|1|L|R|1D|R|4|1|3|L|R|4|1D|R|5|0|H|1I|29|1|2I|1|H|1I|6|29|1|7|2I|1|8|0|B|1|B|1|9|0|0|C|16|1W|8|28|3|2C|1|30|1|3L|1|48|1|C|16|A|1W|8|B|28|3|C|2C|1|D|30|1|E|3L|1|F|48|1|G^^$0|@$1|2|3|4|5|6|7|1P|8|@]|9|@]|A|$]]|$1|B|3|C|5|D|7|1Q|8|@$E|1R|F|1S|G|H]|$E|1T|F|1U|G|H]|$E|1V|F|1W|G|H]]|9|@$E|1X|F|1Y|1|1Z]|$E|20|F|21|1|22]|$E|23|F|24|1|25]]|A|$]]|$1|I|3|J|5|D|7|26|8|@$E|27|F|28|G|H]|$E|29|F|2A|G|H]|$E|2B|F|2C|G|H]]|9|@$E|2D|F|2E|1|2F]|$E|2G|F|2H|1|2I]|$E|2J|F|2K|1|2L]]|A|$]]|$1|K|3|L|5|D|7|2M|8|@$E|2N|F|2O|G|H]|$E|2P|F|2Q|G|H]|$E|2R|F|2S|G|H]]|9|@$E|2T|F|2U|1|2V]|$E|2W|F|2X|1|2Y]|$E|2Z|F|30|1|31]]|A|$]]|$1|M|3|N|5|6|7|32|8|@$E|33|F|34|G|H]]|9|@$E|35|F|36|1|37]]|A|$]]|$1|O|3|P|5|D|7|38|8|@]|9|@]|A|$]]|$1|Q|3|R|5|D|7|39|8|@$E|3A|F|3B|G|H]|$E|3C|F|3D|G|H]|$E|3E|F|3F|G|H]|$E|3G|F|3H|G|H]|$E|3I|F|3J|G|H]|$E|3K|F|3L|G|H]|$E|3M|F|3N|G|H]]|9|@$E|3O|F|3P|1|3Q]|$E|3R|F|3S|1|3T]|$E|3U|F|3V|1|3W]|$E|3X|F|3Y|1|3Z]|$E|40|F|41|1|42]|$E|43|F|44|1|45]|$E|46|F|47|1|48]]|A|$]]]|S|$T|$5|U|V|W|A|$X|Y]]|Z|$5|U|V|W|A|$X|10]]|11|$5|U|V|W|A|$X|12]]|13|$5|U|V|W|A|$X|14]]|15|$5|U|V|W|A|$X|16]]|17|$5|U|V|W|A|$X|18]]|19|$5|U|V|W|A|$X|1A]]|1B|$5|U|V|W|A|$X|Y]]|1C|$5|U|V|W|A|$X|14]]|1D|$5|U|V|W|A|$X|1E]]|1F|$5|U|V|W|A|$X|1G]]|1H|$5|U|V|W|A|$X|1I]]|1J|$5|U|V|W|A|$X|1K]]|1L|$5|U|V|W|A|$X|14]]|1M|$5|U|V|W|A|$X|1E]]|1N|$5|U|V|W|A|$X|1E]]|1O|$5|U|V|W|A|$X|1E]]]]

This heavily depends on the domain knowledge. A general approach would be to place 

<ol>
<li>A multiplicative of the worst or average consistency at each circuit $c$, i.e. $(1 + m)\text{max}(\sigma_c)$ or $(1 + m)\text{avg}(\sigma_c)$ respectively, for the null values at that circuit, or</li>
<li>A multiplicative of the worst or average consistency of each driver $d$, i.e. $(1 + m)\text{max}(\sigma_d)$ or $(1 + m)\text{avg}(\sigma_d)$ respectively, for their unfinished races, or</li>
<li>A multiplicative of average of driver and circuit average consistencies, i.e. $(1 + m)[\text{avg}(\sigma_d) + \text{avg}(\sigma_c)]/2$, for unfinished race of driver $d$ at circuit $c$, or some other combinations.</li>
</ol>

No matter which approach to choose, the choice of coefficient $m$ affects the final ranking and could be determined either

<ol>
<li>Subjectively by looking at the rankings from an expert point of view and selecting the one that makes more sense, or</li>
<li>By trying a range of values like $m \in \{-0.2, -0.1, 0, 0.1, 0.2, .., 0.5\}$ and averaging the consistencies $\sigma_d$ or rankings $R_d$ for each driver $d$. An advantage of this approach would be that when rank of a driver has a low variance over different values of $m$, it implies that driver's rank is insensitive to the choice of $m$, i.e. it is less controversial, and when rank changes a lot with different choices of $m$, the average rank is more controversial.</li>
</ol>

I have to calculate the consistency of racing car drivers during the whole season. My DataFrame consists of 10 columns (10 circuit names) and for each of those columns I have the standard deviation in lap time the driver posted in that circuit. In other words, how consistent the driver is from lap to lap. In races the driver did not finish the field is blank.

So far I have calculated their average season consistency by averaging all 10 columns. However, not finishing a race should affect a driver's consistency negatively and I do not know how to implement that.

How to penalize for empty fields in a DataFrame?

翻译质量差，导致语言生硬或混乱。

没有提供实际的解决方法或示例。

解答不清晰，无法理解或解决问题。

页面排版不美观，阅读体验差。

文章

问答

视频

教程

学习中心

腾讯云实验室

直播

竞赛

腾讯云代码分析专区

腾讯iOA零信任安全管理系统专区

腾讯云架构师技术同盟交流圈

腾讯云数据库专区

腾讯云智能顾问专区

腾讯云原生专区

腾讯混元专区

腾讯云TCE专区

腾讯云Lighthouse专区

腾讯云HAI专区

腾讯云Edgeone专区

腾讯云存储专区

腾讯云智能专区

腾讯轻联专区 

腾讯云开发专区

TAPD专区

腾讯轻量云游戏服专区

腾讯云最具价值专家

腾讯云架构师技术同盟

腾讯云创作之星

腾讯云开发者先锋

腾讯云AI代码助手

云原生构建

TAPD 敏捷项目管理

Cloud Studio

SDK中心

API中心

命令行工具

功能1上新10个字符

功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符。

功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符。

功能5描述100个字符功能5描述100个字符功能5描述100个字符功能5描述100个字符功能5描述100个字符功能5描述100个字符

功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符

功能4上新

文章&问答评论现已支持表情

全新交互，全新视觉，新增快捷键、悬浮工具栏、高亮块等功能并同时优化现有功能，全面提升创作效率和体验

社区富文本编辑器全新改版！诚邀体验～ 

精选全网热门MCP server，让你的AI更好用 🚀

💥开发者 MCP广场重磅上线！

涵盖代码开发、场景应用、自动测试全流程，助你从零构建专属AI助手

一站式MCP教程库，解锁AI应用新玩法

聚焦“写作效率、视觉美观与运行性能”三方面进行全面升级，为您提供更高效、稳定的创作环境

社区富文本&Markdown编辑器全新改版上线，欢迎大家体验!

诚挚邀请您参与本次调研，分享您的真实使用感受与建议。您的反馈至关重要，感谢您的支持与参与！

社区新版编辑器体验调研

我必须计算赛车司机在整个赛季的一致性。我的DataFrame由10列(10个电路名称)组成，对于每一个列，我有在圈时间内的标准偏差，驱动程序张贴在该电路中。换句话说，车手从一圈到另一圈的一致性。在比赛中，车手没有完成的字段是空白的。到目前为止，我已经计算了他们的平均赛季一致性，平均所有10列。然而，不完成一场比赛应该会对车手的一致性产生负面影响，我不知道如何实现。

问如何惩罚DataFrame中的空字段？
EN

回答 1

Data Science用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问如何惩罚DataFrame中的空字段？EN