blocks|key|681922|text|让我们暂时假设您想要包含“corn”的元素的数量：|type|unstyled|depth|inlineStyleRanges|entityRanges|data|681923|length(grep("corn",+dataset))
[1]+3|code-block|syntax|javascript|681924|在更好地理解R的基础知识之后，您可能想看看"tm“包。|681925|编辑:我知道这一次你想要任何-“玉米”，但在未来你可能想要“玉米”这个词。比尔·邓拉普在r-help上指出了一种更紧凑的grep模式，用于收集完整的单词：|681926|grep("\\<corn\\>",+dataset)|681927|entityMap^0|0|0|0|0|0^^$0|@$1|2|3|4|5|6|7|O|8|@]|9|@]|A|$]]|$1|B|3|C|5|D|7|P|8|@]|9|@]|A|$E|F]]|$1|G|3|H|5|6|7|Q|8|@]|9|@]|A|$]]|$1|I|3|J|5|6|7|R|8|@]|9|@]|A|$]]|$1|K|3|L|5|D|7|S|8|@]|9|@]|A|$E|F]]|$1|M|3|-4|5|6|7|T|8|@]|9|@]|A|$]]]|N|$]]

Let's for the moment assume you wanted the number of element containing "corn":

<pre><code>length(grep("corn", dataset))
[1] 3
</code></pre>

After you get the basics of R down better you may want to look at the "tm" package.

EDIT: I realize that this time around you wanted any-"corn" but in the future you might want to get word-"corn". Over on r-help Bill Dunlap pointed out a more compact grep pattern for gathering whole words:

<pre><code>grep("\\&lt;corn\\&gt;", dataset)
</code></pre>

blocks|key|4514761|text|另一种非常方便和直观的方法是使用stringr包的str_count函数：|type|unstyled|depth|inlineStyleRanges|offset|length|style|CODE|entityRanges|data|4514762|library(stringr)
dataset+<-+c("corn",+"cornmeal",+"corn+on+the+cob",+"meal")

#+for+mere+occurences+of+the+pattern:
str_count(dataset,+"corn")
#+[1]+1+1+1+0

#+for+occurences+of+the+word+alone:
str_count(dataset,+"\\bcorn\\b")
#+[1]+1+0+1+0

#+summing+it+up
sum(str_count(dataset,+"corn"))
#+[1]+3|code-block|syntax|javascript|4514763|entityMap^0|G|7|P|9|0|0^^$0|@$1|2|3|4|5|6|7|M|8|@$9|N|A|O|B|C]|$9|P|A|Q|B|C]]|D|@]|E|$]]|$1|F|3|G|5|H|7|R|8|@]|D|@]|E|$I|J]]|$1|K|3|-4|5|6|7|S|8|@]|D|@]|E|$]]]|L|$]]

Another quite convenient and intuitive way to do it is to use the <code>str_count</code> function of the <code>stringr</code> package:

<pre><code>library(stringr)
dataset &lt;- c("corn", "cornmeal", "corn on the cob", "meal")

# for mere occurences of the pattern:
str_count(dataset, "corn")
# [1] 1 1 1 0

# for occurences of the word alone:
str_count(dataset, "\\bcorn\\b")
# [1] 1 0 1 0

# summing it up
sum(str_count(dataset, "corn"))
# [1] 3
</code></pre>

blocks|key|4796792|text|您还可以执行以下操作：|type|unstyled|depth|inlineStyleRanges|entityRanges|data|4796793|length(dataset[which(dataset=="corn")])|code-block|syntax|javascript|4796794|entityMap^0|0|0^^$0|@$1|2|3|4|5|6|7|I|8|@]|9|@]|A|$]]|$1|B|3|C|5|D|7|J|8|@]|9|@]|A|$E|F]]|$1|G|3|-4|5|6|7|K|8|@]|9|@]|A|$]]]|H|$]]

You can also do something like the following:

<pre><code>length(dataset[which(dataset=="corn")])
</code></pre>

blocks|key|681963|text|我只是用字符串除法来做，比如：|type|unstyled|depth|inlineStyleRanges|entityRanges|data|681964|library(roperators)

dataset+<-+c("corn",+"cornmeal",+"corn+on+the+cob",+"meal")

#+for+each+vector+element:
dataset+%25s/%25+'corn'

#+for+everything:
sum(dataset+%25s/%25+'corn')+|code-block|syntax|javascript|681965|entityMap^0|0|0^^$0|@$1|2|3|4|5|6|7|I|8|@]|9|@]|A|$]]|$1|B|3|C|5|D|7|J|8|@]|9|@]|A|$E|F]]|$1|G|3|-4|5|6|7|K|8|@]|9|@]|A|$]]]|H|$]]

I'd just do it with string division like:

<pre><code>library(roperators)

dataset &lt;- c("corn", "cornmeal", "corn on the cob", "meal")

# for each vector element:
dataset %s/% 'corn'

# for everything:
sum(dataset %s/% 'corn') 
</code></pre>

blocks|key|681984|text|您可以使用str_count包中的stringr函数来获取与给定字符向量匹配的关键字数量。|type|unstyled|depth|inlineStyleRanges|offset|length|style|BOLD|entityRanges|data|681985|str_count函数的模式参数接受可用于指定关键字的正则表达式。|681986|正则表达式语法非常灵活，允许匹配整个单词和字符模式。|681987|例如，下面的代码将计算字符串"corn“的所有出现次数，并返回3：|681988|sum(str_count(dataset,+regex("corn")))|code-block|syntax|javascript|681989|要匹配完整的单词，请使用：|681990|sum(str_count(dataset,+regex("\\bcorn\\b")))|681991|"\b"用于指定单词边界。使用str_count函数时，单词边界的默认定义包括撇号。因此，如果您的数据集包含字符串"corn's"，则将匹配它并将其包括在结果中。|681992|这是因为在默认情况下，撇号被视为单词边界。要防止对包含撇号的单词进行计数，请使用带有参数uword+=+T的regex函数。这将导致正则表达式引擎使用unicode+TR+29定义的单词边界。参见http://unicode.org/reports/tr29/tr29-4.html。此定义不将撇号视为单词边界。|681993|下面的代码将给出单词"corn“出现的次数。诸如“玉米的”之类的词将不会被包括在内。|681994|sum(str_count(dataset,+regex("\\bcorn\\b",+uword+=+T)))|681995|entityMap|0|LINK|mutability|MUTABLE|url|http://unicode.org/reports/tr29/tr29-4.html^0|5|9|H|7|0|C|2|0|0|0|0|0|0|0|4|0|2Q|17|0|0|0|0^^$0|@$1|2|3|4|5|6|7|1A|8|@$9|1B|A|1C|B|C]|$9|1D|A|1E|B|C]]|D|@]|E|$]]|$1|F|3|G|5|6|7|1F|8|@$9|1G|A|1H|B|C]]|D|@]|E|$]]|$1|H|3|I|5|6|7|1I|8|@]|D|@]|E|$]]|$1|J|3|K|5|6|7|1J|8|@]|D|@]|E|$]]|$1|L|3|M|5|N|7|1K|8|@]|D|@]|E|$O|P]]|$1|Q|3|R|5|6|7|1L|8|@]|D|@]|E|$]]|$1|S|3|T|5|N|7|1M|8|@]|D|@]|E|$O|P]]|$1|U|3|V|5|6|7|1N|8|@$9|1O|A|1P|B|C]]|D|@]|E|$]]|$1|W|3|X|5|6|7|1Q|8|@]|D|@$9|1R|A|1S|1|1T]]|E|$]]|$1|Y|3|Z|5|6|7|1U|8|@]|D|@]|E|$]]|$1|10|3|11|5|N|7|1V|8|@]|D|@]|E|$O|P]]|$1|12|3|-4|5|6|7|1W|8|@]|D|@]|E|$]]]|13|$14|$5|15|16|17|E|$18|19]]]]

You can use the str_count function from the stringr package to get the number of keywords that match a given character vector.
The pattern argument of the str_count function accepts a regular expression that can be used to specify the keyword.
The regular expression syntax is very flexible and allows matching whole words as well as character patterns.
For example the following code will count all occurrences of the string &quot;corn&quot; and will return 3:
<pre><code>sum(str_count(dataset, regex(&quot;corn&quot;)))
</code></pre>
To match complete words use:
<pre><code>sum(str_count(dataset, regex(&quot;\\bcorn\\b&quot;)))
</code></pre>
The &quot;\b&quot; is used to specify a word boundary. When using str_count function, the default definition of word boundary includes apostrophe. So if your dataset contains the string &quot;corn's&quot;, it would be matched and included in the result.
This is because apostrophe is considered as a word boundary by default. To prevent words containing apostrophe from being counted, use the regex function with parameter uword = T. This will cause the regular expression engine to use the unicode TR 29 definition of word boundaries. See <a href="http://unicode.org/reports/tr29/tr29-4.html" rel="nofollow noreferrer">http://unicode.org/reports/tr29/tr29-4.html</a>. This definition does not consider apostrophe as a word boundary.
The following code will give the number of time the word &quot;corn&quot; occurs. Words such as &quot;corn's&quot; will not be included.
<pre><code>sum(str_count(dataset, regex(&quot;\\bcorn\\b&quot;, uword = T)))
</code></pre>

Is there a function for counting the number of times a particular keyword is contained in a dataset?

For example, if <code>dataset &lt;- c("corn", "cornmeal", "corn on the cob", "meal")</code> the count would be 3.

Count word occurrences in R

翻译质量差，导致语言生硬或混乱。

没有提供实际的解决方法或示例。

解答不清晰，无法理解或解决问题。

页面排版不美观，阅读体验差。

文章

问答

视频

教程

学习中心

腾讯云实验室

直播

竞赛

腾讯云代码分析专区

腾讯iOA零信任安全管理系统专区

腾讯云架构师技术同盟交流圈

腾讯云数据库专区

腾讯云智能顾问专区

腾讯云原生专区

腾讯混元专区

腾讯云TCE专区

腾讯云Lighthouse专区

腾讯云HAI专区

腾讯云Edgeone专区

腾讯云存储专区

腾讯云智能专区

腾讯轻联专区 

腾讯云开发专区

TAPD专区

腾讯轻量云游戏服专区

腾讯云最具价值专家

腾讯云架构师技术同盟

腾讯云创作之星

腾讯云开发者先锋

腾讯云AI代码助手

云原生构建

TAPD 敏捷项目管理

Cloud Studio

SDK中心

API中心

命令行工具

功能1上新10个字符

功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符。

功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符。

功能5描述100个字符功能5描述100个字符功能5描述100个字符功能5描述100个字符功能5描述100个字符功能5描述100个字符

功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符

功能4上新

文章&问答评论现已支持表情

全新交互，全新视觉，新增快捷键、悬浮工具栏、高亮块等功能并同时优化现有功能，全面提升创作效率和体验

社区富文本编辑器全新改版！诚邀体验～ 

精选全网热门MCP server，让你的AI更好用 🚀

💥开发者 MCP广场重磅上线！

涵盖代码开发、场景应用、自动测试全流程，助你从零构建专属AI助手

一站式MCP教程库，解锁AI应用新玩法

聚焦“写作效率、视觉美观与运行性能”三方面进行全面升级，为您提供更高效、稳定的创作环境

社区富文本&Markdown编辑器全新改版上线，欢迎大家体验!

诚挚邀请您参与本次调研，分享您的真实使用感受与建议。您的反馈至关重要，感谢您的支持与参与！

社区新版编辑器体验调研

有没有一个函数可以计算特定关键字在数据集中出现的次数？例如，如果为dataset <- c("corn", "cornmeal", "corn on the cob", "meal")，则计数将为3。

问统计R中的单词出现次数
EN

回答 5

Stack Overflow用户

Stack Overflow用户

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问统计R中的单词出现次数EN

回答 5

Stack Overflow用户

Stack Overflow用户

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问统计R中的单词出现次数
EN