blocks|key|4070579|text|正则表达式通常不是最适合这项工作的工具，但如果您的情况非常具体，如示例中所示，您可以使用：|type|unstyled|depth|inlineStyleRanges|entityRanges|data|4070580|foo:((?>\d%2B))(?!</a>)|code-block|syntax|javascript|4070581|您的第一个表达式不起作用，因为在(?!</a>)匹配之前，\d%2B会回溯。这可以通过不允许\d%2B回溯来修复，就像上面在原子/非回溯组的帮助下一样，或者您也可以在\d%2B回溯的情况下使前视失败，例如：|offset|length|style|CODE|4070582|foo:((?>\d%2B))(?!</a>%7C\d)|4070583|尽管这并不是那么有效。|4070584|entityMap^0|0|0|G|8|T|3|18|3|27|3|0|0|0^^$0|@$1|2|3|4|5|6|7|S|8|@]|9|@]|A|$]]|$1|B|3|C|5|D|7|T|8|@]|9|@]|A|$E|F]]|$1|G|3|H|5|6|7|U|8|@$I|V|J|W|K|L]|$I|X|J|Y|K|L]|$I|Z|J|10|K|L]|$I|11|J|12|K|L]]|9|@]|A|$]]|$1|M|3|N|5|D|7|13|8|@]|9|@]|A|$E|F]]|$1|O|3|P|5|6|7|14|8|@]|9|@]|A|$]]|$1|Q|3|-4|5|6|7|15|8|@]|9|@]|A|$]]]|R|$]]

Regex is usually not the best tool for the job, but if your case is very specific like in your example you could use:

<pre><code>foo:((?&gt;\d+))(?!&lt;/a&gt;)
</code></pre>

Your first expression didn't work because <code>\d+</code> would backtrack till <code>(?!&lt;/a&gt;)</code> matches. This can be fixed by not allowing <code>\d+</code> to backtrack, as above with help of an atomic/nonbacktracking group, or you could also make the lookahead fail in case <code>\d+</code> backtracks, like:

<pre><code>foo:((?&gt;\d+))(?!&lt;/a&gt;|\d)
</code></pre>

Altho that is not as efficient.

blocks|key|4070633|text|如果您想像这样开始分析HTML，那么您可能希望实际解析HTML，而不是使用正则表达式。HTML+Agility+Pack通常是第一个停靠的端口。使用正则表达式，很难处理像<a></a>foo:123456<a></a>这样的东西，它当然应该去掉中间的部分，但编写一个能做到这一点的正则表达式是极其困难的。|type|unstyled|depth|inlineStyleRanges|offset|length|style|CODE|entityRanges|data|4070634|我应该补充的是，我假设您确实有一块HTML，而不仅仅是像上面的每一行这样的单独的短字符串。我排除了它的部分原因是，如果它是唯一的东西，那么匹配它是相当容易的，所以我想如果你想要它，你就会得到它。:)|4070635|entityMap|0|LINK|mutability|MUTABLE|url|http://htmlagilitypack.codeplex.com/^0|2D|O|17|H|0|0|0^^$0|@$1|2|3|4|5|6|7|P|8|@$9|Q|A|R|B|C]]|D|@$9|S|A|T|1|U]]|E|$]]|$1|F|3|G|5|6|7|V|8|@]|D|@]|E|$]]|$1|H|3|-4|5|6|7|W|8|@]|D|@]|E|$]]]|I|$J|$5|K|L|M|E|$N|O]]]]

If you want to start analysing HTML like this then you probably want to actually parse HTML instead of using regular expressions. The <a href="http://htmlagilitypack.codeplex.com/" rel="nofollow">HTML Agility Pack</a> is the usual first port of call. Using Regular Expressions it becomes hard to deal with things like <code>&lt;a&gt;&lt;/a&gt;foo:123456&lt;a&gt;&lt;/a&gt;</code> which of course should pull out the middle bit but its extremely hard to write a regex that will do that.

I should add that I am assuming that you do in fact have a block of HTML rather than just individual short strings such as your each line above. Partly I ruled it out becasue matching it if it is the only thing on the line is pretty easy so I figured you'd have got it if you wanted that. :)

blocks|key|4070597|text|请注意，对于不同的字符串长度，lookbehind将不起作用，您可能会以不同的方式解决它|type|unstyled|depth|inlineStyleRanges|entityRanges|data|4070598|例如|4070599|4070600|查找并标记锚点中包含的所有foo-|ordered-list-item|4070601|查找并使用所有其他|4070602|Remove标记|4070603|实现您的目标|4070604|entityMap^0|0|0|0|0|0|0|0^^$0|@$1|2|3|4|5|6|7|P|8|@]|9|@]|A|$]]|$1|B|3|C|5|6|7|Q|8|@]|9|@]|A|$]]|$1|D|3|-4|5|6|7|R|8|@]|9|@]|A|$]]|$1|E|3|F|5|G|7|S|8|@]|9|@]|A|$]]|$1|H|3|I|5|G|7|T|8|@]|9|@]|A|$]]|$1|J|3|K|5|G|7|U|8|@]|9|@]|A|$]]|$1|L|3|M|5|6|7|V|8|@]|9|@]|A|$]]|$1|N|3|-4|5|6|7|W|8|@]|9|@]|A|$]]]|O|$]]

Note, that lookbehind will not work with differnt string length inside, you may work it out differently 

for example

<ol>
<li>Find and mark all foo-s that are contained in anchor</li>
<li>Find and do your goal with all other</li>
<li>Remove marks</li>
</ol>

blocks|key|29797|text|这可能是一种冗长的方法，但您可以简单地返回foo的所有实例:一些数字，然后在之后排除它们。|type|unstyled|depth|inlineStyleRanges|entityRanges|data|29798|string+pattern+=+@"foo:\d%2B+%7C"+%2B
+++++++++++++++++@"foo:\d%2B[<]";|code-block|syntax|javascript|29799|然后使用matchcollection|29800|+MatchCollection+m0+=+Regex.Matches(file,+pattern,+RegexOptions.Singleline);|29801|然后循环遍历每个匹配项：|29802|foreach+(Match+m+in+m0)
{
+++++++++++++++++.+.+.+exclude+the+matches+that+contain+the+"<"
}|29803|entityMap^0|0|0|0|0|0|0^^$0|@$1|2|3|4|5|6|7|Q|8|@]|9|@]|A|$]]|$1|B|3|C|5|D|7|R|8|@]|9|@]|A|$E|F]]|$1|G|3|H|5|6|7|S|8|@]|9|@]|A|$]]|$1|I|3|J|5|D|7|T|8|@]|9|@]|A|$E|F]]|$1|K|3|L|5|6|7|U|8|@]|9|@]|A|$]]|$1|M|3|N|5|D|7|V|8|@]|9|@]|A|$E|F]]|$1|O|3|-4|5|6|7|W|8|@]|9|@]|A|$]]]|P|$]]

This is prob a long winded way of doing this but you could simply bring back all occurences of foo:some digits then exclude them afterwards..

<pre><code>string pattern = @"foo:\d+ |" +
 @"foo:\d+[&lt;]";
</code></pre>

Then use matchcollection

<pre><code> MatchCollection m0 = Regex.Matches(file, pattern, RegexOptions.Singleline);
</code></pre>

Then loop through each occurrence:

<pre><code>foreach (Match m in m0)
{
 . . . exclude the matches that contain the "&lt;"
}
</code></pre>

blocks|key|247568|text|我会使用linq并像对待xml一样对待html，例如:+var+query+=+MyHtml.Descendants().ToArray()；foreach+(查询中的XElement结果){|type|unstyled|depth|inlineStyleRanges|entityRanges|data|247569|++++++++++++if+(Regex.IsMatch(result.value,+@"foo:123456")+&&+result.Name.ToString()+!=+"a")
++++++++++++{
+++++++++++++++//do+something...
++++++++++++}
++++++++}|code-block|syntax|javascript|247570|也许有一种更好的方法，但我不知道it...this对我来说似乎很直接:P|247571|entityMap^0|0|0|0^^$0|@$1|2|3|4|5|6|7|K|8|@]|9|@]|A|$]]|$1|B|3|C|5|D|7|L|8|@]|9|@]|A|$E|F]]|$1|G|3|H|5|6|7|M|8|@]|9|@]|A|$]]|$1|I|3|-4|5|6|7|N|8|@]|9|@]|A|$]]]|J|$]]

I would use linq and treat the html like xml, for example:
 var query = MyHtml.Descendants().ToArray();
 foreach (XElement result in query)
 {

<pre><code> if (Regex.IsMatch(result.value, @"foo:123456") &amp;&amp; result.Name.ToString() != "a")
 {
 //do something...
 }
 }
</code></pre>

perhaps there's a better way, but i don't know it...this seems pretty straight forward to me :P

I'm looking to match all text in the format foo:12345 that is not contained within an HTML anchor. For example, I'd like to match lines 1 and 3 from the following:

<code>foo:123456</code>

<code>&lt;a href="http://www.google.com"&gt;foo:123456&lt;/a&gt;</code>

<code>foo:123456</code>

I've tried these regexes with no success:

Negative lookahead attempt ( incorrectly matches, but doesn't include the last digit )

<code>foo:(\d+)(?!&lt;/a&gt;)</code>

Negative lookahead with non-capturing grouping

<code>(?:foo:(\d+))(?!&lt;/a&gt;)</code>

Negative lookbehind attempt ( wildcards don't seem to be supported )

<code>(?&lt;!&lt;a[^&gt;]&gt;)foo:(\d+)</code>

c# regex to match specific text

翻译质量差，导致语言生硬或混乱。

没有提供实际的解决方法或示例。

解答不清晰，无法理解或解决问题。

页面排版不美观，阅读体验差。

文章

问答

视频

教程

学习中心

腾讯云实验室

直播

竞赛

腾讯云代码分析专区

腾讯iOA零信任安全管理系统专区

腾讯云架构师技术同盟交流圈

腾讯云数据库专区

腾讯云智能顾问专区

腾讯云原生专区

腾讯混元专区

腾讯云TCE专区

腾讯云Lighthouse专区

腾讯云HAI专区

腾讯云Edgeone专区

腾讯云存储专区

腾讯云智能专区

腾讯轻联专区 

腾讯云开发专区

TAPD专区

腾讯轻量云游戏服专区

腾讯云最具价值专家

腾讯云架构师技术同盟

腾讯云创作之星

腾讯云开发者先锋

腾讯云AI代码助手

云原生构建

TAPD 敏捷项目管理

Cloud Studio

SDK中心

API中心

命令行工具

功能1上新10个字符

功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符功能2描述100个字符。

功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符功能2上新100个字符。

功能5描述100个字符功能5描述100个字符功能5描述100个字符功能5描述100个字符功能5描述100个字符功能5描述100个字符

功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符功能5上新100个字符

功能4上新

文章&问答评论现已支持表情

全新交互，全新视觉，新增快捷键、悬浮工具栏、高亮块等功能并同时优化现有功能，全面提升创作效率和体验

社区富文本编辑器全新改版！诚邀体验～ 

精选全网热门MCP server，让你的AI更好用 🚀

💥开发者 MCP广场重磅上线！

涵盖代码开发、场景应用、自动测试全流程，助你从零构建专属AI助手

一站式MCP教程库，解锁AI应用新玩法

聚焦“写作效率、视觉美观与运行性能”三方面进行全面升级，为您提供更高效、稳定的创作环境

社区富文本&Markdown编辑器全新改版上线，欢迎大家体验!

诚挚邀请您参与本次调研，分享您的真实使用感受与建议。您的反馈至关重要，感谢您的支持与参与！

社区新版编辑器体验调研

我希望匹配foo:12345格式的所有文本，这些文本不包含在HTML锚中。例如，我希望匹配以下代码中的第1行和第3行：foo:123456<a href="http://www.google.com">foo:123456</a>foo:123456我已经尝试了这些正则表达式，但没有成功：负向先行尝试(不正确匹配，但不包括最后一位)foo:(\d+)(?!</a>)使用非捕获分组的负面先行(?:f

问匹配特定文本的c#正则表达式
EN

回答 5

Stack Overflow用户

Stack Overflow用户

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问匹配特定文本的c#正则表达式EN

回答 5

Stack Overflow用户

Stack Overflow用户

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问匹配特定文本的c#正则表达式
EN