blocks|key|1955528|text|$+awk+-F';'+'NR==FNR{a[$3]%2B%2B;next}a[$3]>1'+file+file%7Csort+-t";"+-k3
test;group;10.10.10.10;action2
test3;group3;10.10.10.10;action3
tes4;group;10.10.10.10;action4
test2;group;10.10.13.11;action1
test6;group4;10.10.13.11;action8|type|code-block|depth|inlineStyleRanges|entityRanges|data|syntax|javascript|1955529|unstyled|1955530|awk拾取所有重复的($3)行，|unordered-list-item|1955531|sort+by+ip|1955532|1955533|entityMap^0|0|0|0|0|0^^$0|@$1|2|3|4|5|6|7|N|8|@]|9|@]|A|$B|C]]|$1|D|3|-4|5|E|7|O|8|@]|9|@]|A|$]]|$1|F|3|G|5|H|7|P|8|@]|9|@]|A|$]]|$1|I|3|J|5|H|7|Q|8|@]|9|@]|A|$]]|$1|K|3|-4|5|E|7|R|8|@]|9|@]|A|$]]|$1|L|3|-4|5|E|7|S|8|@]|9|@]|A|$]]]|M|$]]

<pre><code>$ awk -F';' 'NR==FNR{a[$3]++;next}a[$3]&gt;1' file file|sort -t";" -k3
test;group;10.10.10.10;action2
test3;group3;10.10.10.10;action3
tes4;group;10.10.10.10;action4
test2;group;10.10.13.11;action1
test6;group4;10.10.13.11;action8
</code></pre>

<ul>
<li>awk picks all duplicated ($3) lines</li>
<li>sort sorts by ip</li>
</ul>

blocks|key|1574629|text|您还可以使用grep、cut、sort、uniq和中间的临时流程替换来尝试此解决方案。|type|unstyled|depth|inlineStyleRanges|offset|length|style|CODE|entityRanges|data|1574630|grep+-f+<(cut+-d+';'+-f3+file+%7C+sort+%7C+uniq+-d)+file+%7C+sort+-t+';'+-k3|code-block|syntax|javascript|1574631|它并不是很优雅(实际上我更喜欢上面给出的awk答案)，但我认为值得分享，因为它实现了你想要的东西。|1574632|entityMap^0|6|4|B|3|F|4|K|4|0|0|K|3|0^^$0|@$1|2|3|4|5|6|7|O|8|@$9|P|A|Q|B|C]|$9|R|A|S|B|C]|$9|T|A|U|B|C]|$9|V|A|W|B|C]]|D|@]|E|$]]|$1|F|3|G|5|H|7|X|8|@]|D|@]|E|$I|J]]|$1|K|3|L|5|6|7|Y|8|@$9|Z|A|10|B|C]]|D|@]|E|$]]|$1|M|3|-4|5|6|7|11|8|@]|D|@]|E|$]]]|N|$]]

You can also try this solution using <code>grep</code>, <code>cut</code>, <code>sort</code>, <code>uniq</code>, and a casual process substitution in the middle. 

<pre><code>grep -f &lt;(cut -d ';' -f3 file | sort | uniq -d) file | sort -t ';' -k3
</code></pre>

It is not really elegant (I actually prefer the <code>awk</code> answer given above), but I think worth sharing, since it accomplishes what you want.

blocks|key|5740711|text|这与Kent的答案非常相似，但只有一次遍历文件。权衡的是内存:您需要存储要保留的行。这使用GNU+awk作为PROCINFO变量。|type|unstyled|depth|inlineStyleRanges|entityRanges|data|5740712|awk+-F';'+'
++++{count[$3]%2B%2B;+lines[$3]+=+lines[$3]+$0+ORS}+
++++END+{
++++++++PROCINFO["sorted_in"]+=+"@ind_str_asc"
++++++++for+(key+in+count)+
++++++++++++if+(count[key]+>+1)+
++++++++++++++++printf+"%25s",+lines[key]
++++}
'+file|code-block|syntax|javascript|5740713|等价的perl|5740714|perl+-F';'+-lane+'
++++$count{$F[2]}%2B%2B;+push+@{$lines{$F[2]}},+$_
++}+END+{
++++print+join+$/,+@{$lines{$_}}
++++++++for+sort+grep+{$count{$_}+>+1}+keys+%25count
'+file|5740715|entityMap^0|0|0|0|0^^$0|@$1|2|3|4|5|6|7|M|8|@]|9|@]|A|$]]|$1|B|3|C|5|D|7|N|8|@]|9|@]|A|$E|F]]|$1|G|3|H|5|6|7|O|8|@]|9|@]|A|$]]|$1|I|3|J|5|D|7|P|8|@]|9|@]|A|$E|F]]|$1|K|3|-4|5|6|7|Q|8|@]|9|@]|A|$]]]|L|$]]

This is very similar to Kent's answer, but with a single pass through the file. The tradeoff is memory: you need to store the lines to keep. This uses GNU awk for the PROCINFO variable.

<pre><code>awk -F';' '
 {count[$3]++; lines[$3] = lines[$3] $0 ORS} 
 END {
 PROCINFO["sorted_in"] = "@ind_str_asc"
 for (key in count) 
 if (count[key] &gt; 1) 
 printf "%s", lines[key]
 }
' file
</code></pre>

The equivalent perl

<pre><code>perl -F';' -lane '
 $count{$F[2]}++; push @{$lines{$F[2]}}, $_
 } END {
 print join $/, @{$lines{$_}}
 for sort grep {$count{$_} &gt; 1} keys %count
' file
</code></pre>

blocks|key|5740758|text|下面是另一个awk辅助管道|type|unstyled|depth|inlineStyleRanges|offset|length|style|CODE|entityRanges|data|5740759|$+awk+-F';'+'{print+$0+"\t"+$3}'+file+%7C+sort+-sk2+%7C+uniq+-Df1+%7C+cut+-f1

test;group;10.10.10.10;action2
test3;group3;10.10.10.10;action3
tes4;group;10.10.10.10;action4
test2;group;10.10.13.11;action1
test6;group4;10.10.13.11;action8|code-block|syntax|javascript|5740760|单次通过，所以特殊的缓存；也保持原来的顺序(稳定的排序)。假设选项卡不出现在字段中。|5740761|entityMap^0|6|3|0|0|0^^$0|@$1|2|3|4|5|6|7|O|8|@$9|P|A|Q|B|C]]|D|@]|E|$]]|$1|F|3|G|5|H|7|R|8|@]|D|@]|E|$I|J]]|$1|K|3|L|5|6|7|S|8|@]|D|@]|E|$]]|$1|M|3|-4|5|6|7|T|8|@]|D|@]|E|$]]]|N|$]]

here is another <code>awk</code> assisted pipeline

<pre><code>$ awk -F';' '{print $0 "\t" $3}' file | sort -sk2 | uniq -Df1 | cut -f1

test;group;10.10.10.10;action2
test3;group3;10.10.10.10;action3
tes4;group;10.10.10.10;action4
test2;group;10.10.13.11;action1
test6;group4;10.10.13.11;action8
</code></pre>

single pass, so special caching; also keeps the original order (stable sorting). Assumes tab doesn't appear in the fields.

blocks|key|1955703|text|awk+sort+%2B+uniq+%2B+sort+%2B|type|unstyled|depth|inlineStyleRanges|offset|length|style|BOLD|CODE|entityRanges|data|1955704|$+awk+-F+';'+'{print+$0,$3}'+<file>+%7C+sort+-k2+%7C+uniq+-D+-f1+%7C+cut+-d'+'+-f1|code-block|syntax|javascript|1955705|sort+awk+%2B|1955706|$+sort+-t';'+-k3,3+%7C+awk+-F+';'+'($3==k){c%2B%2B;b=b"\n"$0}($3!=k){if+(c>1)+print+b;c=1;k=$3;b=$0}END{if(c>1)print+b}|1955707|awk|1955708|$+awk+-F+';'+'{b[$3"_"%2B%2Bk[$3]]=$0;+}
++++++END{for+(i+in+k)+if(k[i]>1)+for(j=1;j<=k[i];j%2B%2B)+print+b[i"_"j]+}+<file>|1955709|这会缓冲整个文件(与sort相同)，并跟踪关键k出现的次数。最后，如果密钥出现的次数多于1，则打印全套密钥。|1955710|test2;group;10.10.13.11;action1
test6;group4;10.10.13.11;action8
test;group;10.10.10.10;action2
test3;group3;10.10.10.10;action3
tes4;group;10.10.10.10;action4|1955711|如果您希望对其进行排序：|1955712|$+awk+-F+';'+'{b[$3"_"%2B%2Bk[$3]]=$0;+}
++++++END{+asorti(k,l);+
++++++for+(i+in+l)+if(k[l[i]]>1)+for(j=1;j<=k[l[i]];j%2B%2B)+print+b[l[i]"_"j]+}+<file>|1955713|entityMap^0|0|3|4|4|B|4|G|1|I|4|N|1|0|3|4|4|B|4|I|4|0|0|0|4|5|3|0|4|5|3|0|0|0|3|0|3|0|0|0|1I|A|4|N|1|0|0|0|C|0|0^^$0|@$1|2|3|4|5|6|7|13|8|@$9|14|A|15|B|C]|$9|16|A|17|B|C]|$9|18|A|19|B|C]|$9|1A|A|1B|B|C]|$9|1C|A|1D|B|C]|$9|1E|A|1F|B|C]|$9|1G|A|1H|B|D]|$9|1I|A|1J|B|D]|$9|1K|A|1L|B|D]|$9|1M|A|1N|B|D]]|E|@]|F|$]]|$1|G|3|H|5|I|7|1O|8|@]|E|@]|F|$J|K]]|$1|L|3|M|5|6|7|1P|8|@$9|1Q|A|1R|B|C]|$9|1S|A|1T|B|C]|$9|1U|A|1V|B|D]|$9|1W|A|1X|B|D]]|E|@]|F|$]]|$1|N|3|O|5|I|7|1Y|8|@]|E|@]|F|$J|K]]|$1|P|3|Q|5|6|7|1Z|8|@$9|20|A|21|B|C]|$9|22|A|23|B|D]]|E|@]|F|$]]|$1|R|3|S|5|I|7|24|8|@]|E|@]|F|$J|K]]|$1|T|3|U|5|6|7|25|8|@$9|26|A|27|B|C]|$9|28|A|29|B|D]|$9|2A|A|2B|B|D]]|E|@]|F|$]]|$1|V|3|W|5|I|7|2C|8|@]|E|@]|F|$J|K]]|$1|X|3|Y|5|6|7|2D|8|@$9|2E|A|2F|B|C]]|E|@]|F|$]]|$1|Z|3|10|5|I|7|2G|8|@]|E|@]|F|$J|K]]|$1|11|3|-4|5|6|7|2H|8|@]|E|@]|F|$]]]|12|$]]

<code>awk</code> + <code>sort</code> + <code>uniq</code> + <code>cut</code>:

<pre><code>$ awk -F ';' '{print $0,$3}' &lt;file&gt; | sort -k2 | uniq -D -f1 | cut -d' ' -f1
</code></pre>

<code>sort</code> + <code>awk</code>

<pre><code>$ sort -t';' -k3,3 | awk -F ';' '($3==k){c++;b=b"\n"$0}($3!=k){if (c&gt;1) print b;c=1;k=$3;b=$0}END{if(c&gt;1)print b}
</code></pre>

<code>awk</code>

<pre><code>$ awk -F ';' '{b[$3"_"++k[$3]]=$0; }
 END{for (i in k) if(k[i]&gt;1) for(j=1;j&lt;=k[i];j++) print b[i"_"j] } &lt;file&gt;
</code></pre>

This buffers the full file (same as <code>sort</code> does) and keeps track how many times a key <code>k</code> is appearing. At the end, if the key appears more then ones, print the full set.

<pre><code>test2;group;10.10.13.11;action1
test6;group4;10.10.13.11;action8
test;group;10.10.10.10;action2
test3;group3;10.10.10.10;action3
tes4;group;10.10.10.10;action4
</code></pre>

If you want it sorted :

<pre><code>$ awk -F ';' '{b[$3"_"++k[$3]]=$0; }
 END{ asorti(k,l); 
 for (i in l) if(k[l[i]]&gt;1) for(j=1;j&lt;=k[l[i]];j++) print b[l[i]"_"j] } &lt;file&gt;
</code></pre>

I have one file with field separated by ";", like this:

<pre><code>test;group;10.10.10.10;action2
test2;group;10.10.13.11;action1
test3;group3;10.10.10.10;action3
tes4;group;10.10.10.10;action4
test5;group2;10.10.10.12;action5
test6;group4;10.10.13.11;action8
</code></pre>

I would like to identify all non-unique IP addresses (3rd column). With the example the extract should be:

<pre><code>test;group;10.10.10.10;action2
test3;group3;10.10.10.10;action3
tes4;group;10.10.10.10;action4
test2;group;10.10.13.11;action1
test6;group4;10.10.13.11;action8
</code></pre>

Sorted by IP address (3rd column).

Ssing simple commands like <code>cat</code>, <code>uniq</code>, <code>sort</code>, <code>awk</code> (not Perl, not Python, only shell).

Any idea?

uniq sort parsing

翻译质量差，导致语言生硬或混乱。

没有提供实际的解决方法或示例。

解答不清晰，无法理解或解决问题。

页面排版不美观，阅读体验差。

文章

问答

视频

教程

学习中心

腾讯云实验室

直播

竞赛

腾讯云代码分析专区

腾讯iOA零信任安全管理系统专区

腾讯云架构师技术同盟交流圈

腾讯云数据库专区

腾讯云智能顾问专区

腾讯云原生专区

腾讯混元专区

腾讯云TCE专区

腾讯云Lighthouse专区

腾讯云HAI专区

腾讯云Edgeone专区

腾讯云存储专区

腾讯云智能专区

腾讯轻联专区 

腾讯云开发专区

TAPD专区

腾讯轻量云游戏服专区

EdgeOne AI 安全实战专区

腾讯云最具价值专家

腾讯云架构师技术同盟

腾讯云创作之星

腾讯云开发者先锋

腾讯云代码助手

云原生构建

TAPD 敏捷项目管理

Cloud Studio

SDK中心

API中心

命令行工具

涵盖代码开发、场景应用、自动测试全流程，助你从零构建专属AI助手

一站式MCP教程库，解锁AI应用新玩法

聚焦“写作效率、视觉美观与运行性能”三方面进行全面升级，为您提供更高效、稳定的创作环境

社区富文本&Markdown编辑器全新改版上线，欢迎大家体验!

诚挚邀请您参与本次调研，分享您的真实使用感受与建议。您的反馈至关重要，感谢您的支持与参与！

社区新版编辑器体验调研

我有一个文件，其中字段用";“分隔，如下所示：test;group;10.10.10.10;action2test2;group;10.10.13.11;action1test3;group3;10.10.10.10;action3tes4;group;10.10.10.10;action4test5;group2;10.10.10.12;action5test6;group4;10.10.13.

问uniq排序解析
EN

回答 5

Stack Overflow用户

Stack Overflow用户

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问uniq排序解析EN

回答 5

Stack Overflow用户

Stack Overflow用户

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问uniq排序解析
EN