首页
学习
活动
专区
圈层
工具
发布
社区首页 >问答首页 >如何在awk脚本中使用正则表达式?

如何在awk脚本中使用正则表达式?
EN

Stack Overflow用户
提问于 2013-09-30 16:48:40
回答 2查看 166关注 0票数 1

嗨,我正在做一个脚本,给我一个新闻组活动的摘要。到目前为止,除了我尝试使用匹配运算符来查看这个$6字段是否与表达式匹配外,它大部分都能工作。我想把所有的戒指都放在一段下面。我的脚本就是这样的:

newsread.awk:

代码语言:javascript
复制
BEGIN{
print "\t\t\tNews Reader Summary\n\n"
printf("               %-15s%-15s%-15s%-15s\n\n","lonestar","runner","ringer","rings"); 
articles[4];
groups[4];
times[4];
cs2413[4];cs2413d[4];
}

NR == 1 {date1 = $1 " " $2 " " $3}

$6 == "lonestar.jpl.utsa.edu"{
    if ($7=="group"){
        articles[1]+=$9;
        if ($8=="utsa.cs.2413"){
            cs2413[1]+=$9;
        }
        if ($8=="utsa.cs.2413.d"){
            cs2413d[1]+=$9;
        }
    }else if ($7 == "exit"){
        articles[1]+=$9;
        groups[1]+=$11;
    }else {
        times[1]+=$13;
    }
}

$6 == "runner.jpl.utsa.edu"{
    if ($7=="group"){
                articles[2]+=$9;
         if ($8=="utsa.cs.2413"){
                        cs2413[2]+=$9;
                }
                if ($8=="utsa.cs.2413.d"){ 
                        cs2413d[2]+=$9;
                }

        }else if ($7 == "exit"){
                articles[2]+=$9;
                groups[2]+=$11;
        }else {
                times[2]+=$13;
        }

}

$6 == "ringer.cs.utsa.edu"{
    if ($7=="group"){
                articles[3]+=$9;
         if ($8=="utsa.cs.2413"){
                        cs2413[3]+=$9;
                }
                if ($8=="utsa.cs.2413.d"){ 
                        cs2413d[3]+=$9;
                }

        }else if ($7 == "exit"){
                articles[3]+=$9;
                groups[3]+=$11;
        }else {
                times[3]+=$13;
        }

}

$6 ~ "/ring??.cs.utsa.edu/"{
    if ($7=="group"){
                articles[4]+=$9;
         if ($8=="utsa.cs.2413"){
                        cs2413[4]+=$9;
                }
                if ($8=="utsa.cs.2413.d"){ 
                        cs2413d[4]+=$9;
                }

        }else if ($7 == "exit"){
                articles[4]+=$9;
                groups[4]+=$11;
        }else {
                times[4]+=$13;
        }

}
END{
    date2 = $1 " " $2 " " $3
    printf("Articles:      %-15d%-15d%-15d%-15d\n",articles[1],articles[2],articles[3],articles[4]); 
    printf("Groups:        %-15d%-15d%-15d%-15d\n",groups[1],groups[2],groups[3],groups[4]); 
    printf("Cs2413:        %-15d%-15d%-15d%-15d\n",cs2413[1],cs2413[2],cs2413[3],cs2413[4]); 
    printf("Cs2413.d:      %-15d%-15d%-15d%-15d\n",cs2413d[1],cs2413d[2],cs2413d[3],cs2413d[4]); 
    printf("User Time:     %-15d%-15d%-15d%-15d\n",times[1],times[2],times[3],times[4]);  
    printf("\nStart Time = %s\tEnd Time = %s\n",date1,date2); 

}

这是news.notice外观的一个片段:

代码语言:javascript
复制
Feb 13 21:27:14 ringer nnrpd[11474]: lonestar.jpl.utsa.edu group alt.education.distance 19
Feb 13 21:27:14 ringer nnrpd[11474]: lonestar.jpl.utsa.edu exit articles 19 groups 1
Feb 13 21:27:14 ringer nnrpd[11474]: lonestar.jpl.utsa.edu times user 0.470 system 0.930 elapsed 4.766
Feb 13 21:27:49 ringer nnrpd[11462]: ring42.cs.utsa.edu exit articles 0 groups 2
Feb 13 21:27:49 ringer nnrpd[11462]: ring42.cs.utsa.edu times user 2.020 system 1.430 elapsed 45.114
Feb 13 21:28:00 ringer nnrpd[11482]: lonestar.jpl.utsa.edu group utsa.lonestar 7
Feb 13 21:28:00 ringer nnrpd[11482]: lonestar.jpl.utsa.edu exit articles 7 groups 1
Feb 13 21:28:00 ringer nnrpd[11482]: lonestar.jpl.utsa.edu times user 0.520 system 0.890 elapsed 48.286
Feb 13 21:28:38 ringer innd: ME running
Feb 13 21:28:43 ringer nnrpd[11344]: lonestar.jpl.utsa.edu unrecognized NOOP
Feb 13 21:29:01 ringer nnrpd[11601]: lonestar.jpl.utsa.edu connect
Feb 13 21:29:01 ringer nnrpd[11601]: lonestar.jpl.utsa.edu exit articles 0 groups 0
Feb 13 21:29:01 ringer nnrpd[11601]: lonestar.jpl.utsa.edu times user 0.470 system 0.770 elapsed 1.456
Feb 13 21:29:03 ringer nnrpd[11602]: lonestar.jpl.utsa.edu connect
Feb 13 21:29:03 ringer nnrpd[11472]: ring29.cs.utsa.edu exit articles 0 groups 0
Feb 13 21:29:03 ringer nnrpd[11472]: ring29.cs.utsa.edu times user 1.360 system 0.790 elapsed 114.771
Feb 13 21:29:03 ringer nnrpd[11602]: lonestar.jpl.utsa.edu exit articles 0 groups 0
Feb 13 21:29:03 ringer nnrpd[11602]: lonestar.jpl.utsa.edu times user 0.530 system 0.650 elapsed 1.524
Feb 13 21:29:25 ringer nnrpd[11615]: lonestar.jpl.utsa.edu connect

我正在使用这个命令:

代码语言:javascript
复制
awk -f newsread.awk news.notice > newsread.summary

下面是newsread.summary:

代码语言:javascript
复制
            News Reader Summary


               lonestar       runner         ringer         rings          

Articles:      144686         25066          2              0              
Groups:        5282           8344           19             0              
Cs2413:        0              0              0              0              
Cs2413.d:      40             25             0              0              
User Time:     266197         83377          128            0              

Start Time = Feb 13 21:27:14    End Time = Feb 14 20:56:49

而且它必须是一个awk脚本。

EN

回答 2

Stack Overflow用户

回答已采纳

发布于 2013-09-30 16:54:44

首先,去掉引号,也就是说,不要这样:

代码语言:javascript
复制
$6 ~ "/ring??.cs.utsa.edu/"

但这一点:

代码语言:javascript
复制
$6 ~ /ring??.cs.utsa.edu/

引号分隔字符串,斜杠分隔常量REs。

现在,我怀疑您的RE是错误的,因为??是每个POSIX的未定义行为,因为它是背对背的重复元字符,而.的意思是“任何单个字符”。这是一个正则表达式,而不是shell珠-不同含义的元字符。

你可能想要这样做:

代码语言:javascript
复制
$6 ~ /^ring..\.cs\.utsa\.edu$/
票数 3
EN

Stack Overflow用户

发布于 2013-09-30 16:53:26

失去双引号。

代码语言:javascript
复制
$6 ~ /regex/

代码语言:javascript
复制
$6 ~ "/regex/"
票数 1
EN
页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持
原文链接:

https://stackoverflow.com/questions/19099532

复制
相关文章

相似问题

领券
问题归档专栏文章快讯文章归档关键词归档开发者手册归档开发者手册 Section 归档