我是C#新手,我试图从这个问题中重用这个正则表达式,但不幸的是,它并没有给我任何结果。C#代码如下:
string pattern = @"^(((ht|f)tp(s?))\:\/\/)?(www.|[a-zA-Z].)[a-zA-Z0-9\-\.]+\.(com|edu|gov|mil|net|org|biz|info|name|museum|us|bd|ca|uk)(\:[0-9]+)*(\/($|[a-zA-Z0-9\.\,\;\?\'\\\+&%\$#\=~_\-]+))*$";
var linkParser = new Regex(pattern, RegexOptions.Compiled | RegexOptions.IgnoreCase);
var rawString = @"www.antilogvacations.com bhxbldsa
https://www.sample.com
http://www.sample.com/xyz
www.sample.com
www.sample.com/xyz/#/xyz
sample.com
blah www.sample.com blah mofiz.com";
MatchCollection matches = linkParser.Matches(rawString);
foreach (Match m in matches)
Console.WriteLine(m.Value);
}同样的regex也能很好地工作:https://regex101.com。
我不明白我可能做错了什么。
编辑1: regex也无法捕获文本中的URL,例如rawString中的最后一行。
发布于 2018-03-15 11:19:04
RegexOptions.Multiline^foo与启动12个字符的foo不匹配。因此:
string pattern = @"^((ftp|http|https):\/\/)?(www.)?(?!.*(ftp|http|https|www.))[a-zA-Z0-9_-]+(\.[a-zA-Z]+)+((\/)[\w#]+)*(\/\w+\?[a-zA-Z0-9_]+=\w+(&[a-zA-Z0-9_]+=\w+)*)?(\r)?$";
var linkParser = new Regex(pattern, RegexOptions.Compiled | RegexOptions.IgnoreCase
| RegexOptions.Multiline);
var rawString = @"www.antilogvacations.com bhxbldsa
https://www.sample.com
http://www.sample.com/xyz
www.sample.com
www.sample.com/xyz/#/xyz
sample.com
www.sample.com
mofiz.com"; // NO spaces to the left of thesehttps://stackoverflow.com/questions/49298047
复制相似问题