使用正则表达式提取字符串部分

Max Makarochkin 发表于 Dev

马克斯·马卡罗奇金

这是一个字符串： http://news.ycombinator.com/page?vasya=pupkin&b=b news.ycombinator.com/page news.ycombinator.com/page.php news.ycombinator.com/page

我正在提取带有页面的主机。所以我写了以下正则表达式：

([a-zA-Z0-9\.]*[a-zA-Z0-9]+[^\/][\.][a-zA-Z0-9\/\.]+)

这些返回给我（粗体）：

http：// news.ycombinator.com/page？vasya = pupkin＆b = b news.ycombinator.com/page news.ycombinator.com/page.php news.ycombinator.com/page

这不是我真正需要的。在使用以下字符串：的情况下，正则表达式应该看不到带有页面的主机http://news.ycombinator.com/page?vasya=pupkin&b=b，因为它是一个链接，应区别对待。

应该被拒绝：

"http://news.ycombinator.com/page?vasya=pupkin&b=b", "http://news.ycombinator.com/page", "http://news.ycombinator.com/","http://news.ycombinator.com".

不应该被拒绝：

"news.ycombinator.com/page","news.ycombinator.com/page.php", "news.ycombinator.com/page/index", "news.ycombinator.com/page/index.php"

如何改进此正则表达式，以便它只能选择那些附近没有字字符的字符串部分？

用户名

我不确定您使用的是什么正则表达式，但是您实际上已经解决了自己的问题-您只需要正则表达式匹配整个单词即可。这将取决于您使用的程序，但这是一个guidleine（posix样式正则表达式）：

([:space:][a-zA-Z0-9\.]*[a-zA-Z0-9]+[^\/][\.][a-zA-Z0-9\/\.]+[:space:])

or maybe ([:space:]([a-zA-Z0-9]*[\.\/])+[a-zA-Z0-9]+[:space:])

在第二个中，您必须确保内部组适用于非捕获组。

本文收集自互联网，转载请注明来源。

如有侵权，请联系[email protected] 删除。

编辑于2021-02-6

我来说两句

0条评论

登录后参与评论

上一篇：org.springframework.validation.BeanPropertyBindingResult

来自分类Dev

Related 相关文章

文章

使用正则表达式提取字符串部分

使用正则表达式提取字符串部分

如何使用正则表达式提取字符串的各个部分

在Python中使用正则表达式提取字符串的特定部分

如何使用正则表达式提取字符串的不匹配部分

正则表达式提取字符串

正则表达式提取字符串

正则表达式提取字符串

提取字符串正则表达式

使用正则表达式从字符串中的提取字符串

正则表达式以一致的顺序提取字符串的不同部分

正则表达式提取字符串部分

Javascript正则表达式，用于提取字符串的某些部分

使用正则表达式提取字符串的一部分

使用VB.net正则表达式提取字符串的两个部分

Scala：使用正则表达式提取字符串的一部分

使用VB.net正则表达式提取字符串的两个部分

使用正则表达式提取字符串的一部分

使用正则表达式提取字符串的一部分

scala.MatchError：使用正则表达式提取字符串

使用正则表达式提取字符串

外壳脚本。如何使用正则表达式提取字符串

使用正则表达式提取字符串

Python使用正则表达式提取字符串的出现

使用正则表达式提取字符串中的数字

在间隔上使用正则表达式提取字符串html

如何使用正则表达式从列表中提取字符串匹配项？

在R中使用正则表达式从链接中提取字符串

使用正则表达式从URL提取字符串（在Data Studio上）

外壳脚本。如何使用正则表达式提取字符串

使用正则表达式提取字符串