正则表达式，在 char 之前获取 char-解网

问：

样本：

[Foo][Bar]Foo bar foo bar: foo; bar: foo bar foo bar __
[Foo][Bar]Foo; bar: foo bar __ foo bar foo bar
[Foo]Foo bar foo bar foo bar: foo __ bar; foo bar __ foo bar
[Bar]Foo; bar; foo

例如，我有一个如上所示的字符串格式。

我想问的是，如何获取分号（不是空格或空格）之后的字母，而是第一个冒号之前的字母？;:

如果可能的话，我想使用正则表达式一步标记字母

我想得到的字母以粗体标记

*作为附加信息，我想将字母更改为大写。

正则表达式 vb.net

^- 线锚点的起点
(?:- 非捕获组的开始
- [^;:]*- 匹配任何字符，但零次或更多次;:
- ;- 在文字上匹配;
- \s*- 在空格上匹配零次或更多次
- (.)- 捕获一个角色。如果不允许将其替换为([^:]):
)- 非捕获组结束
+- 匹配非捕获组 1 次或更多次

演示

(?<=              # Match something preceded by
  ^[^:]*;\s*      # the start of the line, 0 or more non-colons, a semicolon and any whitespaces
)                 # that is
[^\s:]            # not a colon and not a whitespace
(?=               # which must be followed by
  [^:]*(?:$|:)    # 0 or more non-colons, then either the end of the line or the first colon.
)                 #

在 regex101.com 上试用。

[^:]，，并且从不匹配冒号，因此 lookahead 中的冒号与该行的第一个冒号匹配。如果没有冒号，我们只需回退到行尾，从而允许主表达式匹配。;\s[^\s:]

正则表达式需要具有多行修饰符（/）。我不知道 VB.NET，但以下片段似乎有效：(?m)RegexOptions.Multiline

Sub Main()
  Dim regex As New Regex("(?<=^[^:]*;\s*)[^\s:](?=[^:]*(?:$|:))", RegexOptions.Multiline)
  Dim input As String =
    "[Foo][Bar]Foo bar foo bar: foo; bar: foo bar foo bar __" & vbCrlf &
    "[Foo][Bar]Foo; bar: foo bar __ foo bar foo bar" & vbCrlf &
    "[Foo]Foo bar foo bar foo bar: foo __ bar; foo bar __ foo bar" & vbCrlf &
    "[Bar]Foo; bar; foo"
  
  Console.WriteLine(regex.Replace(input, AddressOf ConvertToUppercase))
End Sub

Function ConvertToUppercase(match As Match) As String
  Return match.Groups(0).Value.ToUpper()
End Function

在 ideone.com 上试用。

不知道。所有以一种或另一种方式工作的替代方案都是好的 - 也许这其中有一个微妙的区别，对 OP 更有效。我认为最好使用常规 VB 函数将它们大写。我在 .net 正则表达式引擎中找不到任何可以自动执行此操作的东西，但也许它在某个地方。

0赞 Espada 10/10/2023

@InSync 经过 2 天的寻找，终于......正则表达式和 vb.net 代码运行良好，正如我所期望的那样。非常感谢：-）

上一个：为什么我的替换正则表达式没有给出正确的结果？

下一个：正则表达式替换为捕获组中的字符替换

正则表达式，在 char 之前获取 char

Regex, get char after char before char

评论

评论

评论