使用^运算符时，如何使正则表达式模式在行首前考虑逗号？-解网

问：

import re

#example 1  with a  ,  before capture group
input_text = "Hello how are you?, dfdfdfd fdfdfdf other text. hghhg"

#example 2 without a  , (or \.|,|;|\n) before capture group
input_text = "dfdfdfd fdfdfdf other text. hghhg"

#No matter what position you place ^ within the options, it always considers it first, ignoring the others.
fears_and_panics_match = re.search(
                                    r"(?:\.|,|;|\n|^)\s*(?:(?:for|by)\s*me|)\s*(.+?)\s*(?:other\s*text)\s*(?:\.|,|;|\n)", 
                                    #r"(?:\.|,|;|\n)\s*(?:(?:for|by)\s*me|)\s*(.+?)\s*(?:other\s*text)\s*(?:\.|,|;|\n|$)", 
                                    input_text, flags = re.IGNORECASE)


if fears_and_panics_match: print(fears_and_panics_match.group(1))

为什么我使用这个模式捕获，无论你把 . 我需要您评估找到逗号然后找到行首逗号的可能性r"(?:\.|,|;|\n|^)\s*(?:(?:for|by)\s*me|)\s*(.+?)\s*(?:other\s*text)\s*(?:\.|,|;|\n)"Hello how are you?, dfdfdfd fdfdfdf^,^

每种情况下的正确输出：

#for example 1
"dfdfdfd fdfdfdf"

#for example 2
"dfdfdfd fdfdfdf"

python-3.x 正则表达式字符串 regex-group

strs = [
  'dfdfdfd fdfdfdf other text. hghhg',
  'Hello how are you?, dfdfdfd fdfdfdf other text.hghhg',
  'for me a word other text',
  'A semicolon first; then some words before other text'
]
regex = r'(?:.*?[.,;])?\s*(?:(?:for|by)\s*me\s*)?(\w.*?)(?=\s*other\s*text)'
for s in strs:
    print(re.match(regex, s).group(1))

输出：

dfdfdfd fdfdfdf
dfdfdfd fdfdfdf
a word
then some words before

上一个：将数据导出为字符串时，DataFrame 格式化失败

下一个：请调试此代码。代码未显示所需的输出。这是骆驼案例 4 问题的解决方案

使用^运算符时，如何使正则表达式模式在行首前考虑逗号？

How to make a regular expression pattern consider a comma before the start of line when using the ^ operator?

评论