python 的字符串查找方法返回 -1 的问题-解网

问：

我正在尝试编写一个接受字符串并返回标记位置的函数。该函数在以下情况下工作正常，但是如果我尝试使用字符串 lower 方法，如下面的代码所示，我的第一个元组返回 as 而不是所需的输出tokens = query_string.split()[(-1, 2), (5, 6), (8, 8), (10, 13)][(0, 3), (5, 6), (8, 8), (10, 13)]

我用于测试的字符串是“This is a test”。

def token_position_list(query_string):
    """
    :param query_string: a string representing a query
    :return: a list of tuples, where each tuple holds the start and end positions of each token
    """
    token_positions = []
    tokens = query_string.lower().split()
    current_position = 0
    for token in tokens:
        start_position = query_string.find(token, current_position)
        end_position = start_position + len(token) - 1
        token_positions.append((start_position, end_position))
        current_position = end_position + 1
    return token_positions

谁能向我解释为什么加低会这样做以及我如何解决这个问题？

python 字符串查找标记化小写

欢迎使用 Stack Overflow。请阅读如何提问。我们在这里不写“找到错误”的答案;我们需要一个具体的问题 - 这将是你理解和定位特定问题的最佳尝试，并在一个最小的可重复的例子中展示它。适合 Stack Overflow 的问题是，你已经弄清楚了代码中执行与预期不同的操作的特定部分（您应该具体期望某些内容），并且不明白为什么。

0赞 Karl Knechtel 11/8/2023

“为什么添加 lower 会这样做”——好吧，您是否尝试一步一步地检查代码运行时会发生什么？例如，对于给定的输入，您认为应该是什么结果？如果你明确检查，你会得到什么结果？这是你所期望的吗？你期望第一次通过循环的价值是什么？如果你试图在字符串中找到字符串，你认为应该发生什么？为什么？This is a testquery_string.lower().split()tokenfor token in tokens:thisThis is a test

答：

0赞 Barmar 11/8/2023 #1

您的所有令牌都是小写的，但仍然是混合大小写的。因此，如果原始字符串在该标记中包含任何大写字母，则它不会找到该标记。query_string

您应该转换为小写并对其进行处理。query_string

def token_position_list(query_string):
    """
    :param query_string: a string representing a query
    :return: a list of tuples, where each tuple holds the start and end positions of each token
    """
    token_positions = []
    query_string = query_string.lower()
    tokens = query_string.split()
    current_position = 0
    for token in tokens:
        start_position = query_string.find(token, current_position)
        end_position = start_position + len(token) - 1
        token_positions.append((start_position, end_position))
        current_position = end_position + 1
    return token_positions

python 的字符串查找方法返回 -1 的问题

Issue with python's string find method returning -1

评论

评论