在 Python 中从给定的 html 中获取所有 xpath 列表的最佳方法是什么？-解网

问：

我希望从 Python 中给定的 html 中获取所有 xpath 的列表。我当前的实现仅使用 lxml 库为我提供了相对 xpath。我需要 xpaths 来使用 ids 和其他属性，这样我就可以在另一个应用程序的 Java Selenium 中使用这些 xpath。

    for element in html.iter():
        try:
            self.listOfXpathsFound.append(tree.getelementpath(element))
        except ValueError as val:
            count = count + 1
            print("ValueError: " + str(val))
            self.errorsDict["ValueError " + str(count)] = str(val)

我无法弄清楚如何在没有相对的情况下获得 xpath。有什么想法吗？

例：

使用 lxml etree 给出的 Xpath： //body//p//

必需的 xpath：//@id=“para-one”

python xpath lxml 元素树

from lxml import html

# Parse your HTML document
html_content = "<your HTML content here>"
tree = html.fromstring(html_content)

# Get all elements with an "id" attribute
elements_with_id = tree.xpath('//*[@id]')

absolute_xpaths = []
for element in elements_with_id:
    # Construct the XPath with @id
    xpath = f'//*[@id="{element.get("id")}"]'
    absolute_xpaths.append(xpath)

for xpath in absolute_xpaths:
    print(xpath)

在 Python 中从给定的 html 中获取所有 xpath 列表的最佳方法是什么？

What is the best way to get a list of all xpaths from given html in Python?

评论

评论