python爬虫怎么爬span里的内容

百变鹏仔 5个月前 (01-15) #Python

文章标签爬虫

Python爬虫抓取Span内容的方法：使用BeautifulSoup库解析HTML文档通过CSS选择器或正则表达式定位Span元素及其内容

Python 爬虫如何抓取 Span 中的内容

方法：

使用 Python 的 BeautifulSoup 库解析 HTML 文档，并通过 CSS 选择器或正则表达式定位 Span 元素及其内容：

步骤：

立即学习“Python免费学习笔记（深入）”；

from bs4 import BeautifulSoup

html_doc = """<html>  <body>    <span id="my-span">This is a span.</span>  </body></html>

soup = BeautifulSoup(html_doc, 'html.parser')span_element = soup.select_one('#my-span')

span_text = span_element.text

span_html = span_element.html

import repattern = '<span id="my-span">(.+?)</span>'matches = re.findall(pattern, html_doc)span_text = matches[0]

文章推荐