阳光中的超人,阳光下的学习，小磊自习室，石小磊的教案网

2022年3月8日 10:40

28 comments
Andrew Coyne

2. HTML文本解析

HTML文本解析

工具: Beautifulsoup

https://www.crummy.com/software/BeautifulSoup/bs4/doc/index.zh.html
安装

pip install beautifulsoup4

解析器:

pip install lxml
导入

from bs4 import BeautifulSoup

简单示例

import requests
from bs4 import BeautifulSoup
url = "http://www.shixiaolei.com/posts/1/"
r = requests.get(url)
r.text

# HTML文本解析成Beautifulsoup对象
soup = BeautifulSoup(r.text,'lxml')
soup

# CSS选择器
data = soup.select(".title a")
for d in data:
    print(d.get_text())

联系我们

2. HTML文本解析

留言

给我留言

欢迎联系

精品课程

服务客户

联系方式

立即登录

注册账号

联系我们

2. HTML文本解析

留言

给我留言

欢迎联系

精品课程

服务客户

联系方式