一、豆瓣网站信息爬取

最简单的代码

import requests
from bs4 import BeautifulSoup

url = 'https://www.douban.com/'
headers = {
'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/126.0.0.0 Safari/537.36 Edg/126.0.0.0'
}
response = requests.get(url,headers=headers)
print(response.text)

抓取效果:

image-20240719234130043