当前位置:网站首页>爬虫练习题(三)
爬虫练习题(三)
2022-07-06 23:52:00 【InfoQ】
'''
1.分析网页:
https://www.6pian.cn/
https://www.6pian.cn/xq.html
https://www.6pian.cn/xq/1/0.html
https://www.6pian.cn/xq/2/0.html
https://www.6pian.cn/xq/3/0.html
'''
import urllib.request
start = int(input("请输入起始页"))
end = int(input("请输入结束页"))
headers = {
'User-Agent':'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/103.0.5060.66 Safari/537.36 Edg/103.0.1264.44'
}
for n in range(start, end + 1):
url = 'https://www.6pian.cn/xq/{}/0.html'.format(n)
print(url)
q = urllib.request.Request(url,headers=headers)
response = urllib.request.urlopen(q)
with open(f'第{n}页.html','w',encoding='utf-8')as f:
f.write(response.read().decode('utf-8'))
边栏推荐
- 不同网段之间实现GDB远程调试功能
- Mybaits multi table query (joint query, nested query)
- Photo selector collectionview
- How digitalization affects workflow automation
- Digital innovation driven guide
- Design, configuration and points for attention of network specified source multicast (SSM) simulation using OPNET
- 导航栏根据路由变换颜色
- Leakage relay jd1-100
- 淘宝店铺发布API接口(新),淘宝oAuth2.0店铺商品API接口,淘宝商品发布API接口,淘宝商品上架API接口,一整套发布上架店铺接口对接分享
- 消息队列:如何确保消息不会丢失
猜你喜欢
[论文阅读] A Multi-branch Hybrid Transformer Network for Corneal Endothelial Cell Segmentation
Common skills and understanding of SQL optimization
K6EL-100漏电继电器
Photo selector collectionview
Initial experience of annotation
Unity keeps the camera behind and above the player
Cve-2021-3156 vulnerability recurrence notes
The year of the tiger is coming. Come and make a wish. I heard that the wish will come true
消息队列:如何确保消息不会丢失
集群、分布式、微服务的区别和介绍
随机推荐
The year of the tiger is coming. Come and make a wish. I heard that the wish will come true
Leakage relay llj-100fs
Mybaits multi table query (joint query, nested query)
Digital innovation driven guide
消息队列:消息积压如何处理?
Tablayout modification of customized tab title does not take effect
Leetcode: maximum number of "balloons"
利用OPNET进行网络单播(一服务器多客户端)仿真的设计、配置及注意点
JVM(二十) -- 性能监控与调优(一) -- 概述
【oracle】简单的日期时间的格式化与排序问题
async / await
Flink SQL realizes reading and writing redis and dynamically generates hset key
Jhok-zbg2 leakage relay
Vector and class copy constructors
淘宝店铺发布API接口(新),淘宝oAuth2.0店铺商品API接口,淘宝商品发布API接口,淘宝商品上架API接口,一整套发布上架店铺接口对接分享
DOM-节点对象+时间节点 综合案例
1. AVL tree: left-right rotation -bite
Sorry, I've learned a lesson
JSP setting header information export to excel
利用OPNET进行网络仿真时网络层协议(以QoS为例)的使用、配置及注意点