当前位置:网站首页>Final part of web crawler: send directional messages to 100000 Netease cloud users
Final part of web crawler: send directional messages to 100000 Netease cloud users
2022-06-26 21:32:00 【Romantic data analysis】
The goal of this article :
In the last article, we got the comment users ID And home page address . This article can conduct some data analysis and market operation based on these data . I learned the method of this article theoretically , You can send advertising messages on any web page , This article has the possibility of being used by bad people , Therefore, charges are set , And this set of crawler tutorials , If you find online classes in Netease cloud class , Tuition fees 1200 yuan . The windfall profits of online classes are still huge .The ultimate goal is achieved :
1、 Through popular singers , Grab songs ID.
2、 Through songs ID, Grab comment users ID.
3、 By commenting on users ID, Send a directed push message .
The last two articles have completed the steps 1、 step 2, This article completes the steps 3.
Conclusion :requests and selenium The difference between :requests No page method to get songs ID, It's quite fast , But you can only get some public web pages without login , If user login and authentication are required ,requests Will not be able to .
selenium Its advantage is that it completely imitates the operation of opening a web page , It's like you hired an assistant to do things for you , Very intuitive , It will not be forbidden to visit . And for interfaces that require user login ( Such as microblog ), use selenium It can easily skip the troublesome part of verification .
In the first part, we use MYSQL Store and crawl the user's home page information , This article will support error redoing , Each time a record is processed, a processing flag bit will be marked Y, Similar to our production system .
step 1: Query the user lD And home page tables
We need to check u
边栏推荐
- Netease Yunxin officially joined the smart hospital branch of China Medical Equipment Association to accelerate the construction of smart hospitals across the country
- [serial] shuotou O & M monitoring system 01 overview of monitoring system
- [leetcode]- linked list-2
- [protobuf] some pits brought by protobuf upgrade
- leetcode刷题:字符串05(剑指 Offer 58 - II. 左旋转字符串)
- 【贝叶斯分类3】半朴素贝叶斯分类器
- How to create an OData service with the graphical modeler on the sap BTP platform
- C language simple login
- Y48. Chapter III kubernetes from introduction to mastery -- pod status and probe (21)
- Leetcode question brushing: String 02 (reverse string II)
猜你喜欢

【贝叶斯分类2】朴素贝叶斯分类器
![[leetcode]- linked list-2](/img/f7/9d4b01285fd6f7fa9f3431985111b0.png)
[leetcode]- linked list-2

Leetcode: String 04 (reverse the words in the string)

The source code that everyone can understand (I) the overall architecture of ahooks

宝藏又小众的覆盖物PBR多通道贴图素材网站分享

Comment installer la base de données MySQL 8.0 sous Windows? (tutoriel graphique)

诗尼曼家居冲刺A股:年营收近12亿 红星美凯龙与居然之家是股东

Leetcode(452)——用最少数量的箭引爆气球

2022年,中轻度游戏出海路在何方?

leetcode刷题:字符串06(实现 strStr())
随机推荐
0 basic C language (2)
Sword finger offer 12 Path in matrix
DAST 黑盒漏洞扫描器 第五篇:漏洞扫描引擎与服务能力
【protobuf 】protobuf 昇級後帶來的一些坑
SAP Commerce Cloud 项目 Spartacus 入门
windows系统下怎么安装mysql8.0数据库?(图文教程)
【连载】说透运维监控系统01-监控系统概述
JWT operation tool class sharing
聊聊我的远程工作体验 | 社区征文
GEE:计算image区域内像素最大最小值
[Shandong University] information sharing for the first and second examinations of postgraduate entrance examination
Gee: calculate the maximum and minimum values of pixels in the image area
Leetcode(763)——划分字母区间
Treasure and niche cover PBR multi-channel mapping material website sharing
不同的子序列问题I
协同过滤进化版本NeuralCF及tensorflow2实现
线性模型LN、单神经网络SNN、深度神经网络DNN与CNN测试对比
12个MySQL慢查询的原因分析
Stop being a giant baby
API管理之利剑 -- Eolink