当前位置:网站首页>咱就是说 随便整几千个表情包为我所用一下
咱就是说 随便整几千个表情包为我所用一下
2022-07-01 02:52:00 【小熊猫爱恰饭】
前言
在如今这个人人配一台手机得时代,很多人聊天都是通过网上,但是,怎么样快速缓解尴尬进入快乐而愉快得聊天呢~
刚认识的朋友丢几个表情包出去分分钟拉进关系,女朋友生闷气了整两个表情包开心一下,也可以化解尴尬,没时间打字整两张表情包,礼貌而不失尴尬。不知道回啥时,那更是丢张表情包分分钟钟解决
现在年轻人聊天,不带点表情包都不好意思说自己是年轻人, 表情包已然成为人与人聊天中不可缺少的部分。

准备工作 (https://jq.qq.com/?_wv=1027&k=Ap5XvyNN)
准备工作很重要,先知道我们要干啥,用什么来做,怎么做,再去一步步实时,稳扎稳打。
开发环境配置
- Python 3.6
- Pycharm
如果有没安装得小伙伴可以左侧主页领取或者私信我领取安装教程~
模块安装配置
- requests
- parsel
- re
第三方模块,我们需要安装: (https://jq.qq.com/?_wv=1027&k=Ap5XvyNN)
win + R 输入cmd 输入安装命令 pip install 模块名 如果出现爆红 可能是因为 网络连接超时 切换国内镜像源
在pycharm中点击Terminal(终端) 输入安装命令
如何配置pycharm里面的python解释器?
选择file(文件) >>> setting(设置) >>> Project(项目) >>> python interpreter(python解释器)
点击齿轮, 选择add
添加python安装路径
pycharm如何安装插件?
- 选择file(文件) >>> setting(设置) >>> Plugins(插件)
- 点击 Marketplace 输入想要安装的插件名字 比如:翻译插件 输入 translation / 汉化插件 输入 Chinese
- 选择相应的插件点击 install(安装) 即可
- 安装成功之后 是会弹出 重启pycharm的选项 点击确定, 重启即可生效

代码
目标:fabiaoqing
地址前面后面大家自己补全一下,包括后面代码里的,这应该没有不会的吧。
导入模块
import requests
import parsel
python学习交流群:660193417 ###
import re
import time
请求网址
url = f'fabiaoqing/biaoqing/lists/page/{page}.html'
请求头
headers = {
'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/84.0.4147.105 Safari/537.36'
}
返回网页源代码 (https://jq.qq.com/?_wv=1027&k=Ap5XvyNN)
response = requests.get(url=url, headers=headers)
获取数据
# 第一次提取 提取所有的div标签内容
divs = selector.css('#container div.tagbqppdiv') # css 根据标签提取内容
# 通过标签内容提取他的图片url地址
img_url = div.css('img::attr(data-original)').get()
# 提取标题
title = div.css('img::attr(title)').get()
# 获取图片的后缀名
name = img_url.split('.')[-1]
保存数据
new_title = change_title(title)
对表情包图片发送请求 获取它二进制数据
img_content = requests.get(url=img_url, headers=headers).content
保存数据
def save(title, img_url, name):
img_content = get_response(img_url).content
try:
with open('img\\' + title + '.' + name, mode='wb') as f:
# 写入图片二进制数据
f.write(img_content)
print('正在保存:', title)
except:
pass
# 替换标题中的特殊字符python学习交流群:660193417 ###
# 因为文件命名不明还有特殊字符,所以我们需要通过正则表达式替换掉特殊字符。
def change_title(title):
new_title = re.sub(mode, "_", title)
return new_title
记录时间
time_2 = time.time()
use_time = int(time_2) - int(time_1)
print(f'总共耗时:{use_time}秒')
多线程 (https://jq.qq.com/?_wv=1027&k=Ap5XvyNN)
import requests
import parsel
import re
import time
import concurrent.futures
def change_title(title):
new_title = re.sub(mode, "_", title)
return new_title
def get_response(html_url):
headers = {
'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/84.0.4147.105 Safari/537.36'
}
repsonse = requests.get(url=html_url, headers=headers)
return repsonse
def save(title, img_url, name):
img_content = get_response(img_url).content
try:
with open('img\\' + title + '.' + name, mode='wb') as f:
f.write(img_content)
print('正在保存:', title)
except:
pass
def main(html_url):
html_data = get_response(html_url).text
selector = parsel.Selector(html_data)
divs = selector.css('#container div.tagbqppdiv')
for div in divs:
img_url = div.css('img::attr(data-original)').get()
title = div.css('img::attr(title)').get()
name = img_url.split('.')[-1]
new_title = change_title(title)
save(new_title, img_url, name)
if __name__ == '__main__':
time_1 = time.time()
exe = concurrent.futures.ThreadPoolExecutor(max_workers=10)
for page in range(1, 201):
url = f'fabiaoqing/biaoqing/lists/page/{page}.html'
exe.submit(main, url)
exe.shutdown()
time_2 = time.time()
use_time = int(time_2) - int(time_1)
print(f'总共耗时:{use_time}秒')


我是小熊猫,咱下篇文章见啦(*◡‿◡)

边栏推荐
- 通信协议——分类及其特征介绍
- Detailed data governance knowledge system
- Contrastive learning of Class-agnostic Activation Map for Weakly Supervised Object Localization and
- 联想X86服务器重启管理控制器(XClarity Controller)或TSM的方法
- Mouse over effect VI
- Restcloud ETl数据通过时间戳实现增量数据同步
- 查看 jvm 参数
- 记一次服务部署失败问题排查
- import tensorflow.contrib.slim as slim报错
- 鼠标悬停效果七
猜你喜欢

旷世轻量化网络ShuffulNetV2学习笔记

The operation efficiency of the park is improved, and the application platform management of applet container technology is accelerated

Pulsar的Proxy支持和SNI路由

基于Pytorch完整的训练一个神经网络并进行验证

Restcloud ETL WebService data synchronization to local

【EXSI】主机间传输文件

js中的图片预加载

Pulsar Geo Replication/灾备/地域复制
![[applet project development -- JD mall] uni app commodity classification page (first)](/img/6c/5b92fc1f18d58e0fdf6f1896188fcd.png)
[applet project development -- JD mall] uni app commodity classification page (first)

Detailed explanation of pointer array and array pointer (comprehensive knowledge points)
随机推荐
PHP batch Excel to word
Lavaweb [first understanding the solution of subsequent problems]
Const and the secret of pointers
Analysis and solution of anr problems
Mouse over effect IV
Sampling Area Lights
鼠标悬停效果六
LeetCode_ Stack_ Difficulties_ 227. basic calculator (excluding multiplication and division)
Evaluation of the entry-level models of 5 mainstream smart speakers: apple, Xiaomi, Huawei, tmall, Xiaodu, who is better?
Prototype and prototype chain in JS
If I am in Beijing, where is a better place to open an account? In addition, is it safe to open a mobile account?
PCB defect detection based on OpenCV and image subtraction
Gartner研究:在中国,混合云的采用已成为主流趋势
大橙子疯博客搬家通知
鼠标悬停效果三
[small program project development -- Jingdong Mall] the home page commodity floor of uni app
【微信小程序開發】樣式匯總
Catch 222222
An article explaining the publisher subscriber model and the observer model
js中的图片预加载