当前位置:网站首页>How to identify fake reptiles?
How to identify fake reptiles?
2022-07-31 21:11:00 【oHuangBing】
When we examine website logs, we often encounter various crawlers.Some are normal crawlers, for example: search engine crawlers (Baidu search engine crawler, Google Search Engine Crawler, Bing Search Engine Crawler, YandexBot, etc.), and some crawlers with various functions, which can be viewed here: list crawlers.
However, not all crawlers on the Internet are beneficial, and some crawlers try to hide themselves, so they will learn some characteristics of real crawlers.There are also fake crawlers, that is, crawlers that fake those search engines, and will crawl the data of your website. Although the User-agent looks the same as the search engine, the IP does not belong to the search engine. At this timeWe need to accurately identify the IP addresses of these fake crawlers.
Through Crawler IP Query Tool, we can easily identify fake crawlers, for example:
34.68.229.128 Mozilla/5.0 (Linux; Android 6.0.1; Nexus 5X Build/MMB29P) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/85.0.4183.121 Mobile Safari/537.36 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)
This is my simplified log record. The front is the IP address, and the back is the User-agent that accesses the crawler. Through the User-agent, we can see that he is a spider of the Google search engine.
By querying, we can see that this is a fake Google spider, the screenshot is as follows:

We only need to enter the IP address of the fake crawler, and we can see some information about the Crawler.In this way, whether it is true or false Li Kui (true and false reptiles) can not escape our eyes.
And if we want to see more fake bots, we can go here: listcrawlers fake bot, sort out the common fake bots on the Internet.
Summary
By introducing what is fake crawler, and how to query this tool by crawler IP, to accurately identify fake reptiles.
边栏推荐
- Book of the Month (202207): The Definitive Guide to Swift Programming
- 架构师04-应用服务间加密设计和实践
- <artifactId>ojdbc8</artifactId>「建议收藏」
- 顺序表的实现
- Shell script quick start to actual combat -02
- linux view redis version command (linux view mysql version number)
- 【AcWing】第 62 场周赛 【2022.07.30】
- STM32 full series development firmware installation guide under Arduino framework
- Financial profitability and solvency indicators
- rj45 to the connector Gigabit (Fast Ethernet interface definition)
猜你喜欢

The old music player WinAmp released version 5.9 RC1: migrated to VS 2019, completely rebuilt, compatible with Win11

【公开课预告】:超分辨率技术在视频画质增强领域的研究与应用

Realization of character makeup

The whole network is on the verge of triggering, and the all-round assistant for content distribution from media people - Rongmeibao
![leetcode: 6135. The longest ring in the graph [inward base ring tree + longest ring board + timestamp]](/img/91/284de3dcbb8d143d85775b314dd41c.png)
leetcode: 6135. The longest ring in the graph [inward base ring tree + longest ring board + timestamp]

老牌音乐播放器 WinAmp 发布 5.9 RC1 版:迁移到 VS 2019 完全重建,兼容 Win11

How can we improve the real yourself, become an excellent architect?

Arduino框架下STM32全系列开发固件安装指南

统计UTF-8字符串中的字符函数

Unity 之 音频类型和编码格式介绍
随机推荐
ThreadLocal
ReentrantLock原理(未完待续)
Redis综述篇:与面试官彻夜长谈Redis缓存、持久化、淘汰机制、哨兵、集群底层原理!...
[PIMF] OpenHarmony Thesis Club - Inventory of the open source Hongmeng tripartite library [3]
Mobile web development 02
linux view redis version command (linux view mysql version number)
A few permanent free network transmission, convenient and simple (Intranet through tutorials)
【AcWing】第 62 场周赛 【2022.07.30】
高通cDSP简单编程例子(实现查询高通cDSP使用率、签名),RK3588 npu使用率查询
How can we improve the real yourself, become an excellent architect?
The principle of ReentrantLock (to be continued)
matplotlib ax bar color Set the color, transparency, label legend of the ax bar
Made with Flutter and Firebase!counter application
利用反射实现一个管理对象信息的简单框架
ojdbc8 "Recommended Collection"财务盈利、偿债能力指标
Basic Grammar Introduction of Carbon Tutorial (Tutorial)
【PIMF】OpenHarmony 啃论文俱乐部—盘点开源鸿蒙三方库【3】
Architect 04 - Application Service Encryption Design and Practice
Chapter VII