当前位置:网站首页>How to view and explain robots protocol
How to view and explain robots protocol
2022-06-25 20:13:00 【Cheng Xiaoqi】
The first thing to learn about reptiles is to know what can't be crawled . So let's get to know robots Make an agreement .
robots Where is the agreement ?
Add... Directly after the target site /robots.txt You can see , With csdn For example
visit https://www.csdn.net/robots.txt, Get the following information :
User-agent: *
Disallow: /scripts
Disallow: /public
Disallow: /css/
Disallow: /images/
Disallow: /content/
Disallow: /ui/
Disallow: /js/
Disallow: /scripts/
Disallow: /article_preview.html*
Disallow: /tag/
Disallow: /?
Disallow: /link/
Sitemap: https://www.csdn.net/sitemap-aggpage-index.xml
Sitemap: https://www.csdn.net/article/sitemap.txt
User-agent: * It means that all reptiles should follow the following rules
disallow: Is not allowed to crawl the page
Yes, there will be Allow
Other websites may also have various interesting agreements , Go and see for yourself !
边栏推荐
- H5 application conversion fast application
- Leaflet modify popup style
- Using flex to implement the Holy Grail layout is as simple as that
- Applet canvas generate sharing Poster
- 在打新债开户证券安全吗
- 2.17(Avoid The Lakes)
- Short video is just the time. How can you quickly build your video creation ability in your app?
- JQ implements tab switching
- Est - il sûr d'ouvrir un compte avec de nouvelles dettes? Une faible Commission est - elle crédible?
- Png to NII
猜你喜欢

Suddenly found that the screen adjustment button can not be used and the brightness can not be adjusted

200 OK (from memory cache) and 200 OK (from disk cache)

Thymleaf template configuration analysis

Jsonp function encapsulation

Number of wechat applet custom input boxes

Applet multi image to Base64 upload

Uni app through uni Navigateto failed to pass parameter (pass object)

Pcl+vs2019 configuration and some source code test cases and demos

<C>. array

<C>. function
随机推荐
Applet password input box
Applet canvas generate sharing Poster
Redis high availability: do you call this the principle of master-slave architecture data synchronization?
Corporate finance formula_ P1_ Accounting statement and cash flow
Use of serialize() and serializearray() methods for form data serialization
Please do not call Page constructor in files
Arduino read temperature
Browser performance optimization (19)
在打新债开户证券安全吗
K-fold cross validation
H5 application conversion fast application
Expand and check the specified node when loading ztree
Is it safe to open a new bond? Is low commission reliable
Clickhouse disables automatic clearing of tables / columns, that is, disables TTL
How to understand var = a = b = C = 9? How to pre parse?
<C>. function
VMware failed to prompt to lock this profile exclusively
Web container basic configuration
mysql load data infile
Delete the page specified in PDF and merge pdf