当前位置:网站首页>Chapter 2 Introduction to key technologies
Chapter 2 Introduction to key technologies
2022-06-27 04:26:00 【H`924】
Catalog
This project is through Python Language to design and implement . The main technologies used in the project are Python Reptiles ,Python Data analysis, etc . The data comes from www.fangjia.com.
2.1 Python Reptiles
Python Reptiles It is a common tool for collecting Internet data , In recent years, with the development of the Internet, it has developed rapidly . Using web crawlers to crawl network data, we must first understand the concept and main classification of the network , System structure of various reptiles 、 How it works , Common strategies , And the main application scenarios , meanwhile , For the sake of copyright and data security , We also need to understand the current application legitimacy of crawlers and the agreements that need to be observed when crawling websites . at present , Most websites allow the data crawled by crawlers to be used for personal use or scientific research . But if the crawled data is used for other purposes , Especially reprint or commercial use , Serious will violate the law or cause civil disputes . The following two kinds of data cannot be crawled , Not for commercial use . Personal privacy data : Such as name 、 Phone number 、 Age 、 Blood type 、 Marital status, etc , Crawling such data will violate the personal information protection law . Data that is explicitly prohibited from being accessed by others : For example, the user has set permission controls such as account and password , Encrypted content . Attention should also be paid to copyright related issues , Copyrighted content signed by the author is not allowed to be reproduced or used for commercial purposes after crawling .
2.2 Python Data analysis
Data analysis refers to the analysis of a large number of collected data with appropriate analytical methods , Extract useful information and form conclusions , The process of studying and summarizing the data in detail . Data mining in broad sense includes data analysis and data mining in narrow sense . In a narrow sense, data analysis refers to data analysis according to the purpose of analysis , Comparative analysis 、 Group analysis 、 Cross analysis and regression analysis , Process and analyze the collected data , Extract valuable information , Play the role of data , The process of obtaining the result of a characteristic statistic . Data mining is from a large number of 、 Not completely 、 Noisy 、 Vague 、 In random practical application data , By applying clustering model 、 Classification model 、 Regression and association rules , The process of tapping potential value . At present, the mainstream data analysis languages are Python、R、Matlab These three . among ,Python Has a rich and powerful library , It's often called glue language , To be able to make various modules in other languages ( In especial c and c++) Easily connected together , It's easier to learn 、 More rigorous programming language .R Language is used for statistical analysis 、 Drawing language and operating environment . It belongs to GNU A freedom of the system 、 free 、 Open source software .Matlab Is used to perform matrix operations 、 Plotting functions and data 、 Implementation algorithm 、 Create user interfaces and programs that connect to other programming languages , It is mainly used in engineering calculation 、 Control design 、 Signal processing and communication 、 The image processing 、 Signal detection 、 Financial modeling, design and analysis .
Python Data analysis mainly includes the following five advantages .
- The grammar is simple and concise
- There are many powerful Libraries
- Powerful
- Not only for research and prototyping , It also applies to building production systems
- Python It's a glue language , Can be easily bonded to components in other languages in a variety of ways .
边栏推荐
- 014 C语言基础:C字符串
- MATLAB | 三个趣的圆相关的数理性质可视化
- 2022-06-26:以下golang代码输出什么?A:true;B:false;C:编译错误。 package main import “fmt“ func main() { type
- Système de collecte des journaux
- Learn crypto from Buu (Zhou Geng)
- 微服务系统设计——分布式缓存服务设计
- Cultural tourism night tour | stimulate tourists' enthusiasm with immersive visual experience
- math_ Number set (number set symbol) and set theory
- 009 basics of C language: C loop
- 008 C语言基础:C判断
猜你喜欢

math_ Number set (number set symbol) and set theory

微服务系统设计——API 网关服务设计

Ledrui ldr6035 usb-c interface device supports rechargeable OTG data transmission scheme.
![[station B up dr_can learning notes] Kalman filter 1](/img/18/ee21d31f6a118e4e4ad466b55361cc.gif)
[station B up dr_can learning notes] Kalman filter 1
![Golang Hello installation environment exception [resolved]](/img/30/bddba695e4c0059102e86de346b58d.png)
Golang Hello installation environment exception [resolved]

Cultural tourism light show breaks the time and space constraints and shows the charm of night tour in the scenic spot

Installation of low code development platform nocobase

Fplan powerplan instance

LDR6028 手机设备一边充电一边OTG传输数据方案

Qchart note 2: add rollover display
随机推荐
微服务系统设计——服务熔断和降级设计
golang hello 安装环境异常【已解决】
清华大学开源软件镜像站网址
016 C语言基础:C语言枚举类型
笔记本电脑没有WiFi选项 解决办法
018 C语言基础:C文件读写
为什么 C# 访问 null 字段会抛异常?
微服务系统设计——微服务监控与系统资源监控设计
022 C语言基础:C内存管理与C命令行参数
Kotlin compose custom compositionlocalprovider compositionlocal
Microservice system design -- service registration, discovery and configuration design
Log collection system
从某种意义来讲,互联网业已成为了一个孵化器,一个母体
Microservice system design -- distributed transaction service design
nignx配置单ip限流
微服务系统设计——消息缓存服务设计
009 C语言基础:C循环
[station B up dr_can learning notes] Kalman filter 1
Building lightweight target detection based on mobilenet-yolov4
缓存综合项目--秒杀架构