当前位置:网站首页>Complete deep neural network CNN training with tensorflow to complete picture recognition case 2
Complete deep neural network CNN training with tensorflow to complete picture recognition case 2
2022-07-03 13:27:00 【Haibao 7】
To be continued . Previous link :https://blog.csdn.net/dongbao520/article/details/125456950
Convolutional neural networks
• Convolutional neural networks
• Visual cortex 、 Feel the field , Some neurons see the line , Some neurons see the line
Direction , Some neurons have larger receptive fields , Combine the patterns on the bottom
• 1998 year Yann LeCun Et al LeNet-5 framework , Widely used in hands
Written digit recognition , Including full connection layer and sigmoid Activation function , There are also volumes
Accumulation layer and pool layer
Convolutional neural networks (Convolutional Neural Networks, CNN) It is a kind of feedforward neural network with convolution calculation and depth structure (Feedforward Neural Networks), It's deep learning (deep learning) One of the representative algorithms of [1-2] . Convolutional neural network has the characteristics of representation learning (representation learning) Ability , The input information can be classified according to its hierarchical structure (shift-invariant classification), So it's also called “ Translation invariant artificial neural networks (Shift-Invariant Artificial Neural Networks, SIANN
Convolution neural network imitates biological visual perception (visual perception) Mechanism construction , Supervised learning and unsupervised learning , The sharing of convolution kernel parameters in the hidden layer and the sparsity of inter layer connections make the convolution neural network lattice with less computation (grid-like topology) features , For example, pixels and audio for learning 、 It has a stable effect and has no additional feature engineering on the data (feature engineering) Complete principle related requirements can ---->> Reference resources
For receptive field :
For pre trained networks Reuse TensorFlow Model
CNN The most important building unit is the convolution layer
• Neurons in the first convolution layer are not connected to every pixel of the input picture ,
Just connect the pixels of their receptive field , And so on , Of the second accretion layer
Each neuron is only connected to a small square God located in the first convolution layer
Jing Yuan
Convolution layer diagram
Convolution cases :
In steps of 2, Then there are
Filter Convolution kernel
• Convolution kernels
• Vertical line filter The middle column is 1, The surrounding areas are listed as 0
• Horizontal line filter Intermediate behavior 1, Surrounding behavior 0
• 7*7 matrix
In a feature map , All neurons share the same parameters (
weights bias), Weight sharing
• Different feature maps have different parameters
Convolution training process
Padding Pattern
VALID
• Do not apply zero padding, It is possible to ignore the right or bottom of the picture , This depends stride Set up
• SAME
• If necessary, add zero padding, In this case , The number of output neurons is equal to the number of input neurons divided by the step size ceil(13/5)=3
Pooling Pooling Handle
The goal is downsampling subsample,shrink, Reduce the calculated load , Memory usage , The number of arguments ( It can also prevent over fitting )• Reducing the size of the input image also allows the neural network to withstand a little image translation , Not affected by location
• Just like convolutional neural networks , Each neuron in the pooling layer is connected to the neuron output in the upper layer , It only corresponds to a small area of receptive field . We have to define size , step ,padding type
• Pooled neurons have no weight value , It just aggregates the input according to the maximum or the average
• 2*2 The pooled core of , In steps of 2, There is no filling , Only the maximum value is passed down
Twice as long and twice as wide , area 4 Times smaller , lose 75% The input value of
• In general , The pooling layer works on each independent input channel , So the depth of output is the same as that of input
CNN framework
• Typical CNN The architecture heap lists some volume layers :
• Usually a convolution layer is followed by ReLU layer , Then there is a pool layer , Then there are other convolutions +ReLU layer , Then another pooling layer , The pictures transmitted through the network are getting smaller and smaller , But it's getting deeper and deeper , For example, more feature maps !
• Finally, the conventional feedforward neural network is added , By some fully connected layers +ReLU layers , Finally, the output layer prediction , For example, one softmax Class probability of layer output prediction
• A common misconception is that the convolution kernel is too large , You can use and 99 Two of the same effect of the nucleus 33 The core of , The advantage is that there will be fewer parameters , Simplify the operation . To be continued ..
边栏推荐
- Tutoriel PowerPoint, comment enregistrer une présentation sous forme de vidéo dans Powerpoint?
- Sword finger offer 12 Path in matrix
- Some thoughts on business
- [Database Principle and Application Tutorial (4th Edition | wechat Edition) Chen Zhibo] [Chapter III exercises]
- 2022-02-11 heap sorting and recursion
- 正则表达式
- 显卡缺货终于到头了:4000多块可得3070Ti,比原价便宜2000块拿下3090Ti
- 编程内功之编程语言众多的原因
- Detailed explanation of multithreading
- 2022-01-27 redis cluster cluster proxy predixy analysis
猜你喜欢
AI 考高数得分 81,网友:AI 模型也免不了“内卷”!
106. How to improve the readability of SAP ui5 application routing URL
【电脑插入U盘或者内存卡显示无法格式化FAT32如何解决】
Flink code is written like this. It's strange that the window can be triggered (bad programming habits)
[Database Principle and Application Tutorial (4th Edition | wechat Edition) Chen Zhibo] [Chapter IV exercises]
stm32和电机开发(从mcu到架构设计)
Sitescms v3.1.0 release, launch wechat applet
今日睡眠质量记录77分
Solve system has not been booted with SYSTEMd as init system (PID 1) Can‘t operate.
2022-02-14 incluxdb cluster write data writetoshard parsing
随机推荐
CVPR 2022 image restoration paper
MapReduce实现矩阵乘法–实现代码
Tutoriel PowerPoint, comment enregistrer une présentation sous forme de vidéo dans Powerpoint?
elk笔记24--用gohangout替代logstash消费日志
Fabric. JS three methods of changing pictures (including changing pictures in the group and caching)
已解决TypeError: Argument ‘parser‘ has incorrect type (expected lxml.etree._BaseParser, got type)
SVN添加文件时的错误处理:…\conf\svnserve.conf:12: Option expected
Today's sleep quality record 77 points
Spark实战1:单节点本地模式搭建Spark运行环境
JS convert pseudo array to array
Multi table query of MySQL - multi table relationship and related exercises
AI scores 81 in high scores. Netizens: AI model can't avoid "internal examination"!
Image component in ETS development mode of openharmony application development
MySQL_ JDBC
106. 如何提高 SAP UI5 应用路由 url 的可读性
Flink SQL knows why (16): dlink, a powerful tool for developing enterprises with Flink SQL
Sword finger offer 12 Path in matrix
Flink SQL knows why (VIII): the wonderful way to parse Flink SQL tumble window
【电脑插入U盘或者内存卡显示无法格式化FAT32如何解决】
Road construction issues