当前位置:网站首页>Complete deep neural network CNN training with tensorflow to complete picture recognition case 2
Complete deep neural network CNN training with tensorflow to complete picture recognition case 2
2022-07-03 13:27:00 【Haibao 7】
To be continued . Previous link :https://blog.csdn.net/dongbao520/article/details/125456950
Convolutional neural networks
• Convolutional neural networks
• Visual cortex 、 Feel the field , Some neurons see the line , Some neurons see the line
Direction , Some neurons have larger receptive fields , Combine the patterns on the bottom
• 1998 year Yann LeCun Et al LeNet-5 framework , Widely used in hands
Written digit recognition , Including full connection layer and sigmoid Activation function , There are also volumes
Accumulation layer and pool layer
Convolutional neural networks (Convolutional Neural Networks, CNN) It is a kind of feedforward neural network with convolution calculation and depth structure (Feedforward Neural Networks), It's deep learning (deep learning) One of the representative algorithms of [1-2] . Convolutional neural network has the characteristics of representation learning (representation learning) Ability , The input information can be classified according to its hierarchical structure (shift-invariant classification), So it's also called “ Translation invariant artificial neural networks (Shift-Invariant Artificial Neural Networks, SIANN
Convolution neural network imitates biological visual perception (visual perception) Mechanism construction , Supervised learning and unsupervised learning , The sharing of convolution kernel parameters in the hidden layer and the sparsity of inter layer connections make the convolution neural network lattice with less computation (grid-like topology) features , For example, pixels and audio for learning 、 It has a stable effect and has no additional feature engineering on the data (feature engineering) Complete principle related requirements can ---->> Reference resources
For receptive field :
For pre trained networks
Reuse TensorFlow Model 
CNN The most important building unit is the convolution layer
• Neurons in the first convolution layer are not connected to every pixel of the input picture ,
Just connect the pixels of their receptive field , And so on , Of the second accretion layer
Each neuron is only connected to a small square God located in the first convolution layer
Jing Yuan
Convolution layer diagram 

Convolution cases :


In steps of 2, Then there are 
Filter Convolution kernel
• Convolution kernels
• Vertical line filter The middle column is 1, The surrounding areas are listed as 0
• Horizontal line filter Intermediate behavior 1, Surrounding behavior 0
• 7*7 matrix

In a feature map , All neurons share the same parameters (
weights bias), Weight sharing
• Different feature maps have different parameters

Convolution training process 
Padding Pattern
VALID
• Do not apply zero padding, It is possible to ignore the right or bottom of the picture , This depends stride Set up
• SAME
• If necessary, add zero padding, In this case , The number of output neurons is equal to the number of input neurons divided by the step size ceil(13/5)=3

Pooling Pooling Handle
The goal is downsampling subsample,shrink, Reduce the calculated load , Memory usage , The number of arguments ( It can also prevent over fitting )• Reducing the size of the input image also allows the neural network to withstand a little image translation , Not affected by location
• Just like convolutional neural networks , Each neuron in the pooling layer is connected to the neuron output in the upper layer , It only corresponds to a small area of receptive field . We have to define size , step ,padding type
• Pooled neurons have no weight value , It just aggregates the input according to the maximum or the average
• 2*2 The pooled core of , In steps of 2, There is no filling , Only the maximum value is passed down

Twice as long and twice as wide , area 4 Times smaller , lose 75% The input value of
• In general , The pooling layer works on each independent input channel , So the depth of output is the same as that of input
CNN framework
• Typical CNN The architecture heap lists some volume layers :
• Usually a convolution layer is followed by ReLU layer , Then there is a pool layer , Then there are other convolutions +ReLU layer , Then another pooling layer , The pictures transmitted through the network are getting smaller and smaller , But it's getting deeper and deeper , For example, more feature maps !
• Finally, the conventional feedforward neural network is added , By some fully connected layers +ReLU layers , Finally, the output layer prediction , For example, one softmax Class probability of layer output prediction
• A common misconception is that the convolution kernel is too large , You can use and 99 Two of the same effect of the nucleus 33 The core of , The advantage is that there will be fewer parameters , Simplify the operation .
To be continued ..
边栏推荐
- Oracle memory management
- JSON serialization case summary
- Asp.Net Core1.1版本没了project.json,这样来生成跨平台包
- Elk note 24 -- replace logstash consumption log with gohangout
- elk笔记24--用gohangout替代logstash消费日志
- CVPR 2022 image restoration paper
- Cadre de logback
- 今日睡眠质量记录77分
- Asp. Net core1.1 without project JSON, so as to generate cross platform packages
- STM32 and motor development (from MCU to architecture design)
猜你喜欢

常见的几种最优化方法Matlab原理和深度分析

My creation anniversary: the fifth anniversary

刚毕业的欧洲大学生,就能拿到美国互联网大厂 Offer?

PowerPoint 教程,如何在 PowerPoint 中將演示文稿另存為視頻?

Flick SQL knows why (10): everyone uses accumulate window to calculate cumulative indicators

Flink SQL knows why (16): dlink, a powerful tool for developing enterprises with Flink SQL

The principle of human voice transformer

Typeerror resolved: argument 'parser' has incorrect type (expected lxml.etree.\u baseparser, got type)

Multi table query of MySQL - multi table relationship and related exercises

Kivy tutorial how to automatically load kV files
随机推荐
JS convert pseudo array to array
February 14, 2022, incluxdb survey - mind map
Luogup3694 Bangbang chorus standing in line
编程内功之编程语言众多的原因
Logseq 评测:优点、缺点、评价、学习教程
The shortage of graphics cards finally came to an end: 3070ti for more than 4000 yuan, 2000 yuan cheaper than the original price, and 3090ti
Logback 日志框架
Image component in ETS development mode of openharmony application development
The difference between session and cookie
106. How to improve the readability of SAP ui5 application routing URL
Kotlin - improved decorator mode
[sort] bucket sort
MapReduce实现矩阵乘法–实现代码
2022-02-11 practice of using freetsdb to build an influxdb cluster
Detailed explanation of multithreading
Libuv库 - 设计概述(中文版)
父亲和篮球
Seven habits of highly effective people
JS 将伪数组转换成数组
刚毕业的欧洲大学生,就能拿到美国互联网大厂 Offer?