当前位置:网站首页>Complete deep neural network CNN training with tensorflow to complete picture recognition case 2
Complete deep neural network CNN training with tensorflow to complete picture recognition case 2
2022-07-03 13:27:00 【Haibao 7】
To be continued . Previous link :https://blog.csdn.net/dongbao520/article/details/125456950
Convolutional neural networks
• Convolutional neural networks
• Visual cortex 、 Feel the field , Some neurons see the line , Some neurons see the line
Direction , Some neurons have larger receptive fields , Combine the patterns on the bottom
• 1998 year Yann LeCun Et al LeNet-5 framework , Widely used in hands
Written digit recognition , Including full connection layer and sigmoid Activation function , There are also volumes
Accumulation layer and pool layer
Convolutional neural networks (Convolutional Neural Networks, CNN) It is a kind of feedforward neural network with convolution calculation and depth structure (Feedforward Neural Networks), It's deep learning (deep learning) One of the representative algorithms of [1-2] . Convolutional neural network has the characteristics of representation learning (representation learning) Ability , The input information can be classified according to its hierarchical structure (shift-invariant classification), So it's also called “ Translation invariant artificial neural networks (Shift-Invariant Artificial Neural Networks, SIANN
Convolution neural network imitates biological visual perception (visual perception) Mechanism construction , Supervised learning and unsupervised learning , The sharing of convolution kernel parameters in the hidden layer and the sparsity of inter layer connections make the convolution neural network lattice with less computation (grid-like topology) features , For example, pixels and audio for learning 、 It has a stable effect and has no additional feature engineering on the data (feature engineering) Complete principle related requirements can ---->> Reference resources
For receptive field :
For pre trained networks
Reuse TensorFlow Model 
CNN The most important building unit is the convolution layer
• Neurons in the first convolution layer are not connected to every pixel of the input picture ,
Just connect the pixels of their receptive field , And so on , Of the second accretion layer
Each neuron is only connected to a small square God located in the first convolution layer
Jing Yuan
Convolution layer diagram 

Convolution cases :


In steps of 2, Then there are 
Filter Convolution kernel
• Convolution kernels
• Vertical line filter The middle column is 1, The surrounding areas are listed as 0
• Horizontal line filter Intermediate behavior 1, Surrounding behavior 0
• 7*7 matrix

In a feature map , All neurons share the same parameters (
weights bias), Weight sharing
• Different feature maps have different parameters

Convolution training process 
Padding Pattern
VALID
• Do not apply zero padding, It is possible to ignore the right or bottom of the picture , This depends stride Set up
• SAME
• If necessary, add zero padding, In this case , The number of output neurons is equal to the number of input neurons divided by the step size ceil(13/5)=3

Pooling Pooling Handle
The goal is downsampling subsample,shrink, Reduce the calculated load , Memory usage , The number of arguments ( It can also prevent over fitting )• Reducing the size of the input image also allows the neural network to withstand a little image translation , Not affected by location
• Just like convolutional neural networks , Each neuron in the pooling layer is connected to the neuron output in the upper layer , It only corresponds to a small area of receptive field . We have to define size , step ,padding type
• Pooled neurons have no weight value , It just aggregates the input according to the maximum or the average
• 2*2 The pooled core of , In steps of 2, There is no filling , Only the maximum value is passed down

Twice as long and twice as wide , area 4 Times smaller , lose 75% The input value of
• In general , The pooling layer works on each independent input channel , So the depth of output is the same as that of input
CNN framework
• Typical CNN The architecture heap lists some volume layers :
• Usually a convolution layer is followed by ReLU layer , Then there is a pool layer , Then there are other convolutions +ReLU layer , Then another pooling layer , The pictures transmitted through the network are getting smaller and smaller , But it's getting deeper and deeper , For example, more feature maps !
• Finally, the conventional feedforward neural network is added , By some fully connected layers +ReLU layers , Finally, the output layer prediction , For example, one softmax Class probability of layer output prediction
• A common misconception is that the convolution kernel is too large , You can use and 99 Two of the same effect of the nucleus 33 The core of , The advantage is that there will be fewer parameters , Simplify the operation .
To be continued ..
边栏推荐
- Flink SQL knows why (XI): weight removal is not only count distinct, but also powerful duplication
- PowerPoint 教程,如何在 PowerPoint 中將演示文稿另存為視頻?
- 已解决TypeError: Argument ‘parser‘ has incorrect type (expected lxml.etree._BaseParser, got type)
- JS convert pseudo array to array
- SVN添加文件时的错误处理:…\conf\svnserve.conf:12: Option expected
- Convolution emotion analysis task4
- Mysqlbetween implementation selects the data range between two values
- Father and basketball
- PostgreSQL installation
- Smbms project
猜你喜欢

Image component in ETS development mode of openharmony application development

IDEA 全文搜索快捷键Ctr+Shift+F失效问题

MySQL installation, uninstallation, initial password setting and general commands of Linux

物联网毕设 --(STM32f407连接云平台检测数据)

Tutoriel PowerPoint, comment enregistrer une présentation sous forme de vidéo dans Powerpoint?

Multi table query of MySQL - multi table relationship and related exercises

Can newly graduated European college students get an offer from a major Internet company in the United States?

PowerPoint 教程,如何在 PowerPoint 中將演示文稿另存為視頻?

2022-02-14 analysis of the startup and request processing process of the incluxdb cluster Coordinator

February 14, 2022, incluxdb survey - mind map
随机推荐
PowerPoint 教程,如何在 PowerPoint 中將演示文稿另存為視頻?
Typeerror resolved: argument 'parser' has incorrect type (expected lxml.etree.\u baseparser, got type)
父亲和篮球
[Database Principle and Application Tutorial (4th Edition | wechat Edition) Chen Zhibo] [Chapter III exercises]
(first) the most complete way to become God of Flink SQL in history (full text 180000 words, 138 cases, 42 pictures)
PowerPoint tutorial, how to save a presentation as a video in PowerPoint?
JSP and filter
2022-02-14 analysis of the startup and request processing process of the incluxdb cluster Coordinator
Logback 日志框架
Sword finger offer 17 Print from 1 to the maximum n digits
MySQL installation, uninstallation, initial password setting and general commands of Linux
Spark practice 1: build spark operation environment in single node local mode
Flink SQL knows why (16): dlink, a powerful tool for developing enterprises with Flink SQL
User and group command exercises
[Database Principle and Application Tutorial (4th Edition | wechat Edition) Chen Zhibo] [Chapter IV exercises]
Sword finger offer 12 Path in matrix
Internet of things completion -- (stm32f407 connects to cloud platform detection data)
php:  The document cannot be displayed in Chinese
Error handling when adding files to SVN:.... \conf\svnserve conf:12: Option expected
Ubuntu 14.04 下开启PHP错误提示