当前位置:网站首页>Complete deep neural network CNN training with tensorflow to complete picture recognition case 2
Complete deep neural network CNN training with tensorflow to complete picture recognition case 2
2022-07-03 13:27:00 【Haibao 7】
To be continued . Previous link :https://blog.csdn.net/dongbao520/article/details/125456950
Convolutional neural networks
• Convolutional neural networks
• Visual cortex 、 Feel the field , Some neurons see the line , Some neurons see the line
Direction , Some neurons have larger receptive fields , Combine the patterns on the bottom
• 1998 year Yann LeCun Et al LeNet-5 framework , Widely used in hands
Written digit recognition , Including full connection layer and sigmoid Activation function , There are also volumes
Accumulation layer and pool layer
Convolutional neural networks (Convolutional Neural Networks, CNN) It is a kind of feedforward neural network with convolution calculation and depth structure (Feedforward Neural Networks), It's deep learning (deep learning) One of the representative algorithms of [1-2] . Convolutional neural network has the characteristics of representation learning (representation learning) Ability , The input information can be classified according to its hierarchical structure (shift-invariant classification), So it's also called “ Translation invariant artificial neural networks (Shift-Invariant Artificial Neural Networks, SIANN
Convolution neural network imitates biological visual perception (visual perception) Mechanism construction , Supervised learning and unsupervised learning , The sharing of convolution kernel parameters in the hidden layer and the sparsity of inter layer connections make the convolution neural network lattice with less computation (grid-like topology) features , For example, pixels and audio for learning 、 It has a stable effect and has no additional feature engineering on the data (feature engineering) Complete principle related requirements can ---->> Reference resources
For receptive field :
For pre trained networks Reuse TensorFlow Model
CNN The most important building unit is the convolution layer
• Neurons in the first convolution layer are not connected to every pixel of the input picture ,
Just connect the pixels of their receptive field , And so on , Of the second accretion layer
Each neuron is only connected to a small square God located in the first convolution layer
Jing Yuan
Convolution layer diagram
Convolution cases :
In steps of 2, Then there are
Filter Convolution kernel
• Convolution kernels
• Vertical line filter The middle column is 1, The surrounding areas are listed as 0
• Horizontal line filter Intermediate behavior 1, Surrounding behavior 0
• 7*7 matrix
In a feature map , All neurons share the same parameters (
weights bias), Weight sharing
• Different feature maps have different parameters
Convolution training process
Padding Pattern
VALID
• Do not apply zero padding, It is possible to ignore the right or bottom of the picture , This depends stride Set up
• SAME
• If necessary, add zero padding, In this case , The number of output neurons is equal to the number of input neurons divided by the step size ceil(13/5)=3
Pooling Pooling Handle
The goal is downsampling subsample,shrink, Reduce the calculated load , Memory usage , The number of arguments ( It can also prevent over fitting )• Reducing the size of the input image also allows the neural network to withstand a little image translation , Not affected by location
• Just like convolutional neural networks , Each neuron in the pooling layer is connected to the neuron output in the upper layer , It only corresponds to a small area of receptive field . We have to define size , step ,padding type
• Pooled neurons have no weight value , It just aggregates the input according to the maximum or the average
• 2*2 The pooled core of , In steps of 2, There is no filling , Only the maximum value is passed down
Twice as long and twice as wide , area 4 Times smaller , lose 75% The input value of
• In general , The pooling layer works on each independent input channel , So the depth of output is the same as that of input
CNN framework
• Typical CNN The architecture heap lists some volume layers :
• Usually a convolution layer is followed by ReLU layer , Then there is a pool layer , Then there are other convolutions +ReLU layer , Then another pooling layer , The pictures transmitted through the network are getting smaller and smaller , But it's getting deeper and deeper , For example, more feature maps !
• Finally, the conventional feedforward neural network is added , By some fully connected layers +ReLU layers , Finally, the output layer prediction , For example, one softmax Class probability of layer output prediction
• A common misconception is that the convolution kernel is too large , You can use and 99 Two of the same effect of the nucleus 33 The core of , The advantage is that there will be fewer parameters , Simplify the operation . To be continued ..
边栏推荐
- Flink SQL knows why (XIV): the way to optimize the performance of dimension table join (Part 1) with source code
- Smbms project
- 已解决(机器学习中查看数据信息报错)AttributeError: target_names
- Sword finger offer 12 Path in matrix
- 开始报名丨CCF C³[email protected]奇安信:透视俄乌网络战 —— 网络空间基础设施面临的安全对抗与制裁博弈...
- When we are doing flow batch integration, what are we doing?
- [Database Principle and Application Tutorial (4th Edition | wechat Edition) Chen Zhibo] [Chapter 6 exercises]
- The shortage of graphics cards finally came to an end: 3070ti for more than 4000 yuan, 2000 yuan cheaper than the original price, and 3090ti
- Flick SQL knows why (10): everyone uses accumulate window to calculate cumulative indicators
- Sitescms v3.0.2 release, upgrade jfinal and other dependencies
猜你喜欢
This math book, which has been written by senior ml researchers for 7 years, is available in free electronic version
这本数学书AI圈都在转,资深ML研究员历时7年之作,免费电子版可看
双链笔记 RemNote 综合评测:快速输入、PDF 阅读、间隔重复/记忆
Resolved (error in viewing data information in machine learning) attributeerror: target_ names
Kivy教程之 盒子布局 BoxLayout将子项排列在垂直或水平框中(教程含源码)
Idea full text search shortcut ctr+shift+f failure problem
MySQL_ JDBC
Internet of things completion -- (stm32f407 connects to cloud platform detection data)
Logseq 评测:优点、缺点、评价、学习教程
CVPR 2022 | 美团技术团队精选6篇优秀论文解读
随机推荐
2022-02-10 introduction to the design of incluxdb storage engine TSM
已解决TypeError: Argument ‘parser‘ has incorrect type (expected lxml.etree._BaseParser, got type)
Father and basketball
The network card fails to start after the cold migration of the server hard disk
刚毕业的欧洲大学生,就能拿到美国互联网大厂 Offer?
2022-02-14 analysis of the startup and request processing process of the incluxdb cluster Coordinator
Multi table query of MySQL - multi table relationship and related exercises
Sword finger offer 15 Number of 1 in binary
Elk note 24 -- replace logstash consumption log with gohangout
Flink SQL knows why (13): is it difficult to join streams? (next)
CVPR 2022 | 美团技术团队精选6篇优秀论文解读
服务器硬盘冷迁移后网卡无法启动问题
Annotation and reflection
The difference between stratifiedkfold (classification) and kfold (regression)
February 14, 2022, incluxdb survey - mind map
JSON serialization case summary
Asp. Net core1.1 without project JSON, so as to generate cross platform packages
Fabric.js 更换图片的3种方法(包括更换分组内的图片,以及存在缓存的情况)
Convolution emotion analysis task4
JS convert pseudo array to array