当前位置:网站首页>ML16 neural network (2)
ML16 neural network (2)
2022-07-29 06:07:00 【19-year-old flower girl】
neural network
The overall architecture
The middle phase is equivalent to the weight , The line in the first column , The first three inputs , The last four outputs , That's the weight W by 34 Matrix , Line in the second column , Input is 4, The output is 4, therefore W The weight matrix is 44 Of , The third column is the same . The result of hidden layer one is xw1, And then after the second floor 【xw1】*w2, Through the third layer, the output result is obtained .
however , Why should there be multiple layers W Weight , Can't one ?
The following explains why there should be multiple layers W The weight , Because there is only one W So it's a linear equation , In that case, classification can only be divided into linear , It is not conducive to the accuracy of classification results . Add multiple weights W after , Become a nonlinear equation ,max Is the activation function , Activation functions can be added to each layer .
Activation function
sigmoid function
At first, neural network is to take sigmoid When activating function , But because when x Greater than 10 Or less than -10 When , Values are very small , The gradient vanishes , When the number of layers is very deep, it will lead to no update ( because W Is too small ), It doesn't work much now .
ReLu function
Replaced the sigmoid function .
Demo example
Neural network demonstration example :http://cs.stanford.edu/people/karpathy/convnetjs/demo/classify2d.html
Change the number of neurons , Will make the classification effect change , The more classification, the better . But it will cause over fitting .
Solve over fitting
λ=0.1 Time has strong generalization ability . stay λ=0.001 when , Those protruding red pointed parts , Although the red dots are well divided , However, the probability of green in this position is relatively high , When a green dot appears in this place, it is easy to judge as a red dot , This is over fitting .
The whole process of neural network .
Data preprocessing
Data processing
chart 2 Make the mean 0; chart 3 Reduce variance .
Weight initialization
Be careful :w Cannot initialize with mean or zero .B The initialization of can be 0.
drop-out
Full connection , It is easy to cause over fitting .
Use drop-out Method , Choose no neurons at random each time .
边栏推荐
- [ml] PMML of machine learning model -- Overview
- Are you sure you know the interaction problem of activity?
- 【Transformer】AdaViT: Adaptive Vision Transformers for Efficient Image Recognition
- 四、One-hot和损失函数的应用
- 二、如何保存MNIST数据集中train和test的图片?
- [semantic segmentation] setr_ Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformer
- anaconda中移除旧环境、增加新环境、查看环境、安装库、清理缓存等操作命令
- 神经网络相关知识回顾(PyTorch篇)
- Lock lock of concurrent programming learning notes and its implementation basic usage of reentrantlock, reentrantreadwritelock and stampedlock
- [convolution kernel design] scaling up your kernels to 31x31: revising large kernel design in CNN
猜你喜欢
Flutter正在被悄悄放弃?浅析Flutter的未来
虚假新闻检测论文阅读(一):Fake News Detection using Semi-Supervised Graph Convolutional Network
【Transformer】AdaViT: Adaptive Tokens for Efficient Vision Transformer
clion+opencv+aruco+cmake配置
Is flutter being quietly abandoned? On the future of flutter
ML15-神经网络(1)
tensorboard使用
迁移学习——Transfer Joint Matching for Unsupervised Domain Adaptation
Flink connector Oracle CDC synchronizes data to MySQL in real time (oracle19c)
MarkDown简明语法手册
随机推荐
一、迁移学习与fine-tuning有什么区别?
四、One-hot和损失函数的应用
迁移学习——Low-Rank Transfer Subspace Learning
【Transformer】SOFT: Softmax-free Transformer with Linear Complexity
一、多个txt文件合并成1个txt文件
迁移学习——Robust Visual Domain Adaptation with Low-Rank Reconstruction
[semantic segmentation] overview of semantic segmentation
备份谷歌或其他浏览器插件
[semantic segmentation] Introduction to mapillary dataset
Spring, summer, autumn and winter with Miss Zhang (5)
简单聊聊 PendingIntent 与 Intent 的区别
【Attention】Visual Attention Network
二、深度学习数据增强方法汇总
MarkDown简明语法手册
神经网络相关知识回顾(PyTorch篇)
A preliminary study on fastjason's autotype
[target detection] KL loss: bounding box progression with uncertainty for accurate object detection
【Clustrmaps】访客统计
Set automatic build in idea - change the code, and refresh the page without restarting the project
Interesting talk about performance optimization thread pool: is the more threads open, the better?