当前位置:网站首页>Can deep learning solve the parameters of a specific function?
Can deep learning solve the parameters of a specific function?
2022-07-31 00:49:00 【Wanli Pengcheng in a blink of an eye】
Deep learning, as the most popular technology at the moment, can solve many industrial problems, such as sentiment analysis, fault prediction, face recognition, stock trend forecast, weather forecast and so on.These applications demonstrate the powerful capabilities of deep learning, so can deep learning techniques be applied to the parameter solving of specific functions?
Answer first, no.Deep learning technology cannot be applied to the parameter solving problem of a specific function, because although deep learning can fit any function, it cannot give a specific function form.Just like resnet and densenet, although they know the classification problem of the imagenet dataset that they solve very well, they cannot give a specific function form.
1. How to solve the parameters of a specific function
Then the next step is to ask what techniques in the field of deep learning can be used to solve the parameters of a specific function.This takes deep learning back to the level of neural networks. The essence of deep learning is to solve complex problems by increasing the depth of the model.The essence of neural network training and parameter tuning is gradient descent and backpropagation. Through gradient descent and backpropagation, parameters of any form of function can be solved.
2. How to construct a neural network for a specific problem
The thinking about this blog post actually comes from the following link: https://ask.csdn.net/questions/7764283
The subject in the question wants to use the neural network method to solve the parameters of the following function
In response to this question, the blogger's answer is: neural networks canFits any function, but cannot specify the function form of the fit, that is to say, the fully connected neural network constructed by conventional methods cannot solve the parameters of the specified function, because the fully connected neural network is nothing more than matrix multiplication and activation function., matrix multiplication cannot represent cos function and exponential function.Unless you implement a custom layer for these functions (implementing cos layer, exponential layer), you can constrain the parameters learned by the network.
Supplementary Note 1: If the fully connected neural network model does not have an activation function, it can only achieve the fitting of a linear function, and cannot achieve the fitting of a nonlinear function.Because the superposition of multiple fully connected layers is essentially a matrix multiplication, which can be simplified to a fully connected layer.
Supplementary Note 2: For a custom layer, solving the parameters of a specific function constrains the form of the function, which makes the initialization of parameters have an extremely serious impact on the training of the model.Unlike the fully connected neural network, only the parameters need to be initialized to a specific distribution.
Supplementary Note 3: According to Supplementary Note 2, when solving the parameters of a specific function, multiple groups of functions should be initialized for multi-channel training.When training a multi-channel model, the parameters of multiple models can be weighted and aggregated every certain rounds to speed up the convergence.
3. There is a problem with constructing a specific layer
In deep learning frameworks, such as tensorflow, paddle, and pytorch, all support custom layers, and any specified function can be implemented by the superposition of multiple custom layers.One of the problems here is the parameters of the layer, the forward propagation of the layer, and the calculation method of the layer gradient.
The parameters of the layer: This is actually the parameters of the function to be solved.It may be the coefficient of the cos function or the coefficient of the exponential function;
The forward propagation of the layer: here is the specific form of the specified function, such as the cos function form, the exponential function form;
Calculation method of layer gradient: Because the specific form of the function is specified in the forward propagation of the layer, the built-in automatic derivation method of the deep learning framework may not be available, and the calculation method of the specific layer derivative needs to be provided by yourself.
边栏推荐
猜你喜欢

Strict Mode for Databases

xss bypass: prompt(1)
![[In-depth and easy-to-follow FPGA learning 15---------- Timing analysis basics]](/img/a9/4c7a703a36a244394b586bfb42ab6b.png)
[In-depth and easy-to-follow FPGA learning 15---------- Timing analysis basics]

IOT跨平台组件设计方案

The difference between h264 and h265 decoding

binglog log tracking: data backup and backup tracking

BOM系列之Navigator对象

typescript15-(同时指定参数和返回值类型)

C语言力扣第48题之旋转图像。辅助数组

C language force buckles the rotating image of the 48th question.auxiliary array
随机推荐
typescript17 - function optional parameters
ShardingSphere之公共表实战(七)
神经网络(ANN)
Consistency and Consensus of Distributed Systems (1) - Overview
DOM系列之 offset 系列
Jetpack Compose learning (8) - State and remeber
Gabor滤波器学习笔记
Strict Mode for Databases
MySQL database advanced articles
Gabor filter study notes
C语言力扣第48题之旋转图像。辅助数组
从两个易错的笔试题深入理解自增运算符
过滤器(Filter)
typescript9 - common base types
从笔试包装类型的11个常见判断是否相等的例子理解:包装类型、自动装箱与拆箱的原理、装箱拆箱的发生时机、包装类型的常量池技术
DNS resolution process [visit website]
DOM系列之 client 系列
IOT跨平台组件设计方案
WEB Security Basics - - - Vulnerability Scanner
Jmeter parameter transfer method (token transfer, interface association, etc.)