当前位置:网站首页>Deep learning, thinking from one dimensional input to multi-dimensional feature input
Deep learning, thinking from one dimensional input to multi-dimensional feature input
2022-07-03 05:48:00 【code bean】
From one dimensional input to multidimensional features
When looking at one dimension alone , I haven't felt anything yet .
But when I see this picture , My first reaction was , The original meaning of activation function is !


( But here is another point to note : Here is only the input dimension increased , Output or 0 and 1, So this is still a problem of two categories , Only the input characteristics become multidimensional , When it comes to multi classification, we should introduce softmax 了 )
At first I was thinking , Why should we add the eight characteristics of a piece of data into a linear equation Give again “ Sigma ” function ?
Then think of , Because each feature contributes differently to the whole , So there are different wb, Yes wb Blessing , So adding eigenvalues is not incomprehensible . then Linear systems The result value of “ Sigma ” function , In this way, the system has nonlinearity , This is also the purpose of the activation function —— It can fit more complex decision boundaries .
Then look at this picture :

Two implications of this picture
1 All forms of multiplication and accumulation , In fact, it can be converted into Multiplication of vectors , In turn to , Why should vector multiplication be designed like this , It may be to simplify the writing of accumulation .
2 After being written as a vector, you can use many vector properties . such as , It should have been x*w, Now I want to become w*x, But matrices have no commutative law , But according to the characteristics of transpose , We can xw All transposed , So they can change positions .
But there are still problems , there b How did you get it ? From the formula, it seems to appear out of thin air !
So I feel that there is a little problem with this way of writing , The more common way to write it should be , Add a dimension :

That is, first in X Add a... In front of the 1, As the base of the offset term ,( here X From n The dimensional vector becomes n+1 Dimension vector , Turn into [1, x1,x2…] ), then , Let each classifier train its own bias term weight , So the weight of each classifier becomes n+1 dimension , namely [w0,w1,…], among ,w0 Is the weight of the offset term , therefore 1*w0 Is the bias of this classifier / Intercept . such , Just let the intercept b This seems to be related to the slope W Different parameters , It's all unified to the next framework , Make the model constantly adjust parameters in the process of training w0, So as to achieve adjustment b Purpose .
————————————————
Copyright notice : This paper is about CSDN Blogger 「 Almost Human V」 The original article of , follow CC 4.0 BY-SA Copyright agreement , For reprint, please attach the original source link and this statement .
Link to the original text :https://blog.csdn.net/Uwr44UOuQcNsUQb60zk2/article/details/81074408
But I still have a problem , If in ,X Add a... In front of the 1 For the base , But adjust w0 When , The base is 1, That's equivalent to every linear function b It's still the same ? Then if it is here 1 Change to the previous linear function b Is it more reasonable ?
I'm not sure here ( Follow up in understanding .)
In a word, this offset term b The role of is Build in space No Across the origin A straight line / Plane / hyperplane . In this way, we can better classify ( Better build decision boundaries )( Digression : But in Euclidean space vectors, the vectors we discuss all cross the origin )


Solved the case
The lines here are not the origin , But vectors in Euclidean space all cross the origin , How do these vectors come from , I don't know if you have the same question as me .
This is before me , A question raised :
After communicating with the teacher , My conclusion is that :
That can be understood from two aspects :
The first one is :
A one-dimensional Euclidean space X Store all x The possibility of , A one-dimensional Euclidean space Y Store all y The possibility of .
And in two Euclidean spaces xy There is a one-to-one correspondence , With this relationship ,
Each pair xy It can also form a new two-dimensional Euclidean space XY. If there is another one-dimensional Euclidean space B,
Store all intercept b, And this B and XY There is also a one-to-one correspondence , According to these relationships , We can describe the straight line of intercept .
such as XY There's an element in (x1, y1) B There's an element in b1, Through these three pieces of information, we can describe a line segment with intercept .
And this B It can also be used as a feature item , Put in X Matrix , A new column .The second kind :
Regard Euclidean space as a set of points , stay X and W Before expansion ,XY The space formed by all the points in is actually a straight line passing through the origin .
stay X and W After expanding one dimension ,XY The space formed by all points in is an intercept ( But the origin ) The straight line of .
( you 're right , I was really going to X Y W These spaces are confused , The teacher woke me up here !!!)summary :
Although the vectors in Euclidean space are all vectors passing through the origin , But if you take these vectors as a point ( Ignore origin ), These points constitute
A space . This space might be N A line in dimensional space , One face , Or a person This is the concept of subspace .
That's it , I hope I can help people who are as confused as me .
Reference material :
《PyTorch Deep learning practice 》 Complete the collection _ Bili, Bili _bilibili
Bias term in neural network b What is it ?_ Almost Human V The blog of -CSDN Blog
边栏推荐
- [teacher Zhao Yuqiang] RDB persistence of redis
- How to set up altaro offsite server for replication
- redis 无法远程连接问题。
- Sorry, this user does not exist!
- [untitled]
- 最大似然估计,散度,交叉熵
- [explain in depth the creation and destruction of function stack frames] | detailed analysis + graphic analysis
- Use telnet to check whether the port corresponding to the IP is open
- Analysis of the example of network subnet division in secondary vocational school
- @Solutions to null pointer error caused by Autowired
猜你喜欢

伯努利分布,二项分布和泊松分布以及最大似然之间的关系(未完成)
![[teacher Zhao Yuqiang] RDB persistence of redis](/img/cc/5509b62756dddc6e5d4facbc6a7c5f.jpg)
[teacher Zhao Yuqiang] RDB persistence of redis

Method of finding prime number

最大似然估计,散度,交叉熵

Life is a process of continuous learning

Strategy pattern: encapsulate changes and respond flexibly to changes in requirements

PHP notes are super detailed!!!
![[advanced pointer (1)] | detailed explanation of character pointer, pointer array, array pointer](/img/9e/a4558e8e53c9655cbc1a38e8c0536e.jpg)
[advanced pointer (1)] | detailed explanation of character pointer, pointer array, array pointer

为什么网站打开速度慢?

今天很多 CTO 都是被幹掉的,因為他沒有成就業務
随机推荐
[untitled]
Btrfs and ext4 - features, strengths and weaknesses
[teacher Zhao Yuqiang] MySQL high availability architecture: MHA
NG Textarea-auto-resize
2022.DAY592
Together, Shangshui Shuo series] day 9
redis 无法远程连接问题。
[together Shangshui Shuo series] day 7 content +day8
Simpleitk learning notes
The server data is all gone! Thinking caused by a RAID5 crash
pytorch 搭建神经网络最简版
Mapbox tasting value cloud animation
2022.6.30DAY591
[teacher Zhao Yuqiang] index in mongodb (Part 2)
Source insight automatic installation and licensing
Redis encountered noauth authentication required
Linux登录MySQL出现ERROR 1045 (28000): Access denied for user ‘root‘@‘localhost‘ (using password: YES)
The request database reported an error: "could not extract resultset; SQL [n/a]; needed exception is org.hibernate.exception.sqlgram"
1. 两数之和
Map的扩容机制
