当前位置:网站首页>What does the inner structure of the neural network "alchemy furnace" look like? An interpretation of the thesis by the doctor of Oxford University
What does the inner structure of the neural network "alchemy furnace" look like? An interpretation of the thesis by the doctor of Oxford University
2022-06-26 16:27:00 【Xiaobai learns vision】
Click on the above “ Xiaobai studies vision ”, Optional plus " Star standard " or “ Roof placement ”
Heavy dry goods , First time delivery Neural networks are like “ The alchemy furnace ” equally , Feed a lot of data , Maybe you can get a magical effect .

“ Alchemy ” After success , Neural networks can also predict data that we have not seen before ~
However , In this case , The neural network actually becomes “ black box ”—— It has certain functions , But I can't see how it works .
If we only do simple image classification , Actually, it's OK ; But if used in medicine , Predict the disease , So the neural network “ Judge ” You can't believe it .
If you can understand how it works , That's better. .
For this reason , A doctoral student from Oxford University Oana-Maria Camburu I wrote my graduation thesis 《 Explain the neural network (Explaining Deep Neural Networks)》.
In this paper , She took these “ black box ” Open one by one , The principle of neural network is explained in detail .
Why open the neural network “ black box ”?
in fact , The reason why neural networks work , The most intuitive reason is , it It consists of a large number of nonlinear functions .

These nonlinear functions , So that the network can learn various abstract level features in the original data .
However , It is precisely because of these nonlinear functions in neural networks , Makes it difficult for humans to understand , How they work .
This leads to the application of neural networks in disease prediction 、 Line of credit 、 In the direction of criminal law “ Not very popular ”.
Doctors and legal researchers tend to prefer interpretable models , For example, linear regression 、 Decision tree , Because neural networks do have problems in disease prediction :
People use neural networks to predict the development of pneumonia , One of the patient characteristics is History of asthma .

Neural networks are trained to predict , Patients with a history of asthma are less likely to die of pneumonia .
But the result is just the opposite , Asthma itself can make pneumonia worse .
The reason why the data show that asthma patients die less of pneumonia , Often because asthma can be detected early , So patients with pneumonia can be treated as soon as possible .
If this neural network is applied in practice , It will bring very dangerous results .
Besides , Even neural networks , It will also affect the gender of men and women stereotypes 、 Create racial prejudice .

for example , Investigation shows , Some corpora and models , When predicting recidivism , Will be more “ A preference for ” men .
Except for wrong predictions and race 、 Beyond sexism , Neural networks are still fragile .
Whether it's Make small changes to the image to deceive the classification algorithm 、 Still use speech recognition to hide NLP Model , The neural network is “ Explosion ray ” There are also many cases .
In order to apply neural network in more directions , In order to let us learn its principle better , The author explains the neural network from two aspects .
2 There are two ways to explain neural networks
“ Explain later ”
The first method , It is called feature-based interpretation , Also called “ Explain later ”—— Because this way , After the training of neural network , To explain its input characteristics .
This method is aimed at the words of the text (token)、 Or super pixels for images (super pixels), Conduct “ After the event ” explain .

At present, this method is widely used , Not prone to interpretation bias , But the authenticity of the interpretation method needs to be verified .
The fundamental principle here , It is the explanation given by studying the external explanation method 、 And the natural language interpretation generated by the model itself , Is there a correlation , And what is the relevance .
In the paper , The author introduces a new verification method , To judge the authenticity of the interpretation method .
Let the neural network explain itself
that , If the neural network can be trained at the same time 、 On one side “ Explain yourself ” Well ?
This is the second method mentioned in the paper , That is, a module for generating prediction interpretation is embedded in the model , Explain the predicted results .

As for whether the neural network is correct in explaining itself , It also requires human judgment .
In this , The author also introduces a judgment method , Judge the interpretation generated by the model itself , Thus, the result of neural network interpretation .

The detailed structure of neural network 、 Explain the method specifically to interested partners , You can check the address of the paper below ~
The authors introduce

Oana-Maria Camburu, From Romania , He is currently a doctoral student at Oxford University , Major in machine learning 、 Artificial intelligence and other directions .
In high school ,Oana-Maria Camburu Once obtained IMO( International Mathematical Olympiad competition ) Silver medal . She used to work in Mapu 、 Google internship , During my blog reading , The paper was ACL、EMNLP、IJCNLP It will be included in the summit .
Address of thesis :
https://arxiv.org/abs/2010.01496
The good news !
Xiaobai learns visual knowledge about the planet
Open to the outside world

download 1:OpenCV-Contrib Chinese version of extension module
stay 「 Xiaobai studies vision 」 Official account back office reply : Extension module Chinese course , You can download the first copy of the whole network OpenCV Extension module tutorial Chinese version , Cover expansion module installation 、SFM Algorithm 、 Stereo vision 、 Target tracking 、 Biological vision 、 Super resolution processing and other more than 20 chapters .
download 2:Python Visual combat project 52 speak
stay 「 Xiaobai studies vision 」 Official account back office reply :Python Visual combat project , You can download, including image segmentation 、 Mask detection 、 Lane line detection 、 Vehicle count 、 Add Eyeliner 、 License plate recognition 、 Character recognition 、 Emotional tests 、 Text content extraction 、 Face recognition, etc 31 A visual combat project , Help fast school computer vision .
download 3:OpenCV Actual project 20 speak
stay 「 Xiaobai studies vision 」 Official account back office reply :OpenCV Actual project 20 speak , You can download the 20 Based on OpenCV Realization 20 A real project , Realization OpenCV Learn advanced .
Communication group
Welcome to join the official account reader group to communicate with your colleagues , There are SLAM、 3 d visual 、 sensor 、 Autopilot 、 Computational photography 、 testing 、 Division 、 distinguish 、 Medical imaging 、GAN、 Wechat groups such as algorithm competition ( It will be subdivided gradually in the future ), Please scan the following micro signal clustering , remarks :” nickname + School / company + Research direction “, for example :” Zhang San + Shanghai Jiaotong University + Vision SLAM“. Please note... According to the format , Otherwise, it will not pass . After successful addition, they will be invited to relevant wechat groups according to the research direction . Please do not send ads in the group , Or you'll be invited out , Thanks for your understanding ~边栏推荐
- 电路中缓存的几种形式
- 请指教同花顺软件究竟是什么?网上开户是否安全么?
- 清华“神奇药水”登Nature:逆转干细胞分化,比诺奖成果更进一步,网友:不靠精子卵子就能创造生命了?!...
- Tencent Peking University's sparse large model training acceleration program het was selected into the VLDB of the international summit
- Least squares system identification class II: recursive least squares
- Acid of redis
- 对话长安马自达高层,全新产品将在Q4发布,空间与智能领跑日系
- Which position does Anxin securities rank? Is it safe to open an account?
- R语言广义线性模型函数GLM、glm函数构建逻辑回归模型(Logistic regression)、分析模型是否过离散(Overdispersion)、使用残差偏差与二项式模型中的剩余自由度的比率评估
- Codeforces Round #802 (Div. 2)
猜你喜欢

Niuke programming problem -- dynamic programming of must brush 101 (a thorough understanding of dynamic programming)

大话领域驱动设计——表示层及其他

【毕业季】致毕业生的一句话:天高任鸟飞,海阔凭鱼跃

Exquisite makeup has become the "soft power" of camping, and the sales of vipshop outdoor beauty and skin care products have surged

Net基于girdview控件实现删除与编辑行数据

理想路径问题

When a programmer is disturbed 10 times a day, the consequences are amazing!

Cloud platform monitoring system based on stm32+ Huawei cloud IOT design

Structure the graduation project of actual combat camp

Develop operator based on kubebuilder (for getting started)
随机推荐
Practice of federal learning in Tencent micro vision advertising
Structure the graduation project of actual combat camp
【从删库到跑路】MySQL基础 完结篇(入个门先跑路了。。)
『C语言』题集 of ⑩
若依如何实现接口限流?
When a programmer is disturbed 10 times a day, the consequences are amazing!
day10每日3题(1):逐步求和得到正数的最小值
国内首款开源 MySQL HTAP 数据库即将发布,三大看点提前告知
redis的二进制数组命令
Scala 基礎 (二):變量和數據類型
Cuckoo filter for Chang'an chain transaction
当一个程序员一天被打扰 10 次,后果很惊人!
Natural language inference with attention and fine tuning Bert pytorch
请指教同花顺软件究竟是什么?网上开户是否安全么?
Failed to get convolution algorithm. This is probably because cuDNN failed to initialize
R语言plotly可视化:plotly可视化归一化的直方图(historgram)并在直方图中添加密度曲线kde、并在直方图的底部边缘使用geom_rug函数添加边缘轴须图
R语言plotly可视化:小提琴图、多分类变量小提琴图、分组(grouped)小提琴图、分裂的分组小提琴图、每个小提琴图内部分为两组数据、每个分组占小提琴图的一半、自定义小提琴图的调色板、抖动数据点
MHA switching (recommended operation process)
[207] several possible causes of Apache crash
Scala Basics (II): variables and data types