当前位置:网站首页>French scholars: the explicability of counter attack under optimal transmission theory
French scholars: the explicability of counter attack under optimal transmission theory
2022-07-05 13:26:00 【I love computer vision】
Official account , Find out CV The beauty of Technology
This article shares papers 『When adversarial attacks become interpretable counterfactual explanations』, The explicability of counter attack under optimal transmission theory .
The details are as follows :
The author is from the third University of Toulouse in France and IRT Saint-Exupéry Research institute, .
Thesis link :https://arxiv.org/pdf/2206.06854.pdf
01
introduction
This paper is a theoretical article on countering attacks , The author provides a very reliable explanation for countering attacks . At present, the optimal transmission theory is a very popular direction in the deep learning theory , The author analyzes the phenomenon of anti attack from the perspective of optimal transmission theory . When learning a neural network with dual loss of optimal transmission problem , The gradient of the model is the direction of the optimal transmission scheme , It is also the direction closest to the confrontation sample .
Moving along the gradient to the decision boundary is no longer a counter attack , It's a counterfactual explanation , That is, it can be seen as explicitly transferring from one class to another . Through a large number of experiments on interpretable metrics, we can find , The simple saliency mapping method applied to the optimal transmission network is a reliable explanation , And it is superior to the latest interpretation method in the unconstrained model .
02
Optimal transmission , Robustness and interpretability
Let is about the optimal transmission scheme of minimizing the loss function . Given , Ling is about images . Because it is uncertain , Can be made as the biggest point about , Then there is the following proposition :
( Transmission scheme direction ) Let be an optimal solution to minimize the loss function . Given sum , So when , There are almost everywhere .
This proposition without regularization is true for dual problems . It proves that for most , Indicates the direction of the transmission scheme .
( Decision boundaries ) Let sum be two separable distributions with minimum distance , To minimize an optimal solution of the loss function , among . Given sum , Then there is and , Among them is the decision boundary .
Let sum be two separable distributions with minimum distance , To minimize an optimal solution of the loss function , among , Given , Then there are
Almost everywhere , among .
inference 1 indicate , The classifier based on the loss function can accurately obtain the countermeasure samples . under these circumstances , The best counter attack is in the gradient direction , All attacks applied to the optimal transmission neural network model , Such as attack or attack , Are equivalent to attacks .
To illustrate these propositions , The author learned a dense binary classifier with loss function to separate two complex distributions . The figure below (a) Two distributions are shown ( Blue and orange snowflakes ), The boundaries of learning ( Red dotted line ). The figure below (b) and (c) Shows the random samples in both distributions , Which is defined in proposition 2 Middle paragraph .
Just like the proposition 2 As described , This point falls right on the decision boundary . Besides , Such as proposition 1 Described , Each segment provides the orientation of the image relative to the transportation scheme .
The author proves that the optimal transmission neural network , Counter attacks are known in form , And easy to calculate . Besides , The author also proves that the counter attack is carried out along the transmission map , So fighting attacks is no longer an imperceptible modification , But an understandable transformation of the sample . The author will use these attributes to provide a natural counterfactual explanation , It has provable explanatory properties .
The counterfactual interpretation of the sample in a given class is the closest sample . Because the global information of and is usually not directly available , Therefore, the author only aims at the classifier to obtain its local information . under these circumstances , Counterfactual corresponds to proposition 2 Counter attack defined in . For classical neural networks , This can only be achieved by adding anti noise , This is not a valuable explanation . Because it only depends on and , The definition of this counterfactual interpretation is partial . contrary , The transmission scheme as the minimum of describes the optimal scheme from class to , So the transmission scheme is a global counterfactual explanation , And it is a partial explanation of .
It should be noted that , The transport scheme does not provide the closest sample on the opposite class , But it provides the closest average in the pairing process . According to the proposition 1, The image in the optimal transmission scheme is . Even if only partially known , at that time , It can be seen that on the boundary of decision , And the path of the optimal transmission scheme can be further determined .
In the past, saliency graph only provides a very intuitive and fuzzy explanation for classifier classification , In this paper , The author puts forward a very reliable explanation in the optimal transmission neural network . Indicates the direction of the optimal transmission scheme , Therefore, the saliency diagram shows the importance of each input feature in this direction .
03
hKR Loss function
You know, one disadvantage of functions is , It strongly depends on the parameters of the loss function . In the case of two categories , There are two parameters : Gap parameters and weight parameters respectively , It is used to balance the robustness and accuracy of the classification model . When classes are separable and small enough , The part of the loss function tends to . This makes it difficult to choose parameters , In this paper, the author proposes a new loss formula as follows :
402 Payment Required
Here is a learnable parameter , Is a new parameter . The gap at the boundary is uniformly distributed , When the ratio is , bring , Then the optimal clearance parameters can be obtained , The latter can be explained as the proportion of target data involved in the key part of the loss .
choice , In the process of optimization , The weight part of is roughly the same as the part . Using this method , The only parameter that can be selected is , It can be explained as the approximate error rate of the goal in the learning process . Given a multi classification problem with classes , Is a one to many two classifier , The loss function is as follows
among
402 Payment Required
The above formula has three main shortcomings : One is that the best margins for each class may be different , As a result, a large number of super parameters need to be adjusted ; The second is that the unbalanced distribution of samples in a large number of classes may lead to slow convergence of the model ; The third is ( Functions of real classes ) The weight relative to other categories decreases as the number of categories increases . To overcome these shortcomings , The author proposes a method based on Of Regularization loss function :among , For real categories , And other functions always have the same weight . In the early stages of learning , Because the value of is average ; During the training , The value of will vary gradually , Until a component reaches its maximum and stabilizes .
04
experimental result
The authors use insertion and deletion indexes to evaluate the quality of saliency map interpretation of optimal transmission neural networks . Classical interpretation methods are evaluated for two types of networks on and datasets .
The following table shows that the saliency map method becomes competitive in the indicators on the optimal transmission neural network and provides a more reliable explanation .
The following table evaluates two indicators and grade correlation coefficients of different data sets on the optimal transmission network . The interpretation distance on the optimal transmission network is much lower than the unconstrained interpretation distance , And very close to .
The following figure shows the qualitative visualization results , The intuitive discovery shows that the optimal neural network provides a better and clearer explanation .
From the quantitative results in the table below, we can find , Through all these experiments, it is concluded that , Use multiple types of interpretive indicators , The interpretability of the optimal transmission neural network is better than that of the unconstrained neural network .
The following two figures show the optimal transmission network learned on and data sets respectively , The following two figures show the original image , The average gradient on the channel , And images with transmission scheme direction . You can intuitively find , Most gradients are visually consistent .
END
Welcome to join 「 Against learning 」 Exchange group notes :Ad
边栏推荐
- 个人组件 - 消息提示
- leetcode:221. Maximum square [essence of DP state transition]
- Clock cycle
- Word document injection (tracking word documents) incomplete
- APICloud Studio3 API管理与调试使用教程
- 【Hot100】33. 搜索旋转排序数组
- CAN和CAN FD
- Binder communication process and servicemanager creation process
- Go pointer
- 初次使用腾讯云,解决只能使用webshell连接,不能使用ssh连接。
猜你喜欢
Android本地Sqlite数据库的备份和还原
Cf:a. the third three number problem
记录一下在深度学习-一些bug处理
C object storage
Reverse Polish notation
Talk about seven ways to realize asynchronous programming
Backup and restore of Android local SQLite database
Operational research 68 | the latest impact factors in 2022 were officially released. Changes in journals in the field of rapid care
The "Baidu Cup" CTF competition was held in February 2017, Web: explosion-2
Flutter draws animation effects of wave movement, curves and line graphs
随机推荐
Shandong University Summer Training - 20220620
go map
4年工作经验,多线程间的5种通信方式都说不出来,你敢信?
“百度杯”CTF比赛 九月场,Web:Upload
碎片化知识管理工具Memos
今年上半年,通信行业发生了哪些事?
Simple page request and parsing cases
Difference between avc1 and H264
Flutter InkWell & Ink组件
Binder通信过程及ServiceManager创建过程
多人合作项目查看每个人写了多少行代码
jenkins安装
数据湖(七):Iceberg概念及回顾什么是数据湖
SAE international strategic investment geometry partner
leetcode 10. Regular Expression Matching 正则表达式匹配 (困难)
MATLAB论文图表标准格式输出(干货)
解决uni-app配置页面、tabBar无效问题
DataPipeline双料入选中国信通院2022数智化图谱、数据库发展报告
【MySQL 使用秘籍】一网打尽 MySQL 时间和日期类型与相关操作函数(三)
Cf:a. the third three number problem