当前位置:网站首页>Auto SEG loss: automatic loss function design
Auto SEG loss: automatic loss function design
2022-06-30 10:41:00 【Xiaobai learns vision】
Click on the above “ Xiaobai studies vision ”, Optional plus " Star standard " or “ Roof placement ”
Heavy dry goods , First time delivery The author 丨 [email protected] You know
Source https://zhuanlan.zhihu.com/p/266102401
Edit the market platform
Reading guide
What is proposed in this paper Auto Seg-Loss Is designed to reduce the cost for a given index ( For example, the edge part IoU perhaps F-score) Trial and error costs in designing and adjusting loss functions , And further to the design of automatic loss function .
Some time ago , One piece of news is quite popular :
The National Swimming Championship has caused controversy , Fuyuanhui and other five top players in the preliminary round missed the final due to their low scores in the physical fitness test
Can the physical level reflect the competitive level ? For the average person , There is a positive correlation between physical fitness level and individual competition ability . However , What high-level athletes need is specific training for specific events , A higher level of physical fitness does not mean a better performance , For example, for some projects ( Such as long-distance running ) Come on , The upper limbs are too strong but a burden . therefore , Many high-level athletes even break the Asian record in special events , In the face of physical fitness test, I also failed . Think in reverse , If an athlete only takes the physical fitness test as his training goal everyday , The final long run 、 sprint 、 Pull in 、 Squatting and other projects are perfect , Then it is likely that he can achieve far more than ordinary people in some special projects by virtue of his physical quality , But not enough to be a top athlete .
Why say this news ? actually , If we think of our neural network model as an athlete , A lot of time , The evaluation index of the special competition faced by this athlete ( For example, in semantic segmentation mIoU) And its training goals ( For example, common Cross Entropy Loss) It's not exactly the same . Even though CE Loss You can train a good model most of the time , But this is by virtue of being strong enough “ Physical fitness ” The results obtained , Lack of targeted optimization for special projects .
that , Whether special evaluation indicators can be used , such as mIoU To guide the training ? Unfortunately , Most of the evaluation indicators are non differentiable , It is not possible to train directly through back propagation . Of course , This does not stop researchers . Many studies try to approximate the evaluation index by means of differentiability , Get a proxy loss function to guide the training ( such as The Lovász-Softmax loss: A tractable surrogate for the optimization of the intersection-over-union measure in neural networks and Learning Surrogates via Deep Embedding). however , These differentiable approximations do not necessarily make the model achieve good training results —— Although the training objectives are consistent , But the coach's training program ( gradient ) Quality also determines the athletes ( Model ) Training level . therefore , The design of proxy loss function needs expertise And higher trial and error costs ; even so , Many of the proxy loss functions designed are not enough to be independent , Need and CE Loss Joint training can achieve good results .
Auto Seg-Loss hope ( On the task of semantic segmentation ) Automate this process . Simply speaking , We found that the mainstream semantic segmentation indicators ( Basically by TP/TN/FP/FN form ) Can be written as a differentiable operation ( For example, add 、 ride )、 quantitative (one-hot) and logical operation ( And AND、 or OR) In the form of . because logical The operation is actually defined only in On , We use a parameterized surface logical Operation for interpolation , Make it in There is a differentiable definition on , And use softmax replace one-hot quantitative , Thus, a differentiable version of the proxy loss function is obtained . Next , We use reinforcement learning algorithms (PPO2) Search the shape of this surface , So that the agent loss function can guide the training well . chart 1 Is the overall framework of our algorithm .

At the time of implementation , We tried piecewise Bezier curve and Piecewise linear curve Two ways of parameterization . We proposed Truth table constraints and Monotonicity constraint As a priori , To limit the shape of the parametric surface . Experiments show that these two parameterization methods can get good proxy loss function , At the same time, these two constraints effectively improve the efficiency and effect of search .
We are PASCAL VOC and Cityscapes We did experiments on . Relative to manually designed proxy loss functions or Cross Entropy Improvement , The searched agent loss function can reach the main semantic segmentation index on par Or higher , In particular, the improvement of edge related indicators is relatively large . There are two points worth mentioning :
May benefit from the design of the search space , Our search efficiency is relatively high , stay VOC Upper use DeepLab V3+, about mIoU Your search only needs 8 In hours or so (8 card V100, In fact, half of the search results have basically converged , The time is roughly equivalent to the same data set two Normal training );
The agent loss function we searched can be well migrated to other Model architecture and Data sets , So you can use it many times with just one search ( More Than This , We found that mIoU The searched parameters also apply to FWIoU and Boundary IoU And other similar indicators ). The following two tables are the experimental results of two parameterized forms we tried .


For edge related indicators , We found that , Using edge index alone to guide the training will make the model only focus on the segmentation results of edges . chart 2 It shows this phenomenon . By comparing the edge index with the overall index ( Such as mIoU) Train in groups , The model can not only ensure the reasonable overall performance, but also improve the edge segmentation effect . Another interesting finding is , Used to evaluate edge accuracy Boundary F1 score When the allowable error is not 0 when , Used to guide training may cause jagged edges , This is actually a reflection of this indicator hack. We discussed this problem in the appendix .

We hope Auto Seg-Loss It can reduce the risk of , For a given metric ( For example, the edge part IoU perhaps F-score) Trial and error costs in designing and adjusting loss functions , Take a step forward to the design of automatic loss function . Our article is already in arxiv Hang it up , The code will also be open source and integrated into some open source Segmentation frame , I look forward to your trial and discussion !
Auto Seg-Loss: Searching Metric Surrogates for Semantic Segmentation
https://arxiv.org/abs/2010.07930
The good news !
Xiaobai learns visual knowledge about the planet
Open to the outside world

download 1:OpenCV-Contrib Chinese version of extension module
stay 「 Xiaobai studies vision 」 Official account back office reply : Extension module Chinese course , You can download the first copy of the whole network OpenCV Extension module tutorial Chinese version , Cover expansion module installation 、SFM Algorithm 、 Stereo vision 、 Target tracking 、 Biological vision 、 Super resolution processing and other more than 20 chapters .
download 2:Python Visual combat project 52 speak
stay 「 Xiaobai studies vision 」 Official account back office reply :Python Visual combat project , You can download, including image segmentation 、 Mask detection 、 Lane line detection 、 Vehicle count 、 Add Eyeliner 、 License plate recognition 、 Character recognition 、 Emotional tests 、 Text content extraction 、 Face recognition, etc 31 A visual combat project , Help fast school computer vision .
download 3:OpenCV Actual project 20 speak
stay 「 Xiaobai studies vision 」 Official account back office reply :OpenCV Actual project 20 speak , You can download the 20 Based on OpenCV Realization 20 A real project , Realization OpenCV Learn advanced .
Communication group
Welcome to join the official account reader group to communicate with your colleagues , There are SLAM、 3 d visual 、 sensor 、 Autopilot 、 Computational photography 、 testing 、 Division 、 distinguish 、 Medical imaging 、GAN、 Wechat groups such as algorithm competition ( It will be subdivided gradually in the future ), Please scan the following micro signal clustering , remarks :” nickname + School / company + Research direction “, for example :” Zhang San + Shanghai Jiaotong University + Vision SLAM“. Please note... According to the format , Otherwise, it will not pass . After successful addition, they will be invited to relevant wechat groups according to the research direction . Please do not send ads in the group , Or you'll be invited out , Thanks for your understanding ~边栏推荐
- ArcGIS Pro + PS 矢量化用地规划图
- R语言plotly可视化:使用plotly可视化多分类模型的预测置信度、模型在2D网格中每个数据点预测的置信度、置信度定义为在某一点上最高分与其他类别得分之和之间的差值
- Robotframework learning notes: environment installation and robotframework browser plug-in installation
- Dow Jones Industrial Average
- WGet -- 404 not found due to spaces in URL
- The preliminary round of the sixth season of 2022 perfect children's model Hefei competition area was successfully concluded
- CVPR 2022 | Tsinghua & bytek & JD put forward BRT: Bridging Transformer for vision and point cloud 3D target detection
- 技能梳理[email protected]基于51系列单片机的智能仪器教具
- Node environment configuration
- 断路器HystrixCircuitBreaker
猜你喜欢

Robotframework learning notes: environment installation and robotframework browser plug-in installation

Auto Seg-Loss: 自动损失函数设计

Use keil5 software to simulate and debug gd32f305 from 0

What is the real performance of CK5, the king machine of CKB?

Android 开发面试真题进阶版(附答案解析)

ArcGIS Pro脚本工具(6)——修复CAD图层数据源

CSDN博客运营团队2022年H1总结

How to deploy deflationary combustion destruction contract code in BSC chain_ Deploy dividend and marketing wallet contract code

Arm新CPU性能提升22%,最高可组合12核,GPU首配硬件光追,网友:跟苹果的差距越来越大了...

Koreano essential creates a professional style
随机推荐
转卡通学习笔记
Apple's 5g chip was revealed to have failed in research and development, and the QQ password bug caused heated discussion. Wei Lai responded to the short selling rumors. Today, more big news is here
技能梳理[email protected]體感機械臂
nvm、nrm、npx使用(安装、基本命令、参数、curl、wget)
Chen Haotian won the national championship of the national finals of the 7th children's model star ceremony
Criu enables hot migration
记一次实习的经历,趟坑必备(一)
技能梳理[email protected]体感机械臂
Go -- maximum heap and minimum heap
ArcGIS Pro脚本工具(5)——排序后删除重复项
The AOV function of R language was used for repeated measures ANOVA (one intra group factor and one inter group factor) and interaction Plot function and boxplot to visualize the interaction
R language plot visualization: use plot to visualize the prediction confidence of the multi classification model, the prediction confidence of each data point of the model in the 2D grid, and the conf
Tooltips in the era of touch
The famous painter shiguoliang's "harvest season" digital collection was launched on the Great Wall Digital Art
机器学习面试准备(一)KNN
今晚19:00知识赋能第2期直播丨OpenHarmony智能家居项目之控制面板界面设计
Gd32 RT thread ota/bootloader driver function
ArcGIS Pro scripting tool (5) - delete duplicates after sorting
Compare the maximum computing power of the Cenozoic top ant s19xp and the existing s19pro in bitland
Getting started with X86 - take over bare metal control