当前位置:网站首页>Loss function of depth model
Loss function of depth model
2022-06-26 01:37:00 【Green Lantern swordsman】
Recently, I did a small project about location prediction . Because it involves regression , I used it huber Loss function , Now let's summarize the regression loss function , In order to deepen the understanding of
1. MSE and MAE The differences and advantages and disadvantages of
(1) Simply speaking ,MSE Simple calculation , but MAE Better robustness to outliers . If the training data is contaminated by outliers , that MAE Loss is better used . For example, see link 1, It can also be understood intuitively :MSE Equivalent to the average ,MAE Equivalent to taking the median .
(2) However MAE There is a serious problem ( Especially for neural networks ): The updated gradient is always the same , in other words , Even for small loss values , The gradient is also large . This is not conducive to model learning . To solve this defect , We can use varying learning rates , Reduce the learning rate when the loss is close to the minimum .
2. Better loss function
huber Loss function , see link 2
3. Dealing with imbalances Focal loss Loss function
Alleviate the imbalance between positive and negative samples . Be careful :keras There is a problem of category imbalance in , You can train by adding class weights , I don't know and Focal loss Does it matter
4. Cross entropy loss function
Basic knowledge can be seen link 4.1
A previous interview question , Why use the cross entropy loss function ? See link 4.2
5. Loss function in twin networks
边栏推荐
- **MySQL例题一(根据不同问题,多条件查询)**
- Sword finger offer II 096 String interleaving
- MOS管防倒灌电路设计及其过程分析
- Handling of @charset UTF-8 warning problems during vite packaging and construction;
- Is it safe for flush software to buy stocks for trading? How to open an account to buy shares
- I2C protocol
- [visual studio code] vscode shortcut keys
- Enlightenment Q & A
- Web information collection, naked runners on the Internet
- Black box test - decision table method of test cases
猜你喜欢
随机推荐
The kth largest element in the array
填鸭数据即时收集解决方案资源
毕业季你考虑好去留了吗
24. histogram calculation
WIN10系统C盘清理策略
Procédure de désinstallation complète de la base de données Oracle (pas de capture d'écran)
leetcode 300. Longest Increasing Subsequence 最长递增子序列 (中等)
Common basic Oracle commands
信息收集的利器,Google骇客语法
2022资料员-通用基础(资料员)考试模拟100题及在线模拟考试
快速生成1~20自然数,并轻松复制
Digital circuit - adder
MySQL图书借阅系统项目数据库建库表语句(组合主键、外键设置)
2021-1-15 摸魚做的筆記Ctrl+c /v來的
Computer network knowledge summary (interview)
--SQL of urban cultivation manual -- Chapter 1 basic review
21. Hoff circle transformation
在FreeBSD中安装MySQL数据库
[Excel知识技能] Excel数据类型
数组中的第K个最大元素









