当前位置:网站首页>Knowledge Distillation 7: Detailed Explanation of Knowledge Distillation Code
Knowledge Distillation 7: Detailed Explanation of Knowledge Distillation Code
2022-07-31 03:37:00 【@BangBang】
Code Description
Differences from yolov5-v6.1 code:
- data/hyps/hyp.scratch-low-distillation.yaml (added file, obtained by modifying hyp.scratch-low.yaml)
- utils/loss.py adds a function compute_distillation_output_loss
- train_distillation.py (added file, obtained by modifying train.py)
hyp.scratch-low-distillation.yaml
- Compared with the original
hyp.scratch-low.yaml
, this file has moredist
hyperparameters, which can be adjusted in the range of [0,1], close to 1The network will pay more attention to the distillation loss, and if it is close to 0, it will prefer thedetection
loss. This grass parameter is used to balance the detection loss and the distillation loss. - There are also corresponding weighted losses for
box loss, clss loss, obj loss
.
utils/loss.py
utils/loss.py
add a function compute_distillation_output_loss
边栏推荐
- web容器及IIS --- 中间件渗透方法1
- 安全20220712
- (Line segment tree) Summary of common problems of basic line segment tree
- 分布式锁以及实现方式三种
- Understanding and Using Unity2D Custom Scriptable Tiles (4) - Start to build a custom tile based on the Tile class (below)
- (树) 最近公共祖先(LCA)
- Pytest电商项目实战(上)
- Ambiguous method call.both
- Mysql 45 study notes (twenty-five) MYSQL guarantees high availability
- IDEA常用快捷键与插件
猜你喜欢
Daily practice of LeetCode - 138. Copy a linked list with random pointers
[Compilation principle] Lexical analysis program design principle and implementation
Getting Started with CefSharp - winform
The BP neural network
[C language] Preprocessing operation
LocalDate addition and subtraction operations and comparison size
【C语言】预处理操作
《DeepJIT: An End-To-End Deep Learning Framework for Just-In-Time Defect Prediction》论文笔记
【动态规划】连续子数组的最大和
A brief introduction to the CheckBox component of the basic components of Flutter
随机推荐
浅识Flutter 基本组件之showDatePicker方法
解析小结—自用
【动态规划】连续子数组的最大和
Pytest e-commerce project combat (on)
一份高质量的测试用例如何养成?
CloudCompare & PCL calculate the degree of overlap between two point clouds
errno错误码及含义(中文)
《DeepJIT: An End-To-End Deep Learning Framework for Just-In-Time Defect Prediction》论文笔记
els block to the right
Point Cloud DBSCAN Clustering (MATLAB, not built-in function)
[Swift] Customize the shortcut that pops up by clicking the APP icon
The els block moves the boundary to the right, and accelerates downward.
安全20220715
SocialFi 何以成就 Web3 去中心化社交未来
IDEA 注释报红解决
The distance value between two arrays of LeetCode simple questions
Day32 LeetCode
Detailed explanation of TCP (3)
$attrs/$listeners
顺序表的实现