当前位置:网站首页>Knowledge distilling learning notes
Knowledge distilling learning notes
2022-06-29 21:01:00 【Study hard, depeng】
【 Classic Jane read 】 Distillation of knowledge (Knowledge Distillation) A classic
paper
Huangzhenhua , Yangshunzhi , Lin Wei , Nijuan , Sunshengli , Chen Yunwen , Tang Yong . A review of knowledge distillation [J]. Journal of Computer Science ,2022,45(03):624-653.
【 intensive reading AI The paper 】 Distillation of knowledge
One 、MMRazor
Model compression toolbox

Two 、MMDeploy
Model deployment toolkit

3、 ... and 、 Distillation of knowledge

Soft Target Contains more information
Predicted by teacher network Soft Target As a label Train students to network 


Knowledge Distilling and Label Smoothing

Development trend of knowledge distillation

Question1: Will knowledge distillation improve the accuracy of the model
Answer1: Knowledge distillation is a new method to obtain efficient and small-scale networks , Its main idea is to integrate the learning ability into the complex teacher model “ knowledge ” Move to a simple student model . meanwhile , It learns from each other through neural networks 、 Optimization strategies such as self-learning and tagless 、 Cross modal and other data resources also have a significant effect on the performance enhancement of the model .
边栏推荐
- Rsync 建立多目录模块的方法
- Live broadcast preview | PostgreSQL kernel Interpretation Series Lecture 1: overview of PostgreSQL system
- Lexin interview process
- 《强化学习周刊》第51期:PAC、ILQL、RRL&无模型强化学习集成于微电网络格控制:综述与启示
- CORDIC based Signal Processor desgn
- The reason why the log analysis tool of "operation and maintenance" is used more and more frequently
- CAD assistant - 3D model format conversion tool
- STM32最小系统搭建(原理图)
- 「运维有小邓」AD域委派
- Detailed description of gaussdb (DWS) complex and diverse resource load management methods
猜你喜欢

Information system project manager -- Chapter VII examination questions of project cost management over the years

Calibration, correction and world coordinate calculation of binocular stereo vision camera (openCV)

STM32最小系统搭建(原理图)

WIN10设置自动拨号联网任务,实现开机、断网自动重连

Verilog realizes serial communication and sends it to the nixie tube

How can colleges and universities build future oriented smart campus based on cloud native? Full stack cloud native vs traditional technology architecture
![Navigation experiment [microcomputer principle] [experiment]](/img/79/8311a409113331e72f650a83351b46.png)
Navigation experiment [microcomputer principle] [experiment]

CORDIC based Signal Processor desgn

THREEJS基础入门

「运维有小邓」AD域委派
随机推荐
Application of VoIP push in overseas audio and video services
「运维有小邓」日志分析工具使用越来越频繁的原因
Rsync 的简单应用与配置
Practical guide to GStreamer application development (V)
Analysis on the true topic of "cost management" by Guangdong second-class cost engineer
WIN10设置自动拨号联网任务,实现开机、断网自动重连
leetcode:724. Find the central subscript of the array
Alibaba cloud released the atlas of China's robot industry (2022), 122 Pages pdf
ads131a04 ADC verilog实现及仿真
Implementing LDAP proxy service with haproxy + keepalive
Common methods of string class
Shutter bottomnavigationbar with page switching example
【ROS进阶篇】第四讲 ROS中的重名问题(节点、话题与参数)
如何审核 Active Directory 用户账户更改?
Simple application and configuration of Rsync
The reason why the log analysis tool of "operation and maintenance" is used more and more frequently
"Xiaodeng" ad domain delegation for operation and maintenance
"Xiaodeng" active directory password expiration notification function is available for operation and maintenance
PostgreSQL每周新闻—6月22日
「运维有小邓」AD域委派