当前位置:网站首页>Machine learning difference in the competition and industry application
Machine learning difference in the competition and industry application
2022-07-30 14:29:00 【Mao Feilong】
There is a big difference between machine learning in competitions and industrial applications. Competitions usually focus on the ultimate evaluation indicators, while industrial applications will pay more attention to the stability, interpretability andApplication of domain expert knowledge
Contest
In order to get the ranking of the competition, the evaluation indicators are improved through various methods to the extreme
- Data quality: the data source remains unchanged, and does not focus on data quality improvement
- Model application: methods for using new models, complex models, and model fusion
- Feature Engineering: Using Computationally Expensive Data Augmentation
- Tune: Do a lot of model tuning
- Stability: Offline model, low stability requirements
- Domain expert knowledge: Many competitions even desensitize the original data (such as re-marking field names) to prevent the use of expert knowledge, so domain expert knowledge is less used in the competition
Industrial Applications
We usually pay more attention to the stability of the model and the continuous improvement of data quality under the condition that the application scenario is satisfied
- Data Quality: Data is constantly changing, so focus on improving data quality
- Model application: generally use mainstream and relatively simple models, and rarely use complex models and model fusion methods, which are helpful for model interpretability and problem debugging
- Feature engineering: focus on engineering performance, generally do not use computationally expensive data augmentation
- Parameter adjustment: After the hyperparameters are fixed, they will not move for a long time (usually adjusted several times a year)
- Stability: Online real-time model deployment in the production environment requires high stability
- Domain expert knowledge: will use expert knowledge and theoretical models for modeling
边栏推荐
- [Advanced ROS] Lecture 11 Robot co-simulation based on Gazebo and Rviz (motion control and sensors)
- CF780G Andryusha and Nervous Barriers
- CF1320E Treeland and Viruses
- LeetCode二叉树系列——144.二叉树的最小深度
- 华为7年经验的软件测试总监,给所有想转行学软件测试的朋友几点建议
- 43.【list的简单属性】
- Data Middle Office Construction (5): Breaking Enterprise Data Silos and Extracting Data Value
- 网络安全——lcx的使用
- Digital signal processing course lab report (what foundation is needed for digital signal processing)
- jsArray array copy method performance test 2207300823
猜你喜欢
随机推荐
00后测试员摸爬滚打近一年,为是否要转行或去学软件测试的学弟们总结出了以下走心建议
Cookie simulation login "recommended collection"
There is a risk of water ingress in the battery pack tray and there is a potential safety hazard. 52,928 Tang DMs are urgently recalled
[ARC092B] Two Sequences
cookie模拟登录「建议收藏」
跳槽前,把自己弄成卷王
ddl and dml in sql (the difference between sql and access)
UPC2022暑期个人训练赛第19场(B,P)
Flask Framework - Flask-Mail Mail
[C# 循环跳转]-C# 中的 while/do-while/for/foreach 循环结构以及 break/continue 跳转语句
jsArray array copy method performance test 2207300040
sql中ddl和dml(sql与access的区别)
激光雷达点云语义分割论文阅读小结
网络安全——lcx的使用
43.【list链表的定义及初始化】
Androd 跳转到google应用市场
ARC115F Migration
CF780G Andryusha and Nervous Barriers
A new generation of open source free terminal tools, so cool
selenium4+pyetsst+allure+pom进行自动化测试框架的最新设计









