当前位置:网站首页>Machine learning difference in the competition and industry application
Machine learning difference in the competition and industry application
2022-07-30 14:29:00 【Mao Feilong】
There is a big difference between machine learning in competitions and industrial applications. Competitions usually focus on the ultimate evaluation indicators, while industrial applications will pay more attention to the stability, interpretability andApplication of domain expert knowledge
Contest
In order to get the ranking of the competition, the evaluation indicators are improved through various methods to the extreme
- Data quality: the data source remains unchanged, and does not focus on data quality improvement
- Model application: methods for using new models, complex models, and model fusion
- Feature Engineering: Using Computationally Expensive Data Augmentation
- Tune: Do a lot of model tuning
- Stability: Offline model, low stability requirements
- Domain expert knowledge: Many competitions even desensitize the original data (such as re-marking field names) to prevent the use of expert knowledge, so domain expert knowledge is less used in the competition
Industrial Applications
We usually pay more attention to the stability of the model and the continuous improvement of data quality under the condition that the application scenario is satisfied
- Data Quality: Data is constantly changing, so focus on improving data quality
- Model application: generally use mainstream and relatively simple models, and rarely use complex models and model fusion methods, which are helpful for model interpretability and problem debugging
- Feature engineering: focus on engineering performance, generally do not use computationally expensive data augmentation
- Parameter adjustment: After the hyperparameters are fixed, they will not move for a long time (usually adjusted several times a year)
- Stability: Online real-time model deployment in the production environment requires high stability
- Domain expert knowledge: will use expert knowledge and theoretical models for modeling
边栏推荐
- SQL 26 calculation under 25 years of age or older and the number of users
- [ARC092D] Two Faced Edges
- CF1677E Tokitsukaze and Beautiful Subsegments
- 还在说软件测试没有中年危机?9年测试工程师惨遭淘汰
- 机器学习在竞赛和工业界应用区别
- Teach you how to write an eye-catching software testing resume, if you don't receive an interview invitation, I will lose
- The truth of the industry: I will only test those that have no future, and I panic...
- #第九章 子查询课后习题
- Conversion between pytorch and keras (the code takes LeNet-5 as an example)
- LeetCode二叉树系列——107.二叉树的层序遍历II
猜你喜欢

A new generation of open source free terminal tools, so cool

网络安全——lcx的使用

The truth of the industry: I will only test those that have no future, and I panic...

VLAN实验

svg波浪动画js特效代码

ARC117E零和范围2

Study Notes - Becoming a Data Analyst in Seven Weeks "Week 2: Business": Business Analysis Metrics

Skywalking入门

LoRaWAN网关源码分析(V1.0.2)

Conversion between pytorch and keras (the code takes LeNet-5 as an example)
随机推荐
Six-faced ant financial clothing, resisting the bombardment of the interviewer, came to interview for review
LeetCode二叉树系列——144.二叉树的最小深度
[VMware virtual machine installation mysql5.7 tutorial]
mongodb打破原则引入SQL,它到底想要干啥?
Web消息推送之SSE
Data Middle Office Construction (5): Breaking Enterprise Data Silos and Extracting Data Value
Flask Framework - Sijax
权威推荐!腾讯安全DDoS边缘安全产品获国际研究机构Omdia认可
CF1320E Treeland and Viruses
桌面软件开发框架大赏
数据中台建设(五):打破企业数据孤岛和提取数据价值
开始学习C语言了
网站添加能换装可互动的live 2d看板娘
Eight years of testing experience, why was the leader criticized: the test documents you wrote are not as good as those of fresh graduates
接口自动化框架,lm-easytest内测版发布,赶紧用起来~
Shell变量与赋值、变量运算、特殊变量、重定向与管渠
There is a risk of water ingress in the battery pack tray and there is a potential safety hazard. 52,928 Tang DMs are urgently recalled
5. DOM
什么是缺陷分析?一篇文章带你了解,测试工程师必备技能
LeetCode二叉树系列——515.最每个树行中找最大值