当前位置:网站首页>Machine learning difference in the competition and industry application
Machine learning difference in the competition and industry application
2022-07-30 14:29:00 【Mao Feilong】
There is a big difference between machine learning in competitions and industrial applications. Competitions usually focus on the ultimate evaluation indicators, while industrial applications will pay more attention to the stability, interpretability andApplication of domain expert knowledge
Contest
In order to get the ranking of the competition, the evaluation indicators are improved through various methods to the extreme
- Data quality: the data source remains unchanged, and does not focus on data quality improvement
- Model application: methods for using new models, complex models, and model fusion
- Feature Engineering: Using Computationally Expensive Data Augmentation
- Tune: Do a lot of model tuning
- Stability: Offline model, low stability requirements
- Domain expert knowledge: Many competitions even desensitize the original data (such as re-marking field names) to prevent the use of expert knowledge, so domain expert knowledge is less used in the competition
Industrial Applications
We usually pay more attention to the stability of the model and the continuous improvement of data quality under the condition that the application scenario is satisfied
- Data Quality: Data is constantly changing, so focus on improving data quality
- Model application: generally use mainstream and relatively simple models, and rarely use complex models and model fusion methods, which are helpful for model interpretability and problem debugging
- Feature engineering: focus on engineering performance, generally do not use computationally expensive data augmentation
- Parameter adjustment: After the hyperparameters are fixed, they will not move for a long time (usually adjusted several times a year)
- Stability: Online real-time model deployment in the production environment requires high stability
- Domain expert knowledge: will use expert knowledge and theoretical models for modeling
边栏推荐
- Six-faced ant financial clothing, resisting the bombardment of the interviewer, came to interview for review
- 关于String的一些思考
- 人社部公布“数据库运行管理员”成新职业,OceanBase参与制定职业标准
- Jenkins自动化部署项目
- ML之PDP:基于FIFA 2018 Statistics(2018年俄罗斯世界杯足球赛)球队比赛之星分类预测数据集利用DT决策树&RF随机森林+PDP部分依赖图可视化实现模型可解释性之详细攻略
- [VMware virtual machine installation mysql5.7 tutorial]
- ARC117E Zero-Sum Ranges 2
- [ARC092B] Two Sequences
- Flask Framework - Sijax
- pytorch与keras的相互转换(代码以LeNet-5为例)
猜你喜欢

pytorch与keras的相互转换(代码以LeNet-5为例)

新一代开源免费的终端工具,太酷了

A new generation of open source free terminal tools, so cool

戴墨镜的卡通太阳SVG动画js特效

Flask框架——Flask-SQLite数据库

高性能数据访问中间件 OBProxy(三):问题排查和服务运维

时序数据库在船舶风险管理领域的应用

【Advanced Mathematics】【7】Double Integral

5. DOM

ML之PDP:基于FIFA 2018 Statistics(2018年俄罗斯世界杯足球赛)球队比赛之星分类预测数据集利用DT决策树&RF随机森林+PDP部分依赖图可视化实现模型可解释性之详细攻略
随机推荐
记面试外包公司的一次经历,到底该不该去?
No-code development platform all application settings introductory tutorial
CF603E Pastoral Oddities
Learning notes - 7 weeks as data analyst "in the first week: data analysis of thinking"
CF338E Optimize!
Cookie simulation login "recommended collection"
jsArray array copy method performance test 2207300823
(论文翻译]未配对Image-To-Image翻译使用Cycle-Consistent敌对的网络
CF780G Andryusha and Nervous Barriers
BI-SQL丨WHILE
LeetCode二叉树系列——515.最每个树行中找最大值
selenium4+pyetsst+allure+pom进行自动化测试框架的最新设计
CF603E Pastoral Oddities
What is the level of Ali P7?
pytorch与keras的相互转换(代码以LeNet-5为例)
pytorch学习记录(五):卷积神经网络的实现
Baijiahao cancels the function of posting documents on the interface: the weight of the plug-in chain is blocked
关于String的一些思考
“12306” 的架构到底有多牛逼
UPC2022 Summer Individual Training Game 19 (B, P)