当前位置:网站首页>Ml self study notes 5
Ml self study notes 5
2022-07-29 06:16:00 【19-year-old flower girl】
Integrated algorithm and random forest
The integration algorithm
The integration algorithm is generally done with the tree model .
Bagging Is to train multiple tree models , Each tree model predicts a result , Sum the results of all tree models and then find the mean , A typical example is random forest , I'll talk about it later .
Boosting( Lifting algorithm ), Is to add a tree every time , Finally, the mean value is not sought
Random forests
brief introduction
- Random : Random sampling of data , For example, establish a random selection of each tree 80% The sample of ; Feature random sampling : Each tree is randomly selected 80% Sample characteristics of ( Then the comparison column can be specified );
- The forest : Multiple trees , Sum the results of each tree and find the mean .
Random forest advantages and
give Negev The important characteristics can be judged by the process in the figure below . First, find out the error rate of original feature modeling , Modify the characteristic value of the feature you want to judge ( Disrupt or change the value ) Then the other features remain unchanged, and then modeling and calculating the error rate ; The two results approximate that this feature is not important , The second result is much larger than the first result, indicating that this feature is very important . The importance can also be judged by the position of the feature in the tree , The closer to the root node, the more important ( It is obtained directly by algorithm ).
The more tree models, the better , After reaching a certain number, the accuracy rate is almost floating up and down .
Lifting algorithm (Boosting) summary
Introduction to the algorithm
Suppose the first tree predicts 950( The label is 1000), that B Just upgrade A Of ,B The prediction is not 1000 了 , Is to make up A Not done well ,B The goal is 1000-950=50, hypothesis B Predicted 30, turn C when ,C The tree regards the front as a whole , The prediction result is expected 1000-950-30=20, hypothesis C forecast 18,D It's also , The prediction label is 2.
Lifting algorithm formula
The previous part is the prediction result of the previous tree , The following part is the current tree model , Previous results Fm-1(xi) Add the result of the current tree h(xi) And the original result yi Compare , If improved, add , If there is no improvement, don't add . Guaranteed to join a tree , It will be stronger than before .
The model represents
- AdaBoost: If the first division is the result of Figure 1 , There is a wrong triangle division , Then the next round will increase the weight of this triangle
- Xgboost The algorithm is mentioned before ABCD How did the tree ascend .
Stacking model (stacking)
Model overview
Use multiple classifiers , such as LR( Logical regression ),DT( Decision tree ),RF( Random forests ), Make separate predictions , Then take the result as input and operate with a model .
边栏推荐
- 2022 spring move - core technology FPGA development post pen test question (original question and experience)
- HAL库学习笔记- 8 串口通信之概念
- Reading papers on false news detection (5): a semi supervised learning method for fake news detection in social media
- 【软件工程之美 - 专栏笔记】“一问一答”第2期 | 30个软件开发常见问题解决策略
- 新能源共享充电桩管理运营平台
- EPS32+Platform+Arduino 跑马灯
- 数论:px+py 不能表示的最大数为pq-p-q的证明
- 2022 spring recruit - Hesai technology FPGA technology post (one or two sides, collected from: Digital IC workers and FPGA Explorers)
- 智能温度控制系统
- NFC双向通讯13.56MHZ非接触式阅读器芯片--Si512替代PN512
猜你喜欢
基于F407ZGT6的WS2812B彩灯驱动
【软件工程之美 - 专栏笔记】16 | 怎样才能写好项目文档?
QT learning notes QT model/view
华为云14天鸿蒙设备开发-Day1环境搭建
【软件工程之美 - 专栏笔记】23 | 架构师:不想当架构师的程序员不是好程序员
【软件工程之美 - 专栏笔记】28 | 软件工程师的核心竞争力是什么?(下)
Based on STM32: couple interactive doll (design scheme + source code +3d drawing +ad circuit)
CS5340国产替代DP5340多比特音频 A/D 转换器
【软件工程之美 - 专栏笔记】“一问一答”第3期 | 18个软件开发常见问题解决策略
【软件工程之美 - 专栏笔记】13 | 白天开会,加班写代码的节奏怎么破?
随机推荐
华为云14天鸿蒙设备开发-Day7WIFI功能开发
低成本2.4GHz 无线收发芯片--Ci24R1
低功耗蓝牙5.0芯片nrf52832-QFAA
ArduinoIDE + STM32Link烧录调试
CS5340国产替代DP5340多比特音频 A/D 转换器
华为云14天鸿蒙设备开发-Day3内核开发
智慧充电桩系统由什么组成?
Based on STM32: couple interactive doll (design scheme + source code +3d drawing +ad circuit)
Hal library learning notes - 9 DMA
充电桩应用方案
基于STC51:四轴飞控开源项目原理图与源码(入门级DIY)
数论:px+py 不能表示的最大数为pq-p-q的证明
HAL库学习笔记- 9 DMA
【软件工程之美 - 专栏笔记】13 | 白天开会,加班写代码的节奏怎么破?
基于51单片机的直流电机调速系统(L298的使用)
Pytorch's data reading mechanism
倾角传感器用于通信铁塔、高压电塔长期监测
新能源共享充电桩管理运营平台
八大排序-----------快速排序
华为云14天鸿蒙设备开发-Day1环境搭建