当前位置:网站首页>Ml self study notes 5
Ml self study notes 5
2022-07-29 06:16:00 【19-year-old flower girl】
Integrated algorithm and random forest
The integration algorithm
The integration algorithm is generally done with the tree model .
Bagging Is to train multiple tree models , Each tree model predicts a result , Sum the results of all tree models and then find the mean , A typical example is random forest , I'll talk about it later .
Boosting( Lifting algorithm ), Is to add a tree every time , Finally, the mean value is not sought
Random forests
brief introduction
- Random : Random sampling of data , For example, establish a random selection of each tree 80% The sample of ; Feature random sampling : Each tree is randomly selected 80% Sample characteristics of ( Then the comparison column can be specified );
- The forest : Multiple trees , Sum the results of each tree and find the mean .
Random forest advantages and
give Negev The important characteristics can be judged by the process in the figure below . First, find out the error rate of original feature modeling , Modify the characteristic value of the feature you want to judge ( Disrupt or change the value ) Then the other features remain unchanged, and then modeling and calculating the error rate ; The two results approximate that this feature is not important , The second result is much larger than the first result, indicating that this feature is very important . The importance can also be judged by the position of the feature in the tree , The closer to the root node, the more important ( It is obtained directly by algorithm ).
The more tree models, the better , After reaching a certain number, the accuracy rate is almost floating up and down .
Lifting algorithm (Boosting) summary
Introduction to the algorithm
Suppose the first tree predicts 950( The label is 1000), that B Just upgrade A Of ,B The prediction is not 1000 了 , Is to make up A Not done well ,B The goal is 1000-950=50, hypothesis B Predicted 30, turn C when ,C The tree regards the front as a whole , The prediction result is expected 1000-950-30=20, hypothesis C forecast 18,D It's also , The prediction label is 2.
Lifting algorithm formula
The previous part is the prediction result of the previous tree , The following part is the current tree model , Previous results Fm-1(xi) Add the result of the current tree h(xi) And the original result yi Compare , If improved, add , If there is no improvement, don't add . Guaranteed to join a tree , It will be stronger than before .
The model represents
- AdaBoost: If the first division is the result of Figure 1 , There is a wrong triangle division , Then the next round will increase the weight of this triangle
- Xgboost The algorithm is mentioned before ABCD How did the tree ascend .
Stacking model (stacking)
Model overview
Use multiple classifiers , such as LR( Logical regression ),DT( Decision tree ),RF( Random forests ), Make separate predictions , Then take the result as input and operate with a model .
边栏推荐
- QT学习笔记-Excel的导入导出
- Logistic regression - project practice - credit card detection task (Part 2)
- 基于51单片机的直流电机调速系统(L298的使用)
- 倾角传感器精度校准检测
- CV520国产替代Ci521 13.56MHz 非接触式读写器芯片
- 【软件工程之美 - 专栏笔记】21 | 架构设计:普通程序员也能实现复杂系统?
- Open source based on STM32: MHD Bluetooth speaker (including source code +pcb)
- 物联网倾斜监测解决方案
- 【软件工程之美 - 专栏笔记】20 | 如何应对让人头疼的需求变更问题?
- STM32FF030 替代国产单片机——DP32G030
猜你喜欢
Am model in NLP field
Power electronics: single inverter design (matlab program +ad schematic diagram)
FPGA based: moving target detection (supplementary simulation results, available)
SQLyog 安装和配置教程
【RoboMaster】从零开始控制RM电机(2)-CAN通信原理及电调通信协议
DP1332E多协议高度集成非接触式读写芯片
【软件工程之美 - 专栏笔记】29 | 自动化测试:如何把Bug杀死在摇篮里?
给二维表添加时间序列索引
Reading papers on false news detection (5): a semi supervised learning method for fake news detection in social media
倾角传感器用于通信铁塔、高压电塔长期监测
随机推荐
PHY6252是一款超低功耗物联网蓝牙无线通信芯片
给二维表添加时间序列索引
【软件工程之美 - 专栏笔记】“一问一答”第2期 | 30个软件开发常见问题解决策略
基于DAC0832的直流电机控制系统
FPGA based: moving target detection (schematic + source code + hardware selection, available)
STM32 检测信号频率
简洁代码实现pdf转word文档
HAL库学习笔记-10 HAL库外设驱动框架概述
ML7 self study notes
STM32 MDK(Keil5) Contents mismatch错误总结
2.4G频段的无线收发芯片 SI24R1 问题汇总解答
防爆倾角传感器应用于LNG液化天然气安全作业
HAL库学习笔记-13 I2C和SPI的应用
HAL库学习笔记- 9 DMA
电力电子:单项逆变器设计(MATLAB程序+AD原理图)
智能货架安全监测系统
【软件工程之美 - 专栏笔记】16 | 怎样才能写好项目文档?
FPGA based: moving target detection (supplementary simulation results, available)
Huawei cloud 14 days Hongmeng device development -day1 environment construction
Pytorch Basics (Introductory)