当前位置:网站首页>Map of mL: Based on Boston house price regression prediction data set, an interpretable case of xgboost model using map value
Map of mL: Based on Boston house price regression prediction data set, an interpretable case of xgboost model using map value
2022-07-04 14:21:00 【A Virgo procedural ape】
ML And shap: be based on boston Boston house price regression forecast data set utilization shap It's worth it XGBoost Model implementation interpretability case
Catalog
# 4、 be based on XGBR Model implementation shap Value analysis
# 4.1、 Model building and training
# 4.2、 Output the importance of features based on the model itself
# 4.4、 utilize Shap Value interpretation XGBR Model
# 4.5、 be based on XGBoost Model implementation Shap Value visual analysis
be based on boston Boston house price regression forecast data set utilization shap It's worth it XGBoost Model implementation interpretability case
# 1、 Define datasets
Updating ……
# 2、 Data set preprocessing
Updating ……
# 4、 be based on XGBR Model implementation shap Value analysis
# 4.1、 Model building and training
# 4.2、 be based on The importance of the output characteristics of the model itself
XGBR_importance_dict: [('DIS', 57), ('RM', 42), ('LSTAT', 39), ('PTRATIO', 29), ('NOX', 28), ('TAX', 28), ('CRIM', 23), ('B', 15), ('AGE', 13), ('RAD', 8), ('INDUS', 8), ('CHAS', 4), ('ZN', 1)]
# 4.3、 The local independent graph visualizes how the change of a feature affects the output of the model and the distribution of the eigenvalue
# 4.4、 utilize Shap Value interpretation XGBR Model
# 4.5、 be based on XGBoost Model implementation Shap Value visual analysis
# (1)、 Use local independent graph to calculate shap value
# (2)、 Sample value of a column ( The eigenvalue )、 And the corresponding shap Value scatter visualization
# (3)、 Calculate for each feature in all samples shap Mean absolute value / Maximum absolute bar graph visualization
# (4)、 Calculate for each feature in all samples shap Visualization of mean absolute value bee colony graph
# (5)、 Calculate for each feature in all samples shap Average absolute value heat map visualization
# (6)、 be based on cluste The algorithm processes the characteristics of correlation and visualizes
边栏推荐
- R language dplyr package summary_ If function calculates the mean and median of all numerical data columns in dataframe data, and summarizes all numerical variables based on conditions
- docker-compose公网部署redis哨兵模式
- Learn kernel 3: use GDB to track the kernel call chain
- 数据仓库面试问题准备
- 聊聊保证线程安全的 10 个小技巧
- sql优化之查询优化器
- 吃透Chisel语言.04.Chisel基础(一)——信号类型和常量
- DDD application and practice of domestic hotel transactions -- Code
- 2022 practice questions and mock exams for the main principals of hazardous chemical business units
- Xcode 异常图片导致ipa包增大问题
猜你喜欢
[antd] how to set antd in form There is input in item Get input when gourp Value of each input of gourp
Vscode common plug-ins summary
gin集成支付宝支付
Test process arrangement (3)
Deming Lee listed on Shenzhen Stock Exchange: the market value is 3.1 billion, which is the husband and wife of Li Hu and Tian Hua
RK1126平台OSD的实现支持颜色半透明度多通道支持中文
CVPR 2022 | greatly reduce the manual annotation required for zero sample learning, and propose category semantic embedding rich in visual information (source code download)
TestSuite and testrunner in unittest
Mask wearing detection based on yolov1
DDD application and practice of domestic hotel transactions -- Code
随机推荐
Use of tiledlayout function in MATLAB
[matlab] summary of conv, filter, conv2, Filter2 and imfilter convolution functions
golang fmt. Printf() (turn)
软件测试之测试评估
R language uses the DOTPLOT function of epidisplay package to visualize the frequency of data points in different intervals in the form of point graph, and uses the by parameter to specify the groupin
Use the default route as the route to the Internet
Incremental ternary subsequence [greedy training]
Detailed index of MySQL
LiveData
Ruichengxin micro sprint technology innovation board: annual revenue of 367million, proposed to raise 1.3 billion, Datang Telecom is a shareholder
R language ggplot2 visualization: gganimate package creates animated graph (GIF) and uses anim_ The save function saves the GIF visual animation
R language uses the mutation function of dplyr package to standardize the specified data column (using mean function and SD function), and calculates the grouping mean of the standardized target varia
奇妙秘境 码蹄集
IP lab monthly resumption · issue 5
[FAQ] summary of common causes and solutions of Huawei account service error 907135701
Why should Base64 encoding be used for image transmission
瑞吉外卖笔记
R language uses dplyr package group_ The by function and the summarize function calculate the mean and standard deviation of the target variables based on the grouped variables
Fs4059c is a 5V input boost charging 12.6v1.2a. Inputting a small current to three lithium battery charging chips will not pull it dead. The temperature is 60 ° and 1000-1100ma is recommended
[R language data science]: cross validation and looking back