当前位置:网站首页>正则过滤数据学习笔记(①)
正则过滤数据学习笔记(①)
2022-07-29 00:57:00 【Json159】
//$noTagsText = ' \n \n \n \n \n \n \n
// Total pending orders \n \n \n 2 \n \n \n \n \n \n \n \n \n \n Today \n \n 2 \n \n \n \n Yesterday \n \n 0 \n \n \n \n Older \n \n 0 \n \n \n Current Daily Order Volume Limitation \n \n 10000 \n \n \n \n \n \n\n \n \n \n \n \n \n your rating \n \n \n 4.0 \n \n \n \n \n \n \n \n \n \n Cancellation rate \n Excellent \n \n \n 0% \n \n \n \n \n Quality return rate \n Excellent \n \n \n 0% \n \n \n \n \n Average customer rating \n Bad \n \n \n 2.8 \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n Best Selling Product Sales Contribution \n \n \n 68% \n \n \n \n \n \n \n \n \n \n Best selling products low stock level \n 0 \n \n \n Total out of stock \n 49820 \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n New product creation in the last 14 days \n \n \n \n \n \n 726 \n \n \n \n \n \n Rejected products (poor quality) \n \n 13951 \n \n \n Rejected products (image missing) \n \n 51307 \n \n \n Approved products \n \n 317087 \n \n \n Pending products \n \n 19 \n \n \n \n \n \n \n';
// $noTagsText = preg_replace("/[\r\n\t]+/", PHP_EOL, $noTagsText);
// $noTagsText = preg_replace("/[\r\n\t]+/", PHP_EOL, $noTagsText);
// $noTagsText = preg_replace("/ +/", " ", $noTagsText);
// $noTagsText = str_replace('\n', PHP_EOL, $noTagsText);
/*$noTagsText = preg_replace("/<style.*?>.*?<\/style>/is", " ", $noTagsText); //过滤css*/
/*$noTagsText = preg_replace("/<script.*?>.*?<\/script>/is", " ", $noTagsText); //过滤script*/
// $noTagsText = preg_replace("/<!--.*?-->/is", " ", $noTagsText); //过滤html注释
/*$noTagsText = preg_replace("/<\/?[a-zA-Z]+.*?>/is", " ", $noTagsText); //过滤掉所有的html标签*/
// $noTagsText = preg_replace("/&[a-zA-Z]{2,};/", " ", $noTagsText); //过滤html实体
//换行符 制表符替换为空格
// $noTagsText = preg_replace("/[\r\n\t]/", PHP_EOL, $noTagsText);
// $noTagsText = strip_tags($noTagsText);
// $noTagsText = preg_replace_callback(
// ['/\\\\n/', '/ +/'],
// function ($matches) {
// if (reset($matches) == PHP_EOL) {
// return '';
// } else {
// return ' ';
// }
// },
// $noTagsText
// );
//
// var_export($noTagsText);
//
// return false;
$noTagsText = '<div class=\"home-stats-container\">\n <div class=\"col-lg-3 col-sm-6 col-md-6\">\n <div class=\"home-stats-block home-stats-color-blue\">\n <!-- Header -->\n <a class=\"home-stats-header-container\" href=\"/order/#!/reset-filters\">\n <span class=\"home-stats-header--row\">\n <span class=\"home-stats-header-text\">\n Total pending orders </span>\n </span>\n <div class=\"home-stats-header-body\">\n 2 </div>\n </a>\n <span class=\"js-tooltip\" data-toggle=\"tooltip\" data-placement=\"top\"\n data-original-title=\"Display of total order amount which is pending since the listed time.\">\n <i class=\"font-icon icon-info-circled-alt\"></i>\n </span>\n <!-- Header -->\n <!-- Rows -->\n <div class=\"home-stats-body-container\">\n <a id=\"js-pending-orders-today\" class=\"home-stats-body-row\" href=\"/order/index/index/fromDate/2022-07-21/#!/reset-filters\">\n <span class=\"details-cell\">\n Today </span>\n <span class=\"numbers-cell\">\n 2 </span>\n</a>\n<a id=\"js-pending-orders-yesterday\" class=\"home-stats-body-row\" href=\"/order/index/index/fromDate/2022-07-20/toDate/2022-07-20/#!/reset-filters\">\n <span class=\"details-cell\">\n Yesterday </span>\n <span class=\"numbers-cell\">\n 0 </span>\n</a>\n<a id=\"js-pending-orders-older\" class=\"home-stats-body-row\" href=\"/order/index/index/fromDate/2021-07-21/toDate/2022-07-19/#!/reset-filters\">\n <span class=\"details-cell\">\n Older </span>\n <span class=\"numbers-cell\">\n 0 </span>\n</a>\n <a class=\"home-stats-body-row\" href=\"/order/#!/reset-filters\">\n <span class=\"details-cell\">Current Daily Order Volume Limitation</span>\n <span class=\"numbers-cell\">\n 10000 </span>\n </a>\n </div>\n <!-- Rows -->\n </div>\n</div>\n\n <div class=\"col-lg-3 col-md-6 col-sm-6\">\n <div class=\"home-stats-block home-stats-color-orange\">\n <!-- Header -->\n <div class=\"home-stats-header-container\">\n <div class=\"home-stats-header--row\">\n <div class=\"home-stats-header-text\">\n your rating </div>\n </div>\n <div class=\"home-stats-header-body\">\n 4.0 </div>\n </div>\n <a class=\"js-tooltip\" data-toggle=\"tooltip\" data-placement=\"top\" title=\"\"\n data-original-title=\"Calculation date: 2022-07-20.\">\n <i class=\"font-icon icon-info-circled-alt\"></i>\n </a>\n <!-- Header -->\n <!-- Rows -->\n <div class=\"home-stats-body-container\">\n <a class=\"home-stats-body-row\"\n href=\"/order/index/index/filteredStatus/3\"\n >\n <span class=\"details-cell\">\n Cancellation rate <br/>\n <strong>Excellent</strong>\n </span>\n <span class=\"numbers-cell\">\n <span>0%<i\n class=\"font-icon\"></i></span>\n </span>\n </a>\n <a class=\"home-stats-body-row\"\n href=\"/order/index/index/filteredStatus/4\"\n >\n <span class=\"details-cell\">\n Quality return rate <br/>\n <strong>Excellent</strong>\n </span>\n <span class=\"numbers-cell\">\n <span>0%<i\n class=\"font-icon\"></i></span>\n </span>\n </a>\n <div class=\"home-stats-body-row\">\n <span class=\"details-cell\">\n Average customer rating <br/>\n <strong>Bad</strong>\n </span>\n <span class=\"numbers-cell\">\n <span>2.8<i\n class=\"font-icon\"></i></span>\n </span>\n </div>\n <div class=\"home-stats-body-row\">\n <span class=\"details-cell\">\n <br/>\n </span>\n <span class=\"numbers-cell\"></span>\n </div>\n </div>\n <!-- Rows -->\n </div>\n</div>\n \n <div class=\"col-lg-3 col-md-6 col-sm-6\">\n <div class=\"home-stats-block home-stats-color-purple\">\n <!-- Header -->\n <a class=\"home-stats-header-container\" href=\"/product/index/best-selling\">\n <span class=\"home-stats-header--row\">\n <span class=\"home-stats-header-text\">\n Best Selling Product Sales Contribution </span>\n </span>\n <div class=\"home-stats-header-body\">\n 68% </div>\n </a>\n <a class=\"js-tooltip pull-right\" data-toggle=\"tooltip\" data-placement=\"top\" title=\"\"\n data-original-title=\"Best Selling products are the top 20% of your shop assortment that sold the most in the past 14 days.\">\n <i class=\"font-icon icon-info-circled-alt\"></i>\n </a>\n <!-- Header -->\n <!-- Rows -->\n <div class=\"home-stats-body-container\">\n <a class=\"home-stats-body-row\" href=\"/product/index/best-selling\">\n <span class=\"details-cell\">\n Best selling products low stock level </span>\n <span class=\"numbers-cell\">0</span>\n </a>\n <a class=\"home-stats-body-row\" href=\"/product/index/sold-out\">\n <span class=\"details-cell\">Total out of stock</span>\n <span class=\"numbers-cell\">49820</span>\n </a>\n <div class=\"home-stats-body-row\">\n <span class=\"details-cell\"></span>\n <span class=\"numbers-cell\"></span>\n </div>\n <div class=\"home-stats-body-row\">\n <span class=\"details-cell\"></span>\n <span class=\"numbers-cell\"></span>\n </div>\n </div>\n <!-- Rows -->\n </div>\n </div>\n <div class=\"col-lg-3 col-md-6 col-sm-6\">\n <div class=\"home-stats-block home-stats-color-green\">\n <!-- Header -->\n <div class=\"home-stats-header-container\">\n <div class=\"home-stats-header--row\">\n <div class=\"home-stats-header-text\">\n New product creation in the last 14 days </div>\n <a class=\"js-tooltip\" data-toggle=\"tooltip\" data-placement=\"top\"\n data-original-title=\"Overview on amount of products, which got created as well as their approval summary.\">\n <i class=\"font-icon icon-info-circled-alt\"></i>\n </a>\n </div>\n <div class=\"home-stats-header-body\">\n 726 </div>\n </div>\n <!-- Header -->\n <!-- Rows -->\n <div class=\"home-stats-body-container\">\n <a class=\"home-stats-body-row\" href=\"/product/index/rejected\">\n <span class=\"details-cell\">Rejected products (poor quality)</span>\n <span class=\"numbers-cell\">\n 13951 </span>\n </a>\n <a class=\"home-stats-body-row\" href=\"/product/index/image-missing\">\n <span class=\"details-cell\">Rejected products (image missing)</span>\n <span class=\"numbers-cell\">\n 51307 </span>\n </a>\n <a class=\"home-stats-body-row\" href=\"/product/index/active\">\n <span class=\"details-cell\">Approved products</span>\n <span class=\"numbers-cell\">\n 317087 </span>\n </a>\n <a class=\"home-stats-body-row\" href=\"/product/index/pending\">\n <span class=\"details-cell\">Pending products</span>\n <span class=\"numbers-cell\">\n 19 </span>\n </a>\n </div>\n <!-- Rows -->\n </div>\n</div> <div class=\"clearfix\"></div>\n</div>\n';
// $noTagsText = preg_replace("/<!--.*?-->/is", " ", $noTagsText);
//
$noTagsText = str_replace('\n', PHP_EOL, $noTagsText);
$noTagsText = str_replace('\\', '', $noTagsText);
/*$noTagsText = preg_replace("/<style.*?>.*?<\/style>/is", " ", $noTagsText); //过滤css*/
/*$noTagsText = preg_replace("/<script.*?>.*?<\/script>/is", " ", $noTagsText); //过滤script*/
// $noTagsText = preg_replace("/<!--.*?-->/is", " ", $noTagsText); //过滤html注释
/*$noTagsText = preg_replace("/<\/?[a-zA-Z]+.*?>/is", " ", $noTagsText); //过滤掉所有的html标签*/
// $noTagsText = preg_replace("/&[a-zA-Z]{2,};/", " ", $noTagsText); //过滤html实体
//换行符 制表符替换为空格
// $noTagsText = preg_replace("/[\r\n\t]/", PHP_EOL, $noTagsText);
// $noTagsText = strip_tags($noTagsText);
$noTagsText = str_replace('\n', PHP_EOL, $noTagsText); // 替换本身的 \n 字符为换行
$noTagsText = preg_replace("/<!--.*?-->/is", "", $noTagsText); // 去除注释
$noTagsText = preg_replace("/[\r\n\t]/", PHP_EOL, $noTagsText);
$noTagsText = strip_tags($noTagsText); // 去除 标签
$noTagsText = preg_replace("(\s+\r)", PHP_EOL, $noTagsText); // 换行多个改为一个
$noTagsText = preg_replace_callback(
['/\\\\n/', '/ +/'],
function ($matches) {
if (reset($matches) == PHP_EOL) {
return '';
} else {
return ' ';
}
},
$noTagsText
);
边栏推荐
- [search] - iteration deepening / bidirectional dfs/ida*
- 【7.21-26】代码源 - 【体育节】【丹钓战】【最大权值划分】
- Read the recent trends of okaleido tiger and tap the value and potential behind it
- Merkel Studio - harmonyos implementation list to do
- With the explosive growth of digital identity in 2022, global organizations are facing greater network security
- LeetCode 113:路径总和 II
- Openpyxl cell center
- How to protect WordPress website from network attack? It is essential to take safety measures
- Where will Jinan win in hosting the first computing power conference?
- 【GoLang】同步锁 Mutex
猜你喜欢

T-sne降维

覆盖接入2w+交通监测设备,EMQ为深圳市打造交通全要素数字化新引擎

Openpyxl border
![[unity project practice] synthetic watermelon](/img/60/20d4ef6f4ad99a9bdb7dc2b4dba23b.png)
[unity project practice] synthetic watermelon

【观察】三年跃居纯公有云SaaS第一,用友YonSuite的“飞轮效应”
![[search] - DFS pruning and optimization](/img/d4/7c2fec02f5a6bcfa2d5e204398af01.png)
[search] - DFS pruning and optimization

【Web技术】1395- Esbuild Bundler HMR

ELMO,BERT和GPT简介

承办首届算力大会,济南胜在何处?

Data security is a competitive advantage. How can companies give priority to information security and compliance
随机推荐
围绕新市民金融聚焦差异化产品设计、智能技术提效及素养教育
golang启动报错【已解决】
JVM learning minutes
Use of resttemplate and Eureka
Openpyxl border
Making high-precision map based on autoware (V)
【HCIP】MGRE环境下OSPF实验,含多进程双向重发布及OSPF特殊区域
The solution to keep the height of the container unchanged during the scaling process of the uniapp movable view table
全面升级,你能想象的都在这————京东API接口
5g commercial third year: driverless "going up the mountain" and "going to the sea"
Sigma-DSP-OUTPUT
覆盖接入2w+交通监测设备,EMQ为深圳市打造交通全要素数字化新引擎
Cloud native application comprehensive exercise
Autoware reports an error: can't generate global path for start solution
T-sne dimensionality reduction
Three ways of creating indexes in MySQL
关于df[‘某一列名’][序号]
瑞吉外卖项目实战Day01
Tomorrow infinite plan, 2022 conceptual planning scheme for a company's yuanuniverse product launch
Reinforcement learning (II): SARS, with code rewriting