当前位置:网站首页>正则过滤数据学习笔记(①)
正则过滤数据学习笔记(①)
2022-07-29 00:57:00 【Json159】
//$noTagsText = ' \n \n \n \n \n \n \n
// Total pending orders \n \n \n 2 \n \n \n \n \n \n \n \n \n \n Today \n \n 2 \n \n \n \n Yesterday \n \n 0 \n \n \n \n Older \n \n 0 \n \n \n Current Daily Order Volume Limitation \n \n 10000 \n \n \n \n \n \n\n \n \n \n \n \n \n your rating \n \n \n 4.0 \n \n \n \n \n \n \n \n \n \n Cancellation rate \n Excellent \n \n \n 0% \n \n \n \n \n Quality return rate \n Excellent \n \n \n 0% \n \n \n \n \n Average customer rating \n Bad \n \n \n 2.8 \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n Best Selling Product Sales Contribution \n \n \n 68% \n \n \n \n \n \n \n \n \n \n Best selling products low stock level \n 0 \n \n \n Total out of stock \n 49820 \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n New product creation in the last 14 days \n \n \n \n \n \n 726 \n \n \n \n \n \n Rejected products (poor quality) \n \n 13951 \n \n \n Rejected products (image missing) \n \n 51307 \n \n \n Approved products \n \n 317087 \n \n \n Pending products \n \n 19 \n \n \n \n \n \n \n';
// $noTagsText = preg_replace("/[\r\n\t]+/", PHP_EOL, $noTagsText);
// $noTagsText = preg_replace("/[\r\n\t]+/", PHP_EOL, $noTagsText);
// $noTagsText = preg_replace("/ +/", " ", $noTagsText);
// $noTagsText = str_replace('\n', PHP_EOL, $noTagsText);
/*$noTagsText = preg_replace("/<style.*?>.*?<\/style>/is", " ", $noTagsText); //过滤css*/
/*$noTagsText = preg_replace("/<script.*?>.*?<\/script>/is", " ", $noTagsText); //过滤script*/
// $noTagsText = preg_replace("/<!--.*?-->/is", " ", $noTagsText); //过滤html注释
/*$noTagsText = preg_replace("/<\/?[a-zA-Z]+.*?>/is", " ", $noTagsText); //过滤掉所有的html标签*/
// $noTagsText = preg_replace("/&[a-zA-Z]{2,};/", " ", $noTagsText); //过滤html实体
//换行符 制表符替换为空格
// $noTagsText = preg_replace("/[\r\n\t]/", PHP_EOL, $noTagsText);
// $noTagsText = strip_tags($noTagsText);
// $noTagsText = preg_replace_callback(
// ['/\\\\n/', '/ +/'],
// function ($matches) {
// if (reset($matches) == PHP_EOL) {
// return '';
// } else {
// return ' ';
// }
// },
// $noTagsText
// );
//
// var_export($noTagsText);
//
// return false;
$noTagsText = '<div class=\"home-stats-container\">\n <div class=\"col-lg-3 col-sm-6 col-md-6\">\n <div class=\"home-stats-block home-stats-color-blue\">\n <!-- Header -->\n <a class=\"home-stats-header-container\" href=\"/order/#!/reset-filters\">\n <span class=\"home-stats-header--row\">\n <span class=\"home-stats-header-text\">\n Total pending orders </span>\n </span>\n <div class=\"home-stats-header-body\">\n 2 </div>\n </a>\n <span class=\"js-tooltip\" data-toggle=\"tooltip\" data-placement=\"top\"\n data-original-title=\"Display of total order amount which is pending since the listed time.\">\n <i class=\"font-icon icon-info-circled-alt\"></i>\n </span>\n <!-- Header -->\n <!-- Rows -->\n <div class=\"home-stats-body-container\">\n <a id=\"js-pending-orders-today\" class=\"home-stats-body-row\" href=\"/order/index/index/fromDate/2022-07-21/#!/reset-filters\">\n <span class=\"details-cell\">\n Today </span>\n <span class=\"numbers-cell\">\n 2 </span>\n</a>\n<a id=\"js-pending-orders-yesterday\" class=\"home-stats-body-row\" href=\"/order/index/index/fromDate/2022-07-20/toDate/2022-07-20/#!/reset-filters\">\n <span class=\"details-cell\">\n Yesterday </span>\n <span class=\"numbers-cell\">\n 0 </span>\n</a>\n<a id=\"js-pending-orders-older\" class=\"home-stats-body-row\" href=\"/order/index/index/fromDate/2021-07-21/toDate/2022-07-19/#!/reset-filters\">\n <span class=\"details-cell\">\n Older </span>\n <span class=\"numbers-cell\">\n 0 </span>\n</a>\n <a class=\"home-stats-body-row\" href=\"/order/#!/reset-filters\">\n <span class=\"details-cell\">Current Daily Order Volume Limitation</span>\n <span class=\"numbers-cell\">\n 10000 </span>\n </a>\n </div>\n <!-- Rows -->\n </div>\n</div>\n\n <div class=\"col-lg-3 col-md-6 col-sm-6\">\n <div class=\"home-stats-block home-stats-color-orange\">\n <!-- Header -->\n <div class=\"home-stats-header-container\">\n <div class=\"home-stats-header--row\">\n <div class=\"home-stats-header-text\">\n your rating </div>\n </div>\n <div class=\"home-stats-header-body\">\n 4.0 </div>\n </div>\n <a class=\"js-tooltip\" data-toggle=\"tooltip\" data-placement=\"top\" title=\"\"\n data-original-title=\"Calculation date: 2022-07-20.\">\n <i class=\"font-icon icon-info-circled-alt\"></i>\n </a>\n <!-- Header -->\n <!-- Rows -->\n <div class=\"home-stats-body-container\">\n <a class=\"home-stats-body-row\"\n href=\"/order/index/index/filteredStatus/3\"\n >\n <span class=\"details-cell\">\n Cancellation rate <br/>\n <strong>Excellent</strong>\n </span>\n <span class=\"numbers-cell\">\n <span>0%<i\n class=\"font-icon\"></i></span>\n </span>\n </a>\n <a class=\"home-stats-body-row\"\n href=\"/order/index/index/filteredStatus/4\"\n >\n <span class=\"details-cell\">\n Quality return rate <br/>\n <strong>Excellent</strong>\n </span>\n <span class=\"numbers-cell\">\n <span>0%<i\n class=\"font-icon\"></i></span>\n </span>\n </a>\n <div class=\"home-stats-body-row\">\n <span class=\"details-cell\">\n Average customer rating <br/>\n <strong>Bad</strong>\n </span>\n <span class=\"numbers-cell\">\n <span>2.8<i\n class=\"font-icon\"></i></span>\n </span>\n </div>\n <div class=\"home-stats-body-row\">\n <span class=\"details-cell\">\n <br/>\n </span>\n <span class=\"numbers-cell\"></span>\n </div>\n </div>\n <!-- Rows -->\n </div>\n</div>\n \n <div class=\"col-lg-3 col-md-6 col-sm-6\">\n <div class=\"home-stats-block home-stats-color-purple\">\n <!-- Header -->\n <a class=\"home-stats-header-container\" href=\"/product/index/best-selling\">\n <span class=\"home-stats-header--row\">\n <span class=\"home-stats-header-text\">\n Best Selling Product Sales Contribution </span>\n </span>\n <div class=\"home-stats-header-body\">\n 68% </div>\n </a>\n <a class=\"js-tooltip pull-right\" data-toggle=\"tooltip\" data-placement=\"top\" title=\"\"\n data-original-title=\"Best Selling products are the top 20% of your shop assortment that sold the most in the past 14 days.\">\n <i class=\"font-icon icon-info-circled-alt\"></i>\n </a>\n <!-- Header -->\n <!-- Rows -->\n <div class=\"home-stats-body-container\">\n <a class=\"home-stats-body-row\" href=\"/product/index/best-selling\">\n <span class=\"details-cell\">\n Best selling products low stock level </span>\n <span class=\"numbers-cell\">0</span>\n </a>\n <a class=\"home-stats-body-row\" href=\"/product/index/sold-out\">\n <span class=\"details-cell\">Total out of stock</span>\n <span class=\"numbers-cell\">49820</span>\n </a>\n <div class=\"home-stats-body-row\">\n <span class=\"details-cell\"></span>\n <span class=\"numbers-cell\"></span>\n </div>\n <div class=\"home-stats-body-row\">\n <span class=\"details-cell\"></span>\n <span class=\"numbers-cell\"></span>\n </div>\n </div>\n <!-- Rows -->\n </div>\n </div>\n <div class=\"col-lg-3 col-md-6 col-sm-6\">\n <div class=\"home-stats-block home-stats-color-green\">\n <!-- Header -->\n <div class=\"home-stats-header-container\">\n <div class=\"home-stats-header--row\">\n <div class=\"home-stats-header-text\">\n New product creation in the last 14 days </div>\n <a class=\"js-tooltip\" data-toggle=\"tooltip\" data-placement=\"top\"\n data-original-title=\"Overview on amount of products, which got created as well as their approval summary.\">\n <i class=\"font-icon icon-info-circled-alt\"></i>\n </a>\n </div>\n <div class=\"home-stats-header-body\">\n 726 </div>\n </div>\n <!-- Header -->\n <!-- Rows -->\n <div class=\"home-stats-body-container\">\n <a class=\"home-stats-body-row\" href=\"/product/index/rejected\">\n <span class=\"details-cell\">Rejected products (poor quality)</span>\n <span class=\"numbers-cell\">\n 13951 </span>\n </a>\n <a class=\"home-stats-body-row\" href=\"/product/index/image-missing\">\n <span class=\"details-cell\">Rejected products (image missing)</span>\n <span class=\"numbers-cell\">\n 51307 </span>\n </a>\n <a class=\"home-stats-body-row\" href=\"/product/index/active\">\n <span class=\"details-cell\">Approved products</span>\n <span class=\"numbers-cell\">\n 317087 </span>\n </a>\n <a class=\"home-stats-body-row\" href=\"/product/index/pending\">\n <span class=\"details-cell\">Pending products</span>\n <span class=\"numbers-cell\">\n 19 </span>\n </a>\n </div>\n <!-- Rows -->\n </div>\n</div> <div class=\"clearfix\"></div>\n</div>\n';
// $noTagsText = preg_replace("/<!--.*?-->/is", " ", $noTagsText);
//
$noTagsText = str_replace('\n', PHP_EOL, $noTagsText);
$noTagsText = str_replace('\\', '', $noTagsText);
/*$noTagsText = preg_replace("/<style.*?>.*?<\/style>/is", " ", $noTagsText); //过滤css*/
/*$noTagsText = preg_replace("/<script.*?>.*?<\/script>/is", " ", $noTagsText); //过滤script*/
// $noTagsText = preg_replace("/<!--.*?-->/is", " ", $noTagsText); //过滤html注释
/*$noTagsText = preg_replace("/<\/?[a-zA-Z]+.*?>/is", " ", $noTagsText); //过滤掉所有的html标签*/
// $noTagsText = preg_replace("/&[a-zA-Z]{2,};/", " ", $noTagsText); //过滤html实体
//换行符 制表符替换为空格
// $noTagsText = preg_replace("/[\r\n\t]/", PHP_EOL, $noTagsText);
// $noTagsText = strip_tags($noTagsText);
$noTagsText = str_replace('\n', PHP_EOL, $noTagsText); // 替换本身的 \n 字符为换行
$noTagsText = preg_replace("/<!--.*?-->/is", "", $noTagsText); // 去除注释
$noTagsText = preg_replace("/[\r\n\t]/", PHP_EOL, $noTagsText);
$noTagsText = strip_tags($noTagsText); // 去除 标签
$noTagsText = preg_replace("(\s+\r)", PHP_EOL, $noTagsText); // 换行多个改为一个
$noTagsText = preg_replace_callback(
['/\\\\n/', '/ +/'],
function ($matches) {
if (reset($matches) == PHP_EOL) {
return '';
} else {
return ' ';
}
},
$noTagsText
);
边栏推荐
- Super scientific and technological data leakage prevention system, control illegal Internet behaviors, and ensure enterprise information security
- JS事件简介
- [hcip] experiment of republishing and routing strategy
- 【观察】三年跃居纯公有云SaaS第一,用友YonSuite的“飞轮效应”
- Three ways of creating indexes in MySQL
- JVM learning minutes
- 【HCIP】两个MGRE网络通过OSPF实现互联(eNSP)
- Embedded sharing collection 23
- 了解各种路径
- Redis is installed on Linux
猜你喜欢

Reinforcement learning (II): SARS, with code rewriting

【搜索】—— DFS之剪枝与优化

560 和为 K 的子数组

Tomorrow infinite plan, 2022 conceptual planning scheme for a company's yuanuniverse product launch

规划数学期末模拟考试一

Plato launched the LAAS protocol elephant swap, which allows users to earn premium income

【HCIP】MGRE环境下OSPF实验,含多进程双向重发布及OSPF特殊区域

TypeError: can only concatenate str (not “int“) to str

SiC Power Semiconductor Industry Summit Forum successfully held

Openpyxl cell center
随机推荐
OpenGL development with QT (II) drawing cube
抓包工具Charles使用
Three ways of creating indexes in MySQL
els 到底停止
Formal parameters, arguments, main function parameters, arrays or pointers as function parameters of the knowledge in every corner of C language
活动速递| Apache Doris 性能优化实战系列直播课程初公开,诚邀您来参加!
internship:用于类型判断的工具类编写
Timer of BOM series
TDA75610-I2C-模拟功放I2C地址的确定
HCIA配置实例(eNSP)
Understand various paths
Cross modal alignment 20220728
【GoLang】网络连接 net.Dial
Network security litigation risk: four issues that chief information security officers are most concerned about
JVM learning minutes
JS 定时器setInterval clearInterval 延时器setTimeOut 异步 动画
Alphafold revealed the universe of protein structure - from nearly 1million structures to more than 200million structures
How many of the top ten test tools in 2022 do you master
PCL 点云转强度图像
BOM系列之定时器