当前位置:网站首页>Hologres shared cluster helps Taobao subscribe to the extreme refined operation
Hologres shared cluster helps Taobao subscribe to the extreme refined operation
2022-06-30 16:46:00 【InfoQ】
Taobao subscription is in urgent need of refined content operation


- High quality content circle selection : Distribute content at the front desk of Taobao subscription , Support multiple dimension feature filtering methods .
- Low quality content filtering : Yellow River, politics and meaningless content , Filter through feature filtering .
- The core content is put on display : The operation will select a batch of core in-depth operation contents for front office delivery , Through the circle selection system , Filter by different dimensions , The obtained content is used for the content aggregation page of the foreground
- Promote the content atmosphere : A number of activities were selected by the operation circle , When the front desk is revealed, it will be greatly promoted to strengthen the atmosphere
- The flow of cooperative merchants is inclined : Through the circle selection system , Circle the contents of a group of core cooperative businesses , Flow tilting is performed when the content on the front desk is displayed by hand .
Subscription features circle selection system engine selection
Architecture design of current subscription circle selection system
- Abstract the circle selection process into content id+ relation id+ Multi dimension indicator filtering , Get the filtered target content id The process of ;
- The process of creating an activity instance that contains a batch of content ;
- Configure the circlable information into filter items schema;
- Take the actual filter condition value as the filter item value.

The core demands of engine selection : flexibility & High performance
- Easy access , Reduce the complexity of filtering executable statement translation
- Performance and stability guarantee , The logic of circle selection changes with the operation strategy , Need to support complex query and rapid response .
- Support to add variable characteristic fields , Have some flexibility .

adopt Hologres Subscription system built by cluster
Hologres colony : Less data movement + Faster queries
- Fast access : Hologres The shared cluster only needs to establish an instance for quick use , It can facilitate the quick start of business , It basically meets most usage scenarios . When business development needs it , You can apply for an independent cluster and migrate , This is relatively friendly among many engines of the group . Subscription services are also built based on public clusters at the initial stage , In the later stage, independent clusters are gradually used .
- Seamless development :Hologres Supported by the SQL Query syntax and common SQL The query is basically consistent , Basically seamless use . The visual interface supports the function of one key synchronization table structure , It is especially suitable for students whose table structure changes frequently .
- Reduce data movement :Hologres The natural support is to read and store in the form of appearance MaxCompute Multiple project The data of , This allows you to aggregate data from different project Of offline data , Reduce the complexity of query , You can query directly without data import and export , It also lowers storage costs .
- Compared with MaxCompute Query for , High performance improvement . After many tests : The data volume is billion , Appearance query complex statements ( Contains multiple tables JOIN) It takes about 8-9 second ; It takes time to filter and query a single table 2 About seconds . Suitable for offline / Quasi real time query scenario . The internal table query is about 60ms, It can be used for online inquiry .
- By supporting UDF/ Expression push down , To achieve user-defined UDF Calculation ; Pushing down the expression can reduce the overhead of useless data transmission , Further improve performance .
adopt Hologres Best practices for setting up subscription systems


SELECT feed_id
FROM qn_xxx_provider AS a
WHERE a.xxx_pv > 30000
AND a.xxx_pctr > '0.1'
AND a.last_publish_time >= '2022-06-17 08:00:00'
AND a.biz_xxx_code = '111'
AND a.ds = MAX_PT('xxxxxx_table')
AND CAST(a.owner_xxx_id AS VARCHAR) IN (SELECT b.domain_xxx_id FROM xxxxxxx_table AS b WHERE b.rule_type = 12 AND b.channel_xxx_id = 137 AND b.dataset_xx_id = xxxxx AND b.ds = MAX_PT('xxxxx_odps_channel') )
and a.feed_id in (SELECT feed_id from xxxxx_submission_feed_hh where activity_id = 222 and approval_status=1 and ds = MAX_PT('xxxxx_submission_hh') and hh = '13')
;Business value
- Support for more real-time features : Hologres The performance of the internal meter is superior , Import live features into Hologres In the inner table , Support real-time feature query
- Reduce tuning GUC Use of parameters , for example :set hg_foreign_table_max_partition_limit =128;( Single adjustment query Number of access surface partitions ), It is expected that better product-oriented capabilities can be used to solve , Reduce GUC Use of parameters .
边栏推荐
- 备战数学建模34-BP神经网络预测2
- I 用c I 实现“栈”
- halcon知识:区域专题【07】
- KDD 2022 | how far are we from the general pre training recommendation model? Universal sequence representation learning model unisrec for recommender system
- 微信表情符号写入判决书,你发的OK、炸弹都可能成为“呈堂证供”
- “低代码”在企业数字化转型中扮演着什么角色?
- 安全帽佩戴检测算法研究
- POJ Project Summer
- 2022 Blue Bridge Cup group B -2022- (01 backpack to calculate the number of schemes)
- JS ES5也可以创建常量?
猜你喜欢

“低代码”在企业数字化转型中扮演着什么角色?

【牛客网刷题系列 之 Verilog快速入门】~ 位拆分与运算

Good partner for cloud skill improvement, senior brother cloud of Amazon officially opened today

Which direction should college students choose to find jobs after graduation?

MC Instruction Decoder

BC1.2 PD协议

'<', Hexadecimal value 0x3c, is an invalid problem solving

How cloudxr promotes the future development of XR

优惠券种类那么多,先区分清楚再薅羊毛!
![[BJDCTF2020]The mystery of ip|[CISCN2019 华东南赛区]Web11|SSTI注入](/img/c2/d6760826b81589781574aebff61f9a.png)
[BJDCTF2020]The mystery of ip|[CISCN2019 华东南赛区]Web11|SSTI注入
随机推荐
mysql8报错:ERROR 1410 (42000): You are not allowed to create a user with GRANT解决办法
IndexSearch
2022蓝桥杯国赛B组-2022-(01背包求方案数)
Half year inventory of new consumption in 2022: the industry is cold, but these nine tracks still attract gold
招标公告:2022年台州联通Oracle一体机和数据库维保服务项目
KDD 2022 | how far are we from the general pre training recommendation model? Universal sequence representation learning model unisrec for recommender system
牛客网:有多少个不同的二叉搜索树
Niuke: how many different binary search trees are there
MC Instruction Decoder
Mathematical modeling for war preparation 33- grey prediction model 2
Bc1.2 PD protocol
RT thread heap size setting
[time series database incluxdb] code example for configuring incluxdb+ data visualization and simple operation with C under Windows Environment
register_ Chrdev and CDEV_ init cdev_ Add usage differences
药品管理系统加数据库,一夜做完,加报告
Niuke network: longest continuous subarray with positive product
中国传奇教授李泽湘,正在批量制造独角兽
2022蓝桥杯国赛B组-费用报销-(线性dp|状态dp)
RT-Thread 堆區大小設置
Mathematical modeling for war preparation 36 time series model 2