当前位置:网站首页>Compiler optimization (4): inductive variables
Compiler optimization (4): inductive variables
2022-07-07 19:52:00 【openEuler】
0. Basic knowledge inventory
0.1 loop (loop)
Definition
loop(llvm It is understood as natural loop) Is defined in CFG A node set in L, And has the following properties [1][2]:
There is a single entry node ( be called header), This node governs loop All the nodes in ; There is a back edge that enters the loop head ;
Related terms
entering block: A non loop The inner node has an edge connected to loop. When there is only one entering block And only one side of it is connected to header, be called preheader; Act as not loop Nodal peheader Dominate the whole loop; latch: There is an edge connected to header; backedge: It's called back side , One from latch To header The edge of ; exiting edge: One side from loop Inward to loop Outside , The starting point of the edge is called exiting block, The target node is called exit block;

In the right picture above , The yellow area is a loop, The red area is not , Why? ?
Because the red area a and c Are all entry nodes , Does not satisfy the nature of a single entry node .
0.2 Scalar Evolution(SCEV)
Definition
SCEV It is the optimization of the compiler to analyze variables ( Often only for integer types ), It is mainly used to analyze how variables are updated in the loop , Then optimize according to this information .
Loop chain
As shown in the figure , Inductive variables in the loop var Starting at start, The way of iteration is ϕ, In steps of step;

Its circular chain (chrec,Chains of Recurrences) as follows :
var = {start, ϕ , step}
// ϕ∈{+,∗}
// start: starting value
// step: step in each iteration
for instance :
int m = 0;
for (int i = 0; i < n; i++) {
m = m + n;
*res = m;
}
that m The cycle chain of is :m = {0,+,n}.
1. Induction Variable( Inductive variables )
1.1 Definition
Each iteration of the loop increases or decreases a fixed amount of variables , Or another linear function of inductive variables .
for instance [3], In the following cycle i and j Are inductive variables :
for (i = 0; i < 10; ++i) {
j = 17 * i;
}
1.2 benefit
Summarize the benefits of variable optimization , There are but not limited to the following points :
Replace the original calculation method with simpler instructions .
such as , Inductive variables are identified in the above example , Replace the corresponding multiplication with a less expensive addition .j = -17;
for (i = 0; i < 10; ++i) {
j = j + 17;
}Reduce the number of inductive variables , Reduce register pressure .
extern int sum;
int foo(int n) {
int i, j;
j = 5;
for (i = 0; i < n; ++i) {
j += 2;
sum += j;
}
return sum;
}Current loop There are two inductive variables :i、j, Use one of the variables to express the other post , as follows :
extern int sum;
int foo(int n) {
int i;
for (i = 0; i < n; ++i) {
sum += 5 + 2 * (i + 1);
}
return sum;
}Inductive variable substitution , Make the relationship between variables and circular indexes clear , It is convenient for other optimization analysis ( Such as dependency analysis ). Examples are as follows , take c Expressed as a function related to circular index :
int c, i;
c = 10;
for (i = 0; i < 10; i++) {
c = c + 5; // c is incremented by 5 for each loop iteration
}Convert to :
int c, i;
c = 10;
for (i = 0; i < 10; i++) {
c = 10 + 5 * (i + 1); // c is explicitly expressed as a function of loop index
}
2. practice
2.1 Related compilation options
| compiler | option |
|---|---|
| gcc | -fivopt |
| Bi Sheng | -indvars |
2.2 Optimize use cases
Optimization of inductive variables (ivs) stay llvm The position in is :llvm\lib\Transforms\Scalar\IndVarSimplify.cpp
Let's pass a use case , Take a look at the optimization process of Bisheng compiler .
Here's the picture , Suppose that func The inner part is the code to be optimized , below func Inside is the expected result :

its IR Use cases test.ll yes :

The compile command is :
opt test.ll -indvars -S
In the current example ,header、latch and exiting block It's all the same BB, namely bb5.

Step one : basis def-use Relationship , Traverse loop Of ExitBlock in phi The source of the operand of the node , Calculate the final value and replace it , Then replace the phi Use of nodes .
In the example , Calculation %tmp2.lcssa , Its only operand is %tmp2 = add nuw nsw i32 %i.01.0, 3 , Where the expression is located loop yes bb5, here %tmp2 The cycle chain of is
%tmp2 = {3,+,3}<nuw><nsw><%bb5>
Get current loop The maximum value of not exiting the loop is 199999, Now %tmp2=add(3, mul(3,199999))=600000; Next, we will see that the current replacement is not expensive ( The calculation of cost will vary according to different architectures ), At the same time phi Nodal user Replace the value in . The optimization results are as follows :

Step two : Traverse ExitingBlock , Calculate the jump condition , basis def-use The relationship between , Delete the corresponding instruction .
In the example , To calculate the br i1 %0, label %bb5, label %bb7 Of %0 yes false, After the jump instruction is replaced ,%0 = icmp ult i32 %tmp4,200000 non-existent user, Add it to “ Dead order ” in . The optimization results are as follows :

Step three : Delete all “ Dead order ”, And see if his operands should be deleted .
In the example , As %0 Of operands %tmp4 And others user %x.03.0, So it can't be regarded as “ Dead order ” Be deleted . The optimization results are as follows :

Step four : Delete HeaderBlock Medium “ die ”phi node .
In the example , %tmp4 and phi node %x.03.0 It forms a cycle without results , Will delete them , Delete... In the same way %tmp2 and %i.01.0 . The optimization results are as follows :

Reference resources
[1] https://llvm.org/docs/LoopTerminology.html
[2] 《 Compiler principle 》 [ beautiful ]Alfred V.Aho,[ beautiful ]Monica S.Lam,[ beautiful ]Ravi Sethi Waiting , Zhao Jianhua , Translated by Zheng Tao, et al
[3] https://en.wikipedia.org/wiki/Induction_variable


Click on Read the original Start using Bisheng compiler
This article is from WeChat official account. - openEuler(openEulercommunity).
If there is any infringement , Please contact the [email protected] Delete .
Participation of this paper “OSC Source creation plan ”, You are welcome to join us , share .
边栏推荐
- 网信办公布《数据出境安全评估办法》,9 月 1 日起施行
- Tp6 realize Commission ranking
- 杰理之快速配对,不支持取消配对【篇】
- Time tools
- PMP对工作有益吗?怎么选择靠谱平台让备考更省心省力!!!
- what‘s the meaning of inference
- LeetCode_7_5
- L1-019 who falls first (Lua)
- L1-028 judging prime number (Lua)
- Kirin Xin'an with heterogeneous integration cloud financial information and innovation solutions appeared at the 15th Hunan Financial Technology Exchange Conference
猜你喜欢

Responsibility chain model - unity

Navicat连接2002 - Can‘t connect to local MySQL server through socket ‘/var/lib/mysql/mysql.sock‘解决

LeetCode_7_5

2022.07.02

位运算介绍

openEuler 资源利用率提升之道 01:概论

使用高斯Redis实现二级索引
![最多可以参加的会议数目[贪心 + 优先队列]](/img/f3/e8e939e0393efc404cc159d7d33364.png)
最多可以参加的会议数目[贪心 + 优先队列]

Kirin Xin'an won the bid for the new generation dispatching project of State Grid!

Redis——基本使用(key、String、List、Set 、Zset 、Hash、Geo、Bitmap、Hyperloglog、事务 )
随机推荐
Is PMP beneficial to work? How to choose a reliable platform to make it easier to prepare for the exam!!!
Install mysql8 for Linux X ultra detailed graphic tutorial
杰理之快速配对,不支持取消配对【篇】
【牛客网刷题系列 之 Verilog进阶挑战】~ 多bit MUX同步器
杰理之发起对耳配对、回连、开启可发现、可连接的轮循函数【篇】
openEuler 有奖捉虫活动,来参与一下?
R language ggplot2 visualization: use the ggstripchart function of ggpubr package to visualize the dot strip plot, set the position parameter, and configure the separation degree of different grouped
2022.07.02
歌单11111
Automatic classification of defective photovoltaic module cells in electroluminescence images-論文閱讀筆記
Browse the purpose of point setting
648. 单词替换
“本真”是什么意思
R语言ggplot2可视化:使用ggpubr包的ggqqplot函数可视化QQ图(Quantile-Quantile plot)
R language ggplot2 visualization: use the ggdensity function of ggpubr package to visualize the packet density graph, and use stat_ overlay_ normal_ The density function superimposes the positive dist
超分辨率技术在实时音视频领域的研究与实践
How to buy bank financial products? Do you need a bank card?
What does "true" mean
The research group of the Hunan Organizing Committee of the 24th China Association for science and technology visited Kirin Xin'an
how to prove compiler‘s correctness