【 title 】MLGOPerf: An ML Guided Inliner to Optimize Performance
【 The author team 】Amir H. Ashouri, Mostafa Elhoushi, Yuzhe Hua
【 Date of publication 】2022.7.18
【 Thesis link 】https://arxiv.org/pdf/2207.08389.pdf
【 Recommended reasons 】 In the past 25 In the year , We have witnessed the extensive application of machine learning in the field of compilers ; Selection and phase sequence problems . also , Limited work has flowed into the most advanced compilers , namely LLVM, To seamlessly integrate the former into the optimization pipeline of the compiler , So that users can easily deploy .MLGO Is one of the first such projects , It only tries to use based on ML The inline of uses reinforcement learning to reduce the size of binary code . This paper introduces MLGOPerf; The first one can use LLVM Of ML Inline end-to-end framework for optimizing performance . It uses two levels ML Model to generate rewards for training retargeting reinforcement learning agents . It achieves this by predicting the post inline acceleration of the analyzed function , It also provides a fast training framework for the main model . Experimental results show that , When in SPEC CPU2006 and Cbench When performing performance training on the benchmark ,MLGOPerf stay O3 Under the LLVM Optimization can achieve up to 1.8% and 2.2% Performance of . Besides , The proposed method provides up to 26% The opportunity to automatically tune code areas , This can translate into additional 3.7% Acceleration value .
当前位置:网站首页>Huawei | mlgoperf: ML boot inline for optimizing performance
Huawei | mlgoperf: ML boot inline for optimizing performance
2022-07-25 04:08:00 【Zhiyuan community】
边栏推荐
- 301. Delete invalid brackets
- Implementing DDD based on ABP -- domain logic and application logic
- One question per day
- Math. Random, switch selection structure
- Jenkins continues to integrate entry to mastery
- Servlet personal practice notes (I)
- C language: string processing function
- Use "display: flex; justify content: Center; align items: Center;" Solve the problem that the streaming grid layout cannot be centered
- Yuntu says digital asset chain: your God of digital asset property protection
- The sixth day of brushing questions with force deduction
猜你喜欢

Operations in shell

Xrrunner, a domestic performance testing tool for palm smart, officially unveiled qecon

144. Preorder traversal of binary tree

Wechat applet access wechat payment process

Acwing 870. approximate number

Pandora IOT development board learning (RT thread) - Experiment 16 WiFi module experiment (learning notes)

Day008 select structure (switch statement)

DNS domain name resolution

Multithreading advanced Step2

Many local and municipal supervision departments carried out cold drink sampling inspection, and Zhong Xue's high-quality products were all qualified
随机推荐
Debezium series: Show slave status view the possible different situations of master-slave delay
基于ABP实现DDD--领域逻辑和应用逻辑
Redis official visualization tool, with high appearance value and powerful functions!
Debezium series: optimize cluster parameters and support personalized settings of debezium connector
Chapter 3 business function development (modify the remarks of market activities)
Moveit2 - 8. Motion planning API
Shell programming (script) (variable)
数据中台建设(一):数据中台出现的背景
01_ Education 1
Server status code
Student thinking vs workplace thinking
Niuke interview high frequency list (group 1) difficulty: simple & medium
Apache Doris series: in depth understanding of real-time analytical database Apache Doris
Wechat applet application development (I)
Pytorch deep learning practice lesson 8 importing data
MySQL eight shares
Deep learning Titanic (beginner) kaggle Liu er's homework Lesson 8
Codewars notes
How to cancel and exit revision mode for word
MIM command