SlideShare a Scribd company logo
Introduction to LLVM
on Program Analysis
Tao He
elfinhe@gmail.com
Department of Computer Science, Sun Yat-Sen University
Department of Computer Science and Engineering, HKUST
Group Discussion
June 2012
HKUST, Hong Kong, China
1/34
Outline
 Objectives
 A quick scenario
 LLVM IR
 ‘opt’ command
 Installation of LLVM
2/34
Objectives -
What do we want to do?
3/34
Objectives
 To implement a symbolic execution engine.
 A expression-based engine [BH07]
different from
most existing implementations (path-based
engines).
 Program analysis on C programs.
 To generate static single assignment (SSA)
representation of C first.
4/34
[BH07] Domagoj Babić and Alan J. Hu. Structural Abstraction of Software Verification Conditions. In Proceedings
of the 19th international conference on Computer aided verification (CAV'07), Lecture Notes in Computer Science,
2007, Volume 4590/2007, 366-378
A Quick Scenario -
What can LLVM do?
5/34
!A Quick Scenario
6/34
 Given a C program:
 #include <stdio.h>
 int branch(int n){
 if (n>0) printf("Positiven");
 else if (n==0) printf("Zeron");
 else if (n<0) printf("Negativen");
 return 0;
 }
 int main() {
 branch(-4); branch(0); branch(6);
 return 0;
 }
!A Quick Scenario
7/34
 Generate immediate representation (IR) of
LLVM – the SSA representation in LLVM
 clang -O3 -emit-llvm hello.c -S -o hello.ll
 define i32 @main() nounwind uwtable {
 %1 = alloca i32, align 4
 store i32 0, i32* %1
 %2 = call i32 @branch(i32 -4)
 %3 = call i32 @branch(i32 0)
 %4 = call i32 @branch(i32 6)
 ret i32 0
 }
 ...
[SH] Reid Spencer and Gordon Henriksen. LLVM's Analysis and Transform Passes.
URL: http://llvm.org/docs/Passes.html.
!A Quick Scenario
8/34
 Print call graph
 opt method_para_int_branch.ll -S -dot-
callgraph 2>output_file >/dev/null
 dot -Tsvg in.dot -o out.svg
[SH] Reid Spencer and Gordon Henriksen. LLVM's Analysis and Transform Passes.
URL: http://llvm.org/docs/Passes.html.
!A Quick Scenario
9/34
 Print control flow graph (CFG)
 opt method_para_int_branch.ll -S -dot-cfg
2>output_file >/dev/null
[SH] Reid Spencer and Gordon Henriksen. LLVM's Analysis and Transform Passes.
URL: http://llvm.org/docs/Passes.html.
# A Quick Scenario
10/34
 More:
 Dead Global Elimination
 Interprocedural Constant Propagation
 Dead Argument Elimination
 Inlining
 Reassociation
 Loop Invariant Code Motion
 Loop Opts
 Memory Promotion
 Dead Store Elimination
 Aggressive Dead Code Elimination
[LA04] Chris Lattner and Vikram Adve. The LLVM Compiler Framework and Infrastructure Tutorial. Mini
Workshop on Compiler Research Infrastructures (LCPC'04), West Lafayette, Indiana, Sep. 2004.
What is the SSA representation in LLVM?
- LLVM IR
11/34
LLVM IR
12/34
 “A Static Single Assignment (SSA) based
representation that provides type safety, low-
level operations, flexibility, and the capability
of representing 'all' high-level languages
cleanly.”
[Lat] Chris Lattner. LLVM Language Reference Manual. URL: http://llvm.org/docs/LangRef.html
[LA04] Chris Lattner and Vikram Adve. The LLVM Compiler Framework and Infrastructure Tutorial. Mini
Workshop on Compiler Research Infrastructures (LCPC'04), West Lafayette, Indiana, Sep. 2004.
LLVM IR
13/34
 Three address code
 SSA-based
 Three different forms
 An in-memory compiler IR
 An on-disk bitcode representation (suitable for
fast loading by a Just-In-Time compiler)
 A human readable assembly language
representation
[Lat] Chris Lattner. LLVM Language Reference Manual. URL: http://llvm.org/docs/LangRef.html
[LA04] Chris Lattner and Vikram Adve. The LLVM Compiler Framework and Infrastructure Tutorial. Mini
Workshop on Compiler Research Infrastructures (LCPC'04), West Lafayette, Indiana, Sep. 2004.
LLVM IR
14/34
 An example
 To multiply the integer variable '%X' by 8
 Syntax:
 <result> = mul <ty> <op1>, <op2>
 IR code:
 %result = mul i32 %X, 8
 More
 For floating point, use fmul
[Lat] Chris Lattner. LLVM Language Reference Manual. URL: http://llvm.org/docs/LangRef.html
[LA04] Chris Lattner and Vikram Adve. The LLVM Compiler Framework and Infrastructure Tutorial. Mini
Workshop on Compiler Research Infrastructures (LCPC'04), West Lafayette, Indiana, Sep. 2004.
LLVM IR
15/34
 Another example
 Instruction jump – to change control flow
 Branches or loops
 Syntax:
 br i1 <cond>, label <iftrue>, label <iffalse>
 br label <dest> ; Unconditional branch
[Lat] Chris Lattner. LLVM Language Reference Manual. URL: http://llvm.org/docs/LangRef.html
[LA04] Chris Lattner and Vikram Adve. The LLVM Compiler Framework and Infrastructure Tutorial. Mini
Workshop on Compiler Research Infrastructures (LCPC'04), West Lafayette, Indiana, Sep. 2004.
LLVM IR
16/34
 IR code:
 Test:
 %cond = icmp eq i32 %a, %b
 br i1 %cond, label %IfEqual, label %IfUnequal
 IfEqual:
 ret i32 1
 IfUnequal:
[Lat] Chris Lattner. LLVM Language Reference Manual. URL: http://llvm.org/docs/LangRef.html
[LA04] Chris Lattner and Vikram Adve. The LLVM Compiler Framework and Infrastructure Tutorial. Mini
Workshop on Compiler Research Infrastructures (LCPC'04), West Lafayette, Indiana, Sep. 2004.
LLVM IR
17/34
 3rd
example
 Function call
 A simplified syntax:
 <result> = call <ty> <fnptrval>(<function args>)
 IR code:
 call i32 (i8*, ...)* @printf(i8* %msg, i32 12, i8 42)
[Lat] Chris Lattner. LLVM Language Reference Manual. URL: http://llvm.org/docs/LangRef.html
[LA04] Chris Lattner and Vikram Adve. The LLVM Compiler Framework and Infrastructure Tutorial. Mini
Workshop on Compiler Research Infrastructures (LCPC'04), West Lafayette, Indiana, Sep. 2004.
LLVM IR
18/34
 4th
example
 Function definition
 A simplified syntax:
 define <ResultType> @<FunctionName> ([argument list]) { ... }
 IR code:
 define i32 @main() { … }
 define i32 @test(i32 %X, ...) { … }
[Lat] Chris Lattner. LLVM Language Reference Manual. URL: http://llvm.org/docs/LangRef.html
[LA04] Chris Lattner and Vikram Adve. The LLVM Compiler Framework and Infrastructure Tutorial. Mini
Workshop on Compiler Research Infrastructures (LCPC'04), West Lafayette, Indiana, Sep. 2004.
LLVM IR
19/34
 The majority of instructions in C programs:
 Operations (binary/bitwise)
 Jumps
 Function calls
 Function definitions
 Many keywords in LLVM IR will not be
used for C programs. (e.g., invoke)
[Lat] Chris Lattner. LLVM Language Reference Manual. URL: http://llvm.org/docs/LangRef.html
[LA04] Chris Lattner and Vikram Adve. The LLVM Compiler Framework and Infrastructure Tutorial. Mini
Workshop on Compiler Research Infrastructures (LCPC'04), West Lafayette, Indiana, Sep. 2004.
How to analyze programs
by using LLVM?
- ‘opt’ command
20/34
‘opt’ command
 Compiler is organized as a series of ‘passes’:
 Each pass is one analysis or transformation
21/34
[SH] Reid Spencer and Gordon Henriksen. LLVM's Analysis and Transform Passes.
URL: http://llvm.org/docs/Passes.html.
[LA04] Chris Lattner and Vikram Adve. The LLVM Compiler Framework and Infrastructure Tutorial. Mini
Workshop on Compiler Research Infrastructures (LCPC'04), West Lafayette, Indiana, Sep. 2004.
!‘opt’ command
 An example
 -dot-callgraph
22/34
[SH] Reid Spencer and Gordon Henriksen. LLVM's Analysis and Transform Passes.
URL: http://llvm.org/docs/Passes.html.
[LA04] Chris Lattner and Vikram Adve. The LLVM Compiler Framework and Infrastructure Tutorial. Mini
Workshop on Compiler Research Infrastructures (LCPC'04), West Lafayette, Indiana, Sep. 2004.
!‘opt’ command
23/34
An example
Print call graph: -dot-callgraph
 opt method_para_int_branch.ll -S -dot-
callgraph 2>output_file >/dev/null
 dot -Tsvg in.dot -o out.svg
[SH] Reid Spencer and Gordon Henriksen. LLVM's Analysis and Transform Passes.
URL: http://llvm.org/docs/Passes.html.
[LA04] Chris Lattner and Vikram Adve. The LLVM Compiler Framework and Infrastructure Tutorial. Mini
Workshop on Compiler Research Infrastructures (LCPC'04), West Lafayette, Indiana, Sep. 2004.
How to write your own pass?
24/34
How to write your own pass?
 Four types of pass:
 ModulePass: general interprocedural pass
 CallGraphSCCPass: bottom-up on the call graph
 FunctionPass: process a function at a time
 BasicBlockPass: process a basic block at a time
25/34
How to write your own pass?
 Two important classes
 User: http://llvm.org/docs/doxygen/html/classllvm_1_1User.html
 This class defines the interface that one who uses a
Value must implement.
 Instructions
 Constants
 Operators
 Value: http://llvm.org/docs/doxygen/html/classllvm_1_1Value.html
 It is the base class of all values computed by a
program that may be used as operands to other
values.
 e.g., instruction and function.
26/34
How to write your own pass?
 An example – print function names
27/34
How to write your own pass?
 An example – print function names
 First generate bytecode:
 clang -emit-llvm hello.c -o hello.bc
 Then
28/34
How to write your own pass?
 Another example – print def-use chain
29/34
How to install LLVM?
30/34
How to install LLVM?
 To compile programs faster and use built-in
transformation and analysis
 Install both ‘llvm’ and ‘clang’ from package
management software
 E.g., Synaptic, yum, apt.
 To write your own pass
 Build from source code and add your own pass
 http://llvm.org/docs/GettingStarted.html#quickstart
 http://llvm.org/docs/WritingAnLLVMPass.html
31/34
LLVM IR
32/34
 The majority of instructions in C programs:
 Operation (binary/bitwise)
 Jump
 Function call
 Function definition
[Lat] Chris Lattner. LLVM Language Reference Manual. URL: http://llvm.org/docs/LangRef.html
[LA04] Chris Lattner and Vikram Adve. The LLVM Compiler Framework and Infrastructure Tutorial. Mini
Workshop on Compiler Research Infrastructures (LCPC'04), West Lafayette, Indiana, Sep. 2004.
Q & A
33/34
Thank you!
Contact me via elfinhe@gmail.com
34/34

More Related Content

PDF
llvm-py: Writing Compilers In Python
PDF
Part II: LLVM Intermediate Representation
PPT
ODP
An Introduction to PC-Lint
PDF
The compilation process
PPTX
.Net platform an understanding
PDF
The essence of the VivaCore code analysis library
PPT
C++ to java
llvm-py: Writing Compilers In Python
Part II: LLVM Intermediate Representation
An Introduction to PC-Lint
The compilation process
.Net platform an understanding
The essence of the VivaCore code analysis library
C++ to java

What's hot (20)

PPTX
Transpilers(Source-to-Source Compilers)
PDF
LinkedIn - Disassembling Dalvik Bytecode
DOCX
C++ question and answers
PDF
Installation of PC-Lint and its using in Visual Studio 2005
PPTX
Linker and loader upload
PDF
Towards easy program migration using language virtualization
PDF
PIL - A Platform Independent Language
PPT
Nakov dot net-framework-overview-english
PPT
Overview of c++
PPTX
C++vs java
PPTX
Compilation of c
PPTX
C compilation process
PDF
Tail Call Elimination in Open Smalltalk
PPT
Net Framework Overview
PPT
How a Compiler Works ?
KEY
In-depth look at the Flex compiler and HFCD
PDF
C Programming - Refresher - Part I
PDF
CORBA Programming with TAOX11/C++11 tutorial
PDF
OFI libfabric Tutorial
PPT
Mixing Python and Java
Transpilers(Source-to-Source Compilers)
LinkedIn - Disassembling Dalvik Bytecode
C++ question and answers
Installation of PC-Lint and its using in Visual Studio 2005
Linker and loader upload
Towards easy program migration using language virtualization
PIL - A Platform Independent Language
Nakov dot net-framework-overview-english
Overview of c++
C++vs java
Compilation of c
C compilation process
Tail Call Elimination in Open Smalltalk
Net Framework Overview
How a Compiler Works ?
In-depth look at the Flex compiler and HFCD
C Programming - Refresher - Part I
CORBA Programming with TAOX11/C++11 tutorial
OFI libfabric Tutorial
Mixing Python and Java
Ad

Similar to Introduction to llvm (20)

PDF
TMPA-2017: Vellvm - Verifying the LLVM
PPTX
07 140430-ipp-languages used in llvm during compilation
PPTX
LLVM-Based-Compiler-for-a-Custom-Language (2).pptx
PDF
Smalltalk JIT Compilation: LLVM Experimentation
PDF
LCU14 209- LLVM Linux
PPTX
LLVM Compiler
PDF
[COSCUP 2021] LLVM Project: The Good, The Bad, and The Ugly
PDF
The true story_of_hello_world
PPTX
LLVM Optimizations for PGAS Programs -Case Study: LLVM Wide Optimization in C...
PDF
Appsec obfuscator reloaded
PPTX
LLVM Compiler for different lagguage.pptx
PDF
LLVM Workshop Osaka Umeda, Japan
PPTX
OptView2 - C++ on Sea 2022
PDF
Os Lattner
PPTX
Как работает LLVM бэкенд в C#. Егор Богатов ➠ CoreHard Autumn 2019
PPT
CC Week 11.ppt
PPTX
Reames-FalconKeynote java falcon about jit.pptx
PDF
BUD17-302: LLVM Internals #2
PDF
Lessons Of Binary Analysis - Christien Rioux
PDF
Developments in LLVM-based toolchains and tooling for RISC-V
TMPA-2017: Vellvm - Verifying the LLVM
07 140430-ipp-languages used in llvm during compilation
LLVM-Based-Compiler-for-a-Custom-Language (2).pptx
Smalltalk JIT Compilation: LLVM Experimentation
LCU14 209- LLVM Linux
LLVM Compiler
[COSCUP 2021] LLVM Project: The Good, The Bad, and The Ugly
The true story_of_hello_world
LLVM Optimizations for PGAS Programs -Case Study: LLVM Wide Optimization in C...
Appsec obfuscator reloaded
LLVM Compiler for different lagguage.pptx
LLVM Workshop Osaka Umeda, Japan
OptView2 - C++ on Sea 2022
Os Lattner
Как работает LLVM бэкенд в C#. Егор Богатов ➠ CoreHard Autumn 2019
CC Week 11.ppt
Reames-FalconKeynote java falcon about jit.pptx
BUD17-302: LLVM Internals #2
Lessons Of Binary Analysis - Christien Rioux
Developments in LLVM-based toolchains and tooling for RISC-V
Ad

More from Tao He (18)

PPTX
Java 并发编程笔记:01. 并行与并发 —— 概念
PPTX
A software fault localization technique based on program mutations
PDF
Testing survey
DOC
Testing survey by_directions
PPT
Smart debugger
PPT
Mutation testing
DOCX
C语言benchmark覆盖信息收集总结4
PPT
Django
DOC
基于覆盖信息的软件错误定位技术综述
DOC
Java覆盖信息收集工具比较
PPT
Testing group’s work on fault localization
PPTX
Muffler a tool using mutation to facilitate fault localization 2.0
PPTX
Muffler a tool using mutation to facilitate fault localization 2.3
PPT
Semantic Parsing in Bayesian Anti Spam
PPT
Problems
PPT
A survey of software testing
PPT
Cleansing test suites from coincidental correctness to enhance falut localiza...
PPTX
Concrete meta research - how to collect, manage, and read papers?
Java 并发编程笔记:01. 并行与并发 —— 概念
A software fault localization technique based on program mutations
Testing survey
Testing survey by_directions
Smart debugger
Mutation testing
C语言benchmark覆盖信息收集总结4
Django
基于覆盖信息的软件错误定位技术综述
Java覆盖信息收集工具比较
Testing group’s work on fault localization
Muffler a tool using mutation to facilitate fault localization 2.0
Muffler a tool using mutation to facilitate fault localization 2.3
Semantic Parsing in Bayesian Anti Spam
Problems
A survey of software testing
Cleansing test suites from coincidental correctness to enhance falut localiza...
Concrete meta research - how to collect, manage, and read papers?

Recently uploaded (20)

PPTX
Transform Your Business with a Software ERP System
PDF
Softaken Excel to vCard Converter Software.pdf
PDF
2025 Textile ERP Trends: SAP, Odoo & Oracle
PDF
Navsoft: AI-Powered Business Solutions & Custom Software Development
PDF
Addressing The Cult of Project Management Tools-Why Disconnected Work is Hold...
PPTX
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
PDF
Raksha Bandhan Grocery Pricing Trends in India 2025.pdf
PPTX
Introduction to Artificial Intelligence
PPTX
ai tools demonstartion for schools and inter college
PDF
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
PPTX
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
PDF
medical staffing services at VALiNTRY
PDF
System and Network Administraation Chapter 3
PPTX
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
PPTX
VVF-Customer-Presentation2025-Ver1.9.pptx
PPTX
ISO 45001 Occupational Health and Safety Management System
PDF
System and Network Administration Chapter 2
PPTX
history of c programming in notes for students .pptx
PDF
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
PDF
Which alternative to Crystal Reports is best for small or large businesses.pdf
Transform Your Business with a Software ERP System
Softaken Excel to vCard Converter Software.pdf
2025 Textile ERP Trends: SAP, Odoo & Oracle
Navsoft: AI-Powered Business Solutions & Custom Software Development
Addressing The Cult of Project Management Tools-Why Disconnected Work is Hold...
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
Raksha Bandhan Grocery Pricing Trends in India 2025.pdf
Introduction to Artificial Intelligence
ai tools demonstartion for schools and inter college
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
medical staffing services at VALiNTRY
System and Network Administraation Chapter 3
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
VVF-Customer-Presentation2025-Ver1.9.pptx
ISO 45001 Occupational Health and Safety Management System
System and Network Administration Chapter 2
history of c programming in notes for students .pptx
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
Which alternative to Crystal Reports is best for small or large businesses.pdf

Introduction to llvm

  • 1. Introduction to LLVM on Program Analysis Tao He elfinhe@gmail.com Department of Computer Science, Sun Yat-Sen University Department of Computer Science and Engineering, HKUST Group Discussion June 2012 HKUST, Hong Kong, China 1/34
  • 2. Outline  Objectives  A quick scenario  LLVM IR  ‘opt’ command  Installation of LLVM 2/34
  • 3. Objectives - What do we want to do? 3/34
  • 4. Objectives  To implement a symbolic execution engine.  A expression-based engine [BH07] different from most existing implementations (path-based engines).  Program analysis on C programs.  To generate static single assignment (SSA) representation of C first. 4/34 [BH07] Domagoj Babić and Alan J. Hu. Structural Abstraction of Software Verification Conditions. In Proceedings of the 19th international conference on Computer aided verification (CAV'07), Lecture Notes in Computer Science, 2007, Volume 4590/2007, 366-378
  • 5. A Quick Scenario - What can LLVM do? 5/34
  • 6. !A Quick Scenario 6/34  Given a C program:  #include <stdio.h>  int branch(int n){  if (n>0) printf("Positiven");  else if (n==0) printf("Zeron");  else if (n<0) printf("Negativen");  return 0;  }  int main() {  branch(-4); branch(0); branch(6);  return 0;  }
  • 7. !A Quick Scenario 7/34  Generate immediate representation (IR) of LLVM – the SSA representation in LLVM  clang -O3 -emit-llvm hello.c -S -o hello.ll  define i32 @main() nounwind uwtable {  %1 = alloca i32, align 4  store i32 0, i32* %1  %2 = call i32 @branch(i32 -4)  %3 = call i32 @branch(i32 0)  %4 = call i32 @branch(i32 6)  ret i32 0  }  ... [SH] Reid Spencer and Gordon Henriksen. LLVM's Analysis and Transform Passes. URL: http://llvm.org/docs/Passes.html.
  • 8. !A Quick Scenario 8/34  Print call graph  opt method_para_int_branch.ll -S -dot- callgraph 2>output_file >/dev/null  dot -Tsvg in.dot -o out.svg [SH] Reid Spencer and Gordon Henriksen. LLVM's Analysis and Transform Passes. URL: http://llvm.org/docs/Passes.html.
  • 9. !A Quick Scenario 9/34  Print control flow graph (CFG)  opt method_para_int_branch.ll -S -dot-cfg 2>output_file >/dev/null [SH] Reid Spencer and Gordon Henriksen. LLVM's Analysis and Transform Passes. URL: http://llvm.org/docs/Passes.html.
  • 10. # A Quick Scenario 10/34  More:  Dead Global Elimination  Interprocedural Constant Propagation  Dead Argument Elimination  Inlining  Reassociation  Loop Invariant Code Motion  Loop Opts  Memory Promotion  Dead Store Elimination  Aggressive Dead Code Elimination [LA04] Chris Lattner and Vikram Adve. The LLVM Compiler Framework and Infrastructure Tutorial. Mini Workshop on Compiler Research Infrastructures (LCPC'04), West Lafayette, Indiana, Sep. 2004.
  • 11. What is the SSA representation in LLVM? - LLVM IR 11/34
  • 12. LLVM IR 12/34  “A Static Single Assignment (SSA) based representation that provides type safety, low- level operations, flexibility, and the capability of representing 'all' high-level languages cleanly.” [Lat] Chris Lattner. LLVM Language Reference Manual. URL: http://llvm.org/docs/LangRef.html [LA04] Chris Lattner and Vikram Adve. The LLVM Compiler Framework and Infrastructure Tutorial. Mini Workshop on Compiler Research Infrastructures (LCPC'04), West Lafayette, Indiana, Sep. 2004.
  • 13. LLVM IR 13/34  Three address code  SSA-based  Three different forms  An in-memory compiler IR  An on-disk bitcode representation (suitable for fast loading by a Just-In-Time compiler)  A human readable assembly language representation [Lat] Chris Lattner. LLVM Language Reference Manual. URL: http://llvm.org/docs/LangRef.html [LA04] Chris Lattner and Vikram Adve. The LLVM Compiler Framework and Infrastructure Tutorial. Mini Workshop on Compiler Research Infrastructures (LCPC'04), West Lafayette, Indiana, Sep. 2004.
  • 14. LLVM IR 14/34  An example  To multiply the integer variable '%X' by 8  Syntax:  <result> = mul <ty> <op1>, <op2>  IR code:  %result = mul i32 %X, 8  More  For floating point, use fmul [Lat] Chris Lattner. LLVM Language Reference Manual. URL: http://llvm.org/docs/LangRef.html [LA04] Chris Lattner and Vikram Adve. The LLVM Compiler Framework and Infrastructure Tutorial. Mini Workshop on Compiler Research Infrastructures (LCPC'04), West Lafayette, Indiana, Sep. 2004.
  • 15. LLVM IR 15/34  Another example  Instruction jump – to change control flow  Branches or loops  Syntax:  br i1 <cond>, label <iftrue>, label <iffalse>  br label <dest> ; Unconditional branch [Lat] Chris Lattner. LLVM Language Reference Manual. URL: http://llvm.org/docs/LangRef.html [LA04] Chris Lattner and Vikram Adve. The LLVM Compiler Framework and Infrastructure Tutorial. Mini Workshop on Compiler Research Infrastructures (LCPC'04), West Lafayette, Indiana, Sep. 2004.
  • 16. LLVM IR 16/34  IR code:  Test:  %cond = icmp eq i32 %a, %b  br i1 %cond, label %IfEqual, label %IfUnequal  IfEqual:  ret i32 1  IfUnequal: [Lat] Chris Lattner. LLVM Language Reference Manual. URL: http://llvm.org/docs/LangRef.html [LA04] Chris Lattner and Vikram Adve. The LLVM Compiler Framework and Infrastructure Tutorial. Mini Workshop on Compiler Research Infrastructures (LCPC'04), West Lafayette, Indiana, Sep. 2004.
  • 17. LLVM IR 17/34  3rd example  Function call  A simplified syntax:  <result> = call <ty> <fnptrval>(<function args>)  IR code:  call i32 (i8*, ...)* @printf(i8* %msg, i32 12, i8 42) [Lat] Chris Lattner. LLVM Language Reference Manual. URL: http://llvm.org/docs/LangRef.html [LA04] Chris Lattner and Vikram Adve. The LLVM Compiler Framework and Infrastructure Tutorial. Mini Workshop on Compiler Research Infrastructures (LCPC'04), West Lafayette, Indiana, Sep. 2004.
  • 18. LLVM IR 18/34  4th example  Function definition  A simplified syntax:  define <ResultType> @<FunctionName> ([argument list]) { ... }  IR code:  define i32 @main() { … }  define i32 @test(i32 %X, ...) { … } [Lat] Chris Lattner. LLVM Language Reference Manual. URL: http://llvm.org/docs/LangRef.html [LA04] Chris Lattner and Vikram Adve. The LLVM Compiler Framework and Infrastructure Tutorial. Mini Workshop on Compiler Research Infrastructures (LCPC'04), West Lafayette, Indiana, Sep. 2004.
  • 19. LLVM IR 19/34  The majority of instructions in C programs:  Operations (binary/bitwise)  Jumps  Function calls  Function definitions  Many keywords in LLVM IR will not be used for C programs. (e.g., invoke) [Lat] Chris Lattner. LLVM Language Reference Manual. URL: http://llvm.org/docs/LangRef.html [LA04] Chris Lattner and Vikram Adve. The LLVM Compiler Framework and Infrastructure Tutorial. Mini Workshop on Compiler Research Infrastructures (LCPC'04), West Lafayette, Indiana, Sep. 2004.
  • 20. How to analyze programs by using LLVM? - ‘opt’ command 20/34
  • 21. ‘opt’ command  Compiler is organized as a series of ‘passes’:  Each pass is one analysis or transformation 21/34 [SH] Reid Spencer and Gordon Henriksen. LLVM's Analysis and Transform Passes. URL: http://llvm.org/docs/Passes.html. [LA04] Chris Lattner and Vikram Adve. The LLVM Compiler Framework and Infrastructure Tutorial. Mini Workshop on Compiler Research Infrastructures (LCPC'04), West Lafayette, Indiana, Sep. 2004.
  • 22. !‘opt’ command  An example  -dot-callgraph 22/34 [SH] Reid Spencer and Gordon Henriksen. LLVM's Analysis and Transform Passes. URL: http://llvm.org/docs/Passes.html. [LA04] Chris Lattner and Vikram Adve. The LLVM Compiler Framework and Infrastructure Tutorial. Mini Workshop on Compiler Research Infrastructures (LCPC'04), West Lafayette, Indiana, Sep. 2004.
  • 23. !‘opt’ command 23/34 An example Print call graph: -dot-callgraph  opt method_para_int_branch.ll -S -dot- callgraph 2>output_file >/dev/null  dot -Tsvg in.dot -o out.svg [SH] Reid Spencer and Gordon Henriksen. LLVM's Analysis and Transform Passes. URL: http://llvm.org/docs/Passes.html. [LA04] Chris Lattner and Vikram Adve. The LLVM Compiler Framework and Infrastructure Tutorial. Mini Workshop on Compiler Research Infrastructures (LCPC'04), West Lafayette, Indiana, Sep. 2004.
  • 24. How to write your own pass? 24/34
  • 25. How to write your own pass?  Four types of pass:  ModulePass: general interprocedural pass  CallGraphSCCPass: bottom-up on the call graph  FunctionPass: process a function at a time  BasicBlockPass: process a basic block at a time 25/34
  • 26. How to write your own pass?  Two important classes  User: http://llvm.org/docs/doxygen/html/classllvm_1_1User.html  This class defines the interface that one who uses a Value must implement.  Instructions  Constants  Operators  Value: http://llvm.org/docs/doxygen/html/classllvm_1_1Value.html  It is the base class of all values computed by a program that may be used as operands to other values.  e.g., instruction and function. 26/34
  • 27. How to write your own pass?  An example – print function names 27/34
  • 28. How to write your own pass?  An example – print function names  First generate bytecode:  clang -emit-llvm hello.c -o hello.bc  Then 28/34
  • 29. How to write your own pass?  Another example – print def-use chain 29/34
  • 30. How to install LLVM? 30/34
  • 31. How to install LLVM?  To compile programs faster and use built-in transformation and analysis  Install both ‘llvm’ and ‘clang’ from package management software  E.g., Synaptic, yum, apt.  To write your own pass  Build from source code and add your own pass  http://llvm.org/docs/GettingStarted.html#quickstart  http://llvm.org/docs/WritingAnLLVMPass.html 31/34
  • 32. LLVM IR 32/34  The majority of instructions in C programs:  Operation (binary/bitwise)  Jump  Function call  Function definition [Lat] Chris Lattner. LLVM Language Reference Manual. URL: http://llvm.org/docs/LangRef.html [LA04] Chris Lattner and Vikram Adve. The LLVM Compiler Framework and Infrastructure Tutorial. Mini Workshop on Compiler Research Infrastructures (LCPC'04), West Lafayette, Indiana, Sep. 2004.
  • 34. Thank you! Contact me via elfinhe@gmail.com 34/34