SlideShare a Scribd company logo
Construct an Efficient and
Secure Microkernel for IoT
Jim Huang ( 黃敬群 ) <jserv.tw@gmail.com>
CTO, SSXelerator, Inc.
May 29, 2015 / CTHPC
Rights to copy
Attribution – ShareAlike 3.0
You are free
to copy, distribute, display, and perform the work
to make derivative works
to make commercial use of the work
Under the following conditions
Attribution. You must give the original author credit.
Share Alike. If you alter, transform, or build upon this work, you may distribute the
resulting work only under a license identical to this one.
For any reuse or distribution, you must make clear to others the license terms of this work.
Any of these conditions can be waived if you get permission from the copyright holder.
Your fair use and other rights are in no way affected by the above.
License text: http://creativecommons.org/licenses/by-sa/3.0/legalcode
© Copyright 2015 Jim Huang
Corrections, suggestions, contributions and translations
are welcome!
Latest update: May 29, 2015
Goals of This Presentation
• The promise of the IoT won’t be fulfilled until integrated
software platforms are available that allow software
developers to develop these devices efficiently and in
the most cost-effective manner possible.
• Introduce F9 microkernel, new open source
implementation built from scratch, which deploys
modern kernel techniques dedicated to deeply
embedded devices.
• Characteristics of F9 microkernel
– Efficiency: performance + power consumption
– Security: memory protection + isolated execution
– Flexible development environment
Wait!
Why do we need yet another kernel?
TCB (Trusted Computing Base)
traditional
embedded
Linux/
Windows
Microkernel
based
all code 100,000 LoC 10,000 LoC
System
TCB
source: Diagram from Kashin Lin (NEWS Lab)
Bugs inside “Bigger than Bigger”
Kernels• Drivers cause 85% of Windows XP crashes.
– Michael M. Swift, Brian N. Bershad, Henry M. Levy: “Improving the
Reliability of Commodity Operating Systems”, SOSP 2003
• Error rate in Linux drivers is 3x (maximum: 10x)
– Andy Chou, Junfeng Yang, Benjamin Chelf, Seth Hallem, Dawson R.
Engler: “An Empirical Study of Operating System Errors”, SOSP 2001
• Causes for driver bugs
– 23% programming error
– 38% mismatch regarding device specification
– 39% OS-driver-interface misconceptions
– Leonid Ryzhyk, Peter Chubb, Ihor Kuz and Gernot Heiser: “Dingo:
Taming device drivers”, EuroSys 2009
Linux Device Driver bugs
[Dingo: Taming device drivers, 2009]
• BlackHat 2013
– MACTANS: INJECTING MALWARE INTO IOS DEVICES
VIA MALICIOUS CHARGERS
– http://www.blackhat.com/us-13/briefings.html#Lau
• "we demonstrate how an iOS device can be compromised
within one minute of being plugged into a malicious charger. We
first examine Apple’s existing security mechanisms to protect
against arbitrary software installation, then describe how USB
capabilities can be leveraged to bypass these defense
mechanisms."
Attack iOS through USB charger!
(fatal problem of in-kernel USB stack)
Microkernel
• Minimalist approach
– IPC, virtual memory, thread scheduling
• Put the rest into user space
– Device drivers, networking, file system, user interface
• Disadvantages
– Lots of system calls and context switches
• Examples: Mach, L4, QNX, MINIX, IBM K42
principle of least privilege (POLA)
A capability is a communicable, unforgeable token of authority. It refers
to a value that references an object along with an associated set of access
rights. A user program on a capability-based operating system must use a
capability to access an object.
Microkernel Concepts
• Minimal kernel and hardware enforce separation
• Only kernel runs in CPU privileged mode
• Components are user level processes
• No restrictions on component software
• Reuse of legacy software
• “A concept is tolerated inside the microkernel only if
moving it outside the kernel, i.e., permitting competing
implementations would prevent the implementation of
the systems' required functionality. “ – Jochen Liedtke
“Worse Is Better", Richard P. Gabriel
New Jersey style
[UNIX, Bell Labs]
MIT style
[Multics]
Simplicity No.1 consideration
Implementation >
Interface
Interface >
Implementation
Correctness mostly 100%
Consistency mostly 100%
Completeness de facto mostly
• Design competition between New Jersey and MIT style
• Interface first [Multics] → Implementation first [Unix] →
Interface first [Mach] → Implementation first [Linux] →
Interface first [seL4]
Microkernel
• Put the rest into user space
– Device drivers, networking, file system, user interface
File
System
Networking Multi-mediaWindowing
Process
Manager
Application
Microkernel
+
Process Manager
are the only trusted
components
microkernel
Message Bus
Applications and Drivers
 Are processes which plug into a message bus
Reside in their own memory-protected address space
• Have a well defined message interface
• Cannot corrupt other software components
• Can be started, stopped and upgraded on the fly
Microkernel: Definitions
• A kernel technique that provides only the minimum OS
services.
– Address Spacing
– Inter-process Communication (IPC)
– Thread Management
– Unique Identifiers
• All other services are done at user space
independently.
3 Generations of Microkernel
• Mach (1985-1994)
– replace pipes with IPC (more general) 
– improved stability (vs monolithic kernels)
– poor performance
• L3 & L4 (1990-2001)
– order of magnitude improvement in IPC performance
• written in assembly, sacrificed CPU portability
• only synchronus IPC (build async on top of sync) 
– very small kernel: more functions moved to userspace
• seL4, Fiasco.OC, Coyotos, NOVA (2000-)
– platform independence
– verification, security, multiple CPUs, etc.
3 Generations of Microkernel
• Generation 1: Mach (1985-1994)
• Generation 2: L3 & L4 (1990-2001)
• Generation 3: seL4, Fiasco.OC, NOVA (2000-)
Performance of 1st
Generation
CMU Mach (1985), Chorus (1987), MkLinux (1996)
• Does not prohibit caching
• Reduce number of copies of data occupying memory
– Copy-to-use, copy-to-kernel
– More memory for caching
• I/O operations reduced by a factor of 10
• Context switch overhead
– Cost of kernel overhead can be up to 800 cycles.
• Address Space Switches
– Expensive Page Table and Segment Switch Overhead
– Untagged TLB = Bad performance
L4: the 2nd
Generation
• Similar to Mach
– Started from scratch, rather than monolithic
– But even more minimal
• minimality principle for L4:
A concept is tolerated inside the microkernel only if moving it
outside the kernel, i.e., permitting competing implementations,
would prevent the implementation of the system's required
functionality.
• Tasks, threads, IPC
– Uses only 12k of memory
– API size of Mach: 140 functions (Asynchronus IPC, Threads, Scheduling,
Memory management, Resource access permissions)
– API size of L4: 7 function (Synchronous IPC, Threads, Scheduling, Memory
management)
Performance Gain (1st
to 2nd
Generation)
• Reason of being slow
kernels: Poor design
[Liedtke SOSP'95]
– complex API
– Too many features
– Poor design and
implementation
– Large cache footprint ⇒
memory-bandwidth limited
• L4 is fast due to small
cache footprint
– 10–14 I-cache lines
– 8 D-cache lines
– Small cache footprint ⇒
CPU limited
L4 Family (incomplete)
Source: Microkernel-based Operating Systems – Introduction,
Carsten Weinhold, TU Dresden (2012)
L4 Family: OKL4
• L4 implementations on embedded
processors
– ARM, MIPS
• Wombat: portable virtualized Linux
for embedded systems
• Utilize ARM FCSE (fast context-
switching extension) for ARMv5
Commercial L4: from NICTA to OKLabs
• L4::Pistachio microkernel was originally developed at
Karlsruhe University. NICTA had ported it to a number
of architectures, including ARM, had optimized it for
use in resource-constrained embedded systems.
• In 2004, Qualcomm engaged NICTA in a consulting
arrangement to deploy L4 on Qualcomm's wireless
communication chips.
• The engagement with Qualcomm grew to a volume
where it was too significant a development/engineering
effort to be done inside the research organization.
– Commercized! Open Kernel Labs
• Acquired by General Dynamics in 2012
Source: http://microkerneldude.wordpress.com/2012/10/02/
giving-it-away-part-2-on-microkernels-and-the-national-interes/
OKL4 Use Cases
Each secure cell in the system offers
isolation from software in other cells
Existing software components can
be reused in new designs
Microvisor tames the complexity of
dispatching multi-OS workloads across
multiple physical CPUs
Moving from 2nd
to 3rd
Generation
OKL4
• Dumped recursive address-space model
– reduced kernel complexity
– First L4 kernel with capability-based
access control
OKL4 Microvisor
• Removed synchronous IPC
• Removed kernel-scheduled threads
seL4
• All memory management at user level
– no kernel heap!
• Formal proof of functional correctness
• Performance on par with fastest kernels
– <200 cycle IPC on ARM11 without
assembler fastpath
Problems in 2nd
Generations
• microkernel needs memory for its abstractions
– tasks: page tables
– threads: kernel-TCB
– capability tables
– IPC wait queues
– mapping database
– kernel memory is limited
– opens the possibility of DoS attacks
seL4 as 3rd
Microkernel
• Functional Correctness [SOSP’09]
• Timeliness (known WCET) [RTSS’11,EuroSys’12]
• Translation Correctness [PLDI’13]
• Fast (258 cycle IPC roundtrip on 1GHz Cortex-A9)
• Safety: specifically temporal properties.
• Minimal TCB (~9000 SLoC)
F9: A new microkernel designed for
Deeply Embedded Devices
Deeply Embedded Devices
• Power awareness; solid and limited applications
• Multi-tasking or cooperative scheduling is still required
• IoT (Internet of Things) is the specialized derivative
with networking facility
• Communication capability is built-in for some products
• Example: AIRO wristband (health tracker)
http://www.weweartech.com/amazing­new­uses­smart­watches/
HRV Knows You
Design Considerations of IoT
• Network
– IoT networks must be scalable in order to support the dynamic nature of
the IoT (as devices are added and removed from the network).
• Security
– Integration of security protocols for encryption and authentication must
always be required.
– Before any data is transferred, the source of the data needs to be
verified.
– The use of encryption prevents the loss of data to passive listeners, but is
does not prevent the alteration of data while traversing the network.
• Power Management
Facilitate processors with many low-power features
including DVFS and Hibernate.
• Need for full-featured RTOS framework
Nuclues RTOS ecosystem for the development
of connected IoT devices
Source:INTERNET OF THINGS (IoT) DESIGN CONSIDERATIONS FOR EMBEDDED
CONNECTED DEVICES, Mentor Graphics
Advanced Software Requirements of IoT Products
• Over-The-Air (OTA) update with a double bank firmware update
mechanism. The switch to a new version is only operated when
the newly downloaded content is fully validated.
• A dedicated first stage loader/diagnostic/recovery application is
used for this update mechanism. It provides full access to all
internal and external memories.
ARM mbed
supports WiFi, Bleutooth, 2G, 3G, and LTE communication technologies, among other technologies among other
technologies and the software could make it easier for hardware companies to get devices to market without
spending a lot of time building custom firmware.
Characteristics of F9 Microkernel
https://github.com/f9micro
Unique Characteristics
• BSD Licensing (two-clause), suitable for both research
and commercial usage.
– Commercial adaptation since 2014
• Efficiency
– Optimized for ARM Cortex-M3/M4
– performance: fast IPC and well-structured designs
– energy-saving: tickless scheduling, adaptive power
management
• Security
– memory protection: MPU guarded
– Isolated execution: L4 based, capabilities model
• Flexible development
– Kprobes
– profile-directed optimizations
Why are current systems unreliable?
• Problem 1: “Systems are huge"
– No single person can understand the whole system
> F9 Microkernel has only 3K LoC of portable C
• Problem 2: “Bug fixes usually introduce new bugs."
> F9 introduces execution domains and on-the-fly patches
• Problem 3: “Poor fault isolation"
– No isolation between system components
– OS contains hundreds of procedures linked together as a
single binary program running on the kernel mode.
> F9 is built from scratch and well-engineered for isolation
F9 Microkernel
Parent
Partition
Applications VM Worker
Process
File system
Server
VM Worker
Process
Application
Framework
Memory Manament
server
(Interrupts)
Scheduling
Policy
Unstrused Domain
F9 Microkernel Architecture
Media
Driver
Network
Driver
Network
Stack
Board
specific
Trusted Domain
Task Manament
KProbes
In-kernel
debugger
User
Space
Kernel
Space
Principles
• F9 follows the fundamental principles of L4
microkernels
– implements address spaces, thread management,
and IPC only in the privileged kernel.
• Designed and customized for ARM Cortex-M,
supporting NVIC (Nested Vectored Interrupt
Controller), Bit Banding, MPU (Memory Protection
Unit)
Thread
• Each thread has its own TCB (Thread Control Block)
and addressed by its global id.
• Also dispatcher is responsible for switching contexts.
Threads with the same priority are executed in a
round-robin fashion.
Memory Management
• split into three concepts:
– Memory pool, which represent area of physical address space
with specific attributes.
– Flexible page, which describes an always size aligned region of
an address space. Unlike other L4 implementations, flexible pages
in F9 represent MPU region instead.
– Address space, which is made up of these flexible pages.
• System calls are provided to manage address spaces:
– Grant: memory page is granted to a new user and cannot be used
anymore by its former user.
– Map: This implements shared memory – the memory page is
passed to another task but can be used by both tasks.
– Flush: The memory page that has been mapped to other users
will be flushed out of their address space.
IPC
• The concept of UTCB (user-level thread-control
blocks) is being taken on. A UTCB is a small thread-
specific region in the thread's virtual address space,
which is always mapped. Therefore, the access to the
UTCB can never raise a page fault, which makes it
perfect for the kernel to access system-call arguments,
in particular IPC payload copied from/to user threads.
• Kernel provides synchronous IPC (inter-process
communication), for which short IPC carries payload in
CPU registers only and full IPC copies message
payload via the UTCBs of the communicating parties.
Microkernel Paging
• Microkernel forwards page fault to a pager server.
• Kernel or server decides which pages need to be
written to disk in low memory situations.
• Pager server handles writing pages to disk.
Recursive Address Space
• Initial address space controlled by first process.
– Controls all available memory.
– Other address spaces empty at boot.
• Other processes obtain memory pages from first or
from their other processes that got pages from first.
• Why is memory manager flexibility useful?
– Different applications: real-time, multimedia, disk cache.
Grant
Map
Flush
ktable
ktable_free()
ktable_alloc()
ktable_init()
Used
Unused
Ktable: fast memory poll
• Ktable is in charge of the allocation / deallocation for
the objects of pre-defined size and numbers easier
• Can be optimized with Bit-banding of ARM Cortex-M
Interrupt Handling
• Two-stage interrupt handling
– ISR: IRQ context
– Softirq
• Thread context
• Real time preemptive characteristic
• Can be scheduled like any other threads in the system
• Handled in both kernel thread and user-space
Energy efficiency: Tickless
• Introduce tickless timer which allow the ARM Cortex-M
to wake up only when needed, either at a scheduled
time or on an interrupt event.
• Therefore, it results in better current consumption than
the common approach using the system timer,
SysTick, which requires a constantly running and high
frequency clock.
Timeout interrupt
Hardware
Timer CPU
Control
Read Counts
Setup timeout value
Adjust system time
Handle timeout event
….
• Hardware timer device
– Assert interrupt after a programmable inteval
– Handling tick stuff in Timeout Interrupt Service
Routine (ISR)
How Tick is Implemented
SysTick in ARM Cortex-M4
• Count-down timer
Auto
Load?
Reload value
0x36000
Current Value
0x12abc=
Interrupt
1 -
0
=
Y
Y
• Timeout ISR
– Increase system ticks
– Execute handler of timeout event
– Re-schedule if required
CPU Operating States
INT : interrupt
CTX: context switch
T : after a while
Processes
Threads
Tasks
Idle thread
ISR
Softirq
sleep
Deep sleep
INT
CTX
INT
INT
INT
CTX
CTX
CTX
T
T
Time Diagram of Legacy Ticks
event1 event3
event2
event4
HW
Timer
interrupt
CPU
activities
Context Switch overhead
Processes
Threads
Tasks
Idle thread
ISR
Softirq
sleep
Deep sleep
INT
CTX
INT
INT
INT
CTX
CTX
CTX
T
T
Regular Power Consumption
Processes
Threads
Tasks
Idle thread
ISR
Softirq
sleep
Deep sleep
INT
CTX
INT
INT
INT
CTX
CTX
CTX
T
T
Time Diagram of Legacy Ticks
event1 event3
event2
event4
HW
Timer
interrupt
CPU
activities
CPU waken up for timekeeping only
event1 event3
event2
event4
Timer
interrupt
New
CPU
activities
Previous
CPU
activities
Solution: Tickless scheduling
Drawback of Tickless scheduling
• Tickless is not free
– “It increases the number of instructions executed on
the path to and from the idle loop.”
– “On many architectures, dyntick-idle mode also
increases the number of expensive clock-
reprogramming operations”
– Source: P. E. McKenney (May 14, 2013),
“NO_HZ: Reducing Scheduling-Clock Ticks”
• Systems with aggressive real-time response
constraints often run periodic tick
Tickless scheduling in F9
• Enter tickless right before going to CPU idle state
– Set interval of next timer interrupt as delta of next event
– Or KTIMER_MAXTICKS
• Adjust system time after waked upires a constantly
running and high frequency clock.
• Tickless Compensation
– SysTick frequency distortion when enter/exit standby mode
Timer
interrupt
idle idle active
handle
other
interrupt
tickless
CPU
Activity
active
systick
Tickless compensation
(compensation from general purpose timer)
Timer
interrupt
idle idle active
handle
other
interrupt
tickless
CPU
Activity
active
systick
• System activity during idle with and without
periodic ticks
• System activity during idle with and
without deferrable timer usage in
ondemand
Kprobes: dynamic instrumentation
• Inspired by Linux Kernel, allowing developers to gather
additional information about kernel operation without
recompiling or rebooting the kernel.
• It enables locations in the kernel to be instrumented
with code, and the instrumentation code runs when the
ARM core encounters that probe point.
• Once the instrumentation code completes execution,
the kernel continues normal execution.
Application Development
• Partial POSIX support
• configurable debug console
• memory dump
• thread profiling
– name, uptime, stack allocated/current/used
• memory profiling
– kernel table, pool free/allocated size, fragmentation
• On-going PGO (Profile-guided optimization) and
AutoFDO support
Commercial Adaptation
• F9 microkernel is used by Genesi USA, Inc. as smart
solutions for the internet of things
http://genesi.company/solutions/embedded
• Genesi's Radix K1 is a low cost embedded device built
around Freescale ARM Cortex-M4
– 100MHz based MCU with 512kB of FLASH and 128KB of built-in RAM
and a 4G GSM module.
• The device ←→ server communication link uses WAMP, a
WebSocket subprotocol and the data exchanged is encrypted
using CycloneSSL.
• Basic memory protection is available through built-in MPU.
Conclusion
• Minimizing TCB is vital for building secure IoT
systems, and L4 based designs bring temporal
isolation, asymmetric protection, safe bounded
resource sharing achieved through scheduling
contexts, criticality, and temporal exceptions.
• ARM Cortex-M processor enables highly deterministic
real-time applications to develop high-performance
low-cost platforms, and F9 microkernel utilizes Cortex-
M advantages to build the efficient and secure TCB.
• The value of open source is the community made up of
people who have dedicated their time and their life to
see its success. So, commercial adaptation is feasible.
Reference
• From L3 to seL4: What Have We Learnt in 20 Years of
L4 Microkernels? Kevin Elphinstone and Gernot
Heiser, NICTA/UNSW
• Microkernel Construction"
http://os.inf.tu-dresden.de/Studium/MkK/
• Microkernel-based Operating Systems
http://www.inf.tu-dresden.de/index.php?node_id=1314
• Getting maximum mileage out of tickless, Intel Open
Source Technology Center
• F9 Microkernel ktimer, Viller Hsiao

More Related Content

PDF
給自己更好未來的 3 個練習:嵌入式作業系統設計、實做,與移植 (2015 年春季 ) 課程說明
PDF
中輟生談教育: 完全用開放原始碼軟體進行 嵌入式系統教學
PDF
What Can Compilers Do for Us?
PDF
用十分鐘 向jserv學習作業系統設計
PDF
Share the Experience of Using Embedded Development Board
PDF
Learn C Programming Language by Using GDB
PDF
from Source to Binary: How GNU Toolchain Works
PDF
LLVM 總是打開你的心:從電玩模擬器看編譯器應用實例
給自己更好未來的 3 個練習:嵌入式作業系統設計、實做,與移植 (2015 年春季 ) 課程說明
中輟生談教育: 完全用開放原始碼軟體進行 嵌入式系統教學
What Can Compilers Do for Us?
用十分鐘 向jserv學習作業系統設計
Share the Experience of Using Embedded Development Board
Learn C Programming Language by Using GDB
from Source to Binary: How GNU Toolchain Works
LLVM 總是打開你的心:從電玩模擬器看編譯器應用實例

What's hot (20)

PDF
The Microkernel Mach Under NeXTSTEP
PDF
Interpreter, Compiler, JIT from scratch
PDF
Launch the First Process in Linux System
PDF
Embedded_Linux_Booting
PDF
ARM and SoC Traning Part I -- Overview
PDF
淺談探索 Linux 系統設計之道
PDF
Let's trace Linux Lernel with KGDB @ COSCUP 2021
PDF
Making Linux do Hard Real-time
PDF
How A Compiler Works: GNU Toolchain
PDF
ゲーム開発者のための C++11/C++14
PDF
Jagan Teki - U-boot from scratch
PDF
GNU ld的linker script簡介
PDF
ZynqMPのQEMU
PDF
ZynqMP勉強会
PDF
Kernel Recipes 2019 - Faster IO through io_uring
PPTX
最新UE4タイトルでのローカライズ事例 (UE4 Localization Deep Dive)
PDF
Embedded Virtualization applied in Mobile Devices
PDF
Arm device tree and linux device drivers
PDF
Unityでパフォーマンスの良いUIを作る為のTips
PPTX
The Microkernel Mach Under NeXTSTEP
Interpreter, Compiler, JIT from scratch
Launch the First Process in Linux System
Embedded_Linux_Booting
ARM and SoC Traning Part I -- Overview
淺談探索 Linux 系統設計之道
Let's trace Linux Lernel with KGDB @ COSCUP 2021
Making Linux do Hard Real-time
How A Compiler Works: GNU Toolchain
ゲーム開発者のための C++11/C++14
Jagan Teki - U-boot from scratch
GNU ld的linker script簡介
ZynqMPのQEMU
ZynqMP勉強会
Kernel Recipes 2019 - Faster IO through io_uring
最新UE4タイトルでのローカライズ事例 (UE4 Localization Deep Dive)
Embedded Virtualization applied in Mobile Devices
Arm device tree and linux device drivers
Unityでパフォーマンスの良いUIを作る為のTips
Ad

Viewers also liked (20)

PDF
F9: A Secure and Efficient Microkernel Built for Deeply Embedded Systems
PDF
Develop Your Own Operating Systems using Cheap ARM Boards
PDF
The Internals of "Hello World" Program
PDF
進階嵌入式作業系統設計與實做 (2015 年秋季 ) 課程說明
PDF
Microkernel Evolution
PDF
2016 年春季嵌入式作業系統課程說明
PDF
Hints for L4 Microkernel
PDF
L4 Microkernel :: Design Overview
PDF
Virtual Machine Constructions for Dummies
PDF
Xvisor: embedded and lightweight hypervisor
PDF
Making Linux do Hard Real-time
PDF
Shorten Device Boot Time for Automotive IVI and Navigation Systems
PDF
Lecture notice about Embedded Operating System Design and Implementation
PDF
Implement Runtime Environments for HSA using LLVM
PDF
olibc: Another C Library optimized for Embedded Linux
PDF
Explore Android Internals
PDF
Priority Inversion on Mars
PDF
Hardware Accelerated 2D Rendering for Android
PDF
進階嵌入式系統開發與實作 (2013 秋季班 ) 課程說明
PDF
Faults inside System Software
F9: A Secure and Efficient Microkernel Built for Deeply Embedded Systems
Develop Your Own Operating Systems using Cheap ARM Boards
The Internals of "Hello World" Program
進階嵌入式作業系統設計與實做 (2015 年秋季 ) 課程說明
Microkernel Evolution
2016 年春季嵌入式作業系統課程說明
Hints for L4 Microkernel
L4 Microkernel :: Design Overview
Virtual Machine Constructions for Dummies
Xvisor: embedded and lightweight hypervisor
Making Linux do Hard Real-time
Shorten Device Boot Time for Automotive IVI and Navigation Systems
Lecture notice about Embedded Operating System Design and Implementation
Implement Runtime Environments for HSA using LLVM
olibc: Another C Library optimized for Embedded Linux
Explore Android Internals
Priority Inversion on Mars
Hardware Accelerated 2D Rendering for Android
進階嵌入式系統開發與實作 (2013 秋季班 ) 課程說明
Faults inside System Software
Ad

Similar to Construct an Efficient and Secure Microkernel for IoT (20)

PDF
μ-Kernel Evolution
PPTX
Microkernels and Beyond
PPTX
Introduction to Operating system and graduate
PDF
Walking around linux kernel
PDF
A Reimplementation of NetBSD Based on a Microkernel by Andrew S. Tanenbaum
PDF
Microkernel design
PDF
The Quest for the Perfect API
PPTX
13-kernels operating system introduction
PDF
Analysis of Practicality and Performance Evaluation for Monolithic Kernel and...
ODP
A tour of F9 microkernel and BitSec hypervisor
PPTX
Monolithic kernel vs. Microkernel
PPT
PDF
Hardware/Software Co-Design for Efficient Microkernel Execution
PDF
2024 in Microkernels (a year in review lightning talk)
PPTX
Operating system 15 micro kernel based os
PPT
casecomp.ppt. shsjsi sjsjjsjsjsjsuaiajjajwjsjsksks
PDF
TDC2016SP - Trilha Linux Embarcado
PDF
Exploiting Modern Microarchitectures: Meltdown, Spectre, and other Attacks
PPTX
KERNEL.pptx
PDF
Status of Embedded Linux
μ-Kernel Evolution
Microkernels and Beyond
Introduction to Operating system and graduate
Walking around linux kernel
A Reimplementation of NetBSD Based on a Microkernel by Andrew S. Tanenbaum
Microkernel design
The Quest for the Perfect API
13-kernels operating system introduction
Analysis of Practicality and Performance Evaluation for Monolithic Kernel and...
A tour of F9 microkernel and BitSec hypervisor
Monolithic kernel vs. Microkernel
Hardware/Software Co-Design for Efficient Microkernel Execution
2024 in Microkernels (a year in review lightning talk)
Operating system 15 micro kernel based os
casecomp.ppt. shsjsi sjsjjsjsjsjsuaiajjajwjsjsksks
TDC2016SP - Trilha Linux Embarcado
Exploiting Modern Microarchitectures: Meltdown, Spectre, and other Attacks
KERNEL.pptx
Status of Embedded Linux

More from National Cheng Kung University (6)

PDF
PyPy's approach to construct domain-specific language runtime
PDF
從線上售票看作業系統設計議題
PDF
進階嵌入式系統開發與實做 (2014 年秋季 ) 課程說明
PDF
Open Source from Legend, Business, to Ecosystem
PDF
Summer Project: Microkernel (2013)
PDF
Develop Your Own Operating System
PyPy's approach to construct domain-specific language runtime
從線上售票看作業系統設計議題
進階嵌入式系統開發與實做 (2014 年秋季 ) 課程說明
Open Source from Legend, Business, to Ecosystem
Summer Project: Microkernel (2013)
Develop Your Own Operating System

Recently uploaded (20)

PPTX
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
PDF
Digital Logic Computer Design lecture notes
PPTX
Internet of Things (IOT) - A guide to understanding
PPTX
UNIT 4 Total Quality Management .pptx
PPT
Mechanical Engineering MATERIALS Selection
PPT
Project quality management in manufacturing
PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
PDF
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
PPTX
CYBER-CRIMES AND SECURITY A guide to understanding
DOCX
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
PPTX
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
PPTX
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
PDF
PPT on Performance Review to get promotions
PPTX
Construction Project Organization Group 2.pptx
PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
PDF
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
PPTX
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
PDF
Embodied AI: Ushering in the Next Era of Intelligent Systems
PDF
Operating System & Kernel Study Guide-1 - converted.pdf
PPTX
web development for engineering and engineering
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
Digital Logic Computer Design lecture notes
Internet of Things (IOT) - A guide to understanding
UNIT 4 Total Quality Management .pptx
Mechanical Engineering MATERIALS Selection
Project quality management in manufacturing
UNIT-1 - COAL BASED THERMAL POWER PLANTS
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
CYBER-CRIMES AND SECURITY A guide to understanding
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
PPT on Performance Review to get promotions
Construction Project Organization Group 2.pptx
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
Embodied AI: Ushering in the Next Era of Intelligent Systems
Operating System & Kernel Study Guide-1 - converted.pdf
web development for engineering and engineering

Construct an Efficient and Secure Microkernel for IoT

  • 1. Construct an Efficient and Secure Microkernel for IoT Jim Huang ( 黃敬群 ) <jserv.tw@gmail.com> CTO, SSXelerator, Inc. May 29, 2015 / CTHPC
  • 2. Rights to copy Attribution – ShareAlike 3.0 You are free to copy, distribute, display, and perform the work to make derivative works to make commercial use of the work Under the following conditions Attribution. You must give the original author credit. Share Alike. If you alter, transform, or build upon this work, you may distribute the resulting work only under a license identical to this one. For any reuse or distribution, you must make clear to others the license terms of this work. Any of these conditions can be waived if you get permission from the copyright holder. Your fair use and other rights are in no way affected by the above. License text: http://creativecommons.org/licenses/by-sa/3.0/legalcode © Copyright 2015 Jim Huang Corrections, suggestions, contributions and translations are welcome! Latest update: May 29, 2015
  • 3. Goals of This Presentation • The promise of the IoT won’t be fulfilled until integrated software platforms are available that allow software developers to develop these devices efficiently and in the most cost-effective manner possible. • Introduce F9 microkernel, new open source implementation built from scratch, which deploys modern kernel techniques dedicated to deeply embedded devices. • Characteristics of F9 microkernel – Efficiency: performance + power consumption – Security: memory protection + isolated execution – Flexible development environment
  • 4. Wait! Why do we need yet another kernel?
  • 5. TCB (Trusted Computing Base) traditional embedded Linux/ Windows Microkernel based all code 100,000 LoC 10,000 LoC System TCB source: Diagram from Kashin Lin (NEWS Lab)
  • 6. Bugs inside “Bigger than Bigger” Kernels• Drivers cause 85% of Windows XP crashes. – Michael M. Swift, Brian N. Bershad, Henry M. Levy: “Improving the Reliability of Commodity Operating Systems”, SOSP 2003 • Error rate in Linux drivers is 3x (maximum: 10x) – Andy Chou, Junfeng Yang, Benjamin Chelf, Seth Hallem, Dawson R. Engler: “An Empirical Study of Operating System Errors”, SOSP 2001 • Causes for driver bugs – 23% programming error – 38% mismatch regarding device specification – 39% OS-driver-interface misconceptions – Leonid Ryzhyk, Peter Chubb, Ihor Kuz and Gernot Heiser: “Dingo: Taming device drivers”, EuroSys 2009
  • 7. Linux Device Driver bugs [Dingo: Taming device drivers, 2009]
  • 8. • BlackHat 2013 – MACTANS: INJECTING MALWARE INTO IOS DEVICES VIA MALICIOUS CHARGERS – http://www.blackhat.com/us-13/briefings.html#Lau • "we demonstrate how an iOS device can be compromised within one minute of being plugged into a malicious charger. We first examine Apple’s existing security mechanisms to protect against arbitrary software installation, then describe how USB capabilities can be leveraged to bypass these defense mechanisms." Attack iOS through USB charger! (fatal problem of in-kernel USB stack)
  • 9. Microkernel • Minimalist approach – IPC, virtual memory, thread scheduling • Put the rest into user space – Device drivers, networking, file system, user interface • Disadvantages – Lots of system calls and context switches • Examples: Mach, L4, QNX, MINIX, IBM K42
  • 10. principle of least privilege (POLA) A capability is a communicable, unforgeable token of authority. It refers to a value that references an object along with an associated set of access rights. A user program on a capability-based operating system must use a capability to access an object.
  • 11. Microkernel Concepts • Minimal kernel and hardware enforce separation • Only kernel runs in CPU privileged mode • Components are user level processes • No restrictions on component software • Reuse of legacy software • “A concept is tolerated inside the microkernel only if moving it outside the kernel, i.e., permitting competing implementations would prevent the implementation of the systems' required functionality. “ – Jochen Liedtke
  • 12. “Worse Is Better", Richard P. Gabriel New Jersey style [UNIX, Bell Labs] MIT style [Multics] Simplicity No.1 consideration Implementation > Interface Interface > Implementation Correctness mostly 100% Consistency mostly 100% Completeness de facto mostly • Design competition between New Jersey and MIT style • Interface first [Multics] → Implementation first [Unix] → Interface first [Mach] → Implementation first [Linux] → Interface first [seL4]
  • 13. Microkernel • Put the rest into user space – Device drivers, networking, file system, user interface File System Networking Multi-mediaWindowing Process Manager Application Microkernel + Process Manager are the only trusted components microkernel Message Bus Applications and Drivers  Are processes which plug into a message bus Reside in their own memory-protected address space • Have a well defined message interface • Cannot corrupt other software components • Can be started, stopped and upgraded on the fly
  • 14. Microkernel: Definitions • A kernel technique that provides only the minimum OS services. – Address Spacing – Inter-process Communication (IPC) – Thread Management – Unique Identifiers • All other services are done at user space independently.
  • 15. 3 Generations of Microkernel • Mach (1985-1994) – replace pipes with IPC (more general)  – improved stability (vs monolithic kernels) – poor performance • L3 & L4 (1990-2001) – order of magnitude improvement in IPC performance • written in assembly, sacrificed CPU portability • only synchronus IPC (build async on top of sync)  – very small kernel: more functions moved to userspace • seL4, Fiasco.OC, Coyotos, NOVA (2000-) – platform independence – verification, security, multiple CPUs, etc.
  • 16. 3 Generations of Microkernel • Generation 1: Mach (1985-1994) • Generation 2: L3 & L4 (1990-2001) • Generation 3: seL4, Fiasco.OC, NOVA (2000-)
  • 17. Performance of 1st Generation CMU Mach (1985), Chorus (1987), MkLinux (1996) • Does not prohibit caching • Reduce number of copies of data occupying memory – Copy-to-use, copy-to-kernel – More memory for caching • I/O operations reduced by a factor of 10 • Context switch overhead – Cost of kernel overhead can be up to 800 cycles. • Address Space Switches – Expensive Page Table and Segment Switch Overhead – Untagged TLB = Bad performance
  • 18. L4: the 2nd Generation • Similar to Mach – Started from scratch, rather than monolithic – But even more minimal • minimality principle for L4: A concept is tolerated inside the microkernel only if moving it outside the kernel, i.e., permitting competing implementations, would prevent the implementation of the system's required functionality. • Tasks, threads, IPC – Uses only 12k of memory – API size of Mach: 140 functions (Asynchronus IPC, Threads, Scheduling, Memory management, Resource access permissions) – API size of L4: 7 function (Synchronous IPC, Threads, Scheduling, Memory management)
  • 19. Performance Gain (1st to 2nd Generation) • Reason of being slow kernels: Poor design [Liedtke SOSP'95] – complex API – Too many features – Poor design and implementation – Large cache footprint ⇒ memory-bandwidth limited • L4 is fast due to small cache footprint – 10–14 I-cache lines – 8 D-cache lines – Small cache footprint ⇒ CPU limited
  • 20. L4 Family (incomplete) Source: Microkernel-based Operating Systems – Introduction, Carsten Weinhold, TU Dresden (2012)
  • 21. L4 Family: OKL4 • L4 implementations on embedded processors – ARM, MIPS • Wombat: portable virtualized Linux for embedded systems • Utilize ARM FCSE (fast context- switching extension) for ARMv5
  • 22. Commercial L4: from NICTA to OKLabs • L4::Pistachio microkernel was originally developed at Karlsruhe University. NICTA had ported it to a number of architectures, including ARM, had optimized it for use in resource-constrained embedded systems. • In 2004, Qualcomm engaged NICTA in a consulting arrangement to deploy L4 on Qualcomm's wireless communication chips. • The engagement with Qualcomm grew to a volume where it was too significant a development/engineering effort to be done inside the research organization. – Commercized! Open Kernel Labs • Acquired by General Dynamics in 2012 Source: http://microkerneldude.wordpress.com/2012/10/02/ giving-it-away-part-2-on-microkernels-and-the-national-interes/
  • 23. OKL4 Use Cases Each secure cell in the system offers isolation from software in other cells Existing software components can be reused in new designs Microvisor tames the complexity of dispatching multi-OS workloads across multiple physical CPUs
  • 24. Moving from 2nd to 3rd Generation OKL4 • Dumped recursive address-space model – reduced kernel complexity – First L4 kernel with capability-based access control OKL4 Microvisor • Removed synchronous IPC • Removed kernel-scheduled threads seL4 • All memory management at user level – no kernel heap! • Formal proof of functional correctness • Performance on par with fastest kernels – <200 cycle IPC on ARM11 without assembler fastpath
  • 25. Problems in 2nd Generations • microkernel needs memory for its abstractions – tasks: page tables – threads: kernel-TCB – capability tables – IPC wait queues – mapping database – kernel memory is limited – opens the possibility of DoS attacks
  • 26. seL4 as 3rd Microkernel • Functional Correctness [SOSP’09] • Timeliness (known WCET) [RTSS’11,EuroSys’12] • Translation Correctness [PLDI’13] • Fast (258 cycle IPC roundtrip on 1GHz Cortex-A9) • Safety: specifically temporal properties. • Minimal TCB (~9000 SLoC)
  • 27. F9: A new microkernel designed for Deeply Embedded Devices
  • 28. Deeply Embedded Devices • Power awareness; solid and limited applications • Multi-tasking or cooperative scheduling is still required • IoT (Internet of Things) is the specialized derivative with networking facility • Communication capability is built-in for some products • Example: AIRO wristband (health tracker) http://www.weweartech.com/amazing­new­uses­smart­watches/
  • 30. Design Considerations of IoT • Network – IoT networks must be scalable in order to support the dynamic nature of the IoT (as devices are added and removed from the network). • Security – Integration of security protocols for encryption and authentication must always be required. – Before any data is transferred, the source of the data needs to be verified. – The use of encryption prevents the loss of data to passive listeners, but is does not prevent the alteration of data while traversing the network. • Power Management Facilitate processors with many low-power features including DVFS and Hibernate. • Need for full-featured RTOS framework
  • 31. Nuclues RTOS ecosystem for the development of connected IoT devices Source:INTERNET OF THINGS (IoT) DESIGN CONSIDERATIONS FOR EMBEDDED CONNECTED DEVICES, Mentor Graphics
  • 32. Advanced Software Requirements of IoT Products • Over-The-Air (OTA) update with a double bank firmware update mechanism. The switch to a new version is only operated when the newly downloaded content is fully validated. • A dedicated first stage loader/diagnostic/recovery application is used for this update mechanism. It provides full access to all internal and external memories.
  • 33. ARM mbed supports WiFi, Bleutooth, 2G, 3G, and LTE communication technologies, among other technologies among other technologies and the software could make it easier for hardware companies to get devices to market without spending a lot of time building custom firmware.
  • 34. Characteristics of F9 Microkernel https://github.com/f9micro
  • 35. Unique Characteristics • BSD Licensing (two-clause), suitable for both research and commercial usage. – Commercial adaptation since 2014 • Efficiency – Optimized for ARM Cortex-M3/M4 – performance: fast IPC and well-structured designs – energy-saving: tickless scheduling, adaptive power management • Security – memory protection: MPU guarded – Isolated execution: L4 based, capabilities model • Flexible development – Kprobes – profile-directed optimizations
  • 36. Why are current systems unreliable? • Problem 1: “Systems are huge" – No single person can understand the whole system > F9 Microkernel has only 3K LoC of portable C • Problem 2: “Bug fixes usually introduce new bugs." > F9 introduces execution domains and on-the-fly patches • Problem 3: “Poor fault isolation" – No isolation between system components – OS contains hundreds of procedures linked together as a single binary program running on the kernel mode. > F9 is built from scratch and well-engineered for isolation
  • 37. F9 Microkernel Parent Partition Applications VM Worker Process File system Server VM Worker Process Application Framework Memory Manament server (Interrupts) Scheduling Policy Unstrused Domain F9 Microkernel Architecture Media Driver Network Driver Network Stack Board specific Trusted Domain Task Manament KProbes In-kernel debugger User Space Kernel Space
  • 38. Principles • F9 follows the fundamental principles of L4 microkernels – implements address spaces, thread management, and IPC only in the privileged kernel. • Designed and customized for ARM Cortex-M, supporting NVIC (Nested Vectored Interrupt Controller), Bit Banding, MPU (Memory Protection Unit)
  • 39. Thread • Each thread has its own TCB (Thread Control Block) and addressed by its global id. • Also dispatcher is responsible for switching contexts. Threads with the same priority are executed in a round-robin fashion.
  • 40. Memory Management • split into three concepts: – Memory pool, which represent area of physical address space with specific attributes. – Flexible page, which describes an always size aligned region of an address space. Unlike other L4 implementations, flexible pages in F9 represent MPU region instead. – Address space, which is made up of these flexible pages. • System calls are provided to manage address spaces: – Grant: memory page is granted to a new user and cannot be used anymore by its former user. – Map: This implements shared memory – the memory page is passed to another task but can be used by both tasks. – Flush: The memory page that has been mapped to other users will be flushed out of their address space.
  • 41. IPC • The concept of UTCB (user-level thread-control blocks) is being taken on. A UTCB is a small thread- specific region in the thread's virtual address space, which is always mapped. Therefore, the access to the UTCB can never raise a page fault, which makes it perfect for the kernel to access system-call arguments, in particular IPC payload copied from/to user threads. • Kernel provides synchronous IPC (inter-process communication), for which short IPC carries payload in CPU registers only and full IPC copies message payload via the UTCBs of the communicating parties.
  • 42. Microkernel Paging • Microkernel forwards page fault to a pager server. • Kernel or server decides which pages need to be written to disk in low memory situations. • Pager server handles writing pages to disk.
  • 43. Recursive Address Space • Initial address space controlled by first process. – Controls all available memory. – Other address spaces empty at boot. • Other processes obtain memory pages from first or from their other processes that got pages from first. • Why is memory manager flexibility useful? – Different applications: real-time, multimedia, disk cache. Grant Map Flush
  • 44. ktable ktable_free() ktable_alloc() ktable_init() Used Unused Ktable: fast memory poll • Ktable is in charge of the allocation / deallocation for the objects of pre-defined size and numbers easier • Can be optimized with Bit-banding of ARM Cortex-M
  • 45. Interrupt Handling • Two-stage interrupt handling – ISR: IRQ context – Softirq • Thread context • Real time preemptive characteristic • Can be scheduled like any other threads in the system • Handled in both kernel thread and user-space
  • 46. Energy efficiency: Tickless • Introduce tickless timer which allow the ARM Cortex-M to wake up only when needed, either at a scheduled time or on an interrupt event. • Therefore, it results in better current consumption than the common approach using the system timer, SysTick, which requires a constantly running and high frequency clock.
  • 47. Timeout interrupt Hardware Timer CPU Control Read Counts Setup timeout value Adjust system time Handle timeout event …. • Hardware timer device – Assert interrupt after a programmable inteval – Handling tick stuff in Timeout Interrupt Service Routine (ISR) How Tick is Implemented
  • 48. SysTick in ARM Cortex-M4 • Count-down timer Auto Load? Reload value 0x36000 Current Value 0x12abc= Interrupt 1 - 0 = Y Y • Timeout ISR – Increase system ticks – Execute handler of timeout event – Re-schedule if required
  • 49. CPU Operating States INT : interrupt CTX: context switch T : after a while Processes Threads Tasks Idle thread ISR Softirq sleep Deep sleep INT CTX INT INT INT CTX CTX CTX T T
  • 50. Time Diagram of Legacy Ticks event1 event3 event2 event4 HW Timer interrupt CPU activities
  • 51. Context Switch overhead Processes Threads Tasks Idle thread ISR Softirq sleep Deep sleep INT CTX INT INT INT CTX CTX CTX T T
  • 52. Regular Power Consumption Processes Threads Tasks Idle thread ISR Softirq sleep Deep sleep INT CTX INT INT INT CTX CTX CTX T T
  • 53. Time Diagram of Legacy Ticks event1 event3 event2 event4 HW Timer interrupt CPU activities CPU waken up for timekeeping only
  • 55. Drawback of Tickless scheduling • Tickless is not free – “It increases the number of instructions executed on the path to and from the idle loop.” – “On many architectures, dyntick-idle mode also increases the number of expensive clock- reprogramming operations” – Source: P. E. McKenney (May 14, 2013), “NO_HZ: Reducing Scheduling-Clock Ticks” • Systems with aggressive real-time response constraints often run periodic tick
  • 56. Tickless scheduling in F9 • Enter tickless right before going to CPU idle state – Set interval of next timer interrupt as delta of next event – Or KTIMER_MAXTICKS • Adjust system time after waked upires a constantly running and high frequency clock. • Tickless Compensation – SysTick frequency distortion when enter/exit standby mode Timer interrupt idle idle active handle other interrupt tickless CPU Activity active systick
  • 57. Tickless compensation (compensation from general purpose timer) Timer interrupt idle idle active handle other interrupt tickless CPU Activity active systick • System activity during idle with and without periodic ticks • System activity during idle with and without deferrable timer usage in ondemand
  • 58. Kprobes: dynamic instrumentation • Inspired by Linux Kernel, allowing developers to gather additional information about kernel operation without recompiling or rebooting the kernel. • It enables locations in the kernel to be instrumented with code, and the instrumentation code runs when the ARM core encounters that probe point. • Once the instrumentation code completes execution, the kernel continues normal execution.
  • 59. Application Development • Partial POSIX support • configurable debug console • memory dump • thread profiling – name, uptime, stack allocated/current/used • memory profiling – kernel table, pool free/allocated size, fragmentation • On-going PGO (Profile-guided optimization) and AutoFDO support
  • 60. Commercial Adaptation • F9 microkernel is used by Genesi USA, Inc. as smart solutions for the internet of things http://genesi.company/solutions/embedded • Genesi's Radix K1 is a low cost embedded device built around Freescale ARM Cortex-M4 – 100MHz based MCU with 512kB of FLASH and 128KB of built-in RAM and a 4G GSM module. • The device ←→ server communication link uses WAMP, a WebSocket subprotocol and the data exchanged is encrypted using CycloneSSL. • Basic memory protection is available through built-in MPU.
  • 61. Conclusion • Minimizing TCB is vital for building secure IoT systems, and L4 based designs bring temporal isolation, asymmetric protection, safe bounded resource sharing achieved through scheduling contexts, criticality, and temporal exceptions. • ARM Cortex-M processor enables highly deterministic real-time applications to develop high-performance low-cost platforms, and F9 microkernel utilizes Cortex- M advantages to build the efficient and secure TCB. • The value of open source is the community made up of people who have dedicated their time and their life to see its success. So, commercial adaptation is feasible.
  • 62. Reference • From L3 to seL4: What Have We Learnt in 20 Years of L4 Microkernels? Kevin Elphinstone and Gernot Heiser, NICTA/UNSW • Microkernel Construction" http://os.inf.tu-dresden.de/Studium/MkK/ • Microkernel-based Operating Systems http://www.inf.tu-dresden.de/index.php?node_id=1314 • Getting maximum mileage out of tickless, Intel Open Source Technology Center • F9 Microkernel ktimer, Viller Hsiao