"An Ultra-low-power Multi-core Engine for Inference on Encrypted DNNs," a Presentation from Xperi

© 2019 FotoNation
An Ultra-low-power Multi-core
Engine for Inference on
Encrypted DNNs
Petronel Bigioi
XPERI
May 2019

© 2019 FotoNation 2
Company Overview

© 2019 FotoNation
FotoNation –XPERI’s Trusted Brand
3
Portfolio of Trusted Brands
Licensing
Semiconductor
Intellectual Property
Imaging and
Computer Vision
silicon IP cores and
solutions
Audio Technology
Solutions
Automotive Audio,
Data, and Digital
Radio Broadcast
Solutions
Semiconductor
and Interconnect
Packaging
Technology &
Solutions
3.4+ B Devices 70+ M Cars 1+ B Devices 100+ B Devices2+ B Devices

Imaging and Inference at the Edge
Always-on inference:
AI operates while the
device is “off”
(e.g. ultra low power face
detection as an enabler)
Battery-powered head-
mounted displays for
AR or MR
(e.g. IRIS rec for device
access, gaze, etc..)
Smart appliances:
from TVs to receivers
to toasters to…
(e.g. ultra low power people
detection as an enabler)
Driver and in cabin
monitoring for
autonomous driving
(e.g. always ON occupancy
assistant)

© 2019 FotoNation
Edge Challenges
5

© 2019 FotoNation
Edge Inference Challenges
- Ultra low power requirements for battery operated devices
- Protect consumer privacy by local processing only
- Similar quality and performance as for cloud solutions are expected
- Scalable and flexible engines are expected
- Depending on application, true parallel processing is expected
6

© 2019 FotoNation
Ingredients to Deliver for Successful AI Based Edge Solutions
7
Computer Vision R&D Infrastructure
Computer generated and real images –
ground truth data-sets for effective NN
training, testing and validation
Core Imaging & ML R&D
Research various image processing
and core ML methods and
architectures for differentiation
Many years of investment …
Ultra low power, high
performance and scalable
engines & development tools
Imaging & Inference Engines
Product testing with more than
40 million images market and
annotated for various features
Vision Testing Infrastructure

© 2019 FotoNation
8
Just some Acquisition Systems Investment

© 2019 FotoNation
Reverse Engineering – the Danger
- In the edge processing model, the neural networks sit in the device’s
permanent storage, widely exposed to various types of reverse
engineering
- Network representation patterns can be identified and localized in the
storage contents
- Once the network representation is known, architecture and weight
values can be obtained
- Networks can be remapped and inferred on alternative architectures
- Many years of investment… gone!
9

© 2019 FotoNation
Edge Inference Solution
10

© 2019 FotoNation
IPU – Image Processing Unit
Preprocessing Cores
- Stream conditioning & statistics for analytics
- Frame to frame registration
- Image enhancement or analytics
Dedicated Cores
- Face detection engine
- People detection engine
- Image enhancement engine
- Image resampling engine
Programmable CNN Cores
- PCNN 1.2: small 72 OPS/cycle and/or PCNN 2.1:
large 1024 OPS/cycle
- PCNN Cluster engine for scalable AI (supporting -
up to four small and/or large PCNNs)
- Multiple clusters supported
11
3rd Party
FN IP
Sensor
MIPI
DDR CTL
I/F
LCD
DDR
COMMS
FLASHI/F
--------
--------
--------
FlashGPU
ISP
Display/LTM
IPU
CPU
Preprocessing
cores
Dedicated cores
Programmable
cores

© 2019 FotoNation
IPU Highlights
- Focus on ultra-low-power imaging AI
- Maximizes quality and performance of imaging solutions
- Flexible deployment enabling market-specific, game-changing use cases
- Enables concurrent processing
- Secure deployment
- Use case driven: each IPU deployment is unique to the addressed use-
case and device
12

Concurrent Execution Comparison (NPU1 vs IPU2)
Face Detection(NN01)
Object Detection (NN02)
Face Recognition(NN03)
Object Classification (NN04)
fps
fps
fps
fps
97 97 48 97
20 41
36 97
15 41
14 56
31 97
13 41
14 56
2 8
100% 9% 100% 22% 100% 63% 100% 100%
PERFORMANCE
Face Detection (NN01)
Object Detection (NN02)
Face Recognition(NN03)
Object Classification (NN04)
512 28
NPU IPU
NN01 NN (01 + 02) NN(01+02+03) NN(01+02+03+04)
256 28
NPU IPU
365 40
192 28
NPU IPU
274 40
266 155
163 28
NPU IPU
234 40
266 155
114 61
512 28 621 68 732 223 777 284
9x 3x 2.7x18x
mW
mW
mW
mW
POWER
Core Utilization for each scenario
2x 3x 3.5xreference
1) NPU – Neural Processing Unit (Competitor)
2) IPU – Image Processing Unit (FotoNation)
At the same utilization, IPU offers more than three times performance boost and is three times more power efficient than NPU
… in other words, at the same performance the IPU is more than 9 times more power efficient …
… at the same power consumption NPU is more than 9 times slower than IPU …
… IPU is simply better!

PCNN – Programmable CNN Engine
Image pre-processing
engine as ‘layer 0’
72 OPS/cycle or
1024 OPS/cycle
Support for compression,
quantization and on the fly
decryption
16-bit floating-point
internal operation
supporting weights as
small as 2 bits

© 2019 FotoNation
PCNN Highlights
Scalable and flexible, configurable
SRAM size to address specific use
cases
Low power consumption
(22 nm FDSOI tech)
• 18 mW for PCNN 1.2
• 120 mW for PCNN 2.1
Built-in real-time, on-the-fly NN
decryption engine
Separate memory channels for network
fetch and intermediate layers/input
Separate cache for code/net and
intermediate layers
15
PCNN CORE
PCNN ENGINE
SYS BUS
(AXI)
DDR CPU
REGS
MAP RD MAP WR CODE RD IRQ APB
FLASH
CTL BUS (APB)

NN Protection Engineering Solution
Kstream=
Npub * Csec
NN encrypted
with Kstream
Stream
decipher
Neural
Network
FN s NN
IP
- HW -
Stream
cipher
SOFTWARE
privately run by the NN designer
HARDWARE
FotoNation s IP and customer s NN
Chip secret
random key
Csec
NN
secret random
key Nsec
Neural
Network
On chip
(fuses)
Kstream
Npub
Npub =
base * Nsec
Cpub =
base * Csec
Kstream= Cpub
* Nsec
Kstream
SW
run by the chip manufacturer
Cpub
Csec

© 2019 FotoNation
Decryption Features
17
Encryption based on two secret
255-bit keys:
• Manufacturer key and
• NN owner key
Secret keys processed offline by a
proprietary software, based on public
domain curve25519
Secret manufacturer key stored on
fuses on-chip
NN designer’s public key upload on-
chip after power-up
• Plain message processed offline with
proprietary software
• Data encrypted/decrypted with Trivium
stream cipher
• Data decryption is implemented in hardware
• 128-bit plain data is generated on-the-fly
based on the 128-bit encrypted data

PCNN Clusters
Clusters can have mixed
PCNN 1.2 and
PCNN 2.0 cores.
Accommodates up to
4xPCNNs executing
the same network or
individual networks.
For more processing
power, several
clusters can be
connected.
PCNN configuration is flexible (1xPCNN executing network 1 & 3xPCNNs executing network 2);
True concurrent network execution.

© 2019 FotoNation
PCNN - Cluster Architecture
Scalable and flexible to address
specific market needs:
- Mobile
- Automotive
- Home
Ultra low power
Secure
19
SYSTEM BUS
(AXI)
SYS MEMORY
(DDR, FLASH)
HOST CPU
PCNN-CLUSTER
CORE
RISC
MAILBOX
CFG
(AHB)
PCNN-C ENGINE
PCNN PCNNPCNN PCNN
SRAM CTL
SHARED SRAM
IRQ
CFG
1K Bus
*
*
ARBITER
TO/FROM
OTHER
PCNN-C

IPU – PCNN Development Tools
Converter
NN Design and Train:
Caffe, Tensorflow , Torch,
Theano, MatConvNet, etc
NN
structure,
weights
PCNN
Configuration
Tool
Performance
report
PCNN Binary
Normalisation
Convertion to
FP16
Input Image
PCNN IP
SW Bit exact
Model
Normalised 8bit
or FP16
Input Map
PCNN results

IPU – Dedicated Detectors Tool
Dedicated
detector tool
Training mode
- Database and ground truth data
- Neural Network settings
- Pre-trained networks
Binaries
Binary that will be load-able by both simulation
tool and HW accelerator on FPGA/ASIC
Dedicated
detector tool
Simulator mode
- Input images
- Detections
- Input images with bounding boxes overlayed
RESULTS

© 2019 FotoNation
Conclusions
IPU is an ultra-low-power multi-core engine optimized for imaging AI on the
edge
IPU prevents neural network reverse engineering and intellectual property
theft by supporting on-the-fly decryption and inference on encrypted DNNs
IPU supports true multi-tasking networks via the “cluster” concept
IPU is scalable to ANY market via two dimensions:
• Cluster of multiple programmable CNN engines
• Cluster of clusters
23

© 2019 FotoNation
Resources
IPU: https://www.fotonation.com/products/optimize/
Encryption/decryption: https://en.wikipedia.org/wiki/Curve25519
24

"An Ultra-low-power Multi-core Engine for Inference on Encrypted DNNs," a Presentation from Xperi

More Related Content

Similar to "An Ultra-low-power Multi-core Engine for Inference on Encrypted DNNs," a Presentation from Xperi (20)

More from Edge AI and Vision Alliance (20)

Recently uploaded (20)

"An Ultra-low-power Multi-core Engine for Inference on Encrypted DNNs," a Presentation from Xperi