SlideShare a Scribd company logo
Uma Abordagem Passo a Passo
Do IoT ao Big Data
• IoT – Visibilidade dos Dados
• IoT – Edge Streaming Analytics
• Big Data Analytics para IoT
• Plataforma de Hardware para um Piloto
Agenda
Do IoT ao Big Data, uma Abordagem Passo a Passo
Sucessos com projetos de IoT
Connected
healthcare
Connected
building
Physical
security
Real-time
analytics
USMEA
UK
US
MEA
First responder
Virtual patient observation
$142 milhões em oportunidades de IoT
Cenário
O Landscape de Big DataO Modelo
IoT
World Forum Model
Collaboration
Business & people process
Application
Report, Control
Data Abstraction
Aggregation and access
Data accumulation
Storage
Edge Computing
Data analisys & transformation
Connectivity
Communication & processing
Physical Devices &
Controllers
Mas eu só queria ter visibilidade
dos meus dados...
Fom io t_to_bigdata_step_by_step-final
IoT – Aquisição dos Dados
PINS DB
MCU MCU
CLP
NUVEM Privada ou Pública
CONTROLADORA
IoT – Soluções de Visualização
CONTROLADORA
PaaS: Utilizando uma
plataforma pronta
Producer
Kafka
Consumer
PaaS :Desenvolvendo sua
Plataforma
JSON / REST / HTTP
JSON
ServerClient
HTTP POST
/service/weather
(REST Interface)
[{“city”:”Paris”, “units”: “C”}]
[{“low”: “16”, “high”: “23”}]
Response
Request
IoT WF Model x DD Model
DD Offers Full Stack Solution
IoT World Forum Model Dimension Data Solution Focus
Collaboration
Business & people process
Application
Report, Control
Data Abstraction
Aggregation and access
Data accumulation
Storage
Edge Computing
Data analisys & transformation
Connectivity
Communication & processing
Physical Devices & Controllers
Applications
Secure Conectivity
Gostaria de atuar de imediato em
situações “críticas” .
CONTROLADORA
MCU MCU CLP
IoT – Edge Streaming Analytics
Quark Edge Devices Communication Centralized Analytics
Kafka
MQTT
Watson IOT
Platform
Custom Hub Custom Apps
IoT WF Model x DD Model
DD Offers Full Stack Solution
IoT World Forum Model Dimension Data Solution Focus
Collaboration
Business & people process
Application
Report, Control
Data Abstraction
Aggregation and access
Data accumulation
Storage
Edge Computing
Data analisys & transformation
Connectivity
Communication & processing
Physical Devices & Controllers
Applications
Distributed Computing
Secure Conectivity
Quero extrair valor dos dados
do meu IoT !
• Fast Data – Capacidade de processar streams dos sensores
De que Precisamos?
• Big Data – Capacidade para armazenar dados dos sensores e
resultados do Analytics (arquivos, tabelas, etc.)
• Uma ferramenta de Analytics
• Uma ferramenta de Visualização / Query
• Ambiente escalável, altamente disponível e seguro
Como seria a Arquitetura da Solução Big Data para IoT ?
Serving LayerTomada de Ação - Visualização
Streaming Data
Twiter´s Nathan Marz (2013)
Discreto Contínuo
Supervisionado Classificação Regressão
File System Distribuído
Modelo
ML
Speed Layer
Desenv.
do modelo
Analítico
Batch Layer
Dados
Em que Ferramenta de Big Data Analytics Investir ?
"Todas as principais plataformas
comerciais avançadas de análise utilizam
Spark. "
Gartner 2016 Magic Quadrant for Advanced Analytics
"Para ciência de dados utilizando
aplicativos de fonte aberta e escalável,
Spark é unanimidade."
Porque Spark é Unanimidade ?
Python API !
Feito para Big Data,
mas “rode” onde
quiser...
“One Shop, all the Goods !”
Spark SQL
structured data
Spark Streaming
real-time
Mlib
machine
learning
GraphX
graph
processing
Spark Core
Enterprise Grade Big Data com Spark
• Storage “Web-scale”
• Alta Disponibilidade
• Mirroring & Snapshots
• Segurança integrada
• Etc.
YARN: Data Operating System
1 ° ° ° ° ° ° ° ° °
° ° ° ° ° ° ° ° °
°
°N
HDFS
Hadoop Distributed File System
Interactive Real-TimeBatch
Applications
Distribuição Hadoop “Enterprise Grade”
• HDFS: File system robusto e escalável
• YARN: Resource Manager e Scheduler
IoT WF Model x DD Model
DD Offers Full Stack Solution
IoT World Forum Model Dimension Data Solution Focus
Collaboration
Business & people process
Application
Report, Control
Data Abstraction
Aggregation and access
Data accumulation
Storage
Edge Computing
Data analisys & transformation
Connectivity
Communication & processing
Physical Devices & Controllers
Applications
Distributed Computing
Secure Conectivity
Analytics
E a infraestrutura de rede e
computação ?
Anatomia do Tráfego em Map-Reduce
Many-to-Many Traffic Pattern
Map 1 Map 2 Map NMap 3
Reducer1 Reducer2 Reducer3 ReducerN
HDFS
Shuffle
Output
Replication
NameNode
JobTracker
ZooKeeper
A Fabric é Importante para o Cluster Big Data
Fabric 10 GE
Tráfego Horizontal
Baixa Latência
“Oversubscription” adequado
0
1000
2000
3000
4000
5000
6000
7000
8000
9000
Latency(us)
Time
READ - Average Latency (us) QoS - READ - Average Latency (us)
~45% Read
Improvement
BufferCellsUsed
Timeline
Hbase Major Compaction Hbase (YCSB)
NO QoS
W/ QoS
Terasoft held on Queues
QOS Faz Diferença
Starter Kits com Servidores e Rede Cisco
5
3
4
5
3
4
2
5
4
MPP HADOOP NOSQL
Compute IO Bandwidth Capacity Starter High Performance
Designed for
Performance and density for
analytics engines, NoSQL
databases, and entry-level
Hadoop deployments
Extreme performance and
density for analytics engines
Server UCS C220 M4 UCS C220 M4
CPU
2 x Intel Xeon E5-2620 v3 (15M
Cache, 2.40 GHz)
2 x Intel Xeon E5-2680 v3 (30M
Cache, 2.50 GHz)
Memory 256GB 256GB
Storage
Controller
Cisco 12-Gbps SAS Modular
Raid Controller with 2-GB FBWC
Cisco 12-Gbps SAS Modular
Raid Controller with 2-GB FBWC
Storage 8 1.2-TB 10K SAS SFF HDD
2 1.2-TB 10K SAS SFF HDD, 6
400-GB SAS SSD
Network Controller
Cisco UCS VIC 1227 2 10GE
SFP+
Cisco UCS VIC 1227 2 10GE
SFP+
Network and Cluster
Scaling
2 Cisco UCS 6248UP FIs, Scale
up to 32 servers with no
additional switching
infrastructure
2 Cisco UCS6248UP FIs, Scale
up to 32 servers with no
additional
switching infrastructure
Cisco Single SKU
UCS-SL-CPA3-S
(8 servers)
UCS-SL-CPA3-H
(8 Servers)
Starter Kits Testados e Homologados
= Performance Previsível
Testado e Validado
Do PoC de IoT à Integração no Data Center
PoC
Infra apartada
Limited
Production
Performance,
Gerencia,
Continuidade,
Etc.
SAN/NAS Arrays
Blade Servers
Enterprise
Platform
Orquestração,
IaaS,
Panejamento de Capacidade
IoT WF Model x DD Model
DD Offers Full Stack Solution
IoT World Forum Model Dimension Data Solution Focus
Collaboration
Business & people process
Application
Report, Control
Data Abstraction
Aggregation and access
Data accumulation
Storage
Edge Computing
Data analisys & transformation
Connectivity
Communication & processing
Physical Devices & Controllers
Applications
Distributed Computing
Secure Conectivity
Analytics
Hybrid Cloud Computing
A Internet das Coisas
ajudando nossos clientes a fazerem
coisas incríveis
Luis Filipe Silva
Senior Solutions Architect, Brazil
Dimension Data Americas Phone: +55 (11) 3878 66549
+55 (21) 99858 9137
Filipe.silva@dimensiondata.com

More Related Content

PDF
Use .NET Core to create IoT Solutions
PPTX
Real-time Microservices and In-Memory Data Grids
PPTX
How komatsu is driving operational efficiencies using io t and machine learni...
PDF
AI for Intelligent Cloud and Intelligent Edge: Discover, Deploy, and Manage w...
PPTX
Hybrid Transactional/Analytics Processing: Beyond the Big Database Hype
PPTX
Consolidate your data marts for fast, flexible analytics 5.24.18
PDF
Azure Days 2019: Business Intelligence auf Azure (Marco Amhof & Yves Mauron)
PPTX
HIPAS UCP HSP Openstack Sascha Oehl
Use .NET Core to create IoT Solutions
Real-time Microservices and In-Memory Data Grids
How komatsu is driving operational efficiencies using io t and machine learni...
AI for Intelligent Cloud and Intelligent Edge: Discover, Deploy, and Manage w...
Hybrid Transactional/Analytics Processing: Beyond the Big Database Hype
Consolidate your data marts for fast, flexible analytics 5.24.18
Azure Days 2019: Business Intelligence auf Azure (Marco Amhof & Yves Mauron)
HIPAS UCP HSP Openstack Sascha Oehl

What's hot (20)

PDF
When Open Source Meets the Enterprise
PDF
5 Comparing Microsoft Big Data Technologies for Analytics
PDF
Real-Time Analytics with Apache Cassandra and Apache Spark
PDF
Trivadis - Microsoft Transform your data estate with cloud, data and AI
PPTX
Cloudera SDX
PDF
Modern Data Warehouse Overview
PPTX
Meetup Toulouse Microsoft Azure : Bâtir une solution IoT
PPTX
What’s New in Cloudera Enterprise 6.0: The Inside Scoop 6.14.18
PPTX
Spark DC Interactive Meetup: HTAP with Spark and In-Memory Data Grids
PPTX
Cloudera Altus: Big Data in the Cloud Made Easy
PPTX
Driving Better Products with Customer Intelligence

PPT
Big Data Paris : Hadoop and NoSQL
PPTX
Get started with Cloudera's cyber solution
PPTX
PaaS or Fail: Rule the Cloud with Altus
PPTX
Pivot 2.0 - The next generation visualization tool for your streaming data
PPTX
Infinite power at your fingertips with Microsoft Azure Cloud & ActiveEon
PPTX
Webinar: Get On-Demand Education Anytime, Anywhere with Coursera and DataStax
PDF
Azure Days 2019: Keynote Azure Switzerland – Status Quo und Ausblick (Primo A...
PPTX
Aeris + Cassandra: An IOT Solution Helping Automakers Make the Connected Car ...
PDF
Building a Digital Bank
When Open Source Meets the Enterprise
5 Comparing Microsoft Big Data Technologies for Analytics
Real-Time Analytics with Apache Cassandra and Apache Spark
Trivadis - Microsoft Transform your data estate with cloud, data and AI
Cloudera SDX
Modern Data Warehouse Overview
Meetup Toulouse Microsoft Azure : Bâtir une solution IoT
What’s New in Cloudera Enterprise 6.0: The Inside Scoop 6.14.18
Spark DC Interactive Meetup: HTAP with Spark and In-Memory Data Grids
Cloudera Altus: Big Data in the Cloud Made Easy
Driving Better Products with Customer Intelligence

Big Data Paris : Hadoop and NoSQL
Get started with Cloudera's cyber solution
PaaS or Fail: Rule the Cloud with Altus
Pivot 2.0 - The next generation visualization tool for your streaming data
Infinite power at your fingertips with Microsoft Azure Cloud & ActiveEon
Webinar: Get On-Demand Education Anytime, Anywhere with Coursera and DataStax
Azure Days 2019: Keynote Azure Switzerland – Status Quo und Ausblick (Primo A...
Aeris + Cassandra: An IOT Solution Helping Automakers Make the Connected Car ...
Building a Digital Bank
Ad

Similar to Fom io t_to_bigdata_step_by_step-final (20)

PPTX
Streaming Analytics for IoT with Apache Spark
PDF
IoT Meets Big Data: The Opportunities and Challenges by Syed Hoda of ParStream
PPTX
Big Data Application Architectures - IoT
PPT
Real-time data integration to the cloud
PDF
MAKING SENSE OF IOT DATA W/ BIG DATA + DATA SCIENCE - CHARLES CAI
PDF
How to maximize profit from IoT by using data platform - Albert Lewandowski, ...
PDF
IOT DATA MANAGEMENT REQUIREMENTS AND ARCHITECTURE OF IOT.pdf
PDF
IoT meets Big Data
PPTX
Managing your Assets with Big Data Tools
PPTX
Lunch Keynote
PDF
DevOps in IoT
PDF
Denodo DataFest 2016: The Role of Data Virtualization in IoT Integration
PPTX
Wikibon #IoT #HyperConvergence Presentation via @theCUBE
PPTX
Hyper-Convergence CrowdChat
PDF
Industry trends.v0.1pptx
PPTX
Big Data Expo 2015 - Pentaho The Future of Analytics
PPTX
Io t analytics-companypresentationmarch 2021
PDF
Phoenix Data Conference - Big Data Analytics for IoT 11/4/17
PPTX
Getting started with IoT
PPTX
Building Large-Scale Applications for the Internet of Things at Bosch
Streaming Analytics for IoT with Apache Spark
IoT Meets Big Data: The Opportunities and Challenges by Syed Hoda of ParStream
Big Data Application Architectures - IoT
Real-time data integration to the cloud
MAKING SENSE OF IOT DATA W/ BIG DATA + DATA SCIENCE - CHARLES CAI
How to maximize profit from IoT by using data platform - Albert Lewandowski, ...
IOT DATA MANAGEMENT REQUIREMENTS AND ARCHITECTURE OF IOT.pdf
IoT meets Big Data
Managing your Assets with Big Data Tools
Lunch Keynote
DevOps in IoT
Denodo DataFest 2016: The Role of Data Virtualization in IoT Integration
Wikibon #IoT #HyperConvergence Presentation via @theCUBE
Hyper-Convergence CrowdChat
Industry trends.v0.1pptx
Big Data Expo 2015 - Pentaho The Future of Analytics
Io t analytics-companypresentationmarch 2021
Phoenix Data Conference - Big Data Analytics for IoT 11/4/17
Getting started with IoT
Building Large-Scale Applications for the Internet of Things at Bosch
Ad

Recently uploaded (20)

PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Electronic commerce courselecture one. Pdf
PDF
Machine learning based COVID-19 study performance prediction
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Encapsulation theory and applications.pdf
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
sap open course for s4hana steps from ECC to s4
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PPTX
Machine Learning_overview_presentation.pptx
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPTX
Spectroscopy.pptx food analysis technology
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPT
Teaching material agriculture food technology
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Approach and Philosophy of On baking technology
PDF
Chapter 3 Spatial Domain Image Processing.pdf
Diabetes mellitus diagnosis method based random forest with bat algorithm
Electronic commerce courselecture one. Pdf
Machine learning based COVID-19 study performance prediction
Dropbox Q2 2025 Financial Results & Investor Presentation
Encapsulation theory and applications.pdf
The AUB Centre for AI in Media Proposal.docx
sap open course for s4hana steps from ECC to s4
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Assigned Numbers - 2025 - Bluetooth® Document
Machine Learning_overview_presentation.pptx
Digital-Transformation-Roadmap-for-Companies.pptx
Spectroscopy.pptx food analysis technology
Review of recent advances in non-invasive hemoglobin estimation
Advanced methodologies resolving dimensionality complications for autism neur...
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Teaching material agriculture food technology
MYSQL Presentation for SQL database connectivity
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Approach and Philosophy of On baking technology
Chapter 3 Spatial Domain Image Processing.pdf

Fom io t_to_bigdata_step_by_step-final

  • 1. Uma Abordagem Passo a Passo Do IoT ao Big Data
  • 2. • IoT – Visibilidade dos Dados • IoT – Edge Streaming Analytics • Big Data Analytics para IoT • Plataforma de Hardware para um Piloto Agenda Do IoT ao Big Data, uma Abordagem Passo a Passo
  • 3. Sucessos com projetos de IoT Connected healthcare Connected building Physical security Real-time analytics USMEA UK US MEA First responder Virtual patient observation $142 milhões em oportunidades de IoT
  • 4. Cenário O Landscape de Big DataO Modelo IoT World Forum Model Collaboration Business & people process Application Report, Control Data Abstraction Aggregation and access Data accumulation Storage Edge Computing Data analisys & transformation Connectivity Communication & processing Physical Devices & Controllers
  • 5. Mas eu só queria ter visibilidade dos meus dados...
  • 7. IoT – Aquisição dos Dados PINS DB MCU MCU CLP NUVEM Privada ou Pública CONTROLADORA
  • 8. IoT – Soluções de Visualização CONTROLADORA PaaS: Utilizando uma plataforma pronta Producer Kafka Consumer PaaS :Desenvolvendo sua Plataforma JSON / REST / HTTP JSON ServerClient HTTP POST /service/weather (REST Interface) [{“city”:”Paris”, “units”: “C”}] [{“low”: “16”, “high”: “23”}] Response Request
  • 9. IoT WF Model x DD Model DD Offers Full Stack Solution IoT World Forum Model Dimension Data Solution Focus Collaboration Business & people process Application Report, Control Data Abstraction Aggregation and access Data accumulation Storage Edge Computing Data analisys & transformation Connectivity Communication & processing Physical Devices & Controllers Applications Secure Conectivity
  • 10. Gostaria de atuar de imediato em situações “críticas” .
  • 11. CONTROLADORA MCU MCU CLP IoT – Edge Streaming Analytics Quark Edge Devices Communication Centralized Analytics Kafka MQTT Watson IOT Platform Custom Hub Custom Apps
  • 12. IoT WF Model x DD Model DD Offers Full Stack Solution IoT World Forum Model Dimension Data Solution Focus Collaboration Business & people process Application Report, Control Data Abstraction Aggregation and access Data accumulation Storage Edge Computing Data analisys & transformation Connectivity Communication & processing Physical Devices & Controllers Applications Distributed Computing Secure Conectivity
  • 13. Quero extrair valor dos dados do meu IoT !
  • 14. • Fast Data – Capacidade de processar streams dos sensores De que Precisamos? • Big Data – Capacidade para armazenar dados dos sensores e resultados do Analytics (arquivos, tabelas, etc.) • Uma ferramenta de Analytics • Uma ferramenta de Visualização / Query • Ambiente escalável, altamente disponível e seguro
  • 15. Como seria a Arquitetura da Solução Big Data para IoT ? Serving LayerTomada de Ação - Visualização Streaming Data Twiter´s Nathan Marz (2013) Discreto Contínuo Supervisionado Classificação Regressão File System Distribuído Modelo ML Speed Layer Desenv. do modelo Analítico Batch Layer Dados
  • 16. Em que Ferramenta de Big Data Analytics Investir ? "Todas as principais plataformas comerciais avançadas de análise utilizam Spark. " Gartner 2016 Magic Quadrant for Advanced Analytics "Para ciência de dados utilizando aplicativos de fonte aberta e escalável, Spark é unanimidade."
  • 17. Porque Spark é Unanimidade ? Python API ! Feito para Big Data, mas “rode” onde quiser... “One Shop, all the Goods !” Spark SQL structured data Spark Streaming real-time Mlib machine learning GraphX graph processing Spark Core
  • 18. Enterprise Grade Big Data com Spark • Storage “Web-scale” • Alta Disponibilidade • Mirroring & Snapshots • Segurança integrada • Etc. YARN: Data Operating System 1 ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° ° °N HDFS Hadoop Distributed File System Interactive Real-TimeBatch Applications Distribuição Hadoop “Enterprise Grade” • HDFS: File system robusto e escalável • YARN: Resource Manager e Scheduler
  • 19. IoT WF Model x DD Model DD Offers Full Stack Solution IoT World Forum Model Dimension Data Solution Focus Collaboration Business & people process Application Report, Control Data Abstraction Aggregation and access Data accumulation Storage Edge Computing Data analisys & transformation Connectivity Communication & processing Physical Devices & Controllers Applications Distributed Computing Secure Conectivity Analytics
  • 20. E a infraestrutura de rede e computação ?
  • 21. Anatomia do Tráfego em Map-Reduce Many-to-Many Traffic Pattern Map 1 Map 2 Map NMap 3 Reducer1 Reducer2 Reducer3 ReducerN HDFS Shuffle Output Replication NameNode JobTracker ZooKeeper
  • 22. A Fabric é Importante para o Cluster Big Data Fabric 10 GE Tráfego Horizontal Baixa Latência “Oversubscription” adequado 0 1000 2000 3000 4000 5000 6000 7000 8000 9000 Latency(us) Time READ - Average Latency (us) QoS - READ - Average Latency (us) ~45% Read Improvement BufferCellsUsed Timeline Hbase Major Compaction Hbase (YCSB) NO QoS W/ QoS Terasoft held on Queues QOS Faz Diferença
  • 23. Starter Kits com Servidores e Rede Cisco 5 3 4 5 3 4 2 5 4 MPP HADOOP NOSQL Compute IO Bandwidth Capacity Starter High Performance Designed for Performance and density for analytics engines, NoSQL databases, and entry-level Hadoop deployments Extreme performance and density for analytics engines Server UCS C220 M4 UCS C220 M4 CPU 2 x Intel Xeon E5-2620 v3 (15M Cache, 2.40 GHz) 2 x Intel Xeon E5-2680 v3 (30M Cache, 2.50 GHz) Memory 256GB 256GB Storage Controller Cisco 12-Gbps SAS Modular Raid Controller with 2-GB FBWC Cisco 12-Gbps SAS Modular Raid Controller with 2-GB FBWC Storage 8 1.2-TB 10K SAS SFF HDD 2 1.2-TB 10K SAS SFF HDD, 6 400-GB SAS SSD Network Controller Cisco UCS VIC 1227 2 10GE SFP+ Cisco UCS VIC 1227 2 10GE SFP+ Network and Cluster Scaling 2 Cisco UCS 6248UP FIs, Scale up to 32 servers with no additional switching infrastructure 2 Cisco UCS6248UP FIs, Scale up to 32 servers with no additional switching infrastructure Cisco Single SKU UCS-SL-CPA3-S (8 servers) UCS-SL-CPA3-H (8 Servers)
  • 24. Starter Kits Testados e Homologados = Performance Previsível Testado e Validado
  • 25. Do PoC de IoT à Integração no Data Center PoC Infra apartada Limited Production Performance, Gerencia, Continuidade, Etc. SAN/NAS Arrays Blade Servers Enterprise Platform Orquestração, IaaS, Panejamento de Capacidade
  • 26. IoT WF Model x DD Model DD Offers Full Stack Solution IoT World Forum Model Dimension Data Solution Focus Collaboration Business & people process Application Report, Control Data Abstraction Aggregation and access Data accumulation Storage Edge Computing Data analisys & transformation Connectivity Communication & processing Physical Devices & Controllers Applications Distributed Computing Secure Conectivity Analytics Hybrid Cloud Computing
  • 27. A Internet das Coisas ajudando nossos clientes a fazerem coisas incríveis
  • 28. Luis Filipe Silva Senior Solutions Architect, Brazil Dimension Data Americas Phone: +55 (11) 3878 66549 +55 (21) 99858 9137 Filipe.silva@dimensiondata.com