SlideShare a Scribd company logo
Real-time Analytics with
Presto and Apache Pinot
Xiang Fu
Sept. 24, 2020
User Facing Applications Business Facing Metrics
Apache Pinot
Anomaly Detection
- Ingestion: Millions of events/sec
- Workload: Thousands of queries/sec
- Performance: Millisecond
Fact Table
Dimension Table Pre-Join Pre-Aggregation Pre-Cube
Presto Pinot
Latency
Flexibility
low
high
low
high
Latency vs Flexibility
SPEED
FLEXIBILITY
Presto + Pinot
Presto + Pinot
SPEED
FLEXIBILITY
Thank you
- Getting Started
https://tinyurl.com/prestoPinotTutorial
- Pinot Slack Channel
https://tinyurl.com/pinotSlackChannel
Contributors: Devesh Agrawal, Dharak
Kharod, Haibo Wang, James Sun, Venki
Korukanti, Xiang Fu, Zhenxiao Luo

More Related Content

PDF
Introduction to Stream Processing
PDF
Building an open data platform with apache iceberg
PPTX
Flexible and Real-Time Stream Processing with Apache Flink
PDF
Apache Iceberg - A Table Format for Hige Analytic Datasets
PDF
Pinot: Enabling Real-time Analytics Applications @ LinkedIn's Scale
PDF
Efficient Data Storage for Analytics with Apache Parquet 2.0
PDF
Parquet Hadoop Summit 2013
PDF
Pinot: Near Realtime Analytics @ Uber
Introduction to Stream Processing
Building an open data platform with apache iceberg
Flexible and Real-Time Stream Processing with Apache Flink
Apache Iceberg - A Table Format for Hige Analytic Datasets
Pinot: Enabling Real-time Analytics Applications @ LinkedIn's Scale
Efficient Data Storage for Analytics with Apache Parquet 2.0
Parquet Hadoop Summit 2013
Pinot: Near Realtime Analytics @ Uber

What's hot (20)

PDF
Cloud DW technology trends and considerations for enterprises to apply snowflake
PDF
A Thorough Comparison of Delta Lake, Iceberg and Hudi
PDF
Apache Druid 101
PDF
XStream: stream processing platform at facebook
PDF
Airflow introduction
PPTX
Autoscaling Flink with Reactive Mode
PDF
Scalability, Availability & Stability Patterns
PDF
[234]멀티테넌트 하둡 클러스터 운영 경험기
PDF
Parquet Strata/Hadoop World, New York 2013
PPTX
Introduction to Apache Flink
PPTX
Apache NiFi in the Hadoop Ecosystem
PDF
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
PPTX
The columnar roadmap: Apache Parquet and Apache Arrow
PDF
From Zero to Hero with Kafka Connect
PDF
Top 5 Mistakes When Writing Spark Applications
PDF
Iceberg + Alluxio for Fast Data Analytics
PDF
PDF
Pinot: Realtime OLAP for 530 Million Users - Sigmod 2018
PDF
Simplify CDC Pipeline with Spark Streaming SQL and Delta Lake
PDF
Building robust CDC pipeline with Apache Hudi and Debezium
Cloud DW technology trends and considerations for enterprises to apply snowflake
A Thorough Comparison of Delta Lake, Iceberg and Hudi
Apache Druid 101
XStream: stream processing platform at facebook
Airflow introduction
Autoscaling Flink with Reactive Mode
Scalability, Availability & Stability Patterns
[234]멀티테넌트 하둡 클러스터 운영 경험기
Parquet Strata/Hadoop World, New York 2013
Introduction to Apache Flink
Apache NiFi in the Hadoop Ecosystem
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
The columnar roadmap: Apache Parquet and Apache Arrow
From Zero to Hero with Kafka Connect
Top 5 Mistakes When Writing Spark Applications
Iceberg + Alluxio for Fast Data Analytics
Pinot: Realtime OLAP for 530 Million Users - Sigmod 2018
Simplify CDC Pipeline with Spark Streaming SQL and Delta Lake
Building robust CDC pipeline with Apache Hudi and Debezium
Ad

Recently uploaded (20)

PPTX
Cloud computing and distributed systems.
PDF
Review of recent advances in non-invasive hemoglobin estimation
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PPTX
sap open course for s4hana steps from ECC to s4
PDF
Approach and Philosophy of On baking technology
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PPTX
Big Data Technologies - Introduction.pptx
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Cloud computing and distributed systems.
Review of recent advances in non-invasive hemoglobin estimation
20250228 LYD VKU AI Blended-Learning.pptx
Advanced methodologies resolving dimensionality complications for autism neur...
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
sap open course for s4hana steps from ECC to s4
Approach and Philosophy of On baking technology
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
“AI and Expert System Decision Support & Business Intelligence Systems”
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Big Data Technologies - Introduction.pptx
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Encapsulation_ Review paper, used for researhc scholars
Understanding_Digital_Forensics_Presentation.pptx
NewMind AI Weekly Chronicles - August'25 Week I
Spectral efficient network and resource selection model in 5G networks
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Building Integrated photovoltaic BIPV_UPV.pdf
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Ad

Real-time Analytics with Presto and Apache Pinot

Editor's Notes

  • #3: Realtime OLAP Database Columnar, Indexed Storage Low latency analytics Distributed – highly available, reliable, scalable Lambda architecture Offline data pushes Real-time stream ingestion Open Source
  • #6: Pinot - Fast single table OLAP Presto - Powerful connector ecosystem Complete system - covers entire landscape Get the best of Presto and Pinot Proven stack at Uber and many more