SlideShare a Scribd company logo
Leveraging the Power of Cassandra:
Operational Reporting & Interactive
Analysis

Ernesto Ongaro
BI Consultant Jaspersoft
Agenda







Requirements for Cassandra reporting and analysis
Current state of reporting and analysis
Architectural approaches
Demo
Q&A

©2013 Jaspersoft Corporation. Proprietary and Confidential

2
Requirements for Cassandra
reporting and analysis

 People want access to the data in Cassandra
 Most consumers of data are not technical
 Traditional reporting and analytics tools don’t work with



Cassandra
Building reports from scratch is not easy or fun
Providing ad-hoc analytics is very complicated

©2013 Jaspersoft Corporation.

3
Current State of Reporting &
Analytics






Connectors are for RDBMS only
Expensive
Desktop
Standalone

©2013 Jaspersoft Corporation. Proprietary and Confidential

4
Advantages to using a reporting
and analysis framework
Build it yourself

Use a framework

Visual report designer

✖

✔

Security

✖

✔

Scheduling

✖

✔

Web access

✖

✔

API

✖

✔

Self-service queries

✖

✔

Charting libraries

✖

✔

Metadata layer

✖

✔

Input controls

✖

✔

Flexibility

✔

✔

©2013 Jaspersoft Corporation.

5
Architectural Approaches

 Four methods to visualize your Cassandra data
① ETL Approach (Extract, Transform, Load)
② Direct access reports and dashboards
③ Direct access data exploration
④ 1-3 with Hadoop Hive

©2013 Jaspersoft Corporation. Proprietary and Confidential

6
1 – ETL Approach

 Most traditional





approach
Data is extracted via
batch method
Option with most
connectors
ETL process is most
robust option

BI Platform
SQL
ETL

RDBMS

©2013 Jaspersoft Corporation. Proprietary and Confidential

7
JaspersoftETL

 Powered by
 Over 450 connectors
 Data quality, transformations, aggregations

©2013 Jaspersoft Corporation. Proprietary and Confidential

8
2 – Direct Access Reports and
Dashboards

 Reports are developed




using Jaspersoft Studio
(Eclipse based designer)
Lowest latency
Good supplement to ETL
when “near time” is
required
Connector based on
https://github.com/Netflix/
astyanax

©2013 Jaspersoft Corporation. Proprietary and Confidential

BI Platform
CQL3 Native
Connector

9
Example Dashboard

©2013 Jaspersoft Corporation. Proprietary and Confidential

10
3 – Direct Access Exploration

 Allows users to



explore data (vs
pre-defined reports
+ dashboards)
Loads results of a
query into memory
where further
filtering, grouping
and agg. occurs

In Memory
OLAP Engine
BI Platform
CQL3 Native
Connector

©2013 Jaspersoft Corporation. Proprietary and Confidential

11
Example OLAP View

©2013 Jaspersoft Corporation. Proprietary and Confidential

12
4 – Hadoop Hive

 Good for massive



data
Batch process
Native Hadoop
Hive connector as
well

BI Platform
HQL

SQL
ETL

RDBMS

©2013 Jaspersoft Corporation. Proprietary and Confidential

13

libhive
Demonstration
Demo flow:
•

•
•

Example Dashboard +
report
Jaspersoft Studio
Ad-hoc Exploration

Demo environment:
•

•

Jaspersoft 5.5 – runs on
Tomcat 7
DataStax Enterprise 3.1
(Cassandra 1.2.10.1)

©2013 Jaspersoft Corporation. Proprietary and Confidential

14
Questions?
www.jaspersoft.com
BigData@jaspersoft.com

©2013 Jaspersoft Corporation.

15
Conclusion

•
•
•
•

Four different ways to get insights from Cassandra
Commercial open-source software
Get started at http://jaspersoft.com
Thank you!

©2013 Jaspersoft Corporation. Proprietary and Confidential

16

More Related Content

PPTX
Hadoop Reporting and Analysis - Jaspersoft
PPTX
Introduction to Microsoft Azure HD Insight by Dattatrey Sindhol
PDF
Benefits of Hadoop as Platform as a Service
PPTX
Introduction to Microsoft HDInsight and BI Tools
PPTX
Scaling Data Science on Big Data
PDF
Data Lake for the Cloud: Extending your Hadoop Implementation
PDF
Building a Big Data platform with the Hadoop ecosystem
PDF
IBM InfoSphere BigInsights for Hadoop: 10 Reasons to Love It
Hadoop Reporting and Analysis - Jaspersoft
Introduction to Microsoft Azure HD Insight by Dattatrey Sindhol
Benefits of Hadoop as Platform as a Service
Introduction to Microsoft HDInsight and BI Tools
Scaling Data Science on Big Data
Data Lake for the Cloud: Extending your Hadoop Implementation
Building a Big Data platform with the Hadoop ecosystem
IBM InfoSphere BigInsights for Hadoop: 10 Reasons to Love It

What's hot (20)

PDF
Empowering you with Democratized Data Access, Data Science and Machine Learning
PPTX
Georgia Azure Event - Scalable cloud games using Microsoft Azure
PPTX
How Big Data and Hadoop Integrated into BMC ControlM at CARFAX
PPTX
PPTX
Oncrawl elasticsearch meetup france #12
PDF
Combine Apache Hadoop and Elasticsearch to Get the Most of Your Big Data
PDF
InfoSphere BigInsights - Analytics power for Hadoop - field experience
PDF
Evolving Hadoop into an Operational Platform with Data Applications
PDF
IlOUG Tech Days 2016 - Unlock the Value in your Data Reservoir using Oracle B...
PDF
Big Data: Architecture and Performance Considerations in Logical Data Lakes
PPTX
Build Big Data Enterprise Solutions Faster on Azure HDInsight
PDF
Democratizing Data Science on Kubernetes
PPTX
Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...
PPTX
Hadoop Powers Modern Enterprise Data Architectures
PPTX
Breakout: Hadoop and the Operational Data Store
PDF
Ibm big data ibm marriage of hadoop and data warehousing
PDF
InfoSphere BigInsights
PDF
Hadoop Integration into Data Warehousing Architectures
PDF
Data-In-Motion Unleashed
Empowering you with Democratized Data Access, Data Science and Machine Learning
Georgia Azure Event - Scalable cloud games using Microsoft Azure
How Big Data and Hadoop Integrated into BMC ControlM at CARFAX
Oncrawl elasticsearch meetup france #12
Combine Apache Hadoop and Elasticsearch to Get the Most of Your Big Data
InfoSphere BigInsights - Analytics power for Hadoop - field experience
Evolving Hadoop into an Operational Platform with Data Applications
IlOUG Tech Days 2016 - Unlock the Value in your Data Reservoir using Oracle B...
Big Data: Architecture and Performance Considerations in Logical Data Lakes
Build Big Data Enterprise Solutions Faster on Azure HDInsight
Democratizing Data Science on Kubernetes
Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...
Hadoop Powers Modern Enterprise Data Architectures
Breakout: Hadoop and the Operational Data Store
Ibm big data ibm marriage of hadoop and data warehousing
InfoSphere BigInsights
Hadoop Integration into Data Warehousing Architectures
Data-In-Motion Unleashed
Ad

Viewers also liked (19)

PDF
Promise Object in Windows Store App
RTF
Clases jasper report
PDF
Jaspersoft Reporting v5
PPTX
Jasper Report - Lesson
PPTX
Jaspersoft Studio Quick Start Guide
PDF
Jasper reports in 3 easy steps
PDF
"Analytics inside your Java application", Part 2, jDays 2015 Speaker: "Veaces...
PPTX
Mobile Web Development from Scratch
PDF
Jasper Reports
PDF
Jaspersoft Studioチュートリアル1 - レポートの作成
PPTX
Introduction to java Jasper Report with Server & iReport
PPT
Advanced Jasper Reports
PPTX
Embedding Jaspersoft into your PHP application
PDF
BatchJobService
PDF
A Short Intorduction to JasperReports
PPT
Open Source Reporting Tool Comparison
PDF
Introduction to Jasper Reports
ODP
Japer Reports
PPT
Captcha ppt
Promise Object in Windows Store App
Clases jasper report
Jaspersoft Reporting v5
Jasper Report - Lesson
Jaspersoft Studio Quick Start Guide
Jasper reports in 3 easy steps
"Analytics inside your Java application", Part 2, jDays 2015 Speaker: "Veaces...
Mobile Web Development from Scratch
Jasper Reports
Jaspersoft Studioチュートリアル1 - レポートの作成
Introduction to java Jasper Report with Server & iReport
Advanced Jasper Reports
Embedding Jaspersoft into your PHP application
BatchJobService
A Short Intorduction to JasperReports
Open Source Reporting Tool Comparison
Introduction to Jasper Reports
Japer Reports
Captcha ppt
Ad

Similar to C* Summit EU 2013: Leveraging the Power of Cassandra: Operational Reporting and Interactive Analysis (20)

PPTX
BI, Reporting and Analytics on Apache Cassandra
PPTX
Low-Latency Analytics with NoSQL – Introduction to Storm and Cassandra
ODP
Open Source Business Intelligence Overview
PPTX
cognos BI10.pptx
PPTX
cognos BI10.pptx
DOCX
Cassandra data modelling best practices
PDF
Slides: Relational to NoSQL Migration
PPTX
5 Ways to Use Spark to Enrich your Cassandra Environment
PDF
Business Intelligence: Data Warehouses
PPTX
Big Data Warehousing Meetup: Real-time Trade Data Monitoring with Storm & Cas...
DOCX
Oracle reports to jasper reports
PDF
No sql now2011_review_of_adhoc_architectures
PPTX
Building a Pluggable Analytics Stack with Cassandra (Jim Peregord, Element Co...
PDF
IBM Cognos tutorial - ABC LEARN
PPTX
Skilwise Big data
PPTX
Skillwise Big Data part 2
PDF
Apache Cassandra and Python for Analyzing Streaming Big Data
PPTX
Cognos bi10
DOCX
us it recruiter
PPTX
BI Reporting Application Comparison
BI, Reporting and Analytics on Apache Cassandra
Low-Latency Analytics with NoSQL – Introduction to Storm and Cassandra
Open Source Business Intelligence Overview
cognos BI10.pptx
cognos BI10.pptx
Cassandra data modelling best practices
Slides: Relational to NoSQL Migration
5 Ways to Use Spark to Enrich your Cassandra Environment
Business Intelligence: Data Warehouses
Big Data Warehousing Meetup: Real-time Trade Data Monitoring with Storm & Cas...
Oracle reports to jasper reports
No sql now2011_review_of_adhoc_architectures
Building a Pluggable Analytics Stack with Cassandra (Jim Peregord, Element Co...
IBM Cognos tutorial - ABC LEARN
Skilwise Big data
Skillwise Big Data part 2
Apache Cassandra and Python for Analyzing Streaming Big Data
Cognos bi10
us it recruiter
BI Reporting Application Comparison

More from DataStax Academy (20)

PDF
Forrester CXNYC 2017 - Delivering great real-time cx is a true craft
PPTX
Introduction to DataStax Enterprise Graph Database
PPTX
Introduction to DataStax Enterprise Advanced Replication with Apache Cassandra
PPTX
Cassandra on Docker @ Walmart Labs
PDF
Cassandra 3.0 Data Modeling
PPTX
Cassandra Adoption on Cisco UCS & Open stack
PDF
Data Modeling for Apache Cassandra
PDF
Coursera Cassandra Driver
PDF
Production Ready Cassandra
PDF
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
PPTX
Cassandra @ Sony: The good, the bad, and the ugly part 1
PPTX
Cassandra @ Sony: The good, the bad, and the ugly part 2
PDF
Standing Up Your First Cluster
PDF
Real Time Analytics with Dse
PDF
Introduction to Data Modeling with Apache Cassandra
PDF
Cassandra Core Concepts
PPTX
Enabling Search in your Cassandra Application with DataStax Enterprise
PPTX
Bad Habits Die Hard
PDF
Advanced Data Modeling with Apache Cassandra
PDF
Advanced Cassandra
Forrester CXNYC 2017 - Delivering great real-time cx is a true craft
Introduction to DataStax Enterprise Graph Database
Introduction to DataStax Enterprise Advanced Replication with Apache Cassandra
Cassandra on Docker @ Walmart Labs
Cassandra 3.0 Data Modeling
Cassandra Adoption on Cisco UCS & Open stack
Data Modeling for Apache Cassandra
Coursera Cassandra Driver
Production Ready Cassandra
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
Cassandra @ Sony: The good, the bad, and the ugly part 1
Cassandra @ Sony: The good, the bad, and the ugly part 2
Standing Up Your First Cluster
Real Time Analytics with Dse
Introduction to Data Modeling with Apache Cassandra
Cassandra Core Concepts
Enabling Search in your Cassandra Application with DataStax Enterprise
Bad Habits Die Hard
Advanced Data Modeling with Apache Cassandra
Advanced Cassandra

Recently uploaded (20)

PPTX
Programs and apps: productivity, graphics, security and other tools
PPTX
Big Data Technologies - Introduction.pptx
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Encapsulation theory and applications.pdf
PPTX
Cloud computing and distributed systems.
PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
sap open course for s4hana steps from ECC to s4
PDF
KodekX | Application Modernization Development
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Electronic commerce courselecture one. Pdf
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
Programs and apps: productivity, graphics, security and other tools
Big Data Technologies - Introduction.pptx
Unlocking AI with Model Context Protocol (MCP)
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Encapsulation theory and applications.pdf
Cloud computing and distributed systems.
Network Security Unit 5.pdf for BCA BBA.
sap open course for s4hana steps from ECC to s4
KodekX | Application Modernization Development
Encapsulation_ Review paper, used for researhc scholars
Building Integrated photovoltaic BIPV_UPV.pdf
MIND Revenue Release Quarter 2 2025 Press Release
Electronic commerce courselecture one. Pdf
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
MYSQL Presentation for SQL database connectivity
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Dropbox Q2 2025 Financial Results & Investor Presentation
Advanced methodologies resolving dimensionality complications for autism neur...

C* Summit EU 2013: Leveraging the Power of Cassandra: Operational Reporting and Interactive Analysis

  • 1. Leveraging the Power of Cassandra: Operational Reporting & Interactive Analysis Ernesto Ongaro BI Consultant Jaspersoft
  • 2. Agenda      Requirements for Cassandra reporting and analysis Current state of reporting and analysis Architectural approaches Demo Q&A ©2013 Jaspersoft Corporation. Proprietary and Confidential 2
  • 3. Requirements for Cassandra reporting and analysis  People want access to the data in Cassandra  Most consumers of data are not technical  Traditional reporting and analytics tools don’t work with   Cassandra Building reports from scratch is not easy or fun Providing ad-hoc analytics is very complicated ©2013 Jaspersoft Corporation. 3
  • 4. Current State of Reporting & Analytics     Connectors are for RDBMS only Expensive Desktop Standalone ©2013 Jaspersoft Corporation. Proprietary and Confidential 4
  • 5. Advantages to using a reporting and analysis framework Build it yourself Use a framework Visual report designer ✖ ✔ Security ✖ ✔ Scheduling ✖ ✔ Web access ✖ ✔ API ✖ ✔ Self-service queries ✖ ✔ Charting libraries ✖ ✔ Metadata layer ✖ ✔ Input controls ✖ ✔ Flexibility ✔ ✔ ©2013 Jaspersoft Corporation. 5
  • 6. Architectural Approaches  Four methods to visualize your Cassandra data ① ETL Approach (Extract, Transform, Load) ② Direct access reports and dashboards ③ Direct access data exploration ④ 1-3 with Hadoop Hive ©2013 Jaspersoft Corporation. Proprietary and Confidential 6
  • 7. 1 – ETL Approach  Most traditional    approach Data is extracted via batch method Option with most connectors ETL process is most robust option BI Platform SQL ETL RDBMS ©2013 Jaspersoft Corporation. Proprietary and Confidential 7
  • 8. JaspersoftETL  Powered by  Over 450 connectors  Data quality, transformations, aggregations ©2013 Jaspersoft Corporation. Proprietary and Confidential 8
  • 9. 2 – Direct Access Reports and Dashboards  Reports are developed    using Jaspersoft Studio (Eclipse based designer) Lowest latency Good supplement to ETL when “near time” is required Connector based on https://github.com/Netflix/ astyanax ©2013 Jaspersoft Corporation. Proprietary and Confidential BI Platform CQL3 Native Connector 9
  • 10. Example Dashboard ©2013 Jaspersoft Corporation. Proprietary and Confidential 10
  • 11. 3 – Direct Access Exploration  Allows users to  explore data (vs pre-defined reports + dashboards) Loads results of a query into memory where further filtering, grouping and agg. occurs In Memory OLAP Engine BI Platform CQL3 Native Connector ©2013 Jaspersoft Corporation. Proprietary and Confidential 11
  • 12. Example OLAP View ©2013 Jaspersoft Corporation. Proprietary and Confidential 12
  • 13. 4 – Hadoop Hive  Good for massive   data Batch process Native Hadoop Hive connector as well BI Platform HQL SQL ETL RDBMS ©2013 Jaspersoft Corporation. Proprietary and Confidential 13 libhive
  • 14. Demonstration Demo flow: • • • Example Dashboard + report Jaspersoft Studio Ad-hoc Exploration Demo environment: • • Jaspersoft 5.5 – runs on Tomcat 7 DataStax Enterprise 3.1 (Cassandra 1.2.10.1) ©2013 Jaspersoft Corporation. Proprietary and Confidential 14
  • 16. Conclusion • • • • Four different ways to get insights from Cassandra Commercial open-source software Get started at http://jaspersoft.com Thank you! ©2013 Jaspersoft Corporation. Proprietary and Confidential 16