SlideShare a Scribd company logo
Version 1.0
Databricks and Cassandra
In Cassandra Lunch #72, we will discuss how we can use
Databricks with Cassandra.
Arpan Patel
Engineer @ Anant
Databricks
● One unified platform for data and AI built on lakehouse
architecture
● Offers free Community Edition (micro-cluster+ cluster
manager + notebook environment)
● Reliable data engineering
● SQL Analytics on all your data
● Collaborative data science
● Production machine learning
● Fully managed cloud service -> security, reliability,
scaling, and performance
● Rooted in open source -> original creators of Apache
Spark, Delta Lake, and MLflow
● Additional technologies -> TensorFlow, Redash, and R
Demo
● Databricks Community Edition
● DataStax Astra
● Databricks Notebook to do read / writes on Astra
Strategy: Scalable Fast Data
Architecture: Cassandra, Spark, Kafka
Engineering: Node, Python, JVM,CLR
Operations: Cloud, Container
Rescue: Downtime!! I need help.
www.anant.us | solutions@anant.us | (855) 262-6826
3 Washington Circle, NW | Suite 301 | Washington, DC 20037

More Related Content

PPTX
Data Engineer's Lunch #46: Node.js and API calls
PPTX
Data Engineer’s Lunch #45: Apache Livy
PPTX
Building a REST API with Cassandra on Datastax Astra Using Python and Node
PDF
A Microservices approach with Cassandra and Quarkus | DevNation Tech Talk
PDF
Kafka for begginer
PPTX
Cassandra Lunch #87: Recreating Cassandra.api using Astra and Stargate
PPTX
Data Engineer's Lunch #54: dbt and Spark
PDF
[WSO2Con USA 2018] Deploying Applications in K8S and Docker
Data Engineer's Lunch #46: Node.js and API calls
Data Engineer’s Lunch #45: Apache Livy
Building a REST API with Cassandra on Datastax Astra Using Python and Node
A Microservices approach with Cassandra and Quarkus | DevNation Tech Talk
Kafka for begginer
Cassandra Lunch #87: Recreating Cassandra.api using Astra and Stargate
Data Engineer's Lunch #54: dbt and Spark
[WSO2Con USA 2018] Deploying Applications in K8S and Docker

What's hot (20)

PDF
Serverless stream processing of Debezium data change events with Knative | De...
PDF
Creating a Kafka Topic. Super easy? | Andrew Stevenson and Marios Andreopoulo...
PDF
Persist your data in an ephemeral k8 ecosystem
PPTX
Zabbix at scale with Elasticsearch
PDF
5 - Hands-on Kubernetes Workshop:
PDF
Moving 150 TB of data resiliently on Kafka With Quorum Controller on Kubernet...
PPTX
Cloud-Based Event Stream Processing Architectures and Patterns with Apache Ka...
PDF
7 - Monitoring Kubernetes with Elastic
PPTX
Backup multi-cloud solution based on named pipes
PDF
4 - Customer story: Telenet
PDF
Storage os kubernetes clusters need persistent data
PPTX
Introduction to Container Storage Interface (CSI)
PDF
Scylla: 1 Million CQL operations per second per server
PDF
Azure Cosmos DB Kafka Connectors | Abinav Rameesh, Microsoft
PPTX
MongoDB vs Scylla: Production Experience from Both Dev & Ops Standpoint at Nu...
PDF
Lookout on Scaling Security to 100 Million Devices
PDF
Scylla Summit 2022: Stream Processing with ScyllaDB
PDF
Cloudian HyperStore Features and Benefits
PPTX
FireEye & Scylla: Intel Threat Analysis Using a Graph Database
PPTX
Scality S3 Server: Node js Meetup Presentation
Serverless stream processing of Debezium data change events with Knative | De...
Creating a Kafka Topic. Super easy? | Andrew Stevenson and Marios Andreopoulo...
Persist your data in an ephemeral k8 ecosystem
Zabbix at scale with Elasticsearch
5 - Hands-on Kubernetes Workshop:
Moving 150 TB of data resiliently on Kafka With Quorum Controller on Kubernet...
Cloud-Based Event Stream Processing Architectures and Patterns with Apache Ka...
7 - Monitoring Kubernetes with Elastic
Backup multi-cloud solution based on named pipes
4 - Customer story: Telenet
Storage os kubernetes clusters need persistent data
Introduction to Container Storage Interface (CSI)
Scylla: 1 Million CQL operations per second per server
Azure Cosmos DB Kafka Connectors | Abinav Rameesh, Microsoft
MongoDB vs Scylla: Production Experience from Both Dev & Ops Standpoint at Nu...
Lookout on Scaling Security to 100 Million Devices
Scylla Summit 2022: Stream Processing with ScyllaDB
Cloudian HyperStore Features and Benefits
FireEye & Scylla: Intel Threat Analysis Using a Graph Database
Scality S3 Server: Node js Meetup Presentation
Ad

Similar to Apache Cassandra Lunch #72: Databricks and Cassandra (20)

PPTX
Apache Cassandra Lunch #93: K8ssandra on Digital Ocean
PDF
5 Factors When Selecting a High Performance, Low Latency Database
PPTX
Apache Cassandra Lunch #94: StreamSets and Cassandra
PPTX
Jump Start with Apache Spark 2.0 on Databricks
PDF
Spark Saturday: Spark SQL & DataFrame Workshop with Apache Spark 2.3
PPTX
Apache cassandra lunch #82 instaclustr managed cassandra and next.js
PPTX
Apache Cassandra Lunch #82: Instaclustr Managed Cassandra and Next.js
PDF
Lambda Architecture with Spark Streaming, Kafka, Cassandra, Akka, Scala
PPTX
Data Engineering A Deep Dive into Databricks
PPTX
Announcing Spark Driver for Cassandra
PDF
Designing Low-Latency Systems with Rust: An Architectural Deep Dive
PDF
Cassandra & Spark for IoT
PDF
Apache Cassandra overview
PDF
Kafka spark cassandra webinar feb 16 2016
PDF
Kafka spark cassandra webinar feb 16 2016
PDF
Scaling Security on 100s of Millions of Mobile Devices Using Apache Kafka® an...
PDF
Sa introduction to big data pipelining with cassandra & spark west mins...
PPTX
Azure Databricks - An Introduction 2019 Roadshow.pptx
PPTX
Apache Cassandra introduction
PDF
5 Comparing Microsoft Big Data Technologies for Analytics
Apache Cassandra Lunch #93: K8ssandra on Digital Ocean
5 Factors When Selecting a High Performance, Low Latency Database
Apache Cassandra Lunch #94: StreamSets and Cassandra
Jump Start with Apache Spark 2.0 on Databricks
Spark Saturday: Spark SQL & DataFrame Workshop with Apache Spark 2.3
Apache cassandra lunch #82 instaclustr managed cassandra and next.js
Apache Cassandra Lunch #82: Instaclustr Managed Cassandra and Next.js
Lambda Architecture with Spark Streaming, Kafka, Cassandra, Akka, Scala
Data Engineering A Deep Dive into Databricks
Announcing Spark Driver for Cassandra
Designing Low-Latency Systems with Rust: An Architectural Deep Dive
Cassandra & Spark for IoT
Apache Cassandra overview
Kafka spark cassandra webinar feb 16 2016
Kafka spark cassandra webinar feb 16 2016
Scaling Security on 100s of Millions of Mobile Devices Using Apache Kafka® an...
Sa introduction to big data pipelining with cassandra & spark west mins...
Azure Databricks - An Introduction 2019 Roadshow.pptx
Apache Cassandra introduction
5 Comparing Microsoft Big Data Technologies for Analytics
Ad

More from Anant Corporation (20)

PPTX
LLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by Anant
PPTX
QLoRA Fine-Tuning on Cassandra Link Data Set (1/2) Cassandra Lunch 137
PDF
Kono.IntelCraft.Weekly.AI.LLM.Landscape.2024.02.28.pdf
PDF
Data Engineer's Lunch 96: Intro to Real Time Analytics Using Apache Pinot
PDF
NoCode, Data & AI LLM Inside Bootcamp: Episode 6 - Design Patterns: Retrieval...
PDF
Automate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPT
PPTX
YugabyteDB Developer Tools
PPTX
Episode 2: The LLM / GPT / AI Prompt / Data Engineer Roadmap
PPTX
Machine Learning Orchestration with Airflow
PDF
Cassandra Lunch 130: Recap of Cassandra Forward Talks
PDF
Data Engineer's Lunch 90: Migrating SQL Data with Arcion
PDF
Data Engineer's Lunch 89: Machine Learning Orchestration with AirflowMachine ...
PDF
Cassandra Lunch 129: What’s New: Apache Cassandra 4.1+ Features & Future
PDF
Data Engineer's Lunch #86: Building Real-Time Applications at Scale: A Case S...
PDF
Data Engineer's Lunch #85: Designing a Modern Data Stack
PPTX
PDF
Data Engineer's Lunch #83: Strategies for Migration to Apache Iceberg
PDF
Apache Cassandra Lunch 120: Apache Cassandra Monitoring Made Easy with AxonOps
PPTX
Apache Cassandra Lunch 119: Desktop GUI Tools for Apache Cassandra
PPTX
Data Engineer's Lunch #82: Automating Apache Cassandra Operations with Apache...
LLM Fine Tuning with QLoRA Cassandra Lunch 4, presented by Anant
QLoRA Fine-Tuning on Cassandra Link Data Set (1/2) Cassandra Lunch 137
Kono.IntelCraft.Weekly.AI.LLM.Landscape.2024.02.28.pdf
Data Engineer's Lunch 96: Intro to Real Time Analytics Using Apache Pinot
NoCode, Data & AI LLM Inside Bootcamp: Episode 6 - Design Patterns: Retrieval...
Automate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPT
YugabyteDB Developer Tools
Episode 2: The LLM / GPT / AI Prompt / Data Engineer Roadmap
Machine Learning Orchestration with Airflow
Cassandra Lunch 130: Recap of Cassandra Forward Talks
Data Engineer's Lunch 90: Migrating SQL Data with Arcion
Data Engineer's Lunch 89: Machine Learning Orchestration with AirflowMachine ...
Cassandra Lunch 129: What’s New: Apache Cassandra 4.1+ Features & Future
Data Engineer's Lunch #86: Building Real-Time Applications at Scale: A Case S...
Data Engineer's Lunch #85: Designing a Modern Data Stack
Data Engineer's Lunch #83: Strategies for Migration to Apache Iceberg
Apache Cassandra Lunch 120: Apache Cassandra Monitoring Made Easy with AxonOps
Apache Cassandra Lunch 119: Desktop GUI Tools for Apache Cassandra
Data Engineer's Lunch #82: Automating Apache Cassandra Operations with Apache...

Recently uploaded (20)

PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PDF
Mega Projects Data Mega Projects Data
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PPTX
Introduction to Knowledge Engineering Part 1
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PDF
.pdf is not working space design for the following data for the following dat...
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PDF
annual-report-2024-2025 original latest.
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PPTX
Supervised vs unsupervised machine learning algorithms
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
Mega Projects Data Mega Projects Data
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
Introduction to Knowledge Engineering Part 1
Acceptance and paychological effects of mandatory extra coach I classes.pptx
.pdf is not working space design for the following data for the following dat...
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
Clinical guidelines as a resource for EBP(1).pdf
Qualitative Qantitative and Mixed Methods.pptx
Introduction-to-Cloud-ComputingFinal.pptx
annual-report-2024-2025 original latest.
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
Supervised vs unsupervised machine learning algorithms

Apache Cassandra Lunch #72: Databricks and Cassandra

  • 1. Version 1.0 Databricks and Cassandra In Cassandra Lunch #72, we will discuss how we can use Databricks with Cassandra. Arpan Patel Engineer @ Anant
  • 2. Databricks ● One unified platform for data and AI built on lakehouse architecture ● Offers free Community Edition (micro-cluster+ cluster manager + notebook environment) ● Reliable data engineering ● SQL Analytics on all your data ● Collaborative data science ● Production machine learning ● Fully managed cloud service -> security, reliability, scaling, and performance ● Rooted in open source -> original creators of Apache Spark, Delta Lake, and MLflow ● Additional technologies -> TensorFlow, Redash, and R
  • 3. Demo ● Databricks Community Edition ● DataStax Astra ● Databricks Notebook to do read / writes on Astra
  • 4. Strategy: Scalable Fast Data Architecture: Cassandra, Spark, Kafka Engineering: Node, Python, JVM,CLR Operations: Cloud, Container Rescue: Downtime!! I need help. www.anant.us | solutions@anant.us | (855) 262-6826 3 Washington Circle, NW | Suite 301 | Washington, DC 20037