SlideShare a Scribd company logo
Wizard Driven AI
Anomaly Detection with
Databricks in Azure
Naomi Kaduwela
Head of Kavi Labs
Rajesh Inbasekaran
CTO
Agenda
Naomi
▪ Fraud Prevention Opportunity
▪ Why AI Audits
▪ Rise of Citizen Data Scientists
▪ Solution Approach
▪ Designing for Citizen Data Scientists
▪ Anomaly Lifecycle
▪ Deployment Options
▪ Success Stories
Rajesh
▪ How the Solution Works
▪ Cloud Native, Serverless Architecture
▪ Databricks Integration
Billions of Dollars of Opportunity
$350 B
Fraudulent Healthcare
spending*
* According to the National Health Care
Anti-Fraud Association
$25 B
Spent annually by US Banks on
anti-money laundering
compliance*
* According to Forbes
$40 B
Total annual cost of Insurance
Fraud (excluding health
insurance)*
* According to the FBI
Ideal
for
AI!
Why AI Audits
Data Volume &
Complex Patterns
Need to
Adapt to New Changes
High Frequency
Transactions
Transaction
Flagging
Actor-to-Actor
Flagging
AI flags the root cause of Anomalies in a Scalable way!
Rise of the Citizen Data Scientist
Thanks to technology abstraction
Data Scientists can now focus on solving
the business problem
Accelerating time to value
& maximizing their human potential!
Solution Approach
1. Different Anomaly
Signatures (possible fraud)
exist within same data
3. Despite different
methods, a holistic view of
anomaly is required for
business
2. Different methods are
efficient in detecting
different Anomaly
Signatures
4. Management of entire
anomaly lifecycle
management is critical for
effectiveness and efficiency
Designing for Citizen Data Scientists
Business Benefits
04
● Holistic and meaningful view
● Aggregate model into quantifiable business
opportunity
Evaluation & Visualizations
03
● Collect and report model metrics
● In built visualizations aid understanding
Portfolio of Algorithms
02
● Diverse portfolio of algorithms available
● Ability to compare parameters across methods &
combine multiple AI methods together
Wizard Driven, No code ML
01
● No programming required
● Enable Citizen Data Scientists
Anomaly Lifecycle Management
05
● Track from detection to actual recovery
● Human in the Loop for continuous improvement
Wizard Driven, No-Code ML
Screenshot
Portfolio of Algorithms
• Unsupervised • Supervised
Distribution Clustering
Association
Sequencing
Historical
Occurrence
Random
Forest
Neural
Network
Evaluation & Visualization
Business Benefits
7,761,096 $768,408,624 929,412 $21,263,307
26,452 $ 4,723,995
36,573 $ 5,263,785
119,079 $ 20,536,362
295,041 $ 262,482
21,099 $ 3,760,542
123,243 $ 4,062,246
308,025 $ 3,384,471
2019 Business Benefits Summary
Anomaly Opportunity Breakdown By Method
Billing Error
Duplicate Repair
Labor Overcharging
Material Overcharging
Over Repair
Wrong Shop
Wrong Repair
Opportunity Records Savings
From Statistical Anomaly to Confirmed Fraud
Raw Data
Predicted
Anomaly
Possible Fraud
Confirmed
Fraud
Actual
Recovery
Human in the Loop Anomaly Lifecycle
Model
Building
Update
Feedback
Anomaly
Detection
Recovery
Process
Anomaly
Validation
Citizen Data Scientist
Business SME
Deployment Options
Estimate
Option 1: Prevention
Real Time Scoring
at Time of Estimate
to Prevent Fraud
Money is Exchanged
Payment
Option 2: Reclaim
Batch Processing
Post Invoicing
to Reclaim Fraud
Invoice
Enterprise Tech Stack Integration
Digital Solutions
Layer
KPIs and Metrics, Descriptive Dashboards. AI Audits
Data Services
Layer
Integration, Transformation, Governance, Security,
Orchestration, Data Catalog
Source Systems &
Infrastructure
Ingestion of Internal Systems, Industry Systems, Customer
Systems. Storage & Compute
Success Stories
▪ $6.8M of potential FW&A in
prescription drug claims
▪ $7M of opportunity in Equipment
Repair Bill Invoicing Audits
• Transportation
• Pharma & Healthcare
ROI is High! Payment time is Short!
How the Solution Works
Cloud Native Serverless Architecture
Databricks Integration
Batch
• Jobs API
• 2.0/jobs/run-now
• Python Task
• Python Params
Interactive
• Notebook Task
• Notebook Params
Wizard Driven AI Anomaly Detection
Thank You!
Please share your feedback!
Feel free to reach out
https://www.linkedin.com/in/naomikaduwela/
https://www.linkedin.com/in/rajeshin/

More Related Content

PPTX
Anatomy of a data driven architecture - Tamir Dresher
PDF
IoT Architectures for a Digital Twin with Apache Kafka, IoT Platforms and Mac...
PDF
Databricks: A Tool That Empowers You To Do More With Data
PPTX
Big Data Analytics
PPTX
Feature store: Solving anti-patterns in ML-systems
PDF
Data Analytics PowerPoint Presentation Slides
PDF
Designing An Enterprise Data Fabric
PPTX
Scaling Data Quality @ Netflix
Anatomy of a data driven architecture - Tamir Dresher
IoT Architectures for a Digital Twin with Apache Kafka, IoT Platforms and Mac...
Databricks: A Tool That Empowers You To Do More With Data
Big Data Analytics
Feature store: Solving anti-patterns in ML-systems
Data Analytics PowerPoint Presentation Slides
Designing An Enterprise Data Fabric
Scaling Data Quality @ Netflix

What's hot (20)

PDF
Data science - An Introduction
PPTX
data-mesh-101.pptx
PPTX
Delta Lake with Azure Databricks
PPTX
Data Science presentation for elementary school students
PDF
Wide Column Store NoSQL vs SQL Data Modeling
PDF
Big Data Analytics for Real Time Systems
PDF
AI: A risk and way to manage risk
PPTX
Data analytics
PPTX
Frame - Feature Management for Productive Machine Learning
PPTX
ADF Demo_ppt.pptx
PDF
Zipline—Airbnb’s Declarative Feature Engineering Framework
PDF
Role of Data Cleaning in Data Warehouse
PDF
Snowflake Data Science and AI/ML at Scale
PDF
The Feature Store in Hopsworks
PPTX
Introduction to data science
PDF
Power BI Architecture
PPTX
Data analytics
PPTX
BI-Analytics-Overview.pptx
PPTX
Azure data platform overview
PDF
Owning Your Own (Data) Lake House
Data science - An Introduction
data-mesh-101.pptx
Delta Lake with Azure Databricks
Data Science presentation for elementary school students
Wide Column Store NoSQL vs SQL Data Modeling
Big Data Analytics for Real Time Systems
AI: A risk and way to manage risk
Data analytics
Frame - Feature Management for Productive Machine Learning
ADF Demo_ppt.pptx
Zipline—Airbnb’s Declarative Feature Engineering Framework
Role of Data Cleaning in Data Warehouse
Snowflake Data Science and AI/ML at Scale
The Feature Store in Hopsworks
Introduction to data science
Power BI Architecture
Data analytics
BI-Analytics-Overview.pptx
Azure data platform overview
Owning Your Own (Data) Lake House
Ad

Similar to Wizard Driven AI Anomaly Detection with Databricks in Azure (20)

PDF
Artificial_Intelligence_Techniques_for_Fraud_Detec.pdf
PDF
How AI is preventing account fraud at web scale
DOCX
Fraud Detection Engine Using AI for a Fintech App (1).docx
PDF
20181129 keynote augmented intelligence and artificial intelligence
PPTX
Hyf project ideas_02
PDF
Operationalize deep learning models for fraud detection with Azure Machine Le...
PPTX
Expert Network - Financial Predictions with Machine Learning
PDF
A Comprehensive Introduction to Anomaly Detection in Machine Learning | USAII®
PDF
Mtc strategy-briefing-houston-pd m-05212018-3
PDF
Exploring Generative AI Use Cases for Accounts Payable Automation1.pdf
PDF
Is Machine learning useful for Fraud Prevention?
PPTX
2019 gam-mc 2-15-19
PDF
Trivadis TechEvent 2017 Demystifying AI, ML and Data Science by Marc Schöni
PDF
20180509 energy - v001
PDF
Artificial Intellegence (AI) FINDS A WAY: How Machine Learning is the Future ...
PPTX
Neotys PAC - Andreas Grabner
PPTX
Presentation made to ICICI for useless stuff
PDF
Utilizing Machine Learning In Banking To Prevent Fraud.pdf
PPTX
Credit Card Fraud Detection Using AI.pptx
PPTX
apidays Singapore 2025 - From Data to Insights: Building AI-Powered Data APIs...
Artificial_Intelligence_Techniques_for_Fraud_Detec.pdf
How AI is preventing account fraud at web scale
Fraud Detection Engine Using AI for a Fintech App (1).docx
20181129 keynote augmented intelligence and artificial intelligence
Hyf project ideas_02
Operationalize deep learning models for fraud detection with Azure Machine Le...
Expert Network - Financial Predictions with Machine Learning
A Comprehensive Introduction to Anomaly Detection in Machine Learning | USAII®
Mtc strategy-briefing-houston-pd m-05212018-3
Exploring Generative AI Use Cases for Accounts Payable Automation1.pdf
Is Machine learning useful for Fraud Prevention?
2019 gam-mc 2-15-19
Trivadis TechEvent 2017 Demystifying AI, ML and Data Science by Marc Schöni
20180509 energy - v001
Artificial Intellegence (AI) FINDS A WAY: How Machine Learning is the Future ...
Neotys PAC - Andreas Grabner
Presentation made to ICICI for useless stuff
Utilizing Machine Learning In Banking To Prevent Fraud.pdf
Credit Card Fraud Detection Using AI.pptx
apidays Singapore 2025 - From Data to Insights: Building AI-Powered Data APIs...
Ad

More from Databricks (20)

PPTX
DW Migration Webinar-March 2022.pptx
PPTX
Data Lakehouse Symposium | Day 1 | Part 1
PPT
Data Lakehouse Symposium | Day 1 | Part 2
PPTX
Data Lakehouse Symposium | Day 2
PPTX
Data Lakehouse Symposium | Day 4
PDF
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
PDF
Democratizing Data Quality Through a Centralized Platform
PDF
Learn to Use Databricks for Data Science
PDF
Why APM Is Not the Same As ML Monitoring
PDF
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
PDF
Stage Level Scheduling Improving Big Data and AI Integration
PDF
Simplify Data Conversion from Spark to TensorFlow and PyTorch
PDF
Scaling your Data Pipelines with Apache Spark on Kubernetes
PDF
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
PDF
Sawtooth Windows for Feature Aggregations
PDF
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
PDF
Re-imagine Data Monitoring with whylogs and Spark
PDF
Raven: End-to-end Optimization of ML Prediction Queries
PDF
Processing Large Datasets for ADAS Applications using Apache Spark
PDF
Massive Data Processing in Adobe Using Delta Lake
DW Migration Webinar-March 2022.pptx
Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 2
Data Lakehouse Symposium | Day 4
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
Democratizing Data Quality Through a Centralized Platform
Learn to Use Databricks for Data Science
Why APM Is Not the Same As ML Monitoring
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
Stage Level Scheduling Improving Big Data and AI Integration
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Scaling your Data Pipelines with Apache Spark on Kubernetes
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
Sawtooth Windows for Feature Aggregations
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Re-imagine Data Monitoring with whylogs and Spark
Raven: End-to-end Optimization of ML Prediction Queries
Processing Large Datasets for ADAS Applications using Apache Spark
Massive Data Processing in Adobe Using Delta Lake

Recently uploaded (20)

PPT
ISS -ESG Data flows What is ESG and HowHow
PPTX
sac 451hinhgsgshssjsjsjheegdggeegegdggddgeg.pptx
PDF
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
PDF
Votre score augmente si vous choisissez une catégorie et que vous rédigez une...
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PPTX
Business_Capability_Map_Collection__pptx
PDF
Introduction to Data Science and Data Analysis
PDF
[EN] Industrial Machine Downtime Prediction
PDF
Global Data and Analytics Market Outlook Report
PPTX
SAP 2 completion done . PRESENTATION.pptx
PPTX
Copy of 16 Timeline & Flowchart Templates – HubSpot.pptx
PPTX
STERILIZATION AND DISINFECTION-1.ppthhhbx
PDF
annual-report-2024-2025 original latest.
PDF
Business Analytics and business intelligence.pdf
PDF
Optimise Shopper Experiences with a Strong Data Estate.pdf
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PPT
Predictive modeling basics in data cleaning process
PDF
Microsoft 365 products and services descrption
PPTX
Introduction to Inferential Statistics.pptx
PDF
Introduction to the R Programming Language
ISS -ESG Data flows What is ESG and HowHow
sac 451hinhgsgshssjsjsjheegdggeegegdggddgeg.pptx
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
Votre score augmente si vous choisissez une catégorie et que vous rédigez une...
Qualitative Qantitative and Mixed Methods.pptx
Business_Capability_Map_Collection__pptx
Introduction to Data Science and Data Analysis
[EN] Industrial Machine Downtime Prediction
Global Data and Analytics Market Outlook Report
SAP 2 completion done . PRESENTATION.pptx
Copy of 16 Timeline & Flowchart Templates – HubSpot.pptx
STERILIZATION AND DISINFECTION-1.ppthhhbx
annual-report-2024-2025 original latest.
Business Analytics and business intelligence.pdf
Optimise Shopper Experiences with a Strong Data Estate.pdf
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
Predictive modeling basics in data cleaning process
Microsoft 365 products and services descrption
Introduction to Inferential Statistics.pptx
Introduction to the R Programming Language

Wizard Driven AI Anomaly Detection with Databricks in Azure

  • 1. Wizard Driven AI Anomaly Detection with Databricks in Azure Naomi Kaduwela Head of Kavi Labs Rajesh Inbasekaran CTO
  • 2. Agenda Naomi ▪ Fraud Prevention Opportunity ▪ Why AI Audits ▪ Rise of Citizen Data Scientists ▪ Solution Approach ▪ Designing for Citizen Data Scientists ▪ Anomaly Lifecycle ▪ Deployment Options ▪ Success Stories Rajesh ▪ How the Solution Works ▪ Cloud Native, Serverless Architecture ▪ Databricks Integration
  • 3. Billions of Dollars of Opportunity $350 B Fraudulent Healthcare spending* * According to the National Health Care Anti-Fraud Association $25 B Spent annually by US Banks on anti-money laundering compliance* * According to Forbes $40 B Total annual cost of Insurance Fraud (excluding health insurance)* * According to the FBI
  • 4. Ideal for AI! Why AI Audits Data Volume & Complex Patterns Need to Adapt to New Changes High Frequency Transactions Transaction Flagging Actor-to-Actor Flagging AI flags the root cause of Anomalies in a Scalable way!
  • 5. Rise of the Citizen Data Scientist Thanks to technology abstraction Data Scientists can now focus on solving the business problem Accelerating time to value & maximizing their human potential!
  • 6. Solution Approach 1. Different Anomaly Signatures (possible fraud) exist within same data 3. Despite different methods, a holistic view of anomaly is required for business 2. Different methods are efficient in detecting different Anomaly Signatures 4. Management of entire anomaly lifecycle management is critical for effectiveness and efficiency
  • 7. Designing for Citizen Data Scientists Business Benefits 04 ● Holistic and meaningful view ● Aggregate model into quantifiable business opportunity Evaluation & Visualizations 03 ● Collect and report model metrics ● In built visualizations aid understanding Portfolio of Algorithms 02 ● Diverse portfolio of algorithms available ● Ability to compare parameters across methods & combine multiple AI methods together Wizard Driven, No code ML 01 ● No programming required ● Enable Citizen Data Scientists Anomaly Lifecycle Management 05 ● Track from detection to actual recovery ● Human in the Loop for continuous improvement
  • 8. Wizard Driven, No-Code ML Screenshot
  • 9. Portfolio of Algorithms • Unsupervised • Supervised Distribution Clustering Association Sequencing Historical Occurrence Random Forest Neural Network
  • 11. Business Benefits 7,761,096 $768,408,624 929,412 $21,263,307 26,452 $ 4,723,995 36,573 $ 5,263,785 119,079 $ 20,536,362 295,041 $ 262,482 21,099 $ 3,760,542 123,243 $ 4,062,246 308,025 $ 3,384,471 2019 Business Benefits Summary Anomaly Opportunity Breakdown By Method Billing Error Duplicate Repair Labor Overcharging Material Overcharging Over Repair Wrong Shop Wrong Repair Opportunity Records Savings
  • 12. From Statistical Anomaly to Confirmed Fraud Raw Data Predicted Anomaly Possible Fraud Confirmed Fraud Actual Recovery
  • 13. Human in the Loop Anomaly Lifecycle Model Building Update Feedback Anomaly Detection Recovery Process Anomaly Validation Citizen Data Scientist Business SME
  • 14. Deployment Options Estimate Option 1: Prevention Real Time Scoring at Time of Estimate to Prevent Fraud Money is Exchanged Payment Option 2: Reclaim Batch Processing Post Invoicing to Reclaim Fraud Invoice
  • 15. Enterprise Tech Stack Integration Digital Solutions Layer KPIs and Metrics, Descriptive Dashboards. AI Audits Data Services Layer Integration, Transformation, Governance, Security, Orchestration, Data Catalog Source Systems & Infrastructure Ingestion of Internal Systems, Industry Systems, Customer Systems. Storage & Compute
  • 16. Success Stories ▪ $6.8M of potential FW&A in prescription drug claims ▪ $7M of opportunity in Equipment Repair Bill Invoicing Audits • Transportation • Pharma & Healthcare ROI is High! Payment time is Short!
  • 18. Cloud Native Serverless Architecture
  • 19. Databricks Integration Batch • Jobs API • 2.0/jobs/run-now • Python Task • Python Params Interactive • Notebook Task • Notebook Params
  • 20. Wizard Driven AI Anomaly Detection Thank You! Please share your feedback! Feel free to reach out https://www.linkedin.com/in/naomikaduwela/ https://www.linkedin.com/in/rajeshin/