SlideShare a Scribd company logo
Automade
Test Automation
Data Vault and Data
Warehouse Automation
9th of December
Stefaan De Vos
Test Automation
Challenges
• Lead times of BI / Analytical projects are ever
decreasing
• Tools
• Automation (Wherescape...)
• Virtualization (Denodo, CIS...)
• Appliances (Netezza, DATAllegro...)
• Methodologies
• Data Vault
• Agile DWH
BI went Agile, but Testing didn’t
Challenges
• Data
• Larger data volumes
• IoT
• Unstructured data / poor data quality
• Social networks
• No relevant test data sets available
• The number of possible test cases are near
infinite
Data Volume & Quality
Challenges
Review
requirements
Review
requirements
Define testing
Strategy
Define testing
Strategy
Prepare Test DataPrepare Test DataEntry / Exit criteria
Design Test Cases &
Scripts
Design Test Cases &
Scripts
Configure the test
environment
Configure the test
environment
Prepare & Design
Agree on Entry / Exit
test criteria
Agree on Entry / Exit
test criteria
Integration
testing
Integration
testing
Performance testingPerformance testing
Acceptance testingAcceptance testing
- Basic testing
- DI jobs accessible
- Reports accessible
- Cleansed
- Complete
- Correct
- Integrated
- Valid reports
- Relevant data
- Available data
- Consistent data
- Accessible
- DI & BI integration
- Full test cycles
- Check NFRs
- Scalable
- Check SLAs
- Peak user
- Peak loads
- Functional
- User
- Production
- SME
- End user
Construct
Defect metrics
review
Defect metrics
review
Performance
statistics
Performance
statistics
Lessons learntLessons learnt
Process & Quality Improvement
Accept
Data Completeness
testing
Data Completeness
testing
BI & Analytical
Testing
BI & Analytical
Testing
Smoke test
Unit test
Smoke test
Unit test
BI Testing Lifecycle
Challenges
** Regulatory compliance might require to use the v-model, e.g. validated environments.
Functional
analysis
Requirements
functional
Non-functional
Technical
Design
Construction
Unit
Smoke Test
Data
Completeness
BI &
Analytical
Integration
testing
Performance
User
Production
AcceptanceValidation
Verification
BI Testing V-Model
Challenge
• Lots of data
• Lots of testing to be done
• Little time to do it
Summary
Test Automation
Complete DWH Testing
• Complete testing is a must
• It requires:
• Testing methodology
• Right project culture/mindset and organization
• Tools
Our View
Complete DWH Testing
Database
integrity
checking.
Risk based
testing
Effective defect
management
and
collaboration
End to End
Performance
testing
Adherence to
compliance and
regulatory
standards
and…AUTOMATE
Critical Success Factors
Test Automation
Leverage social development principles to deepen
functionalities
Testers
- functional testing
- regression testing
- result analysis
Developers / DBAs
- unit testing
- result analysis
Data Analysts
- review, analyze
data
- verify mapping
failures
Operations teams
- monitoring
- result analysis
Collaboration/WorkflowTestManagement
Rational Quality Manager JIRA Team Foundation ServerQuality Center
Test Automation
What to Automate?
Complex
Functional
SQL
Validation
Reconciliation
Test Automation
Basic functionality
• Auto detection of anomalies (or at least prior to being detected by a user)
• Targeted regression testing for planned changes
• Data error identification errors by comparing (huge) data result sets
• Data error detection via a rules engine
• In between data layer reconciliation and auditing
• Test data generation
Additional functionality
• No programming or coding
• Heterogeneous connectivity
• Collaboration and workflow capabilities
• Visually attractive development and monitoring environment
• Intuitive reports & dashboards
Functionality
Test Automation
• Shorten regression cycles
• Save report developers time
• Test the same data set in less time
• Test more
• Faster deployment of defect resolution cycle
• Faster deployment of enhancements
• Less cumbersome upgrades/migrations
• Enabler for
• Implementing continuous testing
• operationalization of testing
Benefits
Test Automation
Vendors - tools
• ICEDQ
• RTTS -
QuerySurge
Unconnected ETL
Source Data Layer Target Data Layer
ETL - ELT
1 3
2
Load test data
Execute Job externally
Extract result
4 Compare & report
• Validation of the business rules
implemented in ETL processes
are assessed by comparing the
results against a ground truth.
• The ETL processes are executed
separated from the test
automation tool
• Less adequate for multi-step ETL
processes.
Approach
Test Automation
Vendors - tools
• Zuzena
Connected ETL
• The business rules present
in the ETL tool are analyzed
and the test results are
assessed against the
anticipated results.
• The test automation tool
executes the ETL processes.
Approach
Source Data Layer Target Data Layer
2 4Load test data Extract result
5 Compare & report
1
Analyze logic
3
Run job
Test Automation
Vendors - tools
• Report Valid8tor (BO)
• 360Bind (BO)
• Integrity Manager
(MSTR)
• Report Validator - BSP
Software (Cognos)
Report Integrity validator
• Parallel testing of 2 live
systems
• Comparing against a
(historical) ground truth
• Comparing against
known good baselines
Approach
Reports
Reporting data Layer
Load test data Scrape data
4 Compare & report
1 3
Run report
2
Test automation
QuerySurge
• Integrates with HP QC / IBM RQM / MSFT
TFS
• Provides collaboration features
pulls data from data sources
pulls data from target data store
compares data quickly
generates reports, audit trails
reports
SQL
Design Tests
Scheduling
Reporting
Run
Dashboard
Wizards
Data
Health
Dashboard
• a SQL execution and data comparison tool
working against heterogeneous
datasources.
• No programming needed
Automade introduction
• Automade
• Provides an agile answer to the ever-increasing
information appetite
• By automating the mind-numbing aspects of
constructing, maintaining and testing of data
warehouses.
• Automade is a spinoff of MindThegap and
has established partnerships with
Test automation
Thank you for listening,
Any questions?
Feel free to send questions & feedback to stefaan.devos@automade.be

More Related Content

DOC
Etl And Data Test Guidelines For Large Applications
PDF
Data Warehouse Testing: It’s All about the Planning
PDF
Big Data: Big SQL and HBase
PDF
Architecting Modern Data Platforms
DOC
Data warehouse concepts
PDF
How to Improve Data Analysis Through Visualization in Tableau
PPT
An introduction to data warehousing
PPTX
Microsoft Data Platform - What's included
Etl And Data Test Guidelines For Large Applications
Data Warehouse Testing: It’s All about the Planning
Big Data: Big SQL and HBase
Architecting Modern Data Platforms
Data warehouse concepts
How to Improve Data Analysis Through Visualization in Tableau
An introduction to data warehousing
Microsoft Data Platform - What's included

What's hot (20)

PPTX
ETL Process
DOC
Testing data warehouse applications by Kirti Bhushan
PPTX
Data type[s] on MS SQL Server
PPT
Date warehousing concepts
PDF
Tableau Tutorial For Beginners | Tableau Training For Beginners | Tableau Cer...
PPTX
PPTX
Tableau slideshare
PDF
Tableau Tutorial Complete by Rohit Dubey
PPTX
Introducing Azure SQL Data Warehouse
PDF
Key Considerations While Rolling Out Denodo Platform
PPTX
Introducción a Oracle Audit Vault
PPTX
NOVA SQL User Group - Azure Synapse Analytics Overview - May 2020
PPTX
Dimensional model | | Fact Tables | | Types
PPTX
03. Data Exploration.pptx
PPT
Lecture 09 dblc centralized vs decentralized design
DOC
Data Warehouse (ETL) testing process
PPTX
Data warehouse system and its concepts
PPTX
Differentiate Big Data vs Data Warehouse use cases for a cloud solution
PDF
Snowflake Data Governance
ETL Process
Testing data warehouse applications by Kirti Bhushan
Data type[s] on MS SQL Server
Date warehousing concepts
Tableau Tutorial For Beginners | Tableau Training For Beginners | Tableau Cer...
Tableau slideshare
Tableau Tutorial Complete by Rohit Dubey
Introducing Azure SQL Data Warehouse
Key Considerations While Rolling Out Denodo Platform
Introducción a Oracle Audit Vault
NOVA SQL User Group - Azure Synapse Analytics Overview - May 2020
Dimensional model | | Fact Tables | | Types
03. Data Exploration.pptx
Lecture 09 dblc centralized vs decentralized design
Data Warehouse (ETL) testing process
Data warehouse system and its concepts
Differentiate Big Data vs Data Warehouse use cases for a cloud solution
Snowflake Data Governance
Ad

Viewers also liked (20)

PDF
WhereScape, the pioneer in data warehouse automation software
PDF
QuerySurge - the automated Data Testing solution
PDF
WhereScape - Business Intelligence for Growth
PDF
Smarter Analytics: Supporting the Enterprise with Automation
PPT
Best Practices for Building a Warehouse Quickly
PPTX
What is a Data Warehouse and How Do I Test It?
PPT
Data Quality Testing Generic (http://www.geektester.blogspot.com/)
PDF
Data Strategies in Testing
PDF
Go past average!
PDF
The Right Data Warehouse: Automation Now, Business Value Thereafter
PPTX
Continuous Integration and the Data Warehouse - PASS SQL Saturday Slovenia
PPTX
Market Research & Competitive Intelligence
PDF
What is Competitive Intelligence (CI) and What It Should Include
PPT
Competitive intelligence - industry research and benchmarking
PDF
Big Data and Competitive Intelligence
PPT
Mobile Business Intelligence - Yellowfin
PPT
Competitive Intelligence and Big Data
PDF
Data Vault Introduction
PPT
Smarter Eduction - Higher Education Summit 2011 - D Watt
PDF
Creating Better Customer Experiences Online (with Top Tasks) presented by Ger...
WhereScape, the pioneer in data warehouse automation software
QuerySurge - the automated Data Testing solution
WhereScape - Business Intelligence for Growth
Smarter Analytics: Supporting the Enterprise with Automation
Best Practices for Building a Warehouse Quickly
What is a Data Warehouse and How Do I Test It?
Data Quality Testing Generic (http://www.geektester.blogspot.com/)
Data Strategies in Testing
Go past average!
The Right Data Warehouse: Automation Now, Business Value Thereafter
Continuous Integration and the Data Warehouse - PASS SQL Saturday Slovenia
Market Research & Competitive Intelligence
What is Competitive Intelligence (CI) and What It Should Include
Competitive intelligence - industry research and benchmarking
Big Data and Competitive Intelligence
Mobile Business Intelligence - Yellowfin
Competitive Intelligence and Big Data
Data Vault Introduction
Smarter Eduction - Higher Education Summit 2011 - D Watt
Creating Better Customer Experiences Online (with Top Tasks) presented by Ger...
Ad

Similar to Test Automation for Data Warehouses (20)

PDF
How to Automate your Enterprise Application / ERP Testing
PPTX
Data Warehouse Testing in the Pharmaceutical Industry
PPTX
Big Data Testing : Automate theTesting of Hadoop, NoSQL & DWH without Writing...
PDF
Creating a Data validation and Testing Strategy
PDF
Leveraging HPE ALM & QuerySurge to test HPE Vertica
PPT
Choosing the Right Business Intelligence Tools for Your Data and Architectura...
PPTX
Query Wizards - data testing made easy - no programming
PDF
AI Assisted Continuous Testing - Talk Track v2.pdf
PDF
Automate ETL Testing, Data Warehouse & Migration Testing The Agile Way - iceDQ
PDF
Data Warehouse Testing—The Next Opportunity for QA Leaders
PPTX
rough-work.pptx
PPTX
DWBI Testing and Analytics Testing Services
PDF
Ray Scott - Agile Solutions – Leading with Test Data Management - EuroSTAR 2012
PPTX
How To Avoid Continuously Delivering Faulty Software
PPTX
SCRIMPS-STD: Test Automation Design Principles - and asking the right questions!
PPTX
How to Avoid Continuously Delivering Faulty Software
PPTX
ETL_TESTING.pptx
DOC
reddythippa ETL 8Years
PDF
Resume sailaja
PPTX
Small is Beautiful- Fully Automate your Test Case Design
How to Automate your Enterprise Application / ERP Testing
Data Warehouse Testing in the Pharmaceutical Industry
Big Data Testing : Automate theTesting of Hadoop, NoSQL & DWH without Writing...
Creating a Data validation and Testing Strategy
Leveraging HPE ALM & QuerySurge to test HPE Vertica
Choosing the Right Business Intelligence Tools for Your Data and Architectura...
Query Wizards - data testing made easy - no programming
AI Assisted Continuous Testing - Talk Track v2.pdf
Automate ETL Testing, Data Warehouse & Migration Testing The Agile Way - iceDQ
Data Warehouse Testing—The Next Opportunity for QA Leaders
rough-work.pptx
DWBI Testing and Analytics Testing Services
Ray Scott - Agile Solutions – Leading with Test Data Management - EuroSTAR 2012
How To Avoid Continuously Delivering Faulty Software
SCRIMPS-STD: Test Automation Design Principles - and asking the right questions!
How to Avoid Continuously Delivering Faulty Software
ETL_TESTING.pptx
reddythippa ETL 8Years
Resume sailaja
Small is Beautiful- Fully Automate your Test Case Design

More from Patrick Van Renterghem (20)

PDF
Ethical AI at VDAB, presented by Vincent Buekenhout (Ethical AI Lead, VDAB) a...
PDF
Implementing error-proof, business-critical Machine Learning, presentation by...
PDF
Building Trust and Explainability into Chatbots: the Partena Ziekenfonds Busi...
PDF
AI & Ethics: The Belgian Industry Vision & Initiatives, presentation by Jelle...
PDF
Responsible AI: An Example AI Development Process with Focus on Risks and Con...
PDF
Fairness and Transparency: Algorithmic Explainability, some Legal and Ethical...
PPTX
How obedient digital twins and intelligent beings contribute to ethics and ex...
PDF
He Said, She Said: Finding and Fixing Bias in NLP (Natural Language Processin...
PDF
Introduction to Bias in Machine Learning, presented by Matthias Feys, CTO @ M...
PDF
Business Case: Ozitem Groupe, where 80% of the company is working remotely. R...
PDF
Digital Workplace Case Study: How the Municipality of Duffel successfully swi...
PDF
Unleashing the Full Potential of People, Teams and SOLVAY, presented by Bruce...
PDF
The Building Blocks of a Digital Workplace, presented by Sam Marshall at the ...
PDF
Engie's Digital Workplace and "Connecting the company" business case, present...
PDF
Face your communication challenges when implementing a digital workplace, bas...
PDF
The first steps in Recticel's Digital Workplace program by Kenneth Meuleman (...
PDF
Presentation by Dave Geentjens at the "Successful Digital Workplace Adoption"...
PDF
Tim scottkoenverheyenpresentation
PDF
Presentation by Ivan Schotsmans (DV Community) at the Data Vault Modelling an...
PDF
Presentation by Luc Delanglez (DataLumen) at the Data Vault Modelling and Dat...
Ethical AI at VDAB, presented by Vincent Buekenhout (Ethical AI Lead, VDAB) a...
Implementing error-proof, business-critical Machine Learning, presentation by...
Building Trust and Explainability into Chatbots: the Partena Ziekenfonds Busi...
AI & Ethics: The Belgian Industry Vision & Initiatives, presentation by Jelle...
Responsible AI: An Example AI Development Process with Focus on Risks and Con...
Fairness and Transparency: Algorithmic Explainability, some Legal and Ethical...
How obedient digital twins and intelligent beings contribute to ethics and ex...
He Said, She Said: Finding and Fixing Bias in NLP (Natural Language Processin...
Introduction to Bias in Machine Learning, presented by Matthias Feys, CTO @ M...
Business Case: Ozitem Groupe, where 80% of the company is working remotely. R...
Digital Workplace Case Study: How the Municipality of Duffel successfully swi...
Unleashing the Full Potential of People, Teams and SOLVAY, presented by Bruce...
The Building Blocks of a Digital Workplace, presented by Sam Marshall at the ...
Engie's Digital Workplace and "Connecting the company" business case, present...
Face your communication challenges when implementing a digital workplace, bas...
The first steps in Recticel's Digital Workplace program by Kenneth Meuleman (...
Presentation by Dave Geentjens at the "Successful Digital Workplace Adoption"...
Tim scottkoenverheyenpresentation
Presentation by Ivan Schotsmans (DV Community) at the Data Vault Modelling an...
Presentation by Luc Delanglez (DataLumen) at the Data Vault Modelling and Dat...

Recently uploaded (20)

PPTX
Business Acumen Training GuidePresentation.pptx
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PDF
Launch Your Data Science Career in Kochi – 2025
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPT
Reliability_Chapter_ presentation 1221.5784
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PDF
.pdf is not working space design for the following data for the following dat...
PDF
Lecture1 pattern recognition............
PPTX
1_Introduction to advance data techniques.pptx
PPTX
Database Infoormation System (DBIS).pptx
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPTX
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
PPTX
CEE 2 REPORT G7.pptxbdbshjdgsgjgsjfiuhsd
Business Acumen Training GuidePresentation.pptx
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
oil_refinery_comprehensive_20250804084928 (1).pptx
Launch Your Data Science Career in Kochi – 2025
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
Acceptance and paychological effects of mandatory extra coach I classes.pptx
Business Ppt On Nestle.pptx huunnnhhgfvu
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
Reliability_Chapter_ presentation 1221.5784
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
.pdf is not working space design for the following data for the following dat...
Lecture1 pattern recognition............
1_Introduction to advance data techniques.pptx
Database Infoormation System (DBIS).pptx
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
CEE 2 REPORT G7.pptxbdbshjdgsgjgsjfiuhsd

Test Automation for Data Warehouses

  • 1. Automade Test Automation Data Vault and Data Warehouse Automation 9th of December Stefaan De Vos
  • 3. Challenges • Lead times of BI / Analytical projects are ever decreasing • Tools • Automation (Wherescape...) • Virtualization (Denodo, CIS...) • Appliances (Netezza, DATAllegro...) • Methodologies • Data Vault • Agile DWH BI went Agile, but Testing didn’t
  • 4. Challenges • Data • Larger data volumes • IoT • Unstructured data / poor data quality • Social networks • No relevant test data sets available • The number of possible test cases are near infinite Data Volume & Quality
  • 5. Challenges Review requirements Review requirements Define testing Strategy Define testing Strategy Prepare Test DataPrepare Test DataEntry / Exit criteria Design Test Cases & Scripts Design Test Cases & Scripts Configure the test environment Configure the test environment Prepare & Design Agree on Entry / Exit test criteria Agree on Entry / Exit test criteria Integration testing Integration testing Performance testingPerformance testing Acceptance testingAcceptance testing - Basic testing - DI jobs accessible - Reports accessible - Cleansed - Complete - Correct - Integrated - Valid reports - Relevant data - Available data - Consistent data - Accessible - DI & BI integration - Full test cycles - Check NFRs - Scalable - Check SLAs - Peak user - Peak loads - Functional - User - Production - SME - End user Construct Defect metrics review Defect metrics review Performance statistics Performance statistics Lessons learntLessons learnt Process & Quality Improvement Accept Data Completeness testing Data Completeness testing BI & Analytical Testing BI & Analytical Testing Smoke test Unit test Smoke test Unit test BI Testing Lifecycle
  • 6. Challenges ** Regulatory compliance might require to use the v-model, e.g. validated environments. Functional analysis Requirements functional Non-functional Technical Design Construction Unit Smoke Test Data Completeness BI & Analytical Integration testing Performance User Production AcceptanceValidation Verification BI Testing V-Model
  • 7. Challenge • Lots of data • Lots of testing to be done • Little time to do it Summary
  • 9. Complete DWH Testing • Complete testing is a must • It requires: • Testing methodology • Right project culture/mindset and organization • Tools Our View
  • 10. Complete DWH Testing Database integrity checking. Risk based testing Effective defect management and collaboration End to End Performance testing Adherence to compliance and regulatory standards and…AUTOMATE Critical Success Factors
  • 11. Test Automation Leverage social development principles to deepen functionalities Testers - functional testing - regression testing - result analysis Developers / DBAs - unit testing - result analysis Data Analysts - review, analyze data - verify mapping failures Operations teams - monitoring - result analysis Collaboration/WorkflowTestManagement Rational Quality Manager JIRA Team Foundation ServerQuality Center
  • 14. Test Automation Basic functionality • Auto detection of anomalies (or at least prior to being detected by a user) • Targeted regression testing for planned changes • Data error identification errors by comparing (huge) data result sets • Data error detection via a rules engine • In between data layer reconciliation and auditing • Test data generation Additional functionality • No programming or coding • Heterogeneous connectivity • Collaboration and workflow capabilities • Visually attractive development and monitoring environment • Intuitive reports & dashboards Functionality
  • 15. Test Automation • Shorten regression cycles • Save report developers time • Test the same data set in less time • Test more • Faster deployment of defect resolution cycle • Faster deployment of enhancements • Less cumbersome upgrades/migrations • Enabler for • Implementing continuous testing • operationalization of testing Benefits
  • 16. Test Automation Vendors - tools • ICEDQ • RTTS - QuerySurge Unconnected ETL Source Data Layer Target Data Layer ETL - ELT 1 3 2 Load test data Execute Job externally Extract result 4 Compare & report • Validation of the business rules implemented in ETL processes are assessed by comparing the results against a ground truth. • The ETL processes are executed separated from the test automation tool • Less adequate for multi-step ETL processes. Approach
  • 17. Test Automation Vendors - tools • Zuzena Connected ETL • The business rules present in the ETL tool are analyzed and the test results are assessed against the anticipated results. • The test automation tool executes the ETL processes. Approach Source Data Layer Target Data Layer 2 4Load test data Extract result 5 Compare & report 1 Analyze logic 3 Run job
  • 18. Test Automation Vendors - tools • Report Valid8tor (BO) • 360Bind (BO) • Integrity Manager (MSTR) • Report Validator - BSP Software (Cognos) Report Integrity validator • Parallel testing of 2 live systems • Comparing against a (historical) ground truth • Comparing against known good baselines Approach Reports Reporting data Layer Load test data Scrape data 4 Compare & report 1 3 Run report 2
  • 20. QuerySurge • Integrates with HP QC / IBM RQM / MSFT TFS • Provides collaboration features pulls data from data sources pulls data from target data store compares data quickly generates reports, audit trails reports SQL Design Tests Scheduling Reporting Run Dashboard Wizards Data Health Dashboard • a SQL execution and data comparison tool working against heterogeneous datasources. • No programming needed
  • 21. Automade introduction • Automade • Provides an agile answer to the ever-increasing information appetite • By automating the mind-numbing aspects of constructing, maintaining and testing of data warehouses. • Automade is a spinoff of MindThegap and has established partnerships with
  • 22. Test automation Thank you for listening, Any questions? Feel free to send questions & feedback to stefaan.devos@automade.be