SlideShare a Scribd company logo
Ab Initio
From the Data Warehousing
Perspective
Data Warehousing
Why did it arise?
 Large Corporations gathered huge amounts of
data.
 Sooner, they were data rich but information
poor.
 Large body of disparate data, difficult to make
informed business decisions.
Solution!!!
 Users wanted more control over their data
 Each department request data specific to their
department.
 Varying amounts of data from the same
source.
One Solution to the PROBLEMS---
Information/Data Warehouse.
What is Data Warehousing?
 Data Warehousing is an architecture
 System for storing, retrieving and managing
large amounts of any type of data.
 Data warehousing concerned in moving data
from its current location to the data warehouse
and transforming data into information.
E.T.L-----What is it??
 E.T.L stands for Extraction, Transformation
Loading.
 This is the principle used in Data Warehousing
Characteristics of Data Warehousing
 Application independent
 Collected at any moment in a business cycle.
 Metadata has been created for it.
 Easily understood by a non-technical
Characteristics of data in data warehouse
 Subject Oriented
 Integrated
 Non-volatile
 Time Variant
Subject-Oriented
 Focus on entities rather than on process.
 A Subject-Oriented data warehouse is called a
Data Mart
ETL Tools
 Many tools are being used in Data
Warehousing for the purpose of ETL.
 Ab Initio is one of the major ETL tool.
What is Ab Initio?
 Latin word, meaning “From First Principles”
 ETL tool, developed by Ab Initio software
corporation (http://www.abinitio.com)
 Used in data warehousing, batch processing
and application integration.
Why Ab Initio?
 Achieving Scalability
 Reduced Development Time
 Managing Metadata
 Integrating Other Applications
Features
 Basic Components: Filter by Expression, Reformat, Sort, Join,
Rollup, Dedup
 Database Components: Input, Output and Update Table;
db_config_utility
 Built in Functions: Ab Initio built-in functions are those which
 can manipulate strings, dates, and numbers
 can access system properties
 Vectors: An array of same type of elements that is repeated
Look Up Function
 Built-in function within a transform function that
allows a transform component to retrieve records
from a look up file
 Held in main memory
 Faster as searching and retrieval is key based
 Not connected to other components in a graph
Alternatives for Ab Initio
 Informatica and Ascential are alternatives for
Ab Initio but the main disadvantage is they are
tougher to work with.
Highlights
 Every plug-in facility available from industry
leaders Informatica and Ascential incorporated
into Ab Initio
 Fastest ETL, possible to extract 41 million
rows of data from an Oracle 8i database
(Geneva billing system!) in about 5.2 minutes
Success Stories
 Bank of Montreal-Moving 10 terabytes (TB)
of data daily and analyzing it done using Ab
Initio
 Premier Inc. (www.premierinc.com health
care services) successfully handled 14 TB of
data using Ab Initio to achieve scalability and
data quality
End
 Thank You for your time!!

More Related Content

PPT
Generic Graph And Psets
PDF
Shell scripting
PDF
Datastage real time scenario
PPTX
Oop’s Concept and its Real Life Applications
PPT
Chapter1 introduction
PDF
Linux Directory Structure
PPTX
Disk and File System Management in Linux
PPTX
The string class
Generic Graph And Psets
Shell scripting
Datastage real time scenario
Oop’s Concept and its Real Life Applications
Chapter1 introduction
Linux Directory Structure
Disk and File System Management in Linux
The string class

What's hot (20)

PDF
Learning c - An extensive guide to learn the C Language
PPTX
Union in c language
PPTX
Linux file system
PPTX
Basic commands of linux
PPTX
Data structure tries
PPTX
Polish Notation In Data Structure
PPTX
Files and directories in Linux 6
PPT
Unit 3
PPT
Structure c
PPTX
Introduction to Vim
PDF
Data Warehouse Architecture
PPT
Xfs file system for linux
PPTX
Hashing In Data Structure
PDF
Ibm pure data system for analytics n200x
PPT
Linux presentation
PDF
PPSX
Complete C programming Language Course
PPTX
Linux basic commands
PDF
Introduction to firewalls through Iptables
PPTX
Raid level
Learning c - An extensive guide to learn the C Language
Union in c language
Linux file system
Basic commands of linux
Data structure tries
Polish Notation In Data Structure
Files and directories in Linux 6
Unit 3
Structure c
Introduction to Vim
Data Warehouse Architecture
Xfs file system for linux
Hashing In Data Structure
Ibm pure data system for analytics n200x
Linux presentation
Complete C programming Language Course
Linux basic commands
Introduction to firewalls through Iptables
Raid level
Ad

Viewers also liked (20)

PDF
Aboutsip - SIP Routing
PPT
Optimization Analysis Case Example
DOCX
Tanglewood 3
PPT
The welch way
PPTX
Advanced Work Packaging in Construction: An Introduction
PPTX
Teamcenter – sap integration gateway
PPTX
Hedge Fund Strategies: Credit Funds
PDF
Real-time, Sensor-based Monitoring of Shipping Containers
PPT
Designing your Product as a Platform
PPTX
Pilling and abrasion Testing of fabrics
PPTX
Shear centre
PPT
Chapter 1 modes of international trade transactions
PPTX
Office 365-single-sign-on-with-adfs
PDF
Revenue assurance 101
PPTX
XRF Theory and Application
PPT
One Page Talent Management
PDF
Acquisition Candidate Analysis
PDF
Textile management system review iii
PPT
Branding in Pharmaceuticals
PPT
Metadata in data warehouse
Aboutsip - SIP Routing
Optimization Analysis Case Example
Tanglewood 3
The welch way
Advanced Work Packaging in Construction: An Introduction
Teamcenter – sap integration gateway
Hedge Fund Strategies: Credit Funds
Real-time, Sensor-based Monitoring of Shipping Containers
Designing your Product as a Platform
Pilling and abrasion Testing of fabrics
Shear centre
Chapter 1 modes of international trade transactions
Office 365-single-sign-on-with-adfs
Revenue assurance 101
XRF Theory and Application
One Page Talent Management
Acquisition Candidate Analysis
Textile management system review iii
Branding in Pharmaceuticals
Metadata in data warehouse
Ad

Similar to What is the future of etl tools like ab initio (20)

PPT
DW 101
PPTX
MIS and Business Functions, TPS/DSS/ESS, MIS and Business Processes, Impact o...
PDF
Implementation of Data Marts in Data ware house
DOCX
Unit 1
PDF
Big data analytics beyond beer and diapers
PPT
Data warehouse presentation
PPT
20IT501_DWDM_PPT_Unit_I.ppt
PPTX
Datawarehouse
PDF
Top 60+ Data Warehouse Interview Questions and Answers.pdf
PPTX
ETL processes , Datawarehouse and Datamarts.pptx
PPT
Datawarehousing
PDF
[IJET-V1I5P5] Authors: T.Jalaja, M.Shailaja
PPT
20IT501_DWDM_PPT_Unit_I.ppt
PDF
single store faster analytics for warehousing
PPTX
Data Warehouse
DOCX
UNIT-5 DATA WAREHOUSING.docx
PPT
Data Warehouse Basic Guide
DOC
Data warehouse concepts
PPT
20IT501_DWDM_PPT_Unit_I.ppt
DW 101
MIS and Business Functions, TPS/DSS/ESS, MIS and Business Processes, Impact o...
Implementation of Data Marts in Data ware house
Unit 1
Big data analytics beyond beer and diapers
Data warehouse presentation
20IT501_DWDM_PPT_Unit_I.ppt
Datawarehouse
Top 60+ Data Warehouse Interview Questions and Answers.pdf
ETL processes , Datawarehouse and Datamarts.pptx
Datawarehousing
[IJET-V1I5P5] Authors: T.Jalaja, M.Shailaja
20IT501_DWDM_PPT_Unit_I.ppt
single store faster analytics for warehousing
Data Warehouse
UNIT-5 DATA WAREHOUSING.docx
Data Warehouse Basic Guide
Data warehouse concepts
20IT501_DWDM_PPT_Unit_I.ppt

Recently uploaded (20)

PDF
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
PDF
Trump Administration's workforce development strategy
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PPTX
Lesson notes of climatology university.
PPTX
Pharma ospi slides which help in ospi learning
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PDF
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
PDF
Yogi Goddess Pres Conference Studio Updates
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PDF
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
PDF
VCE English Exam - Section C Student Revision Booklet
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PPTX
Orientation - ARALprogram of Deped to the Parents.pptx
PDF
Classroom Observation Tools for Teachers
PDF
Weekly quiz Compilation Jan -July 25.pdf
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PDF
Complications of Minimal Access Surgery at WLH
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PDF
01-Introduction-to-Information-Management.pdf
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
Trump Administration's workforce development strategy
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
Lesson notes of climatology university.
Pharma ospi slides which help in ospi learning
FourierSeries-QuestionsWithAnswers(Part-A).pdf
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
Yogi Goddess Pres Conference Studio Updates
STATICS OF THE RIGID BODIES Hibbelers.pdf
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
VCE English Exam - Section C Student Revision Booklet
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
Orientation - ARALprogram of Deped to the Parents.pptx
Classroom Observation Tools for Teachers
Weekly quiz Compilation Jan -July 25.pdf
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
Complications of Minimal Access Surgery at WLH
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
01-Introduction-to-Information-Management.pdf

What is the future of etl tools like ab initio

  • 1. Ab Initio From the Data Warehousing Perspective
  • 2. Data Warehousing Why did it arise?  Large Corporations gathered huge amounts of data.  Sooner, they were data rich but information poor.  Large body of disparate data, difficult to make informed business decisions.
  • 3. Solution!!!  Users wanted more control over their data  Each department request data specific to their department.  Varying amounts of data from the same source. One Solution to the PROBLEMS--- Information/Data Warehouse.
  • 4. What is Data Warehousing?  Data Warehousing is an architecture  System for storing, retrieving and managing large amounts of any type of data.  Data warehousing concerned in moving data from its current location to the data warehouse and transforming data into information.
  • 5. E.T.L-----What is it??  E.T.L stands for Extraction, Transformation Loading.  This is the principle used in Data Warehousing
  • 6. Characteristics of Data Warehousing  Application independent  Collected at any moment in a business cycle.  Metadata has been created for it.  Easily understood by a non-technical
  • 7. Characteristics of data in data warehouse  Subject Oriented  Integrated  Non-volatile  Time Variant
  • 8. Subject-Oriented  Focus on entities rather than on process.  A Subject-Oriented data warehouse is called a Data Mart
  • 9. ETL Tools  Many tools are being used in Data Warehousing for the purpose of ETL.  Ab Initio is one of the major ETL tool.
  • 10. What is Ab Initio?  Latin word, meaning “From First Principles”  ETL tool, developed by Ab Initio software corporation (http://www.abinitio.com)  Used in data warehousing, batch processing and application integration.
  • 11. Why Ab Initio?  Achieving Scalability  Reduced Development Time  Managing Metadata  Integrating Other Applications
  • 12. Features  Basic Components: Filter by Expression, Reformat, Sort, Join, Rollup, Dedup  Database Components: Input, Output and Update Table; db_config_utility  Built in Functions: Ab Initio built-in functions are those which  can manipulate strings, dates, and numbers  can access system properties  Vectors: An array of same type of elements that is repeated
  • 13. Look Up Function  Built-in function within a transform function that allows a transform component to retrieve records from a look up file  Held in main memory  Faster as searching and retrieval is key based  Not connected to other components in a graph
  • 14. Alternatives for Ab Initio  Informatica and Ascential are alternatives for Ab Initio but the main disadvantage is they are tougher to work with.
  • 15. Highlights  Every plug-in facility available from industry leaders Informatica and Ascential incorporated into Ab Initio  Fastest ETL, possible to extract 41 million rows of data from an Oracle 8i database (Geneva billing system!) in about 5.2 minutes
  • 16. Success Stories  Bank of Montreal-Moving 10 terabytes (TB) of data daily and analyzing it done using Ab Initio  Premier Inc. (www.premierinc.com health care services) successfully handled 14 TB of data using Ab Initio to achieve scalability and data quality
  • 17. End  Thank You for your time!!