SlideShare a Scribd company logo
Using Big Data for Customer
Analytics at Transamerica
David Beaudoin
Vishal Bamba
John LoGiudice
Enterprise Computing Community Conference
Marist College
June 12-14, 2016
About Transamerica
Transamerica is the US-based brand of Aegon, a Dutch financial services firm.
Aegon is one of the world's leading providers of life insurance, pensions and
asset management and is helping approximately 30 million customers globally to
achieve a lifetime of financial security.
2
About Transamerica
Operating under the Transamerica brand,
the Americas is Aegon’s largest market,
with two-thirds of the company’s
underlying earnings generated here.
Transamerica products and services help
more than 27 million customers protect
against financial risk, build financial
security and create successful retirements.
3
About Transamerica
Transamerica is among the top ten largest providers of variable annuities,
individual universal life and individual term life in the US. In 2015, over $6B in
benefits were paid.
With over $100B in assets under administration, we serve more than 4 million
retirement plan participants across the entire spectrum of defined benefit and
defined contribution plans.
4
As you may
have guessed…
That is a
WHOLE LOT of
data to manage!
5
Data Architecture
• Rich data environment across organizational business
units, comprised of many source systems across various
platforms…
• A consistent enterprise view of data across business
units is required.
6
Data Architecture
7
As in many industries, we
are focused on leveraging
technology to build
data-driven customer
relationships.
How can the current
data architecture
support this strategic direction?
Data Architecture
8
Enterprise Data Management
Crossroads!
Another traditional warehouse
project?
OR
Enterprise data hub/lake/ocean with
new “Big Data” technologies?
A New Strategy for Customer Data…
EMAP: Enterprise Marketing Analytics Platform
• Integration of both internal and external data sources to
provide a 360-degree view of customer relationships for
marketing, planning and operational analytics.
• Solution must provide strong data governance and
security, and facilitate the use of clean, trustable, high
quality data by business stakeholders and end users.
9
EMAP – Customer 360
What is your current & future
value to the business?
Initial Use Cases
Customer 360
Investment Advisor and Producer Profile
Journey Map
Customer Lifetime Value
Marketing Attribution
Asset Retention
Foundational
Applied
Enterprise Data Hub / Data Lake
12
Customer Data Lake – Iterative Approach
Big
Data
Management
Data
Integration
HDFS
Big
Data
management
Data
Quality
Hive
Hive / Impala
MR /
Spark
Cleansed
Files
Individual Household
Informatica Big Data Management Cloudera Big Data Platform
Big
Data
Analytics
Datameer
Extract Load & Transform
Data Quality –Cleaning, Identity Resolution
Admin
Extracts
Partner
Files
IVR
Enrichmen
t
Inputs
CRM
Solicitation
History
Weblogs
BI
Tableau
H2O – ML
Consumption
Big
Data
Relationship
Management
Identity
Resolution
Search
-
Solr
EMAP Solution Architecture
Technology Stack
15
Identity Resolution
Data Quality
Ingest & Profile Cleanse &
Transform
Standardize /
cleanse
phone, email,
address
DOB
BDM
Tool: Developer &
Analyst Tool
Source 1
Source N
Masking
Cleanse,
Standardize,
Mask Personal
Data
BDM
Tool: Developer
BDRM
matching/linking
(assign cluster ID)
Pre BDRM
Managed Views
Pre BDRM
Managed Views
BDM
Managed Views
(Current)
BDM System
Tables (Date)
Hive
BDRM
Hbase/Hive
Post BDRM
Managed Views
Post BDRM
Managed Views
Post BDRM
Managed Views
Profile
Data Discovery
Design/Define
rules
Build/Define
Ref Table
BDM
Tool: Analyst
Tool
Address
Doctor
process
EMAP Data Process Flow
Informatica Big Data Developer
HDFS Data Management Structure
18
Managed by IT
• Core set of Informatica developers to create the
mappings to ingest data from sources into HDFS
(jobs run and monitored by IT)
• Users can request access via a defined security
approval process (Access can be limited at the
file level)
• Access will be tracked and can be included in
auditing reports/events
• SSN will only be allowed in the core data
ingested from source systems.
Managed by the Team
• Governed onboarding process:
IT will assist the team with
setting up directories, reviewing
recommended data flow
process, reviewing security
privileges, education on platform
security and compliance issues,
and providing basic setup
guidance and assistance
• Access will be tracked and can
be included in auditing
reports/events
MANAGED DATA TEAM DATA
Physically
Separated
by
Directories
Audit Events
Audit events/actions Include:
• Failed login attempts
• Role changes
• Data access
• Denied data access
• Any many more …
Alerts -- Can email an alert when a
specific event occurs
Data captured about the audit
event:
• Date
• Command performed
• Object affected
• User that performed the
action
• IP Address
Audit
Events
ARE NOT
Stored on
the
Hadoop
Cluster
Summary of Key Benefits
• Provides a single platform to house key customer and prospect data
sources
• Establishes persistent keys across previously disparate data sources
• Provides for rapid intake of new data sources (structured and
unstructured)
• Eliminates today’s data intake and append bottleneck
• Empowers analysts to explore all data elements
• Increases processing power for statistical analysis
• Improved recruiting and retention of Data Engineers and Data Scientists
20
Lessons Learned
• Tie to business use cases to demonstrate value
• Align with larger enterprise strategy (business and technical)
• Socialize the platform and vision
• Technology is changing rapidly
• Partner with key vendors
• Partner with business
• Invest in a PoC
• Small team with the right skillset - innovative and curious
21
Lessons Learned
• Big Data requires Data Governance
• Establish tools and processes to support data governance from the start
• Important to have Data Stewards: Profile, validate, catalog, metadata
creation, lineage
22
Transamerica is
Hiring!
bigdatajobs@transamerica.com
Questions?

More Related Content

PDF
Webinar #2 - Transforming Challenges into Opportunities for Credit Unions
PDF
Big Data LDN 2017: Data Governance Reimagined
PPTX
Customer Intelligence_ Harnessing Elephants at Transamerica Presentation (1)
PPTX
Klarna Tech Talk - Mind the Data!
PDF
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
PDF
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
PDF
Data-Ed Online Webinar: Business Value from MDM
PDF
Data-Ed: Business Value From MDM
Webinar #2 - Transforming Challenges into Opportunities for Credit Unions
Big Data LDN 2017: Data Governance Reimagined
Customer Intelligence_ Harnessing Elephants at Transamerica Presentation (1)
Klarna Tech Talk - Mind the Data!
¿En qué se parece el Gobierno del Dato a un parque de atracciones?
Implementar una estrategia eficiente de gobierno y seguridad del dato con la ...
Data-Ed Online Webinar: Business Value from MDM
Data-Ed: Business Value From MDM

Similar to Using Big Data and AI for Customer Analytics (20)

PDF
Active Governance Across the Delta Lake with Alation
PDF
DAS Slides: Metadata Management From Technical Architecture & Business Techni...
PPTX
DataSpryng Overview
PPTX
Customer-Centric Data Management for Better Customer Experiences
PPTX
Customer-Centric Data Management for Better Customer Experiences
PDF
Five Things to Consider About Data Mesh and Data Governance
PDF
Data Lake Architecture – Modern Strategies & Approaches
PDF
Keyrus US Information
PDF
Keyrus US Information
PDF
Data Management Meets Human Management - Why Words Matter
PPTX
Perspectives on Ethical Big Data Governance
PPTX
Chapter 4 : Introduction to BigData.pptx
PDF
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
PDF
Data-Ed Webinar: The Importance of MDM
PDF
Trends in Enterprise Advanced Analytics
PDF
Die Big Data Fabric als Enabler für Machine Learning & AI
PPTX
ANIn Pune July 2024 | Bootstrapping Data Mesh for a Complex Enterprise by Bal...
PPTX
WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEn
PDF
Workable Enteprise Data Governance
PDF
BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...
Active Governance Across the Delta Lake with Alation
DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DataSpryng Overview
Customer-Centric Data Management for Better Customer Experiences
Customer-Centric Data Management for Better Customer Experiences
Five Things to Consider About Data Mesh and Data Governance
Data Lake Architecture – Modern Strategies & Approaches
Keyrus US Information
Keyrus US Information
Data Management Meets Human Management - Why Words Matter
Perspectives on Ethical Big Data Governance
Chapter 4 : Introduction to BigData.pptx
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Data-Ed Webinar: The Importance of MDM
Trends in Enterprise Advanced Analytics
Die Big Data Fabric als Enabler für Machine Learning & AI
ANIn Pune July 2024 | Bootstrapping Data Mesh for a Complex Enterprise by Bal...
WCIT 2014 Rohit Tandon - Big Data to Drive Business Results: HP HAVEn
Workable Enteprise Data Governance
BDW Chicago 2016 - Ramu Kalvakuntla, Sr. Principal - Technical - Big Data Pra...
Ad

Recently uploaded (20)

PPTX
Introduction to Artificial Intelligence
PPTX
CHAPTER 2 - PM Management and IT Context
PDF
Addressing The Cult of Project Management Tools-Why Disconnected Work is Hold...
PDF
Design an Analysis of Algorithms I-SECS-1021-03
PDF
Design an Analysis of Algorithms II-SECS-1021-03
PDF
How to Choose the Right IT Partner for Your Business in Malaysia
PDF
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
PDF
Which alternative to Crystal Reports is best for small or large businesses.pdf
PDF
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
PPTX
history of c programming in notes for students .pptx
PPTX
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
PPTX
Reimagine Home Health with the Power of Agentic AI​
PDF
System and Network Administration Chapter 2
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
PDF
Adobe Illustrator 28.6 Crack My Vision of Vector Design
PDF
wealthsignaloriginal-com-DS-text-... (1).pdf
PDF
Navsoft: AI-Powered Business Solutions & Custom Software Development
PDF
Upgrade and Innovation Strategies for SAP ERP Customers
PPTX
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
PDF
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
Introduction to Artificial Intelligence
CHAPTER 2 - PM Management and IT Context
Addressing The Cult of Project Management Tools-Why Disconnected Work is Hold...
Design an Analysis of Algorithms I-SECS-1021-03
Design an Analysis of Algorithms II-SECS-1021-03
How to Choose the Right IT Partner for Your Business in Malaysia
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
Which alternative to Crystal Reports is best for small or large businesses.pdf
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
history of c programming in notes for students .pptx
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
Reimagine Home Health with the Power of Agentic AI​
System and Network Administration Chapter 2
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
Adobe Illustrator 28.6 Crack My Vision of Vector Design
wealthsignaloriginal-com-DS-text-... (1).pdf
Navsoft: AI-Powered Business Solutions & Custom Software Development
Upgrade and Innovation Strategies for SAP ERP Customers
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
Ad

Using Big Data and AI for Customer Analytics

  • 1. Using Big Data for Customer Analytics at Transamerica David Beaudoin Vishal Bamba John LoGiudice Enterprise Computing Community Conference Marist College June 12-14, 2016
  • 2. About Transamerica Transamerica is the US-based brand of Aegon, a Dutch financial services firm. Aegon is one of the world's leading providers of life insurance, pensions and asset management and is helping approximately 30 million customers globally to achieve a lifetime of financial security. 2
  • 3. About Transamerica Operating under the Transamerica brand, the Americas is Aegon’s largest market, with two-thirds of the company’s underlying earnings generated here. Transamerica products and services help more than 27 million customers protect against financial risk, build financial security and create successful retirements. 3
  • 4. About Transamerica Transamerica is among the top ten largest providers of variable annuities, individual universal life and individual term life in the US. In 2015, over $6B in benefits were paid. With over $100B in assets under administration, we serve more than 4 million retirement plan participants across the entire spectrum of defined benefit and defined contribution plans. 4
  • 5. As you may have guessed… That is a WHOLE LOT of data to manage! 5
  • 6. Data Architecture • Rich data environment across organizational business units, comprised of many source systems across various platforms… • A consistent enterprise view of data across business units is required. 6
  • 7. Data Architecture 7 As in many industries, we are focused on leveraging technology to build data-driven customer relationships. How can the current data architecture support this strategic direction?
  • 8. Data Architecture 8 Enterprise Data Management Crossroads! Another traditional warehouse project? OR Enterprise data hub/lake/ocean with new “Big Data” technologies?
  • 9. A New Strategy for Customer Data… EMAP: Enterprise Marketing Analytics Platform • Integration of both internal and external data sources to provide a 360-degree view of customer relationships for marketing, planning and operational analytics. • Solution must provide strong data governance and security, and facilitate the use of clean, trustable, high quality data by business stakeholders and end users. 9
  • 10. EMAP – Customer 360 What is your current & future value to the business?
  • 11. Initial Use Cases Customer 360 Investment Advisor and Producer Profile Journey Map Customer Lifetime Value Marketing Attribution Asset Retention Foundational Applied
  • 12. Enterprise Data Hub / Data Lake 12
  • 13. Customer Data Lake – Iterative Approach
  • 14. Big Data Management Data Integration HDFS Big Data management Data Quality Hive Hive / Impala MR / Spark Cleansed Files Individual Household Informatica Big Data Management Cloudera Big Data Platform Big Data Analytics Datameer Extract Load & Transform Data Quality –Cleaning, Identity Resolution Admin Extracts Partner Files IVR Enrichmen t Inputs CRM Solicitation History Weblogs BI Tableau H2O – ML Consumption Big Data Relationship Management Identity Resolution Search - Solr EMAP Solution Architecture
  • 16. Identity Resolution Data Quality Ingest & Profile Cleanse & Transform Standardize / cleanse phone, email, address DOB BDM Tool: Developer & Analyst Tool Source 1 Source N Masking Cleanse, Standardize, Mask Personal Data BDM Tool: Developer BDRM matching/linking (assign cluster ID) Pre BDRM Managed Views Pre BDRM Managed Views BDM Managed Views (Current) BDM System Tables (Date) Hive BDRM Hbase/Hive Post BDRM Managed Views Post BDRM Managed Views Post BDRM Managed Views Profile Data Discovery Design/Define rules Build/Define Ref Table BDM Tool: Analyst Tool Address Doctor process EMAP Data Process Flow
  • 17. Informatica Big Data Developer
  • 18. HDFS Data Management Structure 18 Managed by IT • Core set of Informatica developers to create the mappings to ingest data from sources into HDFS (jobs run and monitored by IT) • Users can request access via a defined security approval process (Access can be limited at the file level) • Access will be tracked and can be included in auditing reports/events • SSN will only be allowed in the core data ingested from source systems. Managed by the Team • Governed onboarding process: IT will assist the team with setting up directories, reviewing recommended data flow process, reviewing security privileges, education on platform security and compliance issues, and providing basic setup guidance and assistance • Access will be tracked and can be included in auditing reports/events MANAGED DATA TEAM DATA Physically Separated by Directories
  • 19. Audit Events Audit events/actions Include: • Failed login attempts • Role changes • Data access • Denied data access • Any many more … Alerts -- Can email an alert when a specific event occurs Data captured about the audit event: • Date • Command performed • Object affected • User that performed the action • IP Address Audit Events ARE NOT Stored on the Hadoop Cluster
  • 20. Summary of Key Benefits • Provides a single platform to house key customer and prospect data sources • Establishes persistent keys across previously disparate data sources • Provides for rapid intake of new data sources (structured and unstructured) • Eliminates today’s data intake and append bottleneck • Empowers analysts to explore all data elements • Increases processing power for statistical analysis • Improved recruiting and retention of Data Engineers and Data Scientists 20
  • 21. Lessons Learned • Tie to business use cases to demonstrate value • Align with larger enterprise strategy (business and technical) • Socialize the platform and vision • Technology is changing rapidly • Partner with key vendors • Partner with business • Invest in a PoC • Small team with the right skillset - innovative and curious 21
  • 22. Lessons Learned • Big Data requires Data Governance • Establish tools and processes to support data governance from the start • Important to have Data Stewards: Profile, validate, catalog, metadata creation, lineage 22