SlideShare a Scribd company logo
Security &
Governance
in Big Data
Security
BIG DATA IS STILL A NEW
TECHNOLOGY FOR MOST
ORGANISATIONS,
AND ANY TECHNOLOGY THAT IS NOT
WELL UNDERSTOOD WILL
INTRODUCE NEW VULNERABILITIES
Many businesses already use Big Data to
store data. However they may not have
the access rights control required from a
security perspective.
Big Data breaches can be big, so data
security becomes even more critical
CRITICAL RISKS IN BIG DATA
PLATFORMS
∎ BIG data, BIGGER security leaks
∎ The attack surface of the nodes in a cluster may not be adequately hardened
∎ Access to data from multiple locations may not be sufficiently controlled
∎ Regulatory requirements with logs and audit trails may not be fulfilled,
BIGCHALLENGE∎Platform must have a comprehensive security solution
∎A secure integration into BBVA security is a need
GOSEC is a centralized security
component managing fine-grained
access control across Big Data
services: HDFS, Cassandra, Kafka
and Big Data web applications such
as Viewer.
WHAT IS IT?
GOSEC allows management policies
controlling access to files, topics,
tables, databases… These policies
can be set for individual users or
groups
GOSEC currently covers:
Authentication
Authorization
Audit
OVERVIEW
GOSEC
MANAGER
USERS & GROUPS MANAGEMENT
Users & groups are not directly managed in GOSEC.
They are always recovered from organization Identity
Provider (LDAP).
USERS & GROUPS MANAGEMENT
Roles & profiles are an easy way to set up security
policies for dynamic group memberships or user tasks.
USERS & GROUPS MANAGEMENT
ACLs can be created
to grant or revoke
access to different
resources.
USERS & GROUPS MANAGEMENT
Every operation is
audited by GOSEC.
Only security admins
have access to the audit
log.
SO, WHAT MUST WE DO NOW ?
BBVA INTEGRATION
Synchronized
IDP
Armadillo
and Global
Directory
Data Encoding
for Security
Strategy to
prevent
internal and
external data
leakage
Governance
“DATA IS GOING TO DEFINE THE COMPETITIVE ADVANTAGE
IN THE GLOBAL FINANCIAL ECOSYSTEM OF THE FUTURE”
“WE ARE BEGINNING TO BUILD DATA-DRIVEN BANK”
Francisco González, November 2015
WHY IS DATA GOVERNANCE
NECESSARY?
Large data
volumes and
various data
types
Democratize
the use of the
data with
new flexible
and agile
exploitation
Data
management
policies that
ensure quality
& traceability
Data Centric
GOVERNANCE MODEL TRANSVERSAL
AND CONVERGENT ACROSS ALL
GEOGRAPHIES OF GROUP
GLOBAL
WHAT WE EXPECT OF
DATA GOVERNANCE?
Data Dictionary
Functional and Technical Level
Lineage
Traceability of data throughout its life cycle
Quality
Data Quality and Process Quality
Standards and Best Practices
Standards for each platform technology
Visualization and Exploration
Graphical solution for governance data
THANKS!
Any questions?
@datiobd
info@datiobd.com
datio-big-data

More Related Content

PDF
Big Data Security and Governance
PDF
Expanded top ten_big_data_security_and_privacy_challenges
PPTX
Top 10 Best Practices for Implementing Data Classification
PPTX
Hadoop and Financial Services
PDF
Building the Governance Ready Enterprise for GDPR Compliance December 2017
PDF
Time to re think our security process
PPTX
Tackling the GDPR Dell EMC Index Engines Webinar
PDF
Building the Governance Ready Enterprise for GDPR Compliance
Big Data Security and Governance
Expanded top ten_big_data_security_and_privacy_challenges
Top 10 Best Practices for Implementing Data Classification
Hadoop and Financial Services
Building the Governance Ready Enterprise for GDPR Compliance December 2017
Time to re think our security process
Tackling the GDPR Dell EMC Index Engines Webinar
Building the Governance Ready Enterprise for GDPR Compliance

What's hot (20)

PPTX
A Little Security For Big Data
PPTX
Building trust in your data lake. A fintech case study on automated data disc...
PPTX
Webinar: Practical Technology Playbook for the GDPR
PDF
Security and privacy of cloud data: what you need to know (Interop)
PDF
Network Security‬ and Big ‪‎Data Analytics‬
PDF
Threat Ready Data: Protect Data from the Inside and the Outside
PPTX
Make a case for Data Classification in your organization
PDF
Enterprise policy-management
PDF
Realizing the Value of Social: Evolving from Social Media to Customer Experience
PPTX
Fighting cyber fraud with hadoop
PPTX
Perspectives on Ethical Big Data Governance
PDF
Big Data Everywhere Chicago: The Big Data Imperative -- Discovering & Protect...
PPTX
CCPA Compliance for Analytics and Data Science Use Cases with Databricks and ...
PDF
Privacera Databricks CCPA Webinar Feb 2020
PPTX
Data lake protection ft 3119 -ver1.0
PPTX
Supporting GDPR Compliance through Data Classification
PPTX
Office 365 : Data leakage control, privacy, compliance and regulations in the...
PPTX
The EU General Protection Regulation and how Oracle can help
PDF
Cross border - off-shoring and outsourcing privacy sensitive data
PPTX
Information Governance Maturity for Financial Services
A Little Security For Big Data
Building trust in your data lake. A fintech case study on automated data disc...
Webinar: Practical Technology Playbook for the GDPR
Security and privacy of cloud data: what you need to know (Interop)
Network Security‬ and Big ‪‎Data Analytics‬
Threat Ready Data: Protect Data from the Inside and the Outside
Make a case for Data Classification in your organization
Enterprise policy-management
Realizing the Value of Social: Evolving from Social Media to Customer Experience
Fighting cyber fraud with hadoop
Perspectives on Ethical Big Data Governance
Big Data Everywhere Chicago: The Big Data Imperative -- Discovering & Protect...
CCPA Compliance for Analytics and Data Science Use Cases with Databricks and ...
Privacera Databricks CCPA Webinar Feb 2020
Data lake protection ft 3119 -ver1.0
Supporting GDPR Compliance through Data Classification
Office 365 : Data leakage control, privacy, compliance and regulations in the...
The EU General Protection Regulation and how Oracle can help
Cross border - off-shoring and outsourcing privacy sensitive data
Information Governance Maturity for Financial Services
Ad

Viewers also liked (8)

PDF
Del Mono al QA
PDF
Databases and how to choose them
PDF
Kafka Connect by Datio
PPTX
DC/OS: The definitive platform for modern apps
PPTX
PDP Your personal development plan
PDF
Road to Analytics
PPTX
Apache Spark II (SparkSQL)
PDF
Introduction to Apache Spark
Del Mono al QA
Databases and how to choose them
Kafka Connect by Datio
DC/OS: The definitive platform for modern apps
PDP Your personal development plan
Road to Analytics
Apache Spark II (SparkSQL)
Introduction to Apache Spark
Ad

Similar to Security&Governance (20)

PPTX
Defining and Applying Data Governance in Today’s Business Environment
PDF
ALTR Company Overview 2023
PDF
XA Secure | Whitepaper on data security within Hadoop
PDF
Gdpr ccpa automated compliance - spark java application features and functi...
PDF
How Financial Institutions Are Leveraging Data Virtualization to Overcome the...
PPTX
Unleashing the Power of Cloud-Based Big Data Analytics.pptx
PPTX
Unleashing the Power of Cloud-Based Big Data Analytics.pptx
PPTX
The value of our data
PPTX
Unleashing the Power of Cloud-Based Big Data Analytics.pptx
PDF
Evolving Challenges and Best Practices in Modernizing Your Cloud Data Governa...
PDF
Qlik wp 2021_q3_data_governance_in_the_modern_data_analytics_pipeline
PDF
Big Data LDN 2017: Data Governance Reimagined
PDF
A robust and verifiable threshold multi authority access control system in pu...
PPTX
Introduction of big data and analytics
PDF
Wp security-data-safe
PDF
SECURING THE CLOUD DATA LAKES
PDF
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)
PDF
eBook: 5 Steps to Secure Cloud Data Governance
PDF
Data Ninja Webinar Series: Realizing the Promise of Data Lakes
PDF
Intro to big data and applications -day 3
Defining and Applying Data Governance in Today’s Business Environment
ALTR Company Overview 2023
XA Secure | Whitepaper on data security within Hadoop
Gdpr ccpa automated compliance - spark java application features and functi...
How Financial Institutions Are Leveraging Data Virtualization to Overcome the...
Unleashing the Power of Cloud-Based Big Data Analytics.pptx
Unleashing the Power of Cloud-Based Big Data Analytics.pptx
The value of our data
Unleashing the Power of Cloud-Based Big Data Analytics.pptx
Evolving Challenges and Best Practices in Modernizing Your Cloud Data Governa...
Qlik wp 2021_q3_data_governance_in_the_modern_data_analytics_pipeline
Big Data LDN 2017: Data Governance Reimagined
A robust and verifiable threshold multi authority access control system in pu...
Introduction of big data and analytics
Wp security-data-safe
SECURING THE CLOUD DATA LAKES
Rethink Your 2021 Data Management Strategy with Data Virtualization (ASEAN)
eBook: 5 Steps to Secure Cloud Data Governance
Data Ninja Webinar Series: Realizing the Promise of Data Lakes
Intro to big data and applications -day 3

More from Datio Big Data (13)

PDF
Búsqueda IA
PDF
Descubriendo la Inteligencia Artificial
PDF
Learning Python. Level 0
PDF
Learn Python
PDF
How to document without dying in the attempt
PDF
Developers on test
PDF
Ceph: The Storage System of the Future
PDF
A Travel Through Mesos
PDF
Datio OpenStack
PDF
Quality Assurance Glossary
PDF
Data Integration
PDF
Gamification: from buzzword to reality
PDF
Pandas: High Performance Structured Data Manipulation
Búsqueda IA
Descubriendo la Inteligencia Artificial
Learning Python. Level 0
Learn Python
How to document without dying in the attempt
Developers on test
Ceph: The Storage System of the Future
A Travel Through Mesos
Datio OpenStack
Quality Assurance Glossary
Data Integration
Gamification: from buzzword to reality
Pandas: High Performance Structured Data Manipulation

Recently uploaded (20)

PPTX
Database Infoormation System (DBIS).pptx
PPT
Chapter 3 METAL JOINING.pptnnnnnnnnnnnnn
PPTX
climate analysis of Dhaka ,Banglades.pptx
PPTX
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
PPTX
Logistic Regression ml machine learning.pptx
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PPTX
A Quantitative-WPS Office.pptx research study
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PDF
.pdf is not working space design for the following data for the following dat...
PDF
Mega Projects Data Mega Projects Data
PDF
Fluorescence-microscope_Botany_detailed content
PPT
Reliability_Chapter_ presentation 1221.5784
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPTX
IB Computer Science - Internal Assessment.pptx
PPTX
Supervised vs unsupervised machine learning algorithms
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
Database Infoormation System (DBIS).pptx
Chapter 3 METAL JOINING.pptnnnnnnnnnnnnn
climate analysis of Dhaka ,Banglades.pptx
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
Logistic Regression ml machine learning.pptx
Business Ppt On Nestle.pptx huunnnhhgfvu
Miokarditis (Inflamasi pada Otot Jantung)
A Quantitative-WPS Office.pptx research study
Introduction-to-Cloud-ComputingFinal.pptx
.pdf is not working space design for the following data for the following dat...
Mega Projects Data Mega Projects Data
Fluorescence-microscope_Botany_detailed content
Reliability_Chapter_ presentation 1221.5784
STUDY DESIGN details- Lt Col Maksud (21).pptx
Clinical guidelines as a resource for EBP(1).pdf
IB Computer Science - Internal Assessment.pptx
Supervised vs unsupervised machine learning algorithms
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
Recruitment and Placement PPT.pdfbjfibjdfbjfobj

Security&Governance

  • 3. BIG DATA IS STILL A NEW TECHNOLOGY FOR MOST ORGANISATIONS, AND ANY TECHNOLOGY THAT IS NOT WELL UNDERSTOOD WILL INTRODUCE NEW VULNERABILITIES
  • 4. Many businesses already use Big Data to store data. However they may not have the access rights control required from a security perspective. Big Data breaches can be big, so data security becomes even more critical
  • 5. CRITICAL RISKS IN BIG DATA PLATFORMS ∎ BIG data, BIGGER security leaks ∎ The attack surface of the nodes in a cluster may not be adequately hardened ∎ Access to data from multiple locations may not be sufficiently controlled ∎ Regulatory requirements with logs and audit trails may not be fulfilled,
  • 6. BIGCHALLENGE∎Platform must have a comprehensive security solution ∎A secure integration into BBVA security is a need
  • 7. GOSEC is a centralized security component managing fine-grained access control across Big Data services: HDFS, Cassandra, Kafka and Big Data web applications such as Viewer. WHAT IS IT? GOSEC allows management policies controlling access to files, topics, tables, databases… These policies can be set for individual users or groups
  • 10. USERS & GROUPS MANAGEMENT Users & groups are not directly managed in GOSEC. They are always recovered from organization Identity Provider (LDAP).
  • 11. USERS & GROUPS MANAGEMENT Roles & profiles are an easy way to set up security policies for dynamic group memberships or user tasks.
  • 12. USERS & GROUPS MANAGEMENT ACLs can be created to grant or revoke access to different resources.
  • 13. USERS & GROUPS MANAGEMENT Every operation is audited by GOSEC. Only security admins have access to the audit log.
  • 14. SO, WHAT MUST WE DO NOW ?
  • 15. BBVA INTEGRATION Synchronized IDP Armadillo and Global Directory Data Encoding for Security Strategy to prevent internal and external data leakage
  • 17. “DATA IS GOING TO DEFINE THE COMPETITIVE ADVANTAGE IN THE GLOBAL FINANCIAL ECOSYSTEM OF THE FUTURE” “WE ARE BEGINNING TO BUILD DATA-DRIVEN BANK” Francisco González, November 2015
  • 18. WHY IS DATA GOVERNANCE NECESSARY? Large data volumes and various data types Democratize the use of the data with new flexible and agile exploitation Data management policies that ensure quality & traceability Data Centric
  • 19. GOVERNANCE MODEL TRANSVERSAL AND CONVERGENT ACROSS ALL GEOGRAPHIES OF GROUP GLOBAL
  • 20. WHAT WE EXPECT OF DATA GOVERNANCE? Data Dictionary Functional and Technical Level Lineage Traceability of data throughout its life cycle Quality Data Quality and Process Quality Standards and Best Practices Standards for each platform technology Visualization and Exploration Graphical solution for governance data