SlideShare a Scribd company logo
Data
Architecture
OMG – It’s Made of People!
Mark Madsen, Teradata
@markmadsen
https://www.linkedin.com/in/markmadsen/
The Man. The Myth. The Mark.
Fellow
Technology & Innovation Office
President
Autonomous Robotics
Artificial Intelligence
Data work is not easy. Ask any user.
Technology exists to help the
organization to be more productive
Organizations are made of people
Our goal is to make it easy for
organizations (people) to use data
Data architecture is the foundation
on which this work depends
Why This Topic? Have you tried
turning it off
and on again?
What Do We Mean By Data Architecture?
Data Storage? Data Models? Data Technologies?
What Do We Mean By Data Architecture?
Data Storage? Data Models? Data Technologies?
Data Architecture is
Processes, Standards, and Policies
that address an organization’s collection, storage,
management, and use of data.
It tells you something about what and how
but doesn’t dictate implementation.
You should be able to answer these
key questions:
1. What do you collect, and why?
2. Where do you keep data, and why?
3. How do you organize, curate, and
integrate data?
This takes organization and methods
Where to focus?
Do you focus on organizing
books?
That’s the data-first
approach. Organize
everything up front without
knowing how it is used.
Organize the data wrong
and nobody can find or use
anything.
Where to focus?
Do you focus on the
building that stores books?
That’s the technology-first
approach. Don’t organize
anything in advance. Use
technology to sort it out.
You may have a catalog of
all the contents. Good luck
finding what you need.
Focus on the people and what they do.
Not the books.
Not the building.
What people say
I want self-service!
What they mean
Users think “self-service” in
terms of a finished data
product – self service equals
an answer to a question.
What people say
I want self-service!
What developers
hear
Developers think
“self-service” is data access,
which means the user must
be self-reliant.
Hearing a need, ask:
“Why is this an unmet need?”
Bad IT and organizational policies cause more problems
than technology failures or bad data.
Policy is a part of architecture that is ignored.
Shape architecture for people.
Don’t try to force people to technology.
• Get a quick answer
• Solve a one-off problem
• Analyze causes of a problem
• Build a predictive model
• Make repetitive decisions
• Use data in a routine process
• Make a complex decision
• Do experiments and analyze results
• Explain a situation to someone else
• Choose a course of action
• Convince others to take action
Architecture focuses on
what people want to do
How To Understand What Data Is Being Used?
Monitor the data environments.
Capture what data is used.
Catalogs of data don’t tell you anything
about use – and use changes over time.
This means users shouldn’t control
storage. Copies they make outside your
view are invisible.
So: you must give them a place to work
and not restrict it.
Focus on visibility of use
Different Views – Data and Users
The value of data is tied to its use.
This shows relationships between
people and data used.
70% of the data is used and reused
constantly. 30% of the data is used
by one or a few people, often new
data with undetermined value.
Usage information shows where and
how you should focus curation –
what you need to manage based on
the people using data.
Finally: establish curation practices based on data use
Curation is about what data is used, by whom, and for what purposes
Collect, Label, Link Categorize, Organize Index, Catalog, Place
The amount of available data
is vast. You can’t store it all.
You can’t analyze it all.
Choose wisely.
There’s a difference between
organizing datasets and data
modeling. One is oriented to
datasets and their use, and
one to the contents of the
datasets.
An important and oft-ignored
element of data architecture is
making sure the data is
findable and accessible by the
people who need it. This is a
curation task, not a data
management task
Thank you.
©2021 Teradata
Thank you.
©2021 Teradata

More Related Content

PDF
Solve User Problems: Data Architecture for Humans
PDF
Pay no attention to the man behind the curtain - the unseen work behind data ...
PDF
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
PDF
Architecting a Platform for Enterprise Use - Strata London 2018
PDF
How to understand trends in the data & software market
PDF
Assumptions about Data and Analysis: Briefing room webcast slides
PDF
Building a Data Platform Strata SF 2019
PDF
The Black Box: Interpretability, Reproducibility, and Data Management
Solve User Problems: Data Architecture for Humans
Pay no attention to the man behind the curtain - the unseen work behind data ...
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Platform for Enterprise Use - Strata London 2018
How to understand trends in the data & software market
Assumptions about Data and Analysis: Briefing room webcast slides
Building a Data Platform Strata SF 2019
The Black Box: Interpretability, Reproducibility, and Data Management

What's hot (20)

PDF
Operationalizing Machine Learning in the Enterprise
PDF
Everything Has Changed Except Us: Modernizing the Data Warehouse
PDF
Disruptive Innovation: how do you use these theories to manage your IT?
PDF
Bi isn't big data and big data isn't BI (updated)
PDF
Everything has changed except us
PDF
Briefing room: An alternative for streaming data collection
PDF
Building Data Science Teams
 
PDF
Big Data and Bad Analogies
PPTX
Strata Data Conference 2019 : Scaling Visualization for Big Data in the Cloud
PPTX
Machine Learning in Big Data
PPTX
Managing Data Science | Lessons from the Field
PDF
Intro to Data Science for Non-Data Scientists
PDF
Big dataplatform operationalstrategy
PDF
Analytics 3.0 Measurable business impact from analytics & big data
PPTX
Building Data Science Teams: A Moneyball Approach
PDF
5 Factors Impacting Your Big Data Project's Performance
PDF
How to Build Data Science Teams
PPTX
Moving Data Science from an Event to A Program: Considerations in Creating Su...
PDF
Big Data - Insights & Challenges
PDF
Big Data: Issues and Challenges
Operationalizing Machine Learning in the Enterprise
Everything Has Changed Except Us: Modernizing the Data Warehouse
Disruptive Innovation: how do you use these theories to manage your IT?
Bi isn't big data and big data isn't BI (updated)
Everything has changed except us
Briefing room: An alternative for streaming data collection
Building Data Science Teams
 
Big Data and Bad Analogies
Strata Data Conference 2019 : Scaling Visualization for Big Data in the Cloud
Machine Learning in Big Data
Managing Data Science | Lessons from the Field
Intro to Data Science for Non-Data Scientists
Big dataplatform operationalstrategy
Analytics 3.0 Measurable business impact from analytics & big data
Building Data Science Teams: A Moneyball Approach
5 Factors Impacting Your Big Data Project's Performance
How to Build Data Science Teams
Moving Data Science from an Event to A Program: Considerations in Creating Su...
Big Data - Insights & Challenges
Big Data: Issues and Challenges
Ad

Similar to Data Architecture: OMG It’s Made of People (20)

PDF
DataEd Slides: Data Architecture versus Data Modeling
PDF
Data-Ed Online Webinar: Data Architecture Requirements
PDF
Data Architecture vs Data Modeling
PDF
Data Architecture Strategies
PPT
Data Architecture for Data Governance
PPTX
april2023.pptx
PDF
Data-Ed Slides: Data Architecture Strategies - Constructing Your Data Garden
PDF
Data-Ed Online: Data Architecture Requirements
PDF
Data-Ed: Data Architecture Requirements
PDF
Data+Management+Masterclasssdfsdfsdfsd.pdf
PDF
Information is at the heart of all architecture disciplines & why Conceptual ...
PDF
Data-Ed Webinar: Data Architecture Requirements
PDF
Data-Ed: Data Architecture Requirements
PDF
Data Architecture - The Foundation for Enterprise Architecture and Governance
PDF
Building the Architecture for Analytic Competition
PPTX
Chapter 4: Data Architecture Management
PDF
chapter4-220725121544-5ef6271b.pdf
PDF
Data Architecture for Solutions.pdf
PDF
DataEd Slides: Data Modeling is Fundamental
PDF
Information is at the heart of all architecture disciplines
DataEd Slides: Data Architecture versus Data Modeling
Data-Ed Online Webinar: Data Architecture Requirements
Data Architecture vs Data Modeling
Data Architecture Strategies
Data Architecture for Data Governance
april2023.pptx
Data-Ed Slides: Data Architecture Strategies - Constructing Your Data Garden
Data-Ed Online: Data Architecture Requirements
Data-Ed: Data Architecture Requirements
Data+Management+Masterclasssdfsdfsdfsd.pdf
Information is at the heart of all architecture disciplines & why Conceptual ...
Data-Ed Webinar: Data Architecture Requirements
Data-Ed: Data Architecture Requirements
Data Architecture - The Foundation for Enterprise Architecture and Governance
Building the Architecture for Analytic Competition
Chapter 4: Data Architecture Management
chapter4-220725121544-5ef6271b.pdf
Data Architecture for Solutions.pdf
DataEd Slides: Data Modeling is Fundamental
Information is at the heart of all architecture disciplines
Ad

More from mark madsen (15)

PDF
A Brief Tour through the Geology & Endemic Botany of the Klamath-Siskiyou Range
PDF
A Pragmatic Approach to Analyzing Customers
PDF
Building the Enterprise Data Lake: A look at architecture
PDF
Briefing Room analyst comments - streaming analytics
PDF
On the edge: analytics for the modern enterprise (analyst comments)
PDF
Crossing the chasm with a high performance dynamically scalable open source p...
PDF
Don't let data get in the way of a good story
PDF
Don't follow the followers
PDF
Exploring cloud for data warehousing
PDF
Open Data: Free Data Isn't the Same as Freeing Data
PDF
Exploring cloud for data warehousing
PDF
Wake up and smell the data
PDF
Big Data Wonderland: Two Views on the Big Data Revolution
PDF
Using Data Virtualization to Integrate With Big Data
PDF
One Size Doesn't Fit All: The New Database Revolution
A Brief Tour through the Geology & Endemic Botany of the Klamath-Siskiyou Range
A Pragmatic Approach to Analyzing Customers
Building the Enterprise Data Lake: A look at architecture
Briefing Room analyst comments - streaming analytics
On the edge: analytics for the modern enterprise (analyst comments)
Crossing the chasm with a high performance dynamically scalable open source p...
Don't let data get in the way of a good story
Don't follow the followers
Exploring cloud for data warehousing
Open Data: Free Data Isn't the Same as Freeing Data
Exploring cloud for data warehousing
Wake up and smell the data
Big Data Wonderland: Two Views on the Big Data Revolution
Using Data Virtualization to Integrate With Big Data
One Size Doesn't Fit All: The New Database Revolution

Recently uploaded (20)

PPTX
Database Infoormation System (DBIS).pptx
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PPT
Quality review (1)_presentation of this 21
PPT
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PPTX
Computer network topology notes for revision
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PDF
.pdf is not working space design for the following data for the following dat...
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PPTX
climate analysis of Dhaka ,Banglades.pptx
PDF
Lecture1 pattern recognition............
PPTX
Moving the Public Sector (Government) to a Digital Adoption
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PDF
Fluorescence-microscope_Botany_detailed content
PPT
Chapter 3 METAL JOINING.pptnnnnnnnnnnnnn
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPTX
Global journeys: estimating international migration
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
Database Infoormation System (DBIS).pptx
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
Quality review (1)_presentation of this 21
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
Computer network topology notes for revision
Data_Analytics_and_PowerBI_Presentation.pptx
.pdf is not working space design for the following data for the following dat...
Business Ppt On Nestle.pptx huunnnhhgfvu
climate analysis of Dhaka ,Banglades.pptx
Lecture1 pattern recognition............
Moving the Public Sector (Government) to a Digital Adoption
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
Fluorescence-microscope_Botany_detailed content
Chapter 3 METAL JOINING.pptnnnnnnnnnnnnn
IBA_Chapter_11_Slides_Final_Accessible.pptx
Global journeys: estimating international migration
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf

Data Architecture: OMG It’s Made of People

  • 1. Data Architecture OMG – It’s Made of People! Mark Madsen, Teradata @markmadsen https://www.linkedin.com/in/markmadsen/
  • 2. The Man. The Myth. The Mark. Fellow Technology & Innovation Office President Autonomous Robotics Artificial Intelligence
  • 3. Data work is not easy. Ask any user. Technology exists to help the organization to be more productive Organizations are made of people Our goal is to make it easy for organizations (people) to use data Data architecture is the foundation on which this work depends Why This Topic? Have you tried turning it off and on again?
  • 4. What Do We Mean By Data Architecture? Data Storage? Data Models? Data Technologies?
  • 5. What Do We Mean By Data Architecture? Data Storage? Data Models? Data Technologies? Data Architecture is Processes, Standards, and Policies that address an organization’s collection, storage, management, and use of data. It tells you something about what and how but doesn’t dictate implementation. You should be able to answer these key questions: 1. What do you collect, and why? 2. Where do you keep data, and why? 3. How do you organize, curate, and integrate data?
  • 7. Where to focus? Do you focus on organizing books? That’s the data-first approach. Organize everything up front without knowing how it is used. Organize the data wrong and nobody can find or use anything.
  • 8. Where to focus? Do you focus on the building that stores books? That’s the technology-first approach. Don’t organize anything in advance. Use technology to sort it out. You may have a catalog of all the contents. Good luck finding what you need.
  • 9. Focus on the people and what they do. Not the books. Not the building.
  • 10. What people say I want self-service! What they mean Users think “self-service” in terms of a finished data product – self service equals an answer to a question.
  • 11. What people say I want self-service! What developers hear Developers think “self-service” is data access, which means the user must be self-reliant.
  • 12. Hearing a need, ask: “Why is this an unmet need?” Bad IT and organizational policies cause more problems than technology failures or bad data. Policy is a part of architecture that is ignored.
  • 13. Shape architecture for people. Don’t try to force people to technology.
  • 14. • Get a quick answer • Solve a one-off problem • Analyze causes of a problem • Build a predictive model • Make repetitive decisions • Use data in a routine process • Make a complex decision • Do experiments and analyze results • Explain a situation to someone else • Choose a course of action • Convince others to take action Architecture focuses on what people want to do
  • 15. How To Understand What Data Is Being Used? Monitor the data environments. Capture what data is used. Catalogs of data don’t tell you anything about use – and use changes over time. This means users shouldn’t control storage. Copies they make outside your view are invisible. So: you must give them a place to work and not restrict it. Focus on visibility of use
  • 16. Different Views – Data and Users The value of data is tied to its use. This shows relationships between people and data used. 70% of the data is used and reused constantly. 30% of the data is used by one or a few people, often new data with undetermined value. Usage information shows where and how you should focus curation – what you need to manage based on the people using data.
  • 17. Finally: establish curation practices based on data use Curation is about what data is used, by whom, and for what purposes Collect, Label, Link Categorize, Organize Index, Catalog, Place The amount of available data is vast. You can’t store it all. You can’t analyze it all. Choose wisely. There’s a difference between organizing datasets and data modeling. One is oriented to datasets and their use, and one to the contents of the datasets. An important and oft-ignored element of data architecture is making sure the data is findable and accessible by the people who need it. This is a curation task, not a data management task
  • 18. Thank you. ©2021 Teradata Thank you. ©2021 Teradata