SlideShare a Scribd company logo
Designing
fit for purpose
data structures
Stephen Bounds
Director and PrincipalConsultant
About me
• 20 years experience in IM, IT & KM
• 5 years running KnowQuestion
• 10 years as a Federal public servant
• Last public service job Deputy CIO,
Information Management at
Australian Communications and
Media Authority (ACMA)
Today’s workshop
• Tensions in data structure choices
• Information value over time
• Access / capture trade off
• Process / practice layering
• Recordkeeping and archiving
The four tensions
Access
Usefulness
Preservation
Privacy
The four tensions
Access
Usefulness
Preservation
Privacy
Long term
considerations
Short term
considerations
How is information useful?
For me
For my
team
For my
organisation
n
For the public
public
When is it useful?
Now
Next
Later
Eventually
Psychological distance
We think more concretely about things that are
nearer to us, and more abstractly about things further away
Temporal distance
Whether an event is a long time
in the future or the past
Spatial distance
Whether you are separated by
a large geographic distance
Social distance
Whether you feel you know
someone well or not
Hypothetical distance
Whether you think an event is
likely to occur or not
Relative information value
0
2
4
6
8
10
Me Team Organisation Everyone
Perceived value over time
0
2
4
6
8
10
Now Soon Later Eventually
Reused information over time
0
20
40
60
80
100
Now Soon Later Eventually
%reused
Value of reused information
0
2
4
6
8
10
Now Soon Later Eventually
Access / capture costs
Costs of …
Less
structured
More
structured
Capture Lower Higher
Access Higher Lower
Impact of trade off
Cost of
capture
Cost of
access
Exercise 1:Value assessment
A social worker has to fill out a case notes record every time
they visit a client.
They normally take handwritten notes and then transcribe
into the system back in the office.
• Assess each field on the basis of:
• Scope of information value
• Change in information value over time
• Capture / access trade offs being made
Exercise 1:Value assessment
• It is natural for people to discount the need for
information that is of less direct value to them
• How do you get buy-in when capturing information that is not
directly relevant?
Key design problem
• Case work is uncertain
• Results of any case hugely variable
• Subject to wide range of unpredictable factors
• With a large volume of cases, it is possible to establish if
systems are producing better results more of the time
• Propensity  inclination to behave in a particular way
Value assessment
No long term tracking
tracking
Pr
Case cost ($)
Cost ($)
Minimal long term
impact
0.5 $80 $0
Significant long term
impact
0.45 $80 $2,000
Critical long term impact 0.05 $80 $25,000
Value assessment
Long term tracking Pr
Case cost ($)
Cost ($)
Minimal long term
impact
0.6 $130 $0
Significant long term
impact
0.37 $130 $2,000
Critical long term impact 0.03 $130 $25,000
Monte Carlo simulations
• Monte Carlo simulations build a model of a world and use
this to simulate outcomes.
• Re-running the model multiple times shows both
variability and expected (average) results.
• Can approximate real-world results
without a known algorithm, eg
calculating area of a circle:
Value simulations
$-
$500
$1,000
$1,500
$2,000
$2,500
$3,000
Thousands
Exercise 2:Value perception
An updated case file format has been implemented which
adds additional structured information.
Specifically, the new form now references people by ID, and
categorises referral type and incident type
• Case officers are pushing back on the extra overhead.
• What alternative options exist?
• Simulate a relative cost/benefit of each pathway
• How do we validate our assumptions?
Exercise 2:Value perception
Note about modelling
• Assumptions are critical
• Never hide these – key to informed consent
Process-practice layering
• All organisations do it, but often not consciously
thought about
• Practice – free-form execution that respects the
expertise of individuals and teams
• Process – structured set of repeatable steps
• Process “touch points” used within practice to
provide accountability and evidence
Process-practice layering
S1
S2
S3
A1
A2
A call centre
S1
S2
A1
A barrister
S1
S2
A1
Information serves
different purposes
Evidence = That decisions and actions were taken
Rationale = Why decisions and actions were taken
Type of record: objective, factual, explicit
subjective, opinion, tacit
Very broadly …
More structured information
Less structured information
Common objection
More structured information
Less structured information Valuable BI
Options to enrich data
• Get someone else to add the structure
• Contemporaneous
• Post hoc
• Automatic post processing (eg text to speech),
keyword matching, semantic analysis
• Inferences and trends based on client clustering
• Last (worst) resort: get the expert to do it
Exercise 3: Process / practice
• Which parts of the case management workflow
are process and/or practice? Why?
• Where is data enrichment most necessary, and
what is the business value?
• How do you build a business case around
changing the responsibility for data capture and
enrichment?
Recordkeeping and archiving
Long
term data
storage
Records
Archives
Backups
3 different, overlapping concepts
Collection type Purpose Retention
Records Evidence about a business
process, both of actions and
supporting decisions
Kept until no longer
needed
Archives Information with intrinsic value
outside of the business process
that generated it
Kept indefinitely
Backups Redundant information that
supports timely disaster
recovery
Kept as long as risk
management requires it
Remember…
Not all records are in an archive
Not all archives contain records
Keep or destroy?
• Archives Act 1983 has no proactive requirement to
destroy – NAA approves retention rules for records
• Privacy Act 1988 requires reasonable steps to destroy
personal information once it is no longer required
• Discussion point –
• Why destroy information?
• Why keep information?
Information collection lifecycle
Commissioning Creation
Maintenance/
Migration
Disposal/
Loss
Transferral/
Duplication
Archive management
To unlock value of archive to users, key requirements are:
• Identification – Persistent, actionable identifiers
• Discoverability – Indexing, cataloguing, and access
• Preservation – Integrity, authenticity, and readability
• Decision support – Evaluating relevance and currency
Exercise 4: Information collections
• Consider the listed collections of information
• Which are records, archives and/or backups?
• Which hold information relevant to Privacy Act?
• Trace 2 of these information collections from
commissioning through to final disposal.
Thank you!

More Related Content

PPT
Critical Success Factors
PPT
Bippt
PPTX
MIS 2101 Diamond Peer Exam 2 Review
PPTX
3.7 developing ict solutions
PPT
Turban dss9e ch01
PPT
PDF
Decision support systems and business intelligence
Critical Success Factors
Bippt
MIS 2101 Diamond Peer Exam 2 Review
3.7 developing ict solutions
Turban dss9e ch01
Decision support systems and business intelligence

What's hot (19)

PPT
Ch01 A decision support system (DSS)
PDF
Data Privatisation, Data Anonymisation, Data Pseudonymisation and Differentia...
PPTX
Decision Support System - Management Information System
PPTX
Mis & Decision Making
PPT
Decision Support Systems
PPTX
BI and DSS
PDF
Future Proof Your DAM
PPTX
Decision support systems and business intelligence
PPTX
Auxilion - The Implications of Big Data on the Roadmap Towards Business Intel...
PPTX
Decision support system
PPT
Chapter 8 Mis Decision Support System
PPT
Thesis Concept Km
PPSX
Introduction to DSS
PPT
Mark Dean Notes
PPTX
Information and decision support system
PPT
Lecture9 Systems The Systems Perspective Of A Dss
PPSX
Eduserv Symposium 2013 - Threat or opportunity?
PPTX
Smart Data Module 2 d drive_own data
PPT
Slide01 introductory
Ch01 A decision support system (DSS)
Data Privatisation, Data Anonymisation, Data Pseudonymisation and Differentia...
Decision Support System - Management Information System
Mis & Decision Making
Decision Support Systems
BI and DSS
Future Proof Your DAM
Decision support systems and business intelligence
Auxilion - The Implications of Big Data on the Roadmap Towards Business Intel...
Decision support system
Chapter 8 Mis Decision Support System
Thesis Concept Km
Introduction to DSS
Mark Dean Notes
Information and decision support system
Lecture9 Systems The Systems Perspective Of A Dss
Eduserv Symposium 2013 - Threat or opportunity?
Smart Data Module 2 d drive_own data
Slide01 introductory
Ad

Similar to Designing fit for purpose data structures (20)

PPTX
Cost and Value analysis of digital data archiving
PPTX
15. Brian Bailey presentation 2 DQ Asia Pacific 2010
PPTX
Reflections On The Value Proposition For Information Quality
PPT
Lesson 6 value & importance of information
PPTX
BTEC National in ICT: Unit 3 - Data vs Information
PDF
University of Hertfordshire researcher development - research data management
PPTX
Workshop: Processes and practices for effective information governance
PDF
Data Cleaning
PPTX
Long-term storage – will it fill up with the good stuff, or the big, bad, an...
PPT
Effective Information Management V2 18sep2008
PDF
Information Management White Paper
PDF
Information Governance for Smarter Government Strategy and Solutions
PPTX
Bosman and Kramer Open Research: A 2024 NISO Training Series, Session Four: O...
PDF
Data warehousing change in a challenging environment
PPT
Databases
PPT
Databases
PPT
Grace Currie Ann Jebson First Things First
PPT
Data Warehouse techniques on Intermediate Census and Demographic Statistics W...
PPT
Using data effectively worskhop presentation
PPTX
Databases
Cost and Value analysis of digital data archiving
15. Brian Bailey presentation 2 DQ Asia Pacific 2010
Reflections On The Value Proposition For Information Quality
Lesson 6 value & importance of information
BTEC National in ICT: Unit 3 - Data vs Information
University of Hertfordshire researcher development - research data management
Workshop: Processes and practices for effective information governance
Data Cleaning
Long-term storage – will it fill up with the good stuff, or the big, bad, an...
Effective Information Management V2 18sep2008
Information Management White Paper
Information Governance for Smarter Government Strategy and Solutions
Bosman and Kramer Open Research: A 2024 NISO Training Series, Session Four: O...
Data warehousing change in a challenging environment
Databases
Databases
Grace Currie Ann Jebson First Things First
Data Warehouse techniques on Intermediate Census and Demographic Statistics W...
Using data effectively worskhop presentation
Databases
Ad

More from Stephen Bounds (10)

PPTX
G-Checks : A holistic strategic planning framework
PPTX
Relative Return on Investment
PPTX
Why trust is vital to success with Knowledge Management
PPTX
Introduction to G-CHECKS
PPTX
IT and IG - Friends or frenemies?
PPTX
knowquestion :: agile team management
PPTX
KnowQuestion :: Introduction to G-CHECKS
PPT
Impersonal trust in organisations
PPT
Practical KM Strategy And Tactics
PPT
Rethinking how your organisation collaborates
G-Checks : A holistic strategic planning framework
Relative Return on Investment
Why trust is vital to success with Knowledge Management
Introduction to G-CHECKS
IT and IG - Friends or frenemies?
knowquestion :: agile team management
KnowQuestion :: Introduction to G-CHECKS
Impersonal trust in organisations
Practical KM Strategy And Tactics
Rethinking how your organisation collaborates

Recently uploaded (20)

PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Electronic commerce courselecture one. Pdf
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Review of recent advances in non-invasive hemoglobin estimation
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Machine learning based COVID-19 study performance prediction
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Empathic Computing: Creating Shared Understanding
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
Encapsulation theory and applications.pdf
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPT
Teaching material agriculture food technology
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
KodekX | Application Modernization Development
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
Mobile App Security Testing_ A Comprehensive Guide.pdf
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Diabetes mellitus diagnosis method based random forest with bat algorithm
Electronic commerce courselecture one. Pdf
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Review of recent advances in non-invasive hemoglobin estimation
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Dropbox Q2 2025 Financial Results & Investor Presentation
Machine learning based COVID-19 study performance prediction
Chapter 3 Spatial Domain Image Processing.pdf
Empathic Computing: Creating Shared Understanding
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Encapsulation theory and applications.pdf
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Teaching material agriculture food technology
Agricultural_Statistics_at_a_Glance_2022_0.pdf
KodekX | Application Modernization Development
Encapsulation_ Review paper, used for researhc scholars
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...

Designing fit for purpose data structures

  • 1. Designing fit for purpose data structures Stephen Bounds Director and PrincipalConsultant
  • 2. About me • 20 years experience in IM, IT & KM • 5 years running KnowQuestion • 10 years as a Federal public servant • Last public service job Deputy CIO, Information Management at Australian Communications and Media Authority (ACMA)
  • 3. Today’s workshop • Tensions in data structure choices • Information value over time • Access / capture trade off • Process / practice layering • Recordkeeping and archiving
  • 5. The four tensions Access Usefulness Preservation Privacy Long term considerations Short term considerations
  • 6. How is information useful? For me For my team For my organisation n For the public public
  • 7. When is it useful? Now Next Later Eventually
  • 8. Psychological distance We think more concretely about things that are nearer to us, and more abstractly about things further away Temporal distance Whether an event is a long time in the future or the past Spatial distance Whether you are separated by a large geographic distance Social distance Whether you feel you know someone well or not Hypothetical distance Whether you think an event is likely to occur or not
  • 9. Relative information value 0 2 4 6 8 10 Me Team Organisation Everyone
  • 10. Perceived value over time 0 2 4 6 8 10 Now Soon Later Eventually
  • 11. Reused information over time 0 20 40 60 80 100 Now Soon Later Eventually %reused
  • 12. Value of reused information 0 2 4 6 8 10 Now Soon Later Eventually
  • 13. Access / capture costs Costs of … Less structured More structured Capture Lower Higher Access Higher Lower
  • 14. Impact of trade off Cost of capture Cost of access
  • 15. Exercise 1:Value assessment A social worker has to fill out a case notes record every time they visit a client. They normally take handwritten notes and then transcribe into the system back in the office. • Assess each field on the basis of: • Scope of information value • Change in information value over time • Capture / access trade offs being made
  • 16. Exercise 1:Value assessment • It is natural for people to discount the need for information that is of less direct value to them • How do you get buy-in when capturing information that is not directly relevant?
  • 17. Key design problem • Case work is uncertain • Results of any case hugely variable • Subject to wide range of unpredictable factors • With a large volume of cases, it is possible to establish if systems are producing better results more of the time • Propensity  inclination to behave in a particular way
  • 18. Value assessment No long term tracking tracking Pr Case cost ($) Cost ($) Minimal long term impact 0.5 $80 $0 Significant long term impact 0.45 $80 $2,000 Critical long term impact 0.05 $80 $25,000
  • 19. Value assessment Long term tracking Pr Case cost ($) Cost ($) Minimal long term impact 0.6 $130 $0 Significant long term impact 0.37 $130 $2,000 Critical long term impact 0.03 $130 $25,000
  • 20. Monte Carlo simulations • Monte Carlo simulations build a model of a world and use this to simulate outcomes. • Re-running the model multiple times shows both variability and expected (average) results. • Can approximate real-world results without a known algorithm, eg calculating area of a circle:
  • 22. Exercise 2:Value perception An updated case file format has been implemented which adds additional structured information. Specifically, the new form now references people by ID, and categorises referral type and incident type • Case officers are pushing back on the extra overhead. • What alternative options exist? • Simulate a relative cost/benefit of each pathway • How do we validate our assumptions?
  • 23. Exercise 2:Value perception Note about modelling • Assumptions are critical • Never hide these – key to informed consent
  • 24. Process-practice layering • All organisations do it, but often not consciously thought about • Practice – free-form execution that respects the expertise of individuals and teams • Process – structured set of repeatable steps • Process “touch points” used within practice to provide accountability and evidence
  • 28. Information serves different purposes Evidence = That decisions and actions were taken Rationale = Why decisions and actions were taken Type of record: objective, factual, explicit subjective, opinion, tacit
  • 29. Very broadly … More structured information Less structured information
  • 30. Common objection More structured information Less structured information Valuable BI
  • 31. Options to enrich data • Get someone else to add the structure • Contemporaneous • Post hoc • Automatic post processing (eg text to speech), keyword matching, semantic analysis • Inferences and trends based on client clustering • Last (worst) resort: get the expert to do it
  • 32. Exercise 3: Process / practice • Which parts of the case management workflow are process and/or practice? Why? • Where is data enrichment most necessary, and what is the business value? • How do you build a business case around changing the responsibility for data capture and enrichment?
  • 33. Recordkeeping and archiving Long term data storage Records Archives Backups
  • 34. 3 different, overlapping concepts Collection type Purpose Retention Records Evidence about a business process, both of actions and supporting decisions Kept until no longer needed Archives Information with intrinsic value outside of the business process that generated it Kept indefinitely Backups Redundant information that supports timely disaster recovery Kept as long as risk management requires it
  • 35. Remember… Not all records are in an archive Not all archives contain records
  • 36. Keep or destroy? • Archives Act 1983 has no proactive requirement to destroy – NAA approves retention rules for records • Privacy Act 1988 requires reasonable steps to destroy personal information once it is no longer required • Discussion point – • Why destroy information? • Why keep information?
  • 37. Information collection lifecycle Commissioning Creation Maintenance/ Migration Disposal/ Loss Transferral/ Duplication
  • 38. Archive management To unlock value of archive to users, key requirements are: • Identification – Persistent, actionable identifiers • Discoverability – Indexing, cataloguing, and access • Preservation – Integrity, authenticity, and readability • Decision support – Evaluating relevance and currency
  • 39. Exercise 4: Information collections • Consider the listed collections of information • Which are records, archives and/or backups? • Which hold information relevant to Privacy Act? • Trace 2 of these information collections from commissioning through to final disposal.