SlideShare a Scribd company logo
1 of 48
Document Engineering
STC-PMC 2007
17 March 2007
Robert J. Glushko
glushko@ischool.berkeley.edu
2 of 48
Plan for Today's Lecture
Document Engineering in the News
The Data vs Document Divide
Documents and Processes -- Yin and Yang
Design Patterns in Document Engineering
Synthesizing Four Modeling Approaches
3 of 48
Who Is This Guy?
Adjunct Professor at UC Berkeley "Information School" since 2002
(www.sims.berkeley.edu/~glushko/)
Came to Berkeley from Silicon Valley; founded or co-founded 3
companies in 1990s
Hypertext Engineering
Passage Systems
Veo -> Commerce One
4 of 48
What Is Document Engineering?
A methodology for specifying, designing, and deploying the information
models and repositories that enable document-centric applications
A synthesis of information and systems analysis, business process
modeling, electronic publishing, and service-oriented architecture
It has much in common with Information Architecture, but extends its
scope beyond web site and web application design
5 of 48
Intel, Wal-Mart and others Push Electronic
Health Records
6 of 48
Tsunami Aid Delayed by Incomplete
Shipping & Customs Documents
7 of 48
Salesforce.com Connects its Front End to
Back Ends
8 of 48
Global Shippers Give Customers Real-Time
Cargo Info
9 of 48
FedEx Kinko's Announces Web-based
Printing, Tracking Services
10 of 48
Law Firm Business Models Disrupted by
Document Assembly Software
11 of 48
The Common Themes in These News Items
Enormous opportunities for transforming legacy documents and
processes
New business processes are created / coordinated / choreographed
via the management and exchange of electronic documents
Standards / patterns for documents and business processes are
essential
Information technology and business processes are co-evolving
12 of 48
Is THIS a Typical Document?
13 of 48
Or is THIS a Typical Document?
14 of 48
Or Maybe THIS is a Typical Document?
15 of 48
Contrasting Methodologies for Documents
and Data
Documents and data have had two different disciplines or methods of
analysis that have had little intersection
Document-centric analysis
Data-centric analysis
16 of 48
Document Analysis
Documents are Artifacts or Renditions that combine content, structure
and appearance
The goal of document analysis is a model of a document's content and
structure that is separate from its presentational characteristics
This model of the document and those in its equivalence class is called
a markup language or document schema or document type
17 of 48
Data-Centric Analysis
Goal is to understand and describe the properties and relationships
between information components or objects.
This understanding is represented in conceptual models that organize
the components efficiently to support a broad range of contexts or
applications.
The conceptual model is also typically called a schema, but this is
generally meant to be a "database schema" rather than a "document
schema"
18 of 48
The Data vs. Documents Divide
19 of 48
But A Catalog Is Data (Document)
20 of 48
And a Reference Book is Document (Data)
21 of 48
So it's a Continuum: The Document Type
Spectrum
22 of 48
Can There Be A Bridge?
23 of 48
Spanning the Data/Document Divide
Document Engineering harmonizes the terminology and emphasizes
what they have in common rather than highlighting their differences
Identifying the presentational, content, and structural components and
defining their relationships to each other
Identifying "good" content components
Designing, describing, and organizing components to facilitate their
reuse
Assembling hierarchical document models that organize components
according to the requirements of a specific context for information
exchange
24 of 48
And Business Processes Involve The Entire
Spectrum of Documents
25 of 48
How Should We Understand the
Relationships Between Documents and
Processes?
Business activities involve documents and the processes that produce
and consume them
By understanding the information in the documents, we learn what
kinds of processes are possible
By understanding the processes, we learn what kinds of information
are needed
26 of 48
A Process-Centric Depiction
27 of 48
A Document-Centric Depiction
28 of 48
So How Can We Understand
Documents/Data/Processes In a Systematic
Way?
A focus on processes progressively refines a broadly scoped
description of business activities
"Making it all work" from a business perspective
Inherently a "top down" approach
A focus on documents and data emphasizes the information objects
and flows in a domain and the requirements for implementing a system
in it:
"Making it all work" from a technical perspective
Inherently a "bottom up" approach
29 of 48
Documents and Processes -- Yin and Yang
30 of 48
Design Patterns in Document Engineering
The essence of Document Engineering is its systematic approach for
discovering and exploiting the relationships between patterns of
different types
Business model or organizational patterns
Business process patterns
Business information patterns
31 of 48
Document Exchange Patterns
Businesses have long dealt with each other by exchanging documents
We use concepts like "supply chains" and "distribution channels" as
metaphors for the coordinated or choreographed flow of information
and materials/products between businesses
These are complex patterns composed from the document exchange
pattern
The processes that comprise these patterns are "glued together" by
overlapping information components in the documents
32 of 48
Document Exchange (Physical Model)
33 of 48
Document Exchange (Conceptual Model)
34 of 48
The Drop Shipment Pattern
Customer selects a book from an online bookstore
Customer pays with credit card
Book arrives via express shipper two days later
35 of 48
The Virtual Store as Choreographed
Document Exchanges
36 of 48
Overlapping Information Models in the
Virtual Store
37 of 48
Patterns in the "Model Matrix"
38 of 48
"Meeting in the Middle"
To "bridge the gap between strategy and implementation" we need
models that "meet in the middle"
Reaching the middle from the top down ensures that a business model
is feasible
Reaching the middle from the bottom up ensures that we are designing
and optimizing the activities that add the most value
39 of 48
"Meeting in the Middle" -- 4 Modeling
Approaches
40 of 48
The Document Engineering Approach
41 of 48
So Document Engineering Isn't About XML
XML is a useful technology for Document Engineering, but using XML
doesn't make you a document engineer
The best thing about XML is the ease with which you can create a new
vocabulary for a particular type of document
XML is just the syntax in which we encode document models... what
really matters is how we modeled the documents
42 of 48
Creating Models is Easy, But Creating
GOOD Models is Hard
The worst thing about XML is the same as the best thing – the ease
with which you can create a new vocabulary
No way around the classical problems of classification and naming we
know from philosophy, linguistics, cognitive psychology, and
information science
XML is NOT "self-describing"
There are often multiple vocabularies for the same or related domains
and especially for the common information models that are used in
more than one domain
43 of 48
A Checklist for Describing Projects and
Case Studies
D -- data types and document types
O -- organizational processes
C -- context (types of products or services, industry, geography,
regulatory considerations)
U -- user types and special user requirements
M -- models, patterns, or standards that apply
E -- enterprises and eco systems (e.g., trading communities, standards
bodies)
N -- the needs (business case) driving the enterprise(s)
T -- technology constraints and opportunities
44 of 48
D-O-C-U-M-E-N-T in the Document
Engineering Approach
45 of 48
Summary: Document Engineering's Big
Ideas
"Document Engineering" is evolving as a synthesis of information and
systems analysis, business process modeling, electronic publishing,
and service-oriented architecture
Best practices in Document Engineering require the reuse of
information and process patterns
Business activity always involves both "narrative" and "transactional"
documents – so analysis and design methods must span this
"Document Type Spectrum"
46 of 48
Document Engineering - The Book
47 of 48
Acknowledgments
Much of this material comes from a book called Document
Engineering: Modeling for Business Informatics and Web Services by
Robert Glushko & Tim McGrath
Three years of students at the University of California, Berkeley have
contributed to its development through courses and research projects
with the first author
The methodology has been significantly refined through its use by the
library content team of the Universal Business Language initiative, led
by the second author
48 of 48
But Wait, There's More
I'm easy to find: just "google" Glushko and you find me and the rocket
guy
From my home page you can locate the syllabus and lecture notes for
"Document Engineering and Information Architecture" course at UC
Berkeley
You can also find my "Doc or Die" blog where I occasionally post about
"Document Engineering in the News" or "Semantics in the Wild" stories
THANKS FOR INVITING ME

More Related Content

PDF
DECENTRALIZED BUSINESS PROCESS MODELING AND INSTANCE TRACKING SECURED BY A BL...
PDF
DATA WAREHOUSE AND BIG DATA INTEGRATION
PDF
Agent-SSSN: a strategic scanning system network based on multiagent intellige...
PPTX
Cloud Computing Role in Information technology
PPTX
2016-03-02 research seminar
PDF
Theorizing ict4d
PDF
Full Paper: Analytics: Key to go from generating big data to deriving busines...
PDF
Literature review of attribute level and
DECENTRALIZED BUSINESS PROCESS MODELING AND INSTANCE TRACKING SECURED BY A BL...
DATA WAREHOUSE AND BIG DATA INTEGRATION
Agent-SSSN: a strategic scanning system network based on multiagent intellige...
Cloud Computing Role in Information technology
2016-03-02 research seminar
Theorizing ict4d
Full Paper: Analytics: Key to go from generating big data to deriving busines...
Literature review of attribute level and

What's hot (17)

PDF
F035431037
PPTX
Thought Leadership Session: Enterprise Semantics & Ontology, The Power of Und...
DOCX
05. Physical Data Specification Template
PDF
WHITE PAPER: Distributed Data Quality
PDF
A CASE STUDY OF INNOVATION OF AN INFORMATION COMMUNICATION SYSTEM AND UPGRADE...
PDF
PDF
International Journal of Engineering Research and Development (IJERD)
PDF
An effective pre processing algorithm for information retrieval systems
PDF
Service automation and organisational structure an application example based ...
PDF
3 d molding and casting2
PDF
Accounting information systems_implementation_and_
PDF
Dagstuhl 2013 - Montali - Data-Aware Business Processes - Formalization and R...
DOCX
SimonQuayleCV2015-04
DOCX
04. Logical Data Definition template
DOC
Integrated Information System for Construction Operations
PDF
Mapping the Intellectual Structure of Contemporary Technology Management Rese...
PDF
Bloor Research Comparative costs and uses for data integration platforms
F035431037
Thought Leadership Session: Enterprise Semantics & Ontology, The Power of Und...
05. Physical Data Specification Template
WHITE PAPER: Distributed Data Quality
A CASE STUDY OF INNOVATION OF AN INFORMATION COMMUNICATION SYSTEM AND UPGRADE...
International Journal of Engineering Research and Development (IJERD)
An effective pre processing algorithm for information retrieval systems
Service automation and organisational structure an application example based ...
3 d molding and casting2
Accounting information systems_implementation_and_
Dagstuhl 2013 - Montali - Data-Aware Business Processes - Formalization and R...
SimonQuayleCV2015-04
04. Logical Data Definition template
Integrated Information System for Construction Operations
Mapping the Intellectual Structure of Contemporary Technology Management Rese...
Bloor Research Comparative costs and uses for data integration platforms
Ad

Viewers also liked (20)

PDF
Capture Your Audience: Facilitation that Works!
PDF
DMS Mentoring Brochure
PDF
Vendom Document Control System User Manual
PDF
Paperless Document Control
PPT
Document Control
PPTX
Discipline management
PPTX
Document Management System
PPT
Managing Employee Discipline
PPT
Document Control
PPT
Document & Data Control
PPT
Document Management With Workflow Presentation
PDF
How to do a Project Audit
DOCX
Project audit & review checklist
PDF
Clipping El Observador Solitario 14/11/11 @ IED Barcelona
PDF
To do list the key to efficiency
PPTX
120313 wb mpresentation_rotterdam2012
PDF
Five+ Ways to Add Interactivity to Online Help
PDF
Agile Project Management Quick Guide
PDF
A Realistic Approach to Content Management with Microsoft SharePoint
PPTX
2E3 250311
Capture Your Audience: Facilitation that Works!
DMS Mentoring Brochure
Vendom Document Control System User Manual
Paperless Document Control
Document Control
Discipline management
Document Management System
Managing Employee Discipline
Document Control
Document & Data Control
Document Management With Workflow Presentation
How to do a Project Audit
Project audit & review checklist
Clipping El Observador Solitario 14/11/11 @ IED Barcelona
To do list the key to efficiency
120313 wb mpresentation_rotterdam2012
Five+ Ways to Add Interactivity to Online Help
Agile Project Management Quick Guide
A Realistic Approach to Content Management with Microsoft SharePoint
2E3 250311
Ad

Similar to Document Engineering (20)

PPTX
How do social technologies change knowledge worker business processes km me...
PDF
Knowledge Management Cultures: A Comparison of Engineering and Cultural Scien...
PDF
Marco Tirelli - Open Innovation in the Era of the Internet of Things
PPTX
Taxonomy and seo sla 05-06-10(jc)
PPTX
Thought Leadership Session: Enterprise Semantics & Ontology, The Power of Und...
PPT
Knowledge mgmt
PDF
Document Engineering in User Experience Design
PPT
Why to Architecture Information
PDF
Model Based Software and Data Integration Communications in Computer and Info...
PDF
The Role of Technology in Modern Science Education
PDF
Leveraging Multimedia Resources for Science Instruction
PDF
Smart Service Innovation An Ecosystem Perspective On Organization Design And ...
PDF
Data Mining of Project Management Data: An Analysis of Applied Research Studies.
PDF
Pm的知识架构
PPT
12786246.ppt
PDF
An Assignment On Information System Modeling On Teaching Data And Process Int...
PDF
critical perspective on conceptual models in enterprise architecture
DOCX
Organisational AnalysisOrganisations as Systems of Objectivity.docx
PPTX
Data Collaboration Stack
How do social technologies change knowledge worker business processes km me...
Knowledge Management Cultures: A Comparison of Engineering and Cultural Scien...
Marco Tirelli - Open Innovation in the Era of the Internet of Things
Taxonomy and seo sla 05-06-10(jc)
Thought Leadership Session: Enterprise Semantics & Ontology, The Power of Und...
Knowledge mgmt
Document Engineering in User Experience Design
Why to Architecture Information
Model Based Software and Data Integration Communications in Computer and Info...
The Role of Technology in Modern Science Education
Leveraging Multimedia Resources for Science Instruction
Smart Service Innovation An Ecosystem Perspective On Organization Design And ...
Data Mining of Project Management Data: An Analysis of Applied Research Studies.
Pm的知识架构
12786246.ppt
An Assignment On Information System Modeling On Teaching Data And Process Int...
critical perspective on conceptual models in enterprise architecture
Organisational AnalysisOrganisations as Systems of Objectivity.docx
Data Collaboration Stack

More from STC-Philadelphia Metro Chapter (20)

PDF
From Multimedia to Social Media to Augmented and Virtual Reality April 2019
PPTX
March 2018 STC-PMC Webinar: Millennial Documentation (Presented by Pooja Vija...
PDF
Networking for Geeks (by Brian Winter)
PDF
The Cost of Doing Nothing (By Suzanne Mescan of Vasont)
PDF
Markup For Dummies (Russ Ward)
PPTX
Why is My Team Failing? (By Christine Loch)
PDF
7 Lessons from the Future of Content (By David Dylan Thomas)
PDF
Content Marketing Resource List
PPTX
Managing the MarComm/TechComm Showdown
PPTX
Effectively Telling Your Story through Content Marketing
PPTX
Defining New Roles for Information Engineering in the Customer Content Journey
PPTX
We’re Going Mobile! Great! Wait… What Does That Mean?
PDF
GitHub Workflows for Technical Communication
PDF
STC-PMC November 2016 Presentation - Mobile First Content
PPT
STC-PMC October 2016 Presentation-That's What You Think!
PDF
2016 Conduit Program
PPTX
Video Tools in Tech Writing
PDF
UX and Cognitive Load
PPTX
Public Speaking Skills: A Project Manager’s Best Friend
PPTX
Just Think About This - 9 Thinking Modalities to Consider
From Multimedia to Social Media to Augmented and Virtual Reality April 2019
March 2018 STC-PMC Webinar: Millennial Documentation (Presented by Pooja Vija...
Networking for Geeks (by Brian Winter)
The Cost of Doing Nothing (By Suzanne Mescan of Vasont)
Markup For Dummies (Russ Ward)
Why is My Team Failing? (By Christine Loch)
7 Lessons from the Future of Content (By David Dylan Thomas)
Content Marketing Resource List
Managing the MarComm/TechComm Showdown
Effectively Telling Your Story through Content Marketing
Defining New Roles for Information Engineering in the Customer Content Journey
We’re Going Mobile! Great! Wait… What Does That Mean?
GitHub Workflows for Technical Communication
STC-PMC November 2016 Presentation - Mobile First Content
STC-PMC October 2016 Presentation-That's What You Think!
2016 Conduit Program
Video Tools in Tech Writing
UX and Cognitive Load
Public Speaking Skills: A Project Manager’s Best Friend
Just Think About This - 9 Thinking Modalities to Consider

Recently uploaded (20)

PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPT
Teaching material agriculture food technology
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
A Presentation on Artificial Intelligence
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Spectral efficient network and resource selection model in 5G networks
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Approach and Philosophy of On baking technology
PPTX
Cloud computing and distributed systems.
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
Reach Out and Touch Someone: Haptics and Empathic Computing
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Teaching material agriculture food technology
20250228 LYD VKU AI Blended-Learning.pptx
Agricultural_Statistics_at_a_Glance_2022_0.pdf
A Presentation on Artificial Intelligence
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
Encapsulation_ Review paper, used for researhc scholars
Network Security Unit 5.pdf for BCA BBA.
Spectral efficient network and resource selection model in 5G networks
“AI and Expert System Decision Support & Business Intelligence Systems”
Diabetes mellitus diagnosis method based random forest with bat algorithm
NewMind AI Weekly Chronicles - August'25 Week I
The Rise and Fall of 3GPP – Time for a Sabbatical?
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Approach and Philosophy of On baking technology
Cloud computing and distributed systems.
Per capita expenditure prediction using model stacking based on satellite ima...

Document Engineering

  • 1. 1 of 48 Document Engineering STC-PMC 2007 17 March 2007 Robert J. Glushko glushko@ischool.berkeley.edu
  • 2. 2 of 48 Plan for Today's Lecture Document Engineering in the News The Data vs Document Divide Documents and Processes -- Yin and Yang Design Patterns in Document Engineering Synthesizing Four Modeling Approaches
  • 3. 3 of 48 Who Is This Guy? Adjunct Professor at UC Berkeley "Information School" since 2002 (www.sims.berkeley.edu/~glushko/) Came to Berkeley from Silicon Valley; founded or co-founded 3 companies in 1990s Hypertext Engineering Passage Systems Veo -> Commerce One
  • 4. 4 of 48 What Is Document Engineering? A methodology for specifying, designing, and deploying the information models and repositories that enable document-centric applications A synthesis of information and systems analysis, business process modeling, electronic publishing, and service-oriented architecture It has much in common with Information Architecture, but extends its scope beyond web site and web application design
  • 5. 5 of 48 Intel, Wal-Mart and others Push Electronic Health Records
  • 6. 6 of 48 Tsunami Aid Delayed by Incomplete Shipping & Customs Documents
  • 7. 7 of 48 Salesforce.com Connects its Front End to Back Ends
  • 8. 8 of 48 Global Shippers Give Customers Real-Time Cargo Info
  • 9. 9 of 48 FedEx Kinko's Announces Web-based Printing, Tracking Services
  • 10. 10 of 48 Law Firm Business Models Disrupted by Document Assembly Software
  • 11. 11 of 48 The Common Themes in These News Items Enormous opportunities for transforming legacy documents and processes New business processes are created / coordinated / choreographed via the management and exchange of electronic documents Standards / patterns for documents and business processes are essential Information technology and business processes are co-evolving
  • 12. 12 of 48 Is THIS a Typical Document?
  • 13. 13 of 48 Or is THIS a Typical Document?
  • 14. 14 of 48 Or Maybe THIS is a Typical Document?
  • 15. 15 of 48 Contrasting Methodologies for Documents and Data Documents and data have had two different disciplines or methods of analysis that have had little intersection Document-centric analysis Data-centric analysis
  • 16. 16 of 48 Document Analysis Documents are Artifacts or Renditions that combine content, structure and appearance The goal of document analysis is a model of a document's content and structure that is separate from its presentational characteristics This model of the document and those in its equivalence class is called a markup language or document schema or document type
  • 17. 17 of 48 Data-Centric Analysis Goal is to understand and describe the properties and relationships between information components or objects. This understanding is represented in conceptual models that organize the components efficiently to support a broad range of contexts or applications. The conceptual model is also typically called a schema, but this is generally meant to be a "database schema" rather than a "document schema"
  • 18. 18 of 48 The Data vs. Documents Divide
  • 19. 19 of 48 But A Catalog Is Data (Document)
  • 20. 20 of 48 And a Reference Book is Document (Data)
  • 21. 21 of 48 So it's a Continuum: The Document Type Spectrum
  • 22. 22 of 48 Can There Be A Bridge?
  • 23. 23 of 48 Spanning the Data/Document Divide Document Engineering harmonizes the terminology and emphasizes what they have in common rather than highlighting their differences Identifying the presentational, content, and structural components and defining their relationships to each other Identifying "good" content components Designing, describing, and organizing components to facilitate their reuse Assembling hierarchical document models that organize components according to the requirements of a specific context for information exchange
  • 24. 24 of 48 And Business Processes Involve The Entire Spectrum of Documents
  • 25. 25 of 48 How Should We Understand the Relationships Between Documents and Processes? Business activities involve documents and the processes that produce and consume them By understanding the information in the documents, we learn what kinds of processes are possible By understanding the processes, we learn what kinds of information are needed
  • 26. 26 of 48 A Process-Centric Depiction
  • 27. 27 of 48 A Document-Centric Depiction
  • 28. 28 of 48 So How Can We Understand Documents/Data/Processes In a Systematic Way? A focus on processes progressively refines a broadly scoped description of business activities "Making it all work" from a business perspective Inherently a "top down" approach A focus on documents and data emphasizes the information objects and flows in a domain and the requirements for implementing a system in it: "Making it all work" from a technical perspective Inherently a "bottom up" approach
  • 29. 29 of 48 Documents and Processes -- Yin and Yang
  • 30. 30 of 48 Design Patterns in Document Engineering The essence of Document Engineering is its systematic approach for discovering and exploiting the relationships between patterns of different types Business model or organizational patterns Business process patterns Business information patterns
  • 31. 31 of 48 Document Exchange Patterns Businesses have long dealt with each other by exchanging documents We use concepts like "supply chains" and "distribution channels" as metaphors for the coordinated or choreographed flow of information and materials/products between businesses These are complex patterns composed from the document exchange pattern The processes that comprise these patterns are "glued together" by overlapping information components in the documents
  • 32. 32 of 48 Document Exchange (Physical Model)
  • 33. 33 of 48 Document Exchange (Conceptual Model)
  • 34. 34 of 48 The Drop Shipment Pattern Customer selects a book from an online bookstore Customer pays with credit card Book arrives via express shipper two days later
  • 35. 35 of 48 The Virtual Store as Choreographed Document Exchanges
  • 36. 36 of 48 Overlapping Information Models in the Virtual Store
  • 37. 37 of 48 Patterns in the "Model Matrix"
  • 38. 38 of 48 "Meeting in the Middle" To "bridge the gap between strategy and implementation" we need models that "meet in the middle" Reaching the middle from the top down ensures that a business model is feasible Reaching the middle from the bottom up ensures that we are designing and optimizing the activities that add the most value
  • 39. 39 of 48 "Meeting in the Middle" -- 4 Modeling Approaches
  • 40. 40 of 48 The Document Engineering Approach
  • 41. 41 of 48 So Document Engineering Isn't About XML XML is a useful technology for Document Engineering, but using XML doesn't make you a document engineer The best thing about XML is the ease with which you can create a new vocabulary for a particular type of document XML is just the syntax in which we encode document models... what really matters is how we modeled the documents
  • 42. 42 of 48 Creating Models is Easy, But Creating GOOD Models is Hard The worst thing about XML is the same as the best thing – the ease with which you can create a new vocabulary No way around the classical problems of classification and naming we know from philosophy, linguistics, cognitive psychology, and information science XML is NOT "self-describing" There are often multiple vocabularies for the same or related domains and especially for the common information models that are used in more than one domain
  • 43. 43 of 48 A Checklist for Describing Projects and Case Studies D -- data types and document types O -- organizational processes C -- context (types of products or services, industry, geography, regulatory considerations) U -- user types and special user requirements M -- models, patterns, or standards that apply E -- enterprises and eco systems (e.g., trading communities, standards bodies) N -- the needs (business case) driving the enterprise(s) T -- technology constraints and opportunities
  • 44. 44 of 48 D-O-C-U-M-E-N-T in the Document Engineering Approach
  • 45. 45 of 48 Summary: Document Engineering's Big Ideas "Document Engineering" is evolving as a synthesis of information and systems analysis, business process modeling, electronic publishing, and service-oriented architecture Best practices in Document Engineering require the reuse of information and process patterns Business activity always involves both "narrative" and "transactional" documents – so analysis and design methods must span this "Document Type Spectrum"
  • 46. 46 of 48 Document Engineering - The Book
  • 47. 47 of 48 Acknowledgments Much of this material comes from a book called Document Engineering: Modeling for Business Informatics and Web Services by Robert Glushko & Tim McGrath Three years of students at the University of California, Berkeley have contributed to its development through courses and research projects with the first author The methodology has been significantly refined through its use by the library content team of the Universal Business Language initiative, led by the second author
  • 48. 48 of 48 But Wait, There's More I'm easy to find: just "google" Glushko and you find me and the rocket guy From my home page you can locate the syllabus and lecture notes for "Document Engineering and Information Architecture" course at UC Berkeley You can also find my "Doc or Die" blog where I occasionally post about "Document Engineering in the News" or "Semantics in the Wild" stories THANKS FOR INVITING ME