SlideShare a Scribd company logo
Free GIS and Interoperability GIS Open Source, interoperabilità e cultura del dato  nei SIAT della Pubblica Amministrazione [GIS Open Source, interoperability and the 'culture of data'   in the spatial data warehouses of the Public Administration] GFOSS'04 ITC-irst, 16 Nov 2004 (last revised 10 2005) M. Neteler neteler at itc it http://mpa.itc.it  ITC-irst, Povo (Trento), Italy
The need for Interoperability The problem nowadays data have to be exchanged across often very heterogeneous groups
the personal choice of application software/operating system should not affect the data exchange
data exchange standards are available
limited awareness  for the need of interoperability
limited implementation of interoperability in processes and software
commonly used file formats let to believe in interoperability: “false friends”
What are Standardization & Interoperability? Standardization versus Interoperability  Standardization:  Written/published document describing data formats, models etc. Example Office Standards: ASCII, HTML, XML, ... Example GIS Standards: GML, ISO 08211, ISO/IEC 15444-1, WMS etc. Only  published standards  are acceptable. Interoperability:  More than application of standardization, it also comprises the   interpretation of the standard  (sometimes definitions are incomplete)
Interoperability? The two dimensions of Interoperability  Longitudinal Interoperability:  time - long term storage Data shall be readable over time (years, decades, ...). This is of particular interest for data of public administration and long-term projects. Transversal Interoperability:  sharing data between users Data shall be readable across user communities, independent from software or operating system used (freedom of software choice). Again, this is of particular interest for data of public administration and long-term projects.
Part I: Office Interoperability
Example: MS-Word .DOC format Are WORD.doc files a suitable for data exchange? the format is undocumented, to some extend it was reverse-engineered -> does not support transversal interoperability
the format is regularly changed (Word 1, 2, 95, 97, NT, 2000, XP, ...   also named WinWORD 6, 8, 10,...) -> does not support longitudinal interoperability
Prone to MS-Windows macro viruses
severe security/privacy issues (example next slide) - DOC files contain sensitive information about user (unrelated   to the contents) - deleted text may still be legible outside of MS-Word -> contents cannot be completely verified
Example: MS-Word .DOC format -  security/privacy issues  Descrambling a WORD.doc file Your unique MS-Windows user ID (or similar):    PID_GUIDäAN{714738E3-FF4C-11D3-ZD7C-00E0281D67A7} This makes your (anonymous) document  traceable .
Sometimes delete text is still visible (think of re-using an existing WORD file) A famous example:  In February 2003, the British government of Tony Blair published a  dossier on  Iraq's security and intelligence organizations . This dossier was cited by  Colin Powell in his address to the United Nations the same month. Dr. Glen Rangwala, a lecturer in politics at Cambridge University, quickly  discovered that much of the material in the dossier was actually plagiarized from a U.S. researcher on Iraq. http://www.computerbytesman.com/privacy/blair.htm What you may find:
Descrambling a WORD.doc file: The British Iraq dossier 2003 1/2 http://nytimes.com Example: MS-Word .DOC format -  security/privacy issues
[neteler@dandre2 gfoss04]$  tr -d [:cntrl:] < blair.doc ÐÏࡱá>þÿz|þÿÿÿyÿ  [...] -xxxxí-o#o#{'?^,k6®äí-* RûuËÂG (É-$IRAQ  ITS INFRASTRUCTURE OF CONCEALMENT, DECEPTION AND INTIMIDATIONThis report draws upon a number of sources, including intelligence material, and shows how the Iraqi regime is constructed to have,  and to keep, WMD, and is now engaged in a campaign of obstruction of the  United Nations Weapons Inspectors. [...] [`azbhh§h»h?h-i/isjÿÿ cic22 JC:\DOCUME~1\ phamill \LOCALS~1\Temp\AutoRecovery save of Iraq - security.asd cic22 JC:\DOCUME~1\ phamill \LOCALS~1\Temp\AutoRecovery save of Iraq -  security.asd cic22 JC:\DOCUME~1\ phamill \LOCALS~1\Temp\AutoRecovery save of Iraq -  security.asd JPratt C:\TEMP\Iraq - security.doc JPratt A:\Iraq - security.doc ablackshaw!C:\ ABlackshaw \Iraq - security.docablackshaw#C:\ ABlackshaw \A;Iraq - security.doc ablackshaw A:\Iraq - security.doc MKhan C:\TEMP\Iraq - security.doc MKhan (C:\WINNT\Profiles\mkhan\Desktop\Iraq.docþÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿ PjÿzXVÿ*uzLl_ÿbêzLl_ [...] jP@GTimes New Roman5SymbolG&ArialHelveticaA&Arial Narrow?&Arial Black&quot;qÐh_r&Òr&aõq#JV,?RVW,º!¥À??20døi?fÿÿCIraq- ITS INFRASTRUCTURE OF CONCEALMENT, DECEPTION AND INTIMIDATIONdefaultMKhanþÿàòùOh«+'³Ù0? ìø 4DPlx?¬?äDIraq- ITS INFRASTRUCTURE OF CONCEALMENT, DECEPTION AND INTIMIDATIONraqdefaultefaefaNormal.dotN MKhan .d4ha Microsoft Word 8.0 C@ÒIk@n)§ÈÂ@&quot;ZöfËÂ@døèuËÂ#JVþÿÕÍÕ [...] http://www.computerbytesman.com/privacy/blair.htm Weapons of mass destruction Descrambling a WORD.doc file: The British Iraq dossier 2003 2/2 Example: MS-Word .DOC format -  security/privacy issues
Example: MS-Excel .XLS format Are EXCEL.xls files a suitable for data exchange? the format is undocumented, to some extend it was reverse-engineered -> does not support transversal interoperability
the format is regularly changed (Excel 95, 97, NT, 2000, ...) -> does not support longitudinal interoperability
Prone to MS-Windows viruses
Limitation: max. 65535 lines in a table (2 16 )
Auto-conversion feature risky: Some fields/columns are automatically changed to date-time format (see example next slides) -> risk of accidental data damage high
Example: MS-Excel .XLS format – accidental data damage The “Human Genome Project” case 1/3 In 2004 scientists discovered that some gene names were being changed  inadvertently to non-gene names. Citation: “ A little detective work traced  the problem to default date format conversions and  floating-point format conversions  in the very useful  Excel  program package.  The date conversions affect  at least 30 gene names ; the floating-point conversions  affect at least 2,000 if Riken identifiers are included.  These conversions are  irreversible ; the original gene names cannot be recovered. A default date conversion feature in Excel (Microsoft Corp., Redmond, WA) was  altering gene names that it considered to look like dates . For example, the tumor  suppressor DEC1 [Deleted in Esophageal Cancer 1] [3] was being converted  to '1-DEC.' ” Cited after: B.R. Zeeberg, J. Riss, D.W. Kane, K.J. Bussey, E. Uchio, W.M. Linehan,  J.C. Barrett and J.N. Weinstein, BMC Bioinformatics 2004, 5:80 http://dx.doi.org/10.1186/1471-2105-5-80
The “Human Genome Project” case 2/3 Example: MS-Excel .XLS format – accidental data damage http://dx.doi.org/10.1186/1471-2105-5-80
The “Human Genome Project” case 3/3 Example: MS-Excel .XLS format – accidental data damage http://dx.doi.org/10.1186/1471-2105-5-80

More Related Content

PDF
GIS Standards and Interoperability
PDF
Going for GOLD - Adventures in Open Linked Metadata
PPTX
The Semantic Data Web, Sören Auer, University of Leipzig
PDF
A Survey on Heterogeneous Data Exchange using Xml
PPTX
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
PDF
Standardizing for Open Data
PDF
Wed roman tut_open_datapub
PPT
GIS Standards and Interoperability
Going for GOLD - Adventures in Open Linked Metadata
The Semantic Data Web, Sören Auer, University of Leipzig
A Survey on Heterogeneous Data Exchange using Xml
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Standardizing for Open Data
Wed roman tut_open_datapub

What's hot (20)

PPTX
LOD2 Webinar Series: 3rd relase of the Stack
PPT
LOD2 Webinar Series: D2R and Sparqlify
PPT
Semantics and Web 3.0
PPTX
Linked Open Data
PPTX
Linked services: Connecting services to the Web of Data
PDF
LOD2 Webinar Series Classification and Quality Analysis with DL Learner and ORE
PPTX
Linked data life cycles
ODP
Lod2 review meeting
PPTX
What can linked data do for digital libraries
PDF
Brief State of the Art - Semantic Web technologies for geospatial data - Mode...
PDF
An introduction to Linked (Open) Data
PDF
Nicoletta Fornara and Fabio Marfia | Modeling and Enforcing Access Control Ob...
PDF
LOD2 Plenary Vienna 2012: WP3 - Knowledge Base Creation, Enrichment and Repair
PDF
LDOW2015 Position Talk and Discussion
PDF
Linked Open Data Principles, Technologies and Examples
PDF
Industry Ontologies: Case Studies in Creating and Extending Schema.org
LOD2 Webinar Series: 3rd relase of the Stack
LOD2 Webinar Series: D2R and Sparqlify
Semantics and Web 3.0
Linked Open Data
Linked services: Connecting services to the Web of Data
LOD2 Webinar Series Classification and Quality Analysis with DL Learner and ORE
Linked data life cycles
Lod2 review meeting
What can linked data do for digital libraries
Brief State of the Art - Semantic Web technologies for geospatial data - Mode...
An introduction to Linked (Open) Data
Nicoletta Fornara and Fabio Marfia | Modeling and Enforcing Access Control Ob...
LOD2 Plenary Vienna 2012: WP3 - Knowledge Base Creation, Enrichment and Repair
LDOW2015 Position Talk and Discussion
Linked Open Data Principles, Technologies and Examples
Industry Ontologies: Case Studies in Creating and Extending Schema.org
Ad

Viewers also liked (8)

PDF
Free and Open Source GIS
ODP
Free GIS Software meets zoonotic diseases: From raw data to ecological indica...
PPT
Free GIS Resources
PPT
Free open source gis
ODP
Setting the Table for Open Source GIS
PPT
Free GIS in the Classroom
KEY
GIS in Geography
PPT
Open Source GIS
Free and Open Source GIS
Free GIS Software meets zoonotic diseases: From raw data to ecological indica...
Free GIS Resources
Free open source gis
Setting the Table for Open Source GIS
Free GIS in the Classroom
GIS in Geography
Open Source GIS
Ad

Similar to The need of Interoperability in Office and GIS formats (20)

PDF
FOSS and Open Standards for Digital Sovereignty
PDF
Opw presentation of open office-lgma
PDF
Berlin 6 Open Access Conference: Christian Zier
PDF
Migrating to Free Software: a Reference Protocol for LibreOffce
ODP
A Technical Comparison: ISO/IEC 26300 vs Microsoft Office Open XML
PDF
HCII2014 presentation
ODP
Office OpenXML: a technical approach for OOo.
PPTX
Oceangraphic data formats
ODP
Open Standards0607
PDF
Advice to LibreOffice migration Example in Fukuoka, Japan
PPT
Officexml
PPTX
Data management principles
PPTX
Open standards and open data
PDF
LISA OASIS-feb2011
PDF
OpenAIRE webinar: Principles of Research Data Management, with S. Venkatarama...
ODP
Reliable interoperation between OpenOffice & MS office by UOML
ODP
Reliable interoperation between OpenOffice & MS office by UOML
PPT
Free and Open Source Software technology: General Overview
PPT
Free and Open Source Software technology: General Overview
PDF
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
FOSS and Open Standards for Digital Sovereignty
Opw presentation of open office-lgma
Berlin 6 Open Access Conference: Christian Zier
Migrating to Free Software: a Reference Protocol for LibreOffce
A Technical Comparison: ISO/IEC 26300 vs Microsoft Office Open XML
HCII2014 presentation
Office OpenXML: a technical approach for OOo.
Oceangraphic data formats
Open Standards0607
Advice to LibreOffice migration Example in Fukuoka, Japan
Officexml
Data management principles
Open standards and open data
LISA OASIS-feb2011
OpenAIRE webinar: Principles of Research Data Management, with S. Venkatarama...
Reliable interoperation between OpenOffice & MS office by UOML
Reliable interoperation between OpenOffice & MS office by UOML
Free and Open Source Software technology: General Overview
Free and Open Source Software technology: General Overview
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?

More from Markus Neteler (17)

ODP
OSGeo ireland keynote 2017 by Markus Neteler
ODP
10 years of OSGeo - Happy Birthday
ODP
GRASS GIS 7 capabilities: a graphical overview
ODP
OSGeo: projects, incubation and infrastructure
PDF
Tracking emerging diseases from space: Geoinformatics for human health
PDF
Deriving environmental indicators from massive spatial time series using open...
PDF
GRASS GIS 7: your reliable geospatial number cruncher
ODP
Vom Laptop zum Großrechner: Neues in GRASS GIS 7
ODP
News in GRASS GIS7. Plenary talk at FOSS4G-CEE 2013, Romania
ODP
Scaling up globally: 30 years of FOSS4G development. Keynote at FOSS4G-CEE 20...
PDF
GRASS GIS lightening talk at FOSS4G 2006
PDF
GRASS GIS e Sextante
ODP
25 Years of GRASS GIS
PDF
From a niche to a global user community: Open Source GIS and OSGeo
ODP
GRASS and OSGeo: a framework for archeology
ODP
The GRASS GIS software (with QGIS) - GIS Seminar
ODP
Community based software development: The GRASS GIS project
OSGeo ireland keynote 2017 by Markus Neteler
10 years of OSGeo - Happy Birthday
GRASS GIS 7 capabilities: a graphical overview
OSGeo: projects, incubation and infrastructure
Tracking emerging diseases from space: Geoinformatics for human health
Deriving environmental indicators from massive spatial time series using open...
GRASS GIS 7: your reliable geospatial number cruncher
Vom Laptop zum Großrechner: Neues in GRASS GIS 7
News in GRASS GIS7. Plenary talk at FOSS4G-CEE 2013, Romania
Scaling up globally: 30 years of FOSS4G development. Keynote at FOSS4G-CEE 20...
GRASS GIS lightening talk at FOSS4G 2006
GRASS GIS e Sextante
25 Years of GRASS GIS
From a niche to a global user community: Open Source GIS and OSGeo
GRASS and OSGeo: a framework for archeology
The GRASS GIS software (with QGIS) - GIS Seminar
Community based software development: The GRASS GIS project

Recently uploaded (20)

PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Approach and Philosophy of On baking technology
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
cuic standard and advanced reporting.pdf
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Encapsulation theory and applications.pdf
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PPTX
Big Data Technologies - Introduction.pptx
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Unlocking AI with Model Context Protocol (MCP)
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Approach and Philosophy of On baking technology
NewMind AI Weekly Chronicles - August'25 Week I
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Dropbox Q2 2025 Financial Results & Investor Presentation
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Understanding_Digital_Forensics_Presentation.pptx
Encapsulation_ Review paper, used for researhc scholars
Diabetes mellitus diagnosis method based random forest with bat algorithm
The AUB Centre for AI in Media Proposal.docx
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
cuic standard and advanced reporting.pdf
Digital-Transformation-Roadmap-for-Companies.pptx
Encapsulation theory and applications.pdf
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Advanced methodologies resolving dimensionality complications for autism neur...
MIND Revenue Release Quarter 2 2025 Press Release
Big Data Technologies - Introduction.pptx

The need of Interoperability in Office and GIS formats

  • 1. Free GIS and Interoperability GIS Open Source, interoperabilità e cultura del dato nei SIAT della Pubblica Amministrazione [GIS Open Source, interoperability and the 'culture of data' in the spatial data warehouses of the Public Administration] GFOSS'04 ITC-irst, 16 Nov 2004 (last revised 10 2005) M. Neteler neteler at itc it http://mpa.itc.it ITC-irst, Povo (Trento), Italy
  • 2. The need for Interoperability The problem nowadays data have to be exchanged across often very heterogeneous groups
  • 3. the personal choice of application software/operating system should not affect the data exchange
  • 4. data exchange standards are available
  • 5. limited awareness for the need of interoperability
  • 6. limited implementation of interoperability in processes and software
  • 7. commonly used file formats let to believe in interoperability: “false friends”
  • 8. What are Standardization & Interoperability? Standardization versus Interoperability Standardization: Written/published document describing data formats, models etc. Example Office Standards: ASCII, HTML, XML, ... Example GIS Standards: GML, ISO 08211, ISO/IEC 15444-1, WMS etc. Only published standards are acceptable. Interoperability: More than application of standardization, it also comprises the interpretation of the standard (sometimes definitions are incomplete)
  • 9. Interoperability? The two dimensions of Interoperability Longitudinal Interoperability: time - long term storage Data shall be readable over time (years, decades, ...). This is of particular interest for data of public administration and long-term projects. Transversal Interoperability: sharing data between users Data shall be readable across user communities, independent from software or operating system used (freedom of software choice). Again, this is of particular interest for data of public administration and long-term projects.
  • 10. Part I: Office Interoperability
  • 11. Example: MS-Word .DOC format Are WORD.doc files a suitable for data exchange? the format is undocumented, to some extend it was reverse-engineered -> does not support transversal interoperability
  • 12. the format is regularly changed (Word 1, 2, 95, 97, NT, 2000, XP, ... also named WinWORD 6, 8, 10,...) -> does not support longitudinal interoperability
  • 13. Prone to MS-Windows macro viruses
  • 14. severe security/privacy issues (example next slide) - DOC files contain sensitive information about user (unrelated to the contents) - deleted text may still be legible outside of MS-Word -> contents cannot be completely verified
  • 15. Example: MS-Word .DOC format - security/privacy issues Descrambling a WORD.doc file Your unique MS-Windows user ID (or similar): PID_GUIDäAN{714738E3-FF4C-11D3-ZD7C-00E0281D67A7} This makes your (anonymous) document traceable .
  • 16. Sometimes delete text is still visible (think of re-using an existing WORD file) A famous example: In February 2003, the British government of Tony Blair published a dossier on Iraq's security and intelligence organizations . This dossier was cited by Colin Powell in his address to the United Nations the same month. Dr. Glen Rangwala, a lecturer in politics at Cambridge University, quickly discovered that much of the material in the dossier was actually plagiarized from a U.S. researcher on Iraq. http://www.computerbytesman.com/privacy/blair.htm What you may find:
  • 17. Descrambling a WORD.doc file: The British Iraq dossier 2003 1/2 http://nytimes.com Example: MS-Word .DOC format - security/privacy issues
  • 18. [neteler@dandre2 gfoss04]$ tr -d [:cntrl:] < blair.doc ÐÏࡱá>þÿz|þÿÿÿyÿ [...] -xxxxí-o#o#{'?^,k6®äí-* RûuËÂG (É-$IRAQ ITS INFRASTRUCTURE OF CONCEALMENT, DECEPTION AND INTIMIDATIONThis report draws upon a number of sources, including intelligence material, and shows how the Iraqi regime is constructed to have, and to keep, WMD, and is now engaged in a campaign of obstruction of the United Nations Weapons Inspectors. [...] [`azbhh§h»h?h-i/isjÿÿ cic22 JC:\DOCUME~1\ phamill \LOCALS~1\Temp\AutoRecovery save of Iraq - security.asd cic22 JC:\DOCUME~1\ phamill \LOCALS~1\Temp\AutoRecovery save of Iraq - security.asd cic22 JC:\DOCUME~1\ phamill \LOCALS~1\Temp\AutoRecovery save of Iraq - security.asd JPratt C:\TEMP\Iraq - security.doc JPratt A:\Iraq - security.doc ablackshaw!C:\ ABlackshaw \Iraq - security.docablackshaw#C:\ ABlackshaw \A;Iraq - security.doc ablackshaw A:\Iraq - security.doc MKhan C:\TEMP\Iraq - security.doc MKhan (C:\WINNT\Profiles\mkhan\Desktop\Iraq.docþÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿÿ PjÿzXVÿ*uzLl_ÿbêzLl_ [...] jP@GTimes New Roman5SymbolG&ArialHelveticaA&Arial Narrow?&Arial Black&quot;qÐh_r&Òr&aõq#JV,?RVW,º!¥À??20døi?fÿÿCIraq- ITS INFRASTRUCTURE OF CONCEALMENT, DECEPTION AND INTIMIDATIONdefaultMKhanþÿàòùOh«+'³Ù0? ìø 4DPlx?¬?äDIraq- ITS INFRASTRUCTURE OF CONCEALMENT, DECEPTION AND INTIMIDATIONraqdefaultefaefaNormal.dotN MKhan .d4ha Microsoft Word 8.0 C@ÒIk@n)§ÈÂ@&quot;ZöfËÂ@døèuËÂ#JVþÿÕÍÕ [...] http://www.computerbytesman.com/privacy/blair.htm Weapons of mass destruction Descrambling a WORD.doc file: The British Iraq dossier 2003 2/2 Example: MS-Word .DOC format - security/privacy issues
  • 19. Example: MS-Excel .XLS format Are EXCEL.xls files a suitable for data exchange? the format is undocumented, to some extend it was reverse-engineered -> does not support transversal interoperability
  • 20. the format is regularly changed (Excel 95, 97, NT, 2000, ...) -> does not support longitudinal interoperability
  • 22. Limitation: max. 65535 lines in a table (2 16 )
  • 23. Auto-conversion feature risky: Some fields/columns are automatically changed to date-time format (see example next slides) -> risk of accidental data damage high
  • 24. Example: MS-Excel .XLS format – accidental data damage The “Human Genome Project” case 1/3 In 2004 scientists discovered that some gene names were being changed inadvertently to non-gene names. Citation: “ A little detective work traced the problem to default date format conversions and floating-point format conversions in the very useful Excel program package. The date conversions affect at least 30 gene names ; the floating-point conversions affect at least 2,000 if Riken identifiers are included. These conversions are irreversible ; the original gene names cannot be recovered. A default date conversion feature in Excel (Microsoft Corp., Redmond, WA) was altering gene names that it considered to look like dates . For example, the tumor suppressor DEC1 [Deleted in Esophageal Cancer 1] [3] was being converted to '1-DEC.' ” Cited after: B.R. Zeeberg, J. Riss, D.W. Kane, K.J. Bussey, E. Uchio, W.M. Linehan, J.C. Barrett and J.N. Weinstein, BMC Bioinformatics 2004, 5:80 http://dx.doi.org/10.1186/1471-2105-5-80
  • 25. The “Human Genome Project” case 2/3 Example: MS-Excel .XLS format – accidental data damage http://dx.doi.org/10.1186/1471-2105-5-80
  • 26. The “Human Genome Project” case 3/3 Example: MS-Excel .XLS format – accidental data damage http://dx.doi.org/10.1186/1471-2105-5-80
  • 27. Suggestions for “Office” data interoperability Text files: ASCII, HTML, RTF, XML, Latex Postscript/PDF for read-only documents
  • 28. Tables: CSV, xBase (dBase), XML
  • 31. Suggestions for “Office” data interoperability Automated conversion tools can be used to provide all formats Text files: ASCII, HTML, RTF, XML Postscript/PDF
  • 32. Tables: CSV, xBase (dBase), XML
  • 34. Bibliography: BibTex Converters (examples): OpenOffice.org [1]
  • 39. Bibtex2html [5], (Endnote) [1] http://OpenOffice.org itself uses XML as own standard format [2] http://wvware.sourceforge.net/ [3] http://www.klaban.torun.pl/prog/pg2xbase/ [4] http://www.scripps.edu/~cdputnam/software/bibutils/bibutils.html [5] http://www.lri.fr/~filliatr/bibtex2html/
  • 40. OASIS: “Office” data interoperability Promotion of Open Document Exchange Format Proposed and implemented new open standard format: OASIS OpenDocument XML format
  • 41. The OASIS OpenDocument format [1] is a vendor and implementation independent file format which guarantees freedom and independence
  • 42. E.g., OpenOffice.org uses OASIS as default format from version 2.0 onwards as well as KOffice , StarOffice software and other vendors The OASIS OpenDocument file format is one of the file formats recommended by the European Commision [2] [1] http://www.oasis-open.org/committees/tc_home.php?wg_abbrev=office [2] http://europa.eu.int/idabc/en/document/3439
  • 43.  
  • 44. GIS Standards and Organizations GIS data sets are more than geometry: Metadata - geographic reference - colors, display attributes etc - history of data modifications 1990 1992 2004 1994 1997 http://www.opengeospatial.org
  • 45. GIS Interoperability: GDAL and OGR libraries Data abstraction GDAL http://www.gdal.org Abstraction layer ENVI GeoTIFF SAR GRASS ECW HDF4 JPEG2000 MrSID ArcGRID Metadata - Number of bands - Color table - ... - Coordinate system - Projection 40 Frmts EPSG Codes PROJ.4
  • 46. GIS Interoperability: GDAL and OGR libraries Data abstraction OGR http://www.gdal.org/ogr/ Metadata - Coordinate system - Projection Abstraction layer EPSG Codes ArcCover MITAB Oracle SHAPE PostGIS Geodatabase DGN 20 Frmts
  • 47. GIS Data formats and support question GDAL Development: Raster formats Direct fundings: - Atlantis (ENVISAT, MFF, HKV Blobs) - eCognition Germany (FUJI BAS Format) - Los Alamos Nat. Labs (FITS) - OPeNDAP Inc. (OPeNDAP/DODS) - PeopleSoft ( ERDAS LAN ) - Safe Software (USGS SDTS, ISO8211 support) - Yukon Department of Environment (USGS DEM) Public formats/Open documents/Reverse engineered - ERDAS Imagine ( IMG ) - ERMAPPER ( ECW ) - ESRI formats ( ArcGrid ) - GDAL Virtual Format - JasPer ( JPEG2000 ); Kakadu (GeoJP2 interface for JPEG2000 = ISO/IEC 15444-1) - LizardTech ( MrSID , JPEG2000 ) - NOAA (AVHRR data)
  • 48. GIS Data formats and support question OGR Development: Vector formats Direct fundings: - DM Solutions Group and GoMOOS ( SQLite RDBMS, Comma Sep. Values CSV ) - OPeNDAP Inc. (OPeNDAP/DODS) - Safe Software (FMEObjects) - SRC, LLC ( Oracle Spatial ) Public formats/Open documents/Reverse engineered - ESRI ( SHAPE , ArcCoverage ) - GML - IHO S-57 - MapInfo ( TAB and MIF/MID ) - Microsoft ( ODBC OGR) - Microstation ( DGN ) - MySQL (non-spatial data) OGR - OGDI Vectors (VMAP) - OGR Virtual Format - PostgreSQL/PostGIS - SDTS - UK Ordnance Survey (NTF) - U.S. Census (TIGER)
  • 49. GIS formats Why so many formats? No big problem! Application specific requirements, which partially contradict each other high compression rate
  • 50. small runtime storage requirements
  • 53. easy access to pixels
  • 55. Hardware-/CPU-independence “Good software” can handle numerous formats.
  • 56. Software patents and rights of third parties: future traps ?!
  • 57. GIS formats and Software Patents How software patents affect GIS users LZW (Lempel Ziv Welch) Compression Used in many raster formats (e.g. GIF)
  • 58. Integrated into GRASS before it became patent, later replaced by Zlib Deflate
  • 59. Unisys started to charge for usage after waiting some years MrSID (Multi-resolution Seamless Image Database) wavelet based image file format
  • 60. three patents covering both the image compression and on the fly image decompression technology
  • 61. GDAL support MrSID but requires MrSID SDK license ECW (ERMAPPER Compressed Wavelets) Patent pending
  • 62. GPL released source code available (of patented code?) JPEG 2000 Situation not very clear
  • 63. Summary The personal choice of application software/operating system should not affect the data exchange longitudinal and transversal interoperability must be granted
  • 64. Only documented formats may be used
  • 65. There is no excuse: start to use interoperable formats today
  • 66. GIS interoperability is at a better state than Office documents interoperability
  • 67. Interoperability awareness needs to be promoted : today and in future
  • 68. License of this document Document home: http://mpa.itc.it/gfoss04/neteler_gfoss04_interoperability2005.pdf This work is licensed under a Creative Commons License. http://creativecommons.org/licenses/by-sa/2.0/deed.en “ Free GIS and Interoperability”, © 2004-2005 Markus Neteler [ OpenOffice SXI file available upon request: neteler at itc it neteler at osgeo org ] License details: Attribution-ShareAlike 2.0 You are free: to copy, distribute, display, and perform the work
  • 70. to make commercial use of the work
  • 71. Under the following conditions: Attribution. You must give the original author credit. Share Alike. If you alter, transform, or build upon this work, you may distribute the resulting work only under a license identical to this one. For any reuse or distribution, you must make clear to others the license terms of this work. Any of these conditions can be waived if you get permission from the copyright holder. Your fair use and other rights are in no way affected by the above.