SlideShare a Scribd company logo
Developing a statistical methodology to improve classification and mapping of seabed type from deep water Multi-Beam Echo Sounder (MBES) data   Helen Caughey, Kazi Ishtiak Ahmed, Paul Harris,  Peter Hung, Urška Demšar, Sean McLoone,  A Stewart Fotheringham, Xavier Monteys, Ronan O’Toole Presented by: Helen Caughey National Centre for Geocomputation  National University of Ireland, Maynooth helen.m.caughey @ nuim.ie GISRUK 2010, University College London, 15th April 2010
Overview of Work Collaboration between StratAG and  G eological  S urvey of  I reland/ M arine  I nstitute/INFOMAR programme. Focuses on seabed type mapping from data acquired with a  M ulti- B eam  E cho  S ounder (MBES). The work consists of 2 main phases; PHASE 1 ; the use of spatial and statistical analysis to decide which datasets can be integrated and how. PHASE 2 ; consists of finding the best automatic classification methodology for integrated data and evaluation of classification quality against ground truth and/or existing seabed maps.
MBES Working Principles MBES raw data – backscatter measurements Image compensation to remove effects of sonar angle and range 132 Full Feature Vectors (FFVs, statistical descriptors) for each patch Interpolated acoustic image (pixels) Interpolated acoustic image (patches) Data acquired on receiver Narrow beam width Wider swath Both Bathymetry and backscatter Port Starboard
Data used was collected over 3 years (2000, 2001 & 2002) and at 3 different pulse lengths (2000, 5000 & 15000 ms). Systems recalibrated at the beginning of each survey season. Each survey yielded a number of backscatter features, five of which we have extracted;  Q  (a quantile measure),  P  (the ‘pace’ textural feature),  C  (a ‘contrast’ feature),  M  (the mean) and  S  (the standard deviation). Bathymetry data is also available for the same zone 3 area.
Phase 1: Data integration based on spatial and statistical analysis; Is data integration possible? Removal of pulse length 2000 from analysis
Phase 1: Data integration based on spatial and statistical analysis; Is data integration possible?
Phase 1: Data integration based on spatial and statistical analysis; Is data integration possible? Identification of spatially overlapping data subsets Removal of pulse length 2000 from analysis 6 overlapping, spatially diverse, areas identified from PL 5000 & 15000 (each overlap also has multiple sub areas)
Phase 1: Data integration based on spatial and statistical analysis; Is data integration possible?
Phase 1: Data integration based on spatial and statistical analysis; Is data integration possible? Identification of spatially overlapping data subsets Removal of pulse length 2000 from analysis 6 overlapping, spatially diverse, areas identified from PL 5000 & 15000 (each overlap also has multiple sub areas) “ Pseudo Pairwise” overlapping data subsets were then identified and refined automatically in the R statistical computing environment
Phase 1: Data integration based on spatial and statistical analysis; Is data integration possible?
Phase 1: Data integration based on spatial and statistical analysis; Is data integration possible? Summary Statistics and distributions
Phase 1: Data integration based on spatial and statistical analysis; Is data integration possible?
Phase 1: Data integration based on spatial and statistical analysis; Is data integration possible? Summary Statistics and distributions Classical hypothesis testing (F,  t  and K-S tests) Differences in Spatial Autocorrelation; investigated using estimated variograms  ( M ethods  o f  M oments Variograms)   and modelled variograms  ( Re stricted  M aximum  L ikelihood Variograms)  Correlation and error analyses; Scatterplots & weighted error diagnostics, Weighted correlations, Robust & weighted correlations
Phase 1: Data integration based on spatial and statistical analysis; Is data integration possible?
Phase 1: Data integration based on spatial and statistical analysis; Is data integration possible?
Phase 1: Data integration based on spatial and statistical analysis; Is data integration possible?
Phase 1: Data integration based on spatial and statistical analysis; Is data integration possible?
Phase 1: Data integration based on spatial and statistical analysis; Is data integration possible? Unlikely these datasets should be joined; OL4 OL5 OL7 OL8 Likely that these datasets could be joined; OL6 OL9
Phase 1: Data integration based on spatial and statistical analysis; Is data integration possible? Focused scale investigations
Phase 1: Data integration based on spatial and statistical analysis; Is data integration possible?
Phase 1: Data integration based on spatial and statistical analysis; Is data integration possible?
Phase 1: Data integration based on spatial and statistical analysis; Is data integration possible?
Phase 1: Data integration based on spatial and statistical analysis; Is data integration possible?
Outcomes & Recommendations (Phase 1) OUTCOMES FOR PHASE 2 Based on these statistical tests and results we would suggest that the datasets  should not  be joined together. This does not however rule out joining the dataset if they are rescaled based on the knowledge we have gained about the datasets and their relationships. If they are rescaled then we can incorporate a measure of the uncertainty into the final classification.  RECOMMENDATIONS TO GSI/MI/INFOMAR In the past they have joined datasets assuming that there is no variance between them.  These tests have shown this is not the case and therefore we have recommended that datasets not be joined together without a method of validation prior to joining. More care needs to be taken in the recalibration of the survey systems each year. More emphasis needs to be placed on some level of overlap from one year to next in order to have the necessary data to validate any joins.
Phase 2: Data classification processes Proceed with classification attempts without joining the data. PCA and  K  means as preliminary approach. Other neural networks and quality threshold clustering. Next step – Data Classification
Thank you! Questions? Acknowledgements: Contacts: [email_address] ncg.nuim.ie www.stratag.ie

More Related Content

PDF
test
PDF
GIS in Public Health Research: Understanding Spatial Analysis and Interpretin...
PPSX
7 srtm paleochannels_aeromagnetics data
PPT
Understanding Map Integration Using GIS Software Poster_ff
PPTX
Lexie.IGARSS11.v3.pptx
PDF
Active learning algorithms in seismic facies classification
PDF
A comparison of classification techniques for seismic facies recognition
DOCX
GIS-resume
test
GIS in Public Health Research: Understanding Spatial Analysis and Interpretin...
7 srtm paleochannels_aeromagnetics data
Understanding Map Integration Using GIS Software Poster_ff
Lexie.IGARSS11.v3.pptx
Active learning algorithms in seismic facies classification
A comparison of classification techniques for seismic facies recognition
GIS-resume

What's hot (19)

PPTX
Geographical information system
PDF
Dn33686693
PDF
FUTURE TRENDS OF SEISMIC ANALYSIS
PPT
Design Process Using Hierarchical Spatial Reasoning Theory And Gis
PDF
Poster RDAP13: Provenance of Figures in the Global Change Information System
PPTX
Presentation adv gis 08 01-2014
PDF
The latest trends in academic research on GIS
PDF
BREWSTER_K_poster
PDF
Graham_project_2013 VBS
PDF
Risk governance for traffic accidents by Geostatistical Analyst methods
PPTX
Hong Liu CanWEA presentation
PDF
Radar and optical remote sensing data evaluation and fusion; a case study for...
PDF
Novel Framework for Navigation using Enhanced Fuzzy Approach with Sliding Mod...
PPTX
Beyond good enough? Spatial Data Quality and OpenStreetMap data
PDF
A LINEAR MODELING APPROACH ON LTGP SIGNALS AND SEISMIC ACTIVITY
PPTX
PDF
B00624300_AlfredoConetta_EGM716_MAUP_Projectc
PDF
Space Tug Rendezvous
PDF
Weather factor landslide hazard
Geographical information system
Dn33686693
FUTURE TRENDS OF SEISMIC ANALYSIS
Design Process Using Hierarchical Spatial Reasoning Theory And Gis
Poster RDAP13: Provenance of Figures in the Global Change Information System
Presentation adv gis 08 01-2014
The latest trends in academic research on GIS
BREWSTER_K_poster
Graham_project_2013 VBS
Risk governance for traffic accidents by Geostatistical Analyst methods
Hong Liu CanWEA presentation
Radar and optical remote sensing data evaluation and fusion; a case study for...
Novel Framework for Navigation using Enhanced Fuzzy Approach with Sliding Mod...
Beyond good enough? Spatial Data Quality and OpenStreetMap data
A LINEAR MODELING APPROACH ON LTGP SIGNALS AND SEISMIC ACTIVITY
B00624300_AlfredoConetta_EGM716_MAUP_Projectc
Space Tug Rendezvous
Weather factor landslide hazard
Ad

Viewers also liked (9)

PDF
co:op-READ-Convention Marburg - Annegret Wenz-Haubfleisch
PDF
Simon Barchard, RESON
PDF
MBES_ Taicheng Report
PDF
2013 PLSC Track, Hydrographic Surveying and GIS by Carlos Caceres
PDF
U.S. Law of the Sea Cruise to Map and Sample the US Arctic Ocean Margin
PPTX
Vocabulary semana 2
PPT
Electronics aids to navigation
PPSX
Hydrographic surveying
PDF
eisan_tr_final_submission
co:op-READ-Convention Marburg - Annegret Wenz-Haubfleisch
Simon Barchard, RESON
MBES_ Taicheng Report
2013 PLSC Track, Hydrographic Surveying and GIS by Carlos Caceres
U.S. Law of the Sea Cruise to Map and Sample the US Arctic Ocean Margin
Vocabulary semana 2
Electronics aids to navigation
Hydrographic surveying
eisan_tr_final_submission
Ad

Similar to 5A_ 2_Developing a statistical methodology to improve classification and mapping of seabed type from deep water multi-beam echo sounder data (20)

PDF
Spatial data analysis
PDF
Geostatistical based Digital Soil Mapping
PDF
Spatial data analysis 1
PDF
Predictive geospatial analytics using principal component regression
PDF
A framework for outlier detection in
PDF
Traffic Outlier Detection by Density-Based Bounded Local Outlier Factors
PDF
Time series forecasting of solid waste generation in arusha city tanzania
PPTX
Uses of GIS in Survey Data Collection
PDF
C054
PDF
Modelbased Geostatistics Springer Series In Statistics 1st Edition Peter J Di...
PDF
On Data Quality Assurance and Conflation Entanglement in Crowdsourcing for En...
PDF
Paper 5094-43_5
PPTX
Sample of slides for Statistics for Geography and Environmental Science
PDF
Classification accuracy analyses using Shannon’s Entropy
PPTX
Statistics for Geography and Environmental Science: an introductory lecture c...
PDF
Drsp dimension reduction for similarity matching and pruning of time series ...
PDF
AGILE_FinalDay_RobinFrew
PPT
Individual movements and geographical data mining. Clustering algorithms for ...
PDF
Clustering heterogeneous categorical data using enhanced mini batch K-means ...
PDF
Oversampling technique in student performance classification from engineering...
Spatial data analysis
Geostatistical based Digital Soil Mapping
Spatial data analysis 1
Predictive geospatial analytics using principal component regression
A framework for outlier detection in
Traffic Outlier Detection by Density-Based Bounded Local Outlier Factors
Time series forecasting of solid waste generation in arusha city tanzania
Uses of GIS in Survey Data Collection
C054
Modelbased Geostatistics Springer Series In Statistics 1st Edition Peter J Di...
On Data Quality Assurance and Conflation Entanglement in Crowdsourcing for En...
Paper 5094-43_5
Sample of slides for Statistics for Geography and Environmental Science
Classification accuracy analyses using Shannon’s Entropy
Statistics for Geography and Environmental Science: an introductory lecture c...
Drsp dimension reduction for similarity matching and pruning of time series ...
AGILE_FinalDay_RobinFrew
Individual movements and geographical data mining. Clustering algorithms for ...
Clustering heterogeneous categorical data using enhanced mini batch K-means ...
Oversampling technique in student performance classification from engineering...

More from GISRUK conference (20)

PPTX
8A_1_To vote or not to vote
PPTX
7B_3_Matterhorn on the horizon
PPTX
7B_2_Topological consistent generalization of openstreetmap
PPTX
7A_4_Gps data collection setting for pedestrian activity modelling
PPTX
5A_3_GIS based spatial modelling for improving the sustainability of aggregat...
PPTX
5A_1_Land evaluation techniques comparing fuzzy ahp with ideal point methods
PPTX
4B_3_Automatically generating keywods for georeferenced imaged
PPTX
4B_1_How many volunteers does it take to map an area well
PPTX
4A_1_Uncertainty in the 2001 output area classification for the census of eng...
PPTX
3A_4_Applying network analysis to quantify accessibility to urban greenspace ...
PPTX
3A_2_Modelling health-harming behaviours in a socially ranked geographic space
PPTX
1A_3_A geodemographic classification of london primary schools
PPT
UK Map Challenge Aidan Slingsby
PPT
SP_4 Supporting spatial negotiations in land use planning
PPT
SP_3 Automatic identification of high streets and classification of urban lan...
PPT
9B_1_Trust in web gis a preliminary investigation of the environment agencys ...
PPT
9A_2_Automatic classification of retail spaces from a large scale topographc ...
PPT
9A_1_On automatic mapping of environmental data using adaptive general regres...
PPT
8B_4_Exploring the usability of geographic information
PPT
8B_2_Using sound to represent uncertainty in address locations
8A_1_To vote or not to vote
7B_3_Matterhorn on the horizon
7B_2_Topological consistent generalization of openstreetmap
7A_4_Gps data collection setting for pedestrian activity modelling
5A_3_GIS based spatial modelling for improving the sustainability of aggregat...
5A_1_Land evaluation techniques comparing fuzzy ahp with ideal point methods
4B_3_Automatically generating keywods for georeferenced imaged
4B_1_How many volunteers does it take to map an area well
4A_1_Uncertainty in the 2001 output area classification for the census of eng...
3A_4_Applying network analysis to quantify accessibility to urban greenspace ...
3A_2_Modelling health-harming behaviours in a socially ranked geographic space
1A_3_A geodemographic classification of london primary schools
UK Map Challenge Aidan Slingsby
SP_4 Supporting spatial negotiations in land use planning
SP_3 Automatic identification of high streets and classification of urban lan...
9B_1_Trust in web gis a preliminary investigation of the environment agencys ...
9A_2_Automatic classification of retail spaces from a large scale topographc ...
9A_1_On automatic mapping of environmental data using adaptive general regres...
8B_4_Exploring the usability of geographic information
8B_2_Using sound to represent uncertainty in address locations

5A_ 2_Developing a statistical methodology to improve classification and mapping of seabed type from deep water multi-beam echo sounder data

  • 1. Developing a statistical methodology to improve classification and mapping of seabed type from deep water Multi-Beam Echo Sounder (MBES) data Helen Caughey, Kazi Ishtiak Ahmed, Paul Harris, Peter Hung, Urška Demšar, Sean McLoone, A Stewart Fotheringham, Xavier Monteys, Ronan O’Toole Presented by: Helen Caughey National Centre for Geocomputation National University of Ireland, Maynooth helen.m.caughey @ nuim.ie GISRUK 2010, University College London, 15th April 2010
  • 2. Overview of Work Collaboration between StratAG and G eological S urvey of I reland/ M arine I nstitute/INFOMAR programme. Focuses on seabed type mapping from data acquired with a M ulti- B eam E cho S ounder (MBES). The work consists of 2 main phases; PHASE 1 ; the use of spatial and statistical analysis to decide which datasets can be integrated and how. PHASE 2 ; consists of finding the best automatic classification methodology for integrated data and evaluation of classification quality against ground truth and/or existing seabed maps.
  • 3. MBES Working Principles MBES raw data – backscatter measurements Image compensation to remove effects of sonar angle and range 132 Full Feature Vectors (FFVs, statistical descriptors) for each patch Interpolated acoustic image (pixels) Interpolated acoustic image (patches) Data acquired on receiver Narrow beam width Wider swath Both Bathymetry and backscatter Port Starboard
  • 4. Data used was collected over 3 years (2000, 2001 & 2002) and at 3 different pulse lengths (2000, 5000 & 15000 ms). Systems recalibrated at the beginning of each survey season. Each survey yielded a number of backscatter features, five of which we have extracted; Q (a quantile measure), P (the ‘pace’ textural feature), C (a ‘contrast’ feature), M (the mean) and S (the standard deviation). Bathymetry data is also available for the same zone 3 area.
  • 5. Phase 1: Data integration based on spatial and statistical analysis; Is data integration possible? Removal of pulse length 2000 from analysis
  • 6. Phase 1: Data integration based on spatial and statistical analysis; Is data integration possible?
  • 7. Phase 1: Data integration based on spatial and statistical analysis; Is data integration possible? Identification of spatially overlapping data subsets Removal of pulse length 2000 from analysis 6 overlapping, spatially diverse, areas identified from PL 5000 & 15000 (each overlap also has multiple sub areas)
  • 8. Phase 1: Data integration based on spatial and statistical analysis; Is data integration possible?
  • 9. Phase 1: Data integration based on spatial and statistical analysis; Is data integration possible? Identification of spatially overlapping data subsets Removal of pulse length 2000 from analysis 6 overlapping, spatially diverse, areas identified from PL 5000 & 15000 (each overlap also has multiple sub areas) “ Pseudo Pairwise” overlapping data subsets were then identified and refined automatically in the R statistical computing environment
  • 10. Phase 1: Data integration based on spatial and statistical analysis; Is data integration possible?
  • 11. Phase 1: Data integration based on spatial and statistical analysis; Is data integration possible? Summary Statistics and distributions
  • 12. Phase 1: Data integration based on spatial and statistical analysis; Is data integration possible?
  • 13. Phase 1: Data integration based on spatial and statistical analysis; Is data integration possible? Summary Statistics and distributions Classical hypothesis testing (F, t and K-S tests) Differences in Spatial Autocorrelation; investigated using estimated variograms ( M ethods o f M oments Variograms) and modelled variograms ( Re stricted M aximum L ikelihood Variograms) Correlation and error analyses; Scatterplots & weighted error diagnostics, Weighted correlations, Robust & weighted correlations
  • 14. Phase 1: Data integration based on spatial and statistical analysis; Is data integration possible?
  • 15. Phase 1: Data integration based on spatial and statistical analysis; Is data integration possible?
  • 16. Phase 1: Data integration based on spatial and statistical analysis; Is data integration possible?
  • 17. Phase 1: Data integration based on spatial and statistical analysis; Is data integration possible?
  • 18. Phase 1: Data integration based on spatial and statistical analysis; Is data integration possible? Unlikely these datasets should be joined; OL4 OL5 OL7 OL8 Likely that these datasets could be joined; OL6 OL9
  • 19. Phase 1: Data integration based on spatial and statistical analysis; Is data integration possible? Focused scale investigations
  • 20. Phase 1: Data integration based on spatial and statistical analysis; Is data integration possible?
  • 21. Phase 1: Data integration based on spatial and statistical analysis; Is data integration possible?
  • 22. Phase 1: Data integration based on spatial and statistical analysis; Is data integration possible?
  • 23. Phase 1: Data integration based on spatial and statistical analysis; Is data integration possible?
  • 24. Outcomes & Recommendations (Phase 1) OUTCOMES FOR PHASE 2 Based on these statistical tests and results we would suggest that the datasets should not be joined together. This does not however rule out joining the dataset if they are rescaled based on the knowledge we have gained about the datasets and their relationships. If they are rescaled then we can incorporate a measure of the uncertainty into the final classification. RECOMMENDATIONS TO GSI/MI/INFOMAR In the past they have joined datasets assuming that there is no variance between them. These tests have shown this is not the case and therefore we have recommended that datasets not be joined together without a method of validation prior to joining. More care needs to be taken in the recalibration of the survey systems each year. More emphasis needs to be placed on some level of overlap from one year to next in order to have the necessary data to validate any joins.
  • 25. Phase 2: Data classification processes Proceed with classification attempts without joining the data. PCA and K means as preliminary approach. Other neural networks and quality threshold clustering. Next step – Data Classification
  • 26. Thank you! Questions? Acknowledgements: Contacts: [email_address] ncg.nuim.ie www.stratag.ie