DISPLAYING SURNAME
DATA WITHIN NOVA
SCOTIA
MAJOR GENEALOGICAL RESEARCH
PROJECT FOR ADVANCED GIS
Jon Thomas
28 May 2014
TOPICS
 Historical Context of Nova Scotia (where, who, etc)
 Data Preparation (Ancestry Microfilms, CSV organization)
 Using CartoDB to store Surname Data in CSV format
 Using a Leaflet template to query and present data for
individual surnames
 Using ArcMap to present cartographic data
 Prepare data within ArcMap to use with AGOL and Poster
OVERVIEW
 This projected consisted of genealogical research to
create a database for three Nova Scotia counties.
 The goal was to help show a geographic output in
regards to the distribution of surnames and their
frequencies within particular communities
 This will be beneficial to those who are interested in
finding possible home communities for their ancestors
(it is possible that students great-grandparents are in
this database)
 May benefit historical groups looking to analyze
temporal change in distribution of common surnames.
HISTORICAL CONTEXT
 Historical groups and trends affecting this study area:
 English (New England Planters, Yorkshiremen in Cumberland
Co., Loyalists, etc.)
 Scottish (Highlanders mostly within Pictou Co.): 1700’s –
1800’s
 Ulster-Scots (Colchester Co.), After 1760.
 French-Swiss Huguenots (Northumberland Shore within all
three counties). Via Lunenburg, 1770’s.
DATA PREP - USING VOTER LISTS
 Only three counties could be processed due to time restrictions.
 Census lists are not legally attainable after 1921.
 Voter lists are readily available from 1935 onwards on Ancestry.ca
 Provided in microfilm format using Optical Character Recognition
(OCR)technology, which is indexed for users to query data.
 The data can often be incorrect:
 Surnames with Mac as a prefix are often incorrectly indexed.
 Incorrect identification of letters such as c and e, s and z, etc. (Dclancy vs. Delaney,
Mackensic vs. MacKenzie, Eraser vs. Fraser, with many other examples).
 Several missing counties.
 Limitations: No record of Natives, no voters under 21.
CREATING A DATABASE
 An xlsx was used originally, which was converted to a csv so
that it could be uploaded to CartoDB
 An excel formula was used to return the last name of each
individual
 =IF(ISERROR(FIND(" ",A2)),A2,RIGHT(A2,LEN(A2) -
FIND("~",SUBSTITUTE(A2," ","~",LEN(A2)-LEN(SUBSTITUTE(A2," ",""))))))
 Problematic names: names with Jr, two words (ex.: Van
Buskirk).
 Finalized fields: PERSONID, Raw_Name (which was later
excluded to conserve space), LASTNAME, LOCATION, COUNTY,
LATITUDE, LONGITUDE.
HOW DOES THE VOTER LIST DATA APPEAR?
• The data is incorrectly transcribed as Miss Oeorgle Alkens (Miss George Aikens
is correct).
CARTODB
 CartoDB is an online mapping service, which allows users to
upload a CSV, Esri shapefile, among other files.
 Users can store 5 MB of data, use 5 unique tables, and have
up to 10,000 map views per month for free.
 Users can manipulate the map by using SQL queries to display
desired data.
 CartoCSS language is editable to help style data, while users
can modify an info window, and use a visualization wizard to
alter the display properties.
 Problem: A slight cost is necessary for larger datasets, which
will likely be encountered with province-wide genealogical data.
CARTODB IN ACTION
Querying instances per community of Mc/MacNutt
LEAFLET AND CARTODB
 Open-source JavaScript library, which is very lightweight (only about 33 KB).
 Can be used for several mapping sources, including Esri and Google.
 Utilizes HTML5 and CSS3 codes.
 For this project, a name search is used to select any record with the
specified surname, along with a dropdown to filter the results by county.
 A JavaScript function was added, which queried the CSV in CartoDB based
on locations within a specific county.
 The spiderfy option allows users to click on a location at a suitable level,
with all of the symbols spreading outward from the central point.
 Note: A thank-you to Ed Symons for helping demonstrate a leaflet template.
LEAFLET AND CARTODB
Fraser was the 2nd most commonly occurring name within the three
counties in 1940, with over 1200 records. However, Fraser was the
most commonly occurring name in Pictou County (over 1000).
ANOTHER EXAMPLE:
Showing the symbology of an individual
record and the spiderfy option
PROCESSING THE DATA IN ARCMAP: LOCATION
 The original xlsx was used.
 A Locations feature class was created by referencing two
feature classes from the nscccods2 SDE geodatabase: places
and roads. This feature class was used largely for gathering
coordinates to use for the CartoDB CSV.
 Points were placed near the community or between two
communities if the electoral district consisted of both
communities.
 Estimations were made, based on looking at addresses for
individuals, if a community could not be found.
PROCESSING THE DATA IN ARCMAP: PART 2
 As mentioned, the Locations shapefile was best used as a
reference for coordinates.
 The spreadsheet was spatially represented in ArcMap by
displaying XY fields as Latitude and Longitude, and was saved
as a feature class.
 The frequency tool was performed to help show the frequency
of every surname within each community, leaving 15,000
individual records for the 135 communities.
 The surnames were ordered by both frequency and community,
meaning that the top five values could easily be selected and
exported into a new feature class.
TOP SURNAMES
Smith - Cumberland Langille - Colchester Fraser – Pictou
USING ET GEO WIZARDS TO STYLIZE SURNAME
DATA
 The new feature class would have displayed all five surnames per location
for one point.
 The Build Thiessen Polygon Surface was used to develop a polygon that
contained the community/location coordinate.
 Helped determine placement of the random points and allowed the
cartographer to refer to a spatial region that a community had influence
over.
 Random Points in Polygons tool could be used to generate five points
per polygon, creating a total of 675 new points. The ID field had to be
multiplied by 10 to create 5 points for each.
 Both fields were exported and converted to a csv with FME Quick Translator
to reorder by location and be assigned a uniqueID.
 These tables were brought back into ArcMap, and joined based on the
matching ObjectIDs.
HOW DOES THIS LOOK?
 Result of Thiessen Polygon and Random Point Tools. The communities are
symbolized with the larger symbols, while the random points are blue.
LABELING THE FEATURES
 Three new label classes were created to indicate the frequency
of surnames within a community:
 Greater than than 50 within a community, 20 to 50 within a
community, and less than 20 within a community.
 Proportional labelling symbols were applied.
 An effort was made to only have labels on-land and keep them
a reasonable distance from county boundaries.
 Labeling was difficult for some coastal communities, as the
font was represented as being larger than the geographical
area.
POINTS EDITED FOR LABELING
 In this example, the labeling is very basic and has no settings applied.
 The labels have been moved to enhance visibility, but more work will be needed.
 The points largely follow the Thiessen polygons.
FONT STYLES APPLIED
EXCEPTION…
Due to the high population density, large number of communities,
and very common surnames, the Thiessen Polygon strategy does not
work well for Central Pictou Co.
ANNOTATIONS
 Annotations allow the data to be easily
placed in any location. Users can move
annotations in an edit session.
 For this dataset, there may be many
unplaced labels. These labels can still
be drawn and edited in the same style
as the other annotations.
 Individual annotations can be edited to
appear however the user would like
them to appear.
 Label classes can also have their
original settings overridden if, for some
reason, the user wants to change
them.
 Overlapping labels can be used with
annotations.
OTHER CARTOGRAPHIC ADDITIONS
 DEM
 Shapefile with dissolved counties to symbolize
coastline (copied layer 3 times, different outline
widths).
 Community names
 Labeled counties, water bodies
HARD COPY MAP SNIPPET
USING AGOL TO SHOW SURNAME FREQUENCIES
 The inspiration for this map was Mapping the Emerald Isle: a
geo-genealogy of Irish Surnames found here
http://storymaps.esri.com/stories/ireland/
 ArcGIS Online can be used to display label text, but must be
converted to a shapefile first.
 The data can be loaded into and used within the Find, Edit,
Filter downloadable application, but this application has a more
limited functionality than the Irish Surname story map.
STEPS NEEDED TO CREATE AGOL-FRIENDLY
LABELS: 1
 Labels converted to annotations, containing limited information.
 The annotations and the original top five surname feature class were joined
to link the feature ID of the annotation with the original object ID of a
feature.
 The Feature Outline Masks tool was used to create unique features.
 Coordinate system: WGS 1984 Web Mercator Auxiliary Sphere. This will
allow the features to draw correctly.
 The margin around the feature must be zero.
 Mask kind is set to exact; the mask will only include the annotation.
 A mask was only created for placed features; this is useful for lower-
scale zoom levels.
 All features were transferred, which will allow the user to rejoin the
original feature class to these new features based on the Object ID and
the Feature ID.
STEPS NEEDED TO CREATE AGOL-FRIENDLY
LABELS: 2
 The Top 5 Surnames feature class was joined to the annotations, using the
FeatureID (annotation) and the ObjectID (Top5Names).
 The Top 5 Surnames feature was joined to the output of the Feature Outline
Masks tool (using the FeatureID and ObjectID, again).
 By doing so, features in AGOL will have meaningful attributes, such as
community, county, and frequency.
 Each feature will need to be created four times, which will account for four
different zoom levels (based on the ArcGIS - Google Maps – Bing online
zoom levels): 1:144,448, 1:288,895, 1:577,791, and 1:1,155,581.
 Minimum and Maximum zoom levels need to be set at each step.
This map shows how the features look when two feature
classes are not controlled by zoom settings.
PUBLISHING TO AGOL
 Create a new document with only the four shapefiles that will
be used on AGOL.
 Publish as a feature service.
 Turn off tiled mapping.
 Mark as exception: Layer does not have a feature template set,
and Map is being published with data copied to the server
using data frame full extent. These will not affect the output.
USING A CONFIGURABLE APP FROM AGOL
 The Find, Filter, Edit application from Esri allows
users to easily query and select data.
 This application can be downloaded and configured
to suit the user’s needs.
 The web map created from the Surname feature
service was used.
 The filter and edit features were not useful for this
assignment (changing the data is not desirable).
 The 144K layer was used.
 The LASTNAME field was used in the find field
section.
 The result fields are the LASTNAME, Location,
Frequency, and County.
 A zoom level of 13 was applied for selected features.
AGOL APPLICATION IN ACTION
 https://cogsnscc.maps.arcgis.com/apps/Solutions/s2.html?appid=6
11860d390a7423a900f09e7036cb06e
CONCLUSIONS AND LIMITATIONS
 Useful for family identification
 There are several possibilities for this data, as long as
there is a geography
 Programming knowledge may come in handy to further
the research for this project
 Ancestry: Missing counties, time-consuming, incorrect
transcriptions
 Possible add-in: ethnic origin symbolization (would
require extensive research for each name)
 Leaflet/CartoDB: cost-effective options for genealogy
groups who cannot afford ArcGIS
SOURCES
 www.ancestry.ca
 www.leafletjs.com
 www.cartodb.com
 http://www.chebucto.ns.ca/heritage/FSCNS/Scots_NS/About_Clans/HtySctNS.html
 http://freepages.genealogy.rootsweb.ancestry.com/~goudied/ulster-scot2canada.html
 http://www.blupete.com/Hist/NovaScotiaBk2/Part1/Ch05.htm
 http://www.rootsweb.ancestry.com/~canns/lunenburg/memorial.html
 http://www.thecanadianencyclopedia.ca/en/article/nova-scotia/
 http://blogs.esri.com/esri/arcgis/2011/11/21/adding-labels-to-arcgis-online-web-maps-part-
1/
 http://downloads2.esri.com/MappingCenter2007/resources/publications/Field_DesigningAnIri
shSurnamesMapUsingGIS.pdf
QUESTIONS?

More Related Content

PDF
rworldmap: A New R package for Mapping Global Data
PDF
Kaitie Watson Lab 2 2013
PPTX
R programming language in spatial analysis
PPTX
Charlotte Gallant Presentation
PPTX
CMacMullen_CubaPresentation
PPTX
Developing a Tutorial for Grouping Analysis in ArcGIS
PPTX
The Richard McCully Aerial Photograph Collection Presentation
PPT
GIS Research at UCL
rworldmap: A New R package for Mapping Global Data
Kaitie Watson Lab 2 2013
R programming language in spatial analysis
Charlotte Gallant Presentation
CMacMullen_CubaPresentation
Developing a Tutorial for Grouping Analysis in ArcGIS
The Richard McCully Aerial Photograph Collection Presentation
GIS Research at UCL

Similar to Nova Scotia Surnames and Mapping Methods (20)

PPTX
RGS Annual Conference Presentation
PDF
Looking into the past - feature extraction from historic maps using Python, O...
PDF
4. empirical and practical issues
PPT
Analysing and Visualising Social Change: The Case of Surnames
PDF
unitiv-spacialdataanalysis-200423132043.pdf
PPTX
TYBSC IT PGIS Unit IV Spacial Data Analysis
PPT
3A_3_Informing population genetics through spatial analysis of surnames
PPTX
Linking data without common identifiers
PPT
Regional Science Presentation
PPTX
Geographic information system and remote sensing
PPT
Introduction_to_QGIS_Revision, read before
PDF
Putting Folklore in Its Place
PDF
PLOTCON NYC: Behind Every Great Plot There's a Great Deal of Wrangling
PDF
Chapter Four.pdf
PPTX
final ppt
PDF
The recovery of netherlands geographic information system (nlgis 2)
 
PPTX
New analytical methods for geocomputation - Guy Lansley, UCL
PDF
CensusGIV - Geographic Information Visualisation of Census Data
ODP
Geospatial Data in R
RGS Annual Conference Presentation
Looking into the past - feature extraction from historic maps using Python, O...
4. empirical and practical issues
Analysing and Visualising Social Change: The Case of Surnames
unitiv-spacialdataanalysis-200423132043.pdf
TYBSC IT PGIS Unit IV Spacial Data Analysis
3A_3_Informing population genetics through spatial analysis of surnames
Linking data without common identifiers
Regional Science Presentation
Geographic information system and remote sensing
Introduction_to_QGIS_Revision, read before
Putting Folklore in Its Place
PLOTCON NYC: Behind Every Great Plot There's a Great Deal of Wrangling
Chapter Four.pdf
final ppt
The recovery of netherlands geographic information system (nlgis 2)
 
New analytical methods for geocomputation - Guy Lansley, UCL
CensusGIV - Geographic Information Visualisation of Census Data
Geospatial Data in R
Ad

More from COGS Presentations (20)

PDF
Karman vortices
PPTX
Adams Hunt Lawrence May 2016
PPTX
Presentation: Fee & Brigley
PPTX
Presentation Brake & Scott
PPTX
Sutherland final presentation
PPTX
Base mapping of the St.Mary's District
PPTX
Interactive Web Map of New Zealand Earthquakes
PPTX
Exploring Halifax Attractions using the Esri Runtime SDK for Android
PPTX
The Processing of the 1920's Survey Sheets of the City of Saint John, NB for ...
PPTX
Coastal erosion
PPTX
Various frontslides2016
PDF
Southwest mongolia multispectral program title
PPTX
Model for Prioritizing Catchments for Terrestrial Liming in NS
PPTX
Remote Sensing Field Camp 2016
PPTX
Trying to decipher fort beausejour
PDF
PPTX
Using ArcMap’s Network Analyst to Model Emergency Service Response Routes Dur...
PPTX
The essentials for life at cogs
PPTX
Automated change detection in grass gis
PPTX
Online Mapping Support - Age Advantage Association
Karman vortices
Adams Hunt Lawrence May 2016
Presentation: Fee & Brigley
Presentation Brake & Scott
Sutherland final presentation
Base mapping of the St.Mary's District
Interactive Web Map of New Zealand Earthquakes
Exploring Halifax Attractions using the Esri Runtime SDK for Android
The Processing of the 1920's Survey Sheets of the City of Saint John, NB for ...
Coastal erosion
Various frontslides2016
Southwest mongolia multispectral program title
Model for Prioritizing Catchments for Terrestrial Liming in NS
Remote Sensing Field Camp 2016
Trying to decipher fort beausejour
Using ArcMap’s Network Analyst to Model Emergency Service Response Routes Dur...
The essentials for life at cogs
Automated change detection in grass gis
Online Mapping Support - Age Advantage Association
Ad

Recently uploaded (20)

PDF
AI-driven educational solutions for real-life interventions in the Philippine...
PDF
FORM 1 BIOLOGY MIND MAPS and their schemes
PDF
MBA _Common_ 2nd year Syllabus _2021-22_.pdf
PDF
Vision Prelims GS PYQ Analysis 2011-2022 www.upscpdf.com.pdf
PPTX
ELIAS-SEZIURE AND EPilepsy semmioan session.pptx
PDF
Journal of Dental Science - UDMY (2021).pdf
PDF
David L Page_DCI Research Study Journey_how Methodology can inform one's prac...
PDF
LIFE & LIVING TRILOGY - PART (3) REALITY & MYSTERY.pdf
PDF
Τίμαιος είναι φιλοσοφικός διάλογος του Πλάτωνα
PDF
English Textual Question & Ans (12th Class).pdf
PDF
Environmental Education MCQ BD2EE - Share Source.pdf
PDF
1.3 FINAL REVISED K-10 PE and Health CG 2023 Grades 4-10 (1).pdf
DOCX
Cambridge-Practice-Tests-for-IELTS-12.docx
PPTX
Education and Perspectives of Education.pptx
PPTX
Introduction to pro and eukaryotes and differences.pptx
PDF
BP 505 T. PHARMACEUTICAL JURISPRUDENCE (UNIT 1).pdf
PPTX
Unit 4 Computer Architecture Multicore Processor.pptx
PDF
ChatGPT for Dummies - Pam Baker Ccesa007.pdf
PDF
CISA (Certified Information Systems Auditor) Domain-Wise Summary.pdf
PDF
Mucosal Drug Delivery system_NDDS_BPHARMACY__SEM VII_PCI.pdf
AI-driven educational solutions for real-life interventions in the Philippine...
FORM 1 BIOLOGY MIND MAPS and their schemes
MBA _Common_ 2nd year Syllabus _2021-22_.pdf
Vision Prelims GS PYQ Analysis 2011-2022 www.upscpdf.com.pdf
ELIAS-SEZIURE AND EPilepsy semmioan session.pptx
Journal of Dental Science - UDMY (2021).pdf
David L Page_DCI Research Study Journey_how Methodology can inform one's prac...
LIFE & LIVING TRILOGY - PART (3) REALITY & MYSTERY.pdf
Τίμαιος είναι φιλοσοφικός διάλογος του Πλάτωνα
English Textual Question & Ans (12th Class).pdf
Environmental Education MCQ BD2EE - Share Source.pdf
1.3 FINAL REVISED K-10 PE and Health CG 2023 Grades 4-10 (1).pdf
Cambridge-Practice-Tests-for-IELTS-12.docx
Education and Perspectives of Education.pptx
Introduction to pro and eukaryotes and differences.pptx
BP 505 T. PHARMACEUTICAL JURISPRUDENCE (UNIT 1).pdf
Unit 4 Computer Architecture Multicore Processor.pptx
ChatGPT for Dummies - Pam Baker Ccesa007.pdf
CISA (Certified Information Systems Auditor) Domain-Wise Summary.pdf
Mucosal Drug Delivery system_NDDS_BPHARMACY__SEM VII_PCI.pdf

Nova Scotia Surnames and Mapping Methods

  • 1. DISPLAYING SURNAME DATA WITHIN NOVA SCOTIA MAJOR GENEALOGICAL RESEARCH PROJECT FOR ADVANCED GIS Jon Thomas 28 May 2014
  • 2. TOPICS  Historical Context of Nova Scotia (where, who, etc)  Data Preparation (Ancestry Microfilms, CSV organization)  Using CartoDB to store Surname Data in CSV format  Using a Leaflet template to query and present data for individual surnames  Using ArcMap to present cartographic data  Prepare data within ArcMap to use with AGOL and Poster
  • 3. OVERVIEW  This projected consisted of genealogical research to create a database for three Nova Scotia counties.  The goal was to help show a geographic output in regards to the distribution of surnames and their frequencies within particular communities  This will be beneficial to those who are interested in finding possible home communities for their ancestors (it is possible that students great-grandparents are in this database)  May benefit historical groups looking to analyze temporal change in distribution of common surnames.
  • 4. HISTORICAL CONTEXT  Historical groups and trends affecting this study area:  English (New England Planters, Yorkshiremen in Cumberland Co., Loyalists, etc.)  Scottish (Highlanders mostly within Pictou Co.): 1700’s – 1800’s  Ulster-Scots (Colchester Co.), After 1760.  French-Swiss Huguenots (Northumberland Shore within all three counties). Via Lunenburg, 1770’s.
  • 5. DATA PREP - USING VOTER LISTS  Only three counties could be processed due to time restrictions.  Census lists are not legally attainable after 1921.  Voter lists are readily available from 1935 onwards on Ancestry.ca  Provided in microfilm format using Optical Character Recognition (OCR)technology, which is indexed for users to query data.  The data can often be incorrect:  Surnames with Mac as a prefix are often incorrectly indexed.  Incorrect identification of letters such as c and e, s and z, etc. (Dclancy vs. Delaney, Mackensic vs. MacKenzie, Eraser vs. Fraser, with many other examples).  Several missing counties.  Limitations: No record of Natives, no voters under 21.
  • 6. CREATING A DATABASE  An xlsx was used originally, which was converted to a csv so that it could be uploaded to CartoDB  An excel formula was used to return the last name of each individual  =IF(ISERROR(FIND(" ",A2)),A2,RIGHT(A2,LEN(A2) - FIND("~",SUBSTITUTE(A2," ","~",LEN(A2)-LEN(SUBSTITUTE(A2," ",""))))))  Problematic names: names with Jr, two words (ex.: Van Buskirk).  Finalized fields: PERSONID, Raw_Name (which was later excluded to conserve space), LASTNAME, LOCATION, COUNTY, LATITUDE, LONGITUDE.
  • 7. HOW DOES THE VOTER LIST DATA APPEAR? • The data is incorrectly transcribed as Miss Oeorgle Alkens (Miss George Aikens is correct).
  • 8. CARTODB  CartoDB is an online mapping service, which allows users to upload a CSV, Esri shapefile, among other files.  Users can store 5 MB of data, use 5 unique tables, and have up to 10,000 map views per month for free.  Users can manipulate the map by using SQL queries to display desired data.  CartoCSS language is editable to help style data, while users can modify an info window, and use a visualization wizard to alter the display properties.  Problem: A slight cost is necessary for larger datasets, which will likely be encountered with province-wide genealogical data.
  • 9. CARTODB IN ACTION Querying instances per community of Mc/MacNutt
  • 10. LEAFLET AND CARTODB  Open-source JavaScript library, which is very lightweight (only about 33 KB).  Can be used for several mapping sources, including Esri and Google.  Utilizes HTML5 and CSS3 codes.  For this project, a name search is used to select any record with the specified surname, along with a dropdown to filter the results by county.  A JavaScript function was added, which queried the CSV in CartoDB based on locations within a specific county.  The spiderfy option allows users to click on a location at a suitable level, with all of the symbols spreading outward from the central point.  Note: A thank-you to Ed Symons for helping demonstrate a leaflet template.
  • 11. LEAFLET AND CARTODB Fraser was the 2nd most commonly occurring name within the three counties in 1940, with over 1200 records. However, Fraser was the most commonly occurring name in Pictou County (over 1000).
  • 12. ANOTHER EXAMPLE: Showing the symbology of an individual record and the spiderfy option
  • 13. PROCESSING THE DATA IN ARCMAP: LOCATION  The original xlsx was used.  A Locations feature class was created by referencing two feature classes from the nscccods2 SDE geodatabase: places and roads. This feature class was used largely for gathering coordinates to use for the CartoDB CSV.  Points were placed near the community or between two communities if the electoral district consisted of both communities.  Estimations were made, based on looking at addresses for individuals, if a community could not be found.
  • 14. PROCESSING THE DATA IN ARCMAP: PART 2  As mentioned, the Locations shapefile was best used as a reference for coordinates.  The spreadsheet was spatially represented in ArcMap by displaying XY fields as Latitude and Longitude, and was saved as a feature class.  The frequency tool was performed to help show the frequency of every surname within each community, leaving 15,000 individual records for the 135 communities.  The surnames were ordered by both frequency and community, meaning that the top five values could easily be selected and exported into a new feature class.
  • 15. TOP SURNAMES Smith - Cumberland Langille - Colchester Fraser – Pictou
  • 16. USING ET GEO WIZARDS TO STYLIZE SURNAME DATA  The new feature class would have displayed all five surnames per location for one point.  The Build Thiessen Polygon Surface was used to develop a polygon that contained the community/location coordinate.  Helped determine placement of the random points and allowed the cartographer to refer to a spatial region that a community had influence over.  Random Points in Polygons tool could be used to generate five points per polygon, creating a total of 675 new points. The ID field had to be multiplied by 10 to create 5 points for each.  Both fields were exported and converted to a csv with FME Quick Translator to reorder by location and be assigned a uniqueID.  These tables were brought back into ArcMap, and joined based on the matching ObjectIDs.
  • 17. HOW DOES THIS LOOK?  Result of Thiessen Polygon and Random Point Tools. The communities are symbolized with the larger symbols, while the random points are blue.
  • 18. LABELING THE FEATURES  Three new label classes were created to indicate the frequency of surnames within a community:  Greater than than 50 within a community, 20 to 50 within a community, and less than 20 within a community.  Proportional labelling symbols were applied.  An effort was made to only have labels on-land and keep them a reasonable distance from county boundaries.  Labeling was difficult for some coastal communities, as the font was represented as being larger than the geographical area.
  • 19. POINTS EDITED FOR LABELING  In this example, the labeling is very basic and has no settings applied.  The labels have been moved to enhance visibility, but more work will be needed.  The points largely follow the Thiessen polygons.
  • 21. EXCEPTION… Due to the high population density, large number of communities, and very common surnames, the Thiessen Polygon strategy does not work well for Central Pictou Co.
  • 22. ANNOTATIONS  Annotations allow the data to be easily placed in any location. Users can move annotations in an edit session.  For this dataset, there may be many unplaced labels. These labels can still be drawn and edited in the same style as the other annotations.  Individual annotations can be edited to appear however the user would like them to appear.  Label classes can also have their original settings overridden if, for some reason, the user wants to change them.  Overlapping labels can be used with annotations.
  • 23. OTHER CARTOGRAPHIC ADDITIONS  DEM  Shapefile with dissolved counties to symbolize coastline (copied layer 3 times, different outline widths).  Community names  Labeled counties, water bodies
  • 24. HARD COPY MAP SNIPPET
  • 25. USING AGOL TO SHOW SURNAME FREQUENCIES  The inspiration for this map was Mapping the Emerald Isle: a geo-genealogy of Irish Surnames found here http://storymaps.esri.com/stories/ireland/  ArcGIS Online can be used to display label text, but must be converted to a shapefile first.  The data can be loaded into and used within the Find, Edit, Filter downloadable application, but this application has a more limited functionality than the Irish Surname story map.
  • 26. STEPS NEEDED TO CREATE AGOL-FRIENDLY LABELS: 1  Labels converted to annotations, containing limited information.  The annotations and the original top five surname feature class were joined to link the feature ID of the annotation with the original object ID of a feature.  The Feature Outline Masks tool was used to create unique features.  Coordinate system: WGS 1984 Web Mercator Auxiliary Sphere. This will allow the features to draw correctly.  The margin around the feature must be zero.  Mask kind is set to exact; the mask will only include the annotation.  A mask was only created for placed features; this is useful for lower- scale zoom levels.  All features were transferred, which will allow the user to rejoin the original feature class to these new features based on the Object ID and the Feature ID.
  • 27. STEPS NEEDED TO CREATE AGOL-FRIENDLY LABELS: 2  The Top 5 Surnames feature class was joined to the annotations, using the FeatureID (annotation) and the ObjectID (Top5Names).  The Top 5 Surnames feature was joined to the output of the Feature Outline Masks tool (using the FeatureID and ObjectID, again).  By doing so, features in AGOL will have meaningful attributes, such as community, county, and frequency.  Each feature will need to be created four times, which will account for four different zoom levels (based on the ArcGIS - Google Maps – Bing online zoom levels): 1:144,448, 1:288,895, 1:577,791, and 1:1,155,581.  Minimum and Maximum zoom levels need to be set at each step. This map shows how the features look when two feature classes are not controlled by zoom settings.
  • 28. PUBLISHING TO AGOL  Create a new document with only the four shapefiles that will be used on AGOL.  Publish as a feature service.  Turn off tiled mapping.  Mark as exception: Layer does not have a feature template set, and Map is being published with data copied to the server using data frame full extent. These will not affect the output.
  • 29. USING A CONFIGURABLE APP FROM AGOL  The Find, Filter, Edit application from Esri allows users to easily query and select data.  This application can be downloaded and configured to suit the user’s needs.  The web map created from the Surname feature service was used.  The filter and edit features were not useful for this assignment (changing the data is not desirable).  The 144K layer was used.  The LASTNAME field was used in the find field section.  The result fields are the LASTNAME, Location, Frequency, and County.  A zoom level of 13 was applied for selected features.
  • 30. AGOL APPLICATION IN ACTION  https://cogsnscc.maps.arcgis.com/apps/Solutions/s2.html?appid=6 11860d390a7423a900f09e7036cb06e
  • 31. CONCLUSIONS AND LIMITATIONS  Useful for family identification  There are several possibilities for this data, as long as there is a geography  Programming knowledge may come in handy to further the research for this project  Ancestry: Missing counties, time-consuming, incorrect transcriptions  Possible add-in: ethnic origin symbolization (would require extensive research for each name)  Leaflet/CartoDB: cost-effective options for genealogy groups who cannot afford ArcGIS
  • 32. SOURCES  www.ancestry.ca  www.leafletjs.com  www.cartodb.com  http://www.chebucto.ns.ca/heritage/FSCNS/Scots_NS/About_Clans/HtySctNS.html  http://freepages.genealogy.rootsweb.ancestry.com/~goudied/ulster-scot2canada.html  http://www.blupete.com/Hist/NovaScotiaBk2/Part1/Ch05.htm  http://www.rootsweb.ancestry.com/~canns/lunenburg/memorial.html  http://www.thecanadianencyclopedia.ca/en/article/nova-scotia/  http://blogs.esri.com/esri/arcgis/2011/11/21/adding-labels-to-arcgis-online-web-maps-part- 1/  http://downloads2.esri.com/MappingCenter2007/resources/publications/Field_DesigningAnIri shSurnamesMapUsingGIS.pdf