SlideShare a Scribd company logo
© Concept Searching 2017
Why Most Migration Projects Fail –
Don’t Be a Statistic
Michael Paye
Chief Technology Officer
Concept Searching
mikep@conceptsearching.com
www.conceptsearching.com
marketing@conceptsearching.com
Twitter @conceptsearch
Robert Piddocke
Vice President of Channel and Business
Development
Concept Searching
robertp@conceptsearching.com
© Concept Searching 2017
Robert Piddocke – Vice President of Channel and Business
Development is passionate about information management and
governance. He has worked for several information management
companies and assisted with a number of migration projects.
In addition, he is an information retrieval geek, who has authored
two books on SharePoint Search.
Michael Paye – Chief Technology Officer at Concept
Searching has been the driving force behind many of the
company's recent innovations, including the SharePoint Add-in and
hybrid search products. He has a wealth of experience across the
Microsoft platform and related technologies, and oversees all
product development.
© Concept Searching 2017
Agenda
• Who we are and what we do
• Success rate and other fun facts
• The business requirements
• Best practices – data profiling
• How long is this going to take?
• Records management
• Cloud migration
• Metadata and auto-classification
• The process
• Case study
• Takeaways
© Concept Searching 2017
• Company founded in 2002
• Product launched in 2003
• Focus on management of structured and unstructured information
• Profitable, debt free
• Technology Platform
• Delivered as a web service
• Automatic concept identification, content tagging, auto-classification,
taxonomy management
• Only statistical vendor that can extract conceptual metadata
• 8 years KMWorld ‘100 Companies that Matter in Knowledge Management’
8 years KMWorld ‘Trend Setting Product’
• Authority to Operate enterprise wide US Air Force, NETCON US Army,
and Canadian SLSA
• Client base: Fortune 500/1000 organizations in Healthcare,
Financial Services, Manufacturing, Energy, Professional Services,
Pharmaceutical, Public sector and DoD
• Microsoft Gold Certification in Application Development
• Member of SharePoint PAC and TAP programs
• Deployed as a full trust Add-in for all versions of SharePoint on-premises
and SharePoint Online, including the latest vNext dedicated platform and the
government cloud
The Global Leader in
Managed Metadata Solutions
© Concept Searching 2017
Concept Searching’s technology platforms deliver
semantic metadata generation, auto-classification and
taxonomy/Term Store management, and are fully
integrated with all versions of SharePoint on-premises,
Microsoft Online/Office 365, and OneDrive for Business
What Do We Do?
These infrastructure platforms integrate not only with
SharePoint but also other content repositories, search
engines and file shares, enabling our clients to add
structure and manage their enterprise content,
regardless of environment
The resulting classification metadata is used by clients
to deliver ‘intelligent metadata solutions’ in areas such
as enhanced search, migration, data privacy, records
management, policy enforcement, compliance, text
analytics, and business and social collaboration
© Concept Searching 2017
Unique Approach – Compound Term Processing
• Remains unique in the industry
• Ability to identify and correctly weight
multi-word concepts in unstructured text
6
Concept Searching
provides Automatic
Concept Term Extraction
Triple
Baseball
Three
Heart
Organ
Center
Bypass
Highway
Avoid
© Concept Searching 2017
Success Rate and Other Fun Facts
• Only 16% of migration projects were on time and on budget
• 64% were late
• Estimate for overtime overruns was 40%
• 37% were over budget
• More than half of survey respondents blamed poor, inadequate or
unrealistic scoping caused budget overruns
(Bloor Research Data Migration White Paper)
MIGRATION IS AN OPPORTUNITY
© Concept Searching 2017
Large Scale Projects Don’t Do Any Better
• Only 16% of these did not experience either time or cost overruns
• 19% of projects did not have a separate budget or a timescale for data
migration, so these could not be measured
• Of the remainder
• 3% went over budget but were delivered on time
• 46% went over time but not over budget
• 51% went over both time and budget
• How can you run over time but not go over budget?
• Appears to be easier to exceed timescales and stay within budget than
to do the reverse
© Concept Searching 2017
The Business Requirements
Critical First Steps
• Identify what information exists
• Determine what may need
migrating
• Determine how the business
needs to use the information
• Is migration the right response?
• How to maintain business
continuity
• Who needs access and how quickly?
• How is the information used?
• How do you need to read, edit, reuse,
print, publish, share or make use of the
information?
• Is it being kept because of regulatory
or historical preservation?
• Is the content still within the retention
period?
• How long do you need to keep it?
• Can it be archived or deleted?
© Concept Searching 2017
There is a gross underestimation of just how complex and
challenging a data migration can become. It is often perceived as a
data ‘shunt and grunt’ exercise, tagged on to the end of the much
higher visibility target implementation.
Data Migration Pro
© Concept Searching 2017
Best Practices – Data Profiling
• The quality of your project planning will influence any application that
uses metadata after the migration
• Search, data protection, records management, eDiscovery, text analytics
• 90% do not use data profiling* – ?
• Include quality analysis
• Almost all of you use hand coding
• Reactive or proactive?
• Data profiling by hand is typically unfeasible documents
(Bloor Research)
© Concept Searching 2017
Best Practices – Data Profiling
• What content is moving?
• What content can we get rid of? ROT
• How can it be grouped?
• What content requires special handling – privacy, undeclared records?
• What content requires changes?
• Most likely the metadata
• How volatile is the content?
• Data profiling outputs include lists that detail
• Content to be migrated in logical groups
• Content that requires special handling
• Content that will require changes along with the scope
• Do it before time and budget estimates
© Concept Searching 2017
© Concept Searching 2017
Content Optimization
• 1% of corporate information is on litigation hold, 5% is in a records
retention category, 25% has current business value
• This means 69% of data most organizations keep can, and should,
be deleted (CGOC)
• 60% of documents are obsolete (eLaw)
• 3%-5% of files are lost or misplaced – annual losses to a Fortune 1000
company with one million files is $5M (Survey reported in Information Week)
• Companies typically misfile 2% - 7% of their records (New York City
Chapter, ARMA International)
• Costs $4-$7 to recreate it
• US managers spend an average of 4 weeks per year searching for
misfiled, mislabeled, untracked, or lost papers (Cuadro Associates)
• 90% of content is never accessed after creation
© Concept Searching 2017
Best Practices – Content Optimization
Content Optimization
• De-duplication
• Versioning
• Identification of records that were never declared
• Identification of content containing security
vulnerabilities
• Elimination of redundant, obsolete, and trivial content
• Addresses noncompliant content and content that
should be archived
• Can be used to auto-classify content from diverse
internal or external repositories
95% of your data is unstructured
Annual growth rate of
unstructured content is 120%
© Concept Searching 2017
Migration of Records
ISO standard 15489-1:2001
Defines records management as the field of management responsible for
the efficient and systematic control of the creation, receipt, maintenance,
use, and disposition of records, including the processes for capturing and
maintaining evidence of and information about business activities and
transactions in the form of records
© Concept Searching 2017
Best of Breed? Hardly
Solutions can be categorized into five broad categories
1. Preserving the original technology used to create or store the records
2. Emulating the original technology on new platforms
3. Migrating the software necessary to retrieve, deliver, and use the records
4. Migrating the records to up-to-date formats
5. Converting records to standard forms
Each has pros and cons
© Concept Searching 2017
Fundamental Problems in Records Management?
• The problem of content migration is not specific to records systems –
it is a universal problem
• Hampers the ability to manage content over a period of time, impacting
data privacy, eDiscovery, litigation
• The first generation of electronic records management system
specifications – everything from the US DoD 5015.2 that came out in
1998 to MoReq2 that came out in 2008 – did not attempt to tackle the
problem
• They told vendors what types of metadata to put into their products –
but they did not tell vendors how to implement that metadata
© Concept Searching 2017
© Concept Searching 2017
Cloud Migration
1. Planning and assessment
2. Duplicate environments
3. Staff training
4. Third party migration tools
5. Leases and licenses
6. Migration consulting
© Concept Searching 2017
Exchange
• Emails are opened by only one third of recipients
• About 90% is junk
• Still must be managed
• Classification of emails and attachments as content is created or ingested
• Identifies unprotected security vulnerabilities, as well as confidential
information as defined by an organization
• Removes breach from search
• Sends to a secure repository, or person
• Disables download unless authorized
• Added value for compliance officers who
can identify vulnerabilities within the content
• Can migrate content to other content sources
• Retains existing security
AIIM notes that 87% of
organizations are concerned
about cloud chaos.
The desire to shift strategies to
embrace newer content
management tools has
refocused attention around
.migration and disposition.
© Concept Searching 2017
Reducing the Quality of Metadata
The process of migration is lossy
• Enormous amount of time to move metatada
• Most will not manually map content to new system counterpart metadata
• What are the approaches?
• Maintain the original application
• Cost of licenses fees and support
• Users will need to go to the old repository to retrieve information –
sooner or later they will forget
• Try to connect search policies to new and old system and execute search
• Poor search results and ranking
• Leave old application, fix new only
• Poor ranking
• Users need access to old system
© Concept Searching 2017
Auto-classification in Intelligent Migration
Solution
• Content migration is an opportunity to bring governance to existing
processes
• Ability to address high value content, content that should be deleted,
duplicate documents, records that were never declared, unprotected
privacy exposures
• Normalize systems that may have fallen out of compliance
• Opportunity to get rid of dark data, ROT, content garbage
• File shares to file shares, file shares to SharePoint, SharePoint to
SharePoint, custom action from any other repository – .NET code and
web services, plug-in architecture to custom develop content sources and
destination sources
After the migration, content is organized in a hierarchical format, managed internally,
and is comprised of multi-term metadata that is unique to each discrete piece of content
© Concept Searching 2017
How Long Is This Going to Take?
Specifically for a migration to the web, but most are applicable to an
on-premises environment
• Content needs to be rewritten for the web because it is not up to a certain
standard or has been copied directly from traditional media
• Content needs to be reorganized to fit a new information architecture
• Content is old or irrelevant and hasn’t been touched in the last five years
• Most of the content is no longer performing and not attracting many visitors
• No recent content quality analysis based on usage and conversion exists
• Project managers underestimate effort to manually convert and enter content
• Content is not fully separated from presentations, making automation difficult
• Metadata around content needs to be regenerated because classifications or
other systems have changed – retagging content becomes a major pain point
• Persistency of URLs can become a problem after automated content
migration – URLs can change and follow different patterns, turning defining
and implementing redirect systems into a difficult new project
© Concept Searching 2017
© Concept Searching 2017
Situation:
• Global automotive organization
• 40,000 users
Challenge:
• Migration of over 20 million documents from SharePoint on-premises to
the Office 365 dedicated vNext platform
• Improved enterprise search and collaboration across 30 content sources
• Simplified access to information for a variety of stakeholders
Solution:
• conceptClassifier for Office 365 platform
Benefits:
• Cost reduction – decommission of 50 on-premises servers to 5
• Content now auto-classified and searchable in the cloud
• Ease of access to information
• Improved business production
Case Study – Not the Norm
© Concept Searching 2017
How Did the Process Work?
• Create a taxonomy
• Taxonomy designed for subject-matter experts
• Easy to use
• Begin auto-classification
• Update taxonomy using the taxonomy prompt ‘Suggest Clues from Class’
• Reiterate for content optimization, security breaches, records
• Index and classify the whole corpus in alignment with business
requirements
© Concept Searching 2017
What Was the Result?
• Reduced on-premises servers from 50 servers to 5
• Achieved immediate improvements in enterprise search and eDiscovery,
enabled concept based searching
• Accomplished in two weeks
• Successful migration of 20 million documents
© Concept Searching 2017
Takeaways
• See migration as an OPPORTUNITY
• Reduce your risk through content optimization
• Planning is a key component
• Failure due to poor planning and budget overruns
© Concept Searching 2017
Thank You
Michael Paye
Chief Technology Officer
Concept Searching
mikep@conceptsearching.com
www.conceptsearching.com
marketing@conceptsearching.com
Twitter @conceptsearch
Robert Piddocke
Vice President of Channel and Business
Development
Concept Searching
robertp@conceptsearching.com

More Related Content

PPTX
Overcoming Capability Gaps in Information Transparency, Knowledge Management,...
PDF
Going Meta in SharePoint – Tricks of the Trade
PDF
Using Metadata-Driven Taxonomies to Solve Business Problems
PDF
[AIIM16] Implementing Automated Retention at the European Central Bank
PDF
SharePoint Saturday London - The Nuts and Bolts of Metadata Tagging and Taxon...
PDF
The Nuts and Bolts of Metadata Tagging and Taxonomies Made Easy Webinar
PPT
Concept Searching Webinar P
PDF
FEDSPUG Meeting: Intelligent Metadata and Auto-classification in Records Mana...
Overcoming Capability Gaps in Information Transparency, Knowledge Management,...
Going Meta in SharePoint – Tricks of the Trade
Using Metadata-Driven Taxonomies to Solve Business Problems
[AIIM16] Implementing Automated Retention at the European Central Bank
SharePoint Saturday London - The Nuts and Bolts of Metadata Tagging and Taxon...
The Nuts and Bolts of Metadata Tagging and Taxonomies Made Easy Webinar
Concept Searching Webinar P
FEDSPUG Meeting: Intelligent Metadata and Auto-classification in Records Mana...

What's hot (20)

PPTX
InfoDNA Everteam houston breakfast 06.29.17
PDF
How To Drive Intelligent Migration Webinar
PPTX
Climbing the Slippery Slope of SharePoint Migrations Webinar
PDF
Real-World Data Governance Webinar: Big Data Governance - What Is It and Why ...
PDF
How Global Records Management Practices and Standards Are Evolving for Busine...
PPTX
Exploring Automatic Metadata Generation Based on SharePoint Term Sets
PDF
Automation of document management paul fenton webinar
PDF
[AIIM16] A Look behind the Curtain: The State of the Industry Report
PDF
The Economic Value of Data: A New Revenue Stream for Global Custodians
PPTX
Content marketing for human action
PDF
You Need a Data Catalog. Do You Know Why?
PPTX
Get doing GDPR right now! IRMS May 2018
PPTX
[Webinar Slides] 3 Steps to Organizing, Finding, and Governing Your Information
PDF
Data Integration, Access, Flow, Exchange, Transfer, Load And Extract Architec...
PDF
Introduction to Business Process Management
PPTX
Strata NYC 2015 - Transamerica and INFA v1
PPTX
Zen and the Art of Datanauting
PDF
LDM Webinar: Data Modeling & Metadata Management
PPTX
Cleaning up Redundant, Obsolete and Trivial Data to Reclaim Capacity and Mana...
PDF
Post-Mainframe Managed Services
InfoDNA Everteam houston breakfast 06.29.17
How To Drive Intelligent Migration Webinar
Climbing the Slippery Slope of SharePoint Migrations Webinar
Real-World Data Governance Webinar: Big Data Governance - What Is It and Why ...
How Global Records Management Practices and Standards Are Evolving for Busine...
Exploring Automatic Metadata Generation Based on SharePoint Term Sets
Automation of document management paul fenton webinar
[AIIM16] A Look behind the Curtain: The State of the Industry Report
The Economic Value of Data: A New Revenue Stream for Global Custodians
Content marketing for human action
You Need a Data Catalog. Do You Know Why?
Get doing GDPR right now! IRMS May 2018
[Webinar Slides] 3 Steps to Organizing, Finding, and Governing Your Information
Data Integration, Access, Flow, Exchange, Transfer, Load And Extract Architec...
Introduction to Business Process Management
Strata NYC 2015 - Transamerica and INFA v1
Zen and the Art of Datanauting
LDM Webinar: Data Modeling & Metadata Management
Cleaning up Redundant, Obsolete and Trivial Data to Reclaim Capacity and Mana...
Post-Mainframe Managed Services
Ad

Similar to Why Most Migration Projects Fail – Don’t Be a Statistic Webinar (20)

PDF
Is Your Content Migration Strategy Garbage In, Garbage Out? Webinar
PPTX
Compliance, Security, Migration, Systems Management – All Fixed by Microsoft?
PDF
Why You Need Metadata-Driven Records Management Webinar
PDF
Eliminate the 49% of Documents that Contain Data Breaches Webinar
PDF
Why You Need Intelligent Metadata and Auto-classification in Records Management
PDF
84% of Migration Projects Fail – Getting it Right in SharePoint Webinar
PDF
Using Metadata and Classification in Records Management
PDF
Metadata-Driven Cleanup of Files, Content, and Email Webinar
PDF
Eliminating End User Tagging – Minimizing Organizational Risk and Improving B...
PDF
A Roadmap to Data Migration Success
PPTX
Migrating data: How to reduce risk
PDF
KMWorld Martin Briefing
PPTX
The #1 Success Factor for Data Migration Projects
PDF
Metadata Matters: Business Critical Metadata
DOCX
Data Migration_ Process, Risks and Differences.docx
PPTX
#EuropeanSP--11 Strategic Considerations for SharePoint Migrations
PDF
SharePoint and Office 365 State of the Market Survey Results Webinar
PDF
Data Migration in Malta and Libya
PDF
Tackling the ticking time bomb – Data Migration and the hidden risks
PDF
5 Unexpected Risks of a Data Migration.pdf
Is Your Content Migration Strategy Garbage In, Garbage Out? Webinar
Compliance, Security, Migration, Systems Management – All Fixed by Microsoft?
Why You Need Metadata-Driven Records Management Webinar
Eliminate the 49% of Documents that Contain Data Breaches Webinar
Why You Need Intelligent Metadata and Auto-classification in Records Management
84% of Migration Projects Fail – Getting it Right in SharePoint Webinar
Using Metadata and Classification in Records Management
Metadata-Driven Cleanup of Files, Content, and Email Webinar
Eliminating End User Tagging – Minimizing Organizational Risk and Improving B...
A Roadmap to Data Migration Success
Migrating data: How to reduce risk
KMWorld Martin Briefing
The #1 Success Factor for Data Migration Projects
Metadata Matters: Business Critical Metadata
Data Migration_ Process, Risks and Differences.docx
#EuropeanSP--11 Strategic Considerations for SharePoint Migrations
SharePoint and Office 365 State of the Market Survey Results Webinar
Data Migration in Malta and Libya
Tackling the ticking time bomb – Data Migration and the hidden risks
5 Unexpected Risks of a Data Migration.pdf
Ad

More from Concept Searching, Inc (18)

PDF
ARMA NOVA’s Auto-Categorization Showcase
PDF
Discovery, Risk, and Insight in a Metadata-Driven World Webinar
PDF
Drowning in Data and Starving for Information
PDF
Why You Need Intelligent Metadata and Auto-classification in Records Management
PPTX
Going Meta – How to use Metadata in SharePoint
PDF
Enough Talk – Solving GDPR Problems Through Metadata-Driven Compliance Webinar
PDF
What You Don’t Know May Hurt You – Achieving Insight and Knowledge Discovery
PDF
Going Meta – How to Use Metadata in SharePoint and Office 365
PPTX
SharePoint Saturday Toronto - Going Meta – How to Use Metadata in SharePoint ...
PDF
ECM or CLM? A Fight to the Finish Webinar
PDF
ARMA Calgary Spring Seminar: The Nuts and Bolts of Metadata Tagging and Taxon...
PDF
Collaboration Can Be Dangerous Webinar
PDF
Groundbreaking and Game-changing Enterprise Search Webinar
PDF
Coexist or Integrate? Manage Unstructured Content from Diverse Repositories a...
PPTX
The Value of Adding Managed Metadata to Microsoft Online Search
PDF
How To Implement Engineering Search Within Your Organization Webinar
PPTX
conceptTermStoreManager Demo On Demand
PDF
Optimize and Organize Your Content with conceptClassifier for File Shares
ARMA NOVA’s Auto-Categorization Showcase
Discovery, Risk, and Insight in a Metadata-Driven World Webinar
Drowning in Data and Starving for Information
Why You Need Intelligent Metadata and Auto-classification in Records Management
Going Meta – How to use Metadata in SharePoint
Enough Talk – Solving GDPR Problems Through Metadata-Driven Compliance Webinar
What You Don’t Know May Hurt You – Achieving Insight and Knowledge Discovery
Going Meta – How to Use Metadata in SharePoint and Office 365
SharePoint Saturday Toronto - Going Meta – How to Use Metadata in SharePoint ...
ECM or CLM? A Fight to the Finish Webinar
ARMA Calgary Spring Seminar: The Nuts and Bolts of Metadata Tagging and Taxon...
Collaboration Can Be Dangerous Webinar
Groundbreaking and Game-changing Enterprise Search Webinar
Coexist or Integrate? Manage Unstructured Content from Diverse Repositories a...
The Value of Adding Managed Metadata to Microsoft Online Search
How To Implement Engineering Search Within Your Organization Webinar
conceptTermStoreManager Demo On Demand
Optimize and Organize Your Content with conceptClassifier for File Shares

Recently uploaded (20)

PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
cuic standard and advanced reporting.pdf
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
Spectroscopy.pptx food analysis technology
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Electronic commerce courselecture one. Pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPTX
Cloud computing and distributed systems.
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PPTX
sap open course for s4hana steps from ECC to s4
PDF
Machine learning based COVID-19 study performance prediction
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
NewMind AI Weekly Chronicles - August'25 Week I
Spectral efficient network and resource selection model in 5G networks
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Reach Out and Touch Someone: Haptics and Empathic Computing
Unlocking AI with Model Context Protocol (MCP)
Advanced methodologies resolving dimensionality complications for autism neur...
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
cuic standard and advanced reporting.pdf
Building Integrated photovoltaic BIPV_UPV.pdf
Spectroscopy.pptx food analysis technology
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
The AUB Centre for AI in Media Proposal.docx
Electronic commerce courselecture one. Pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
Cloud computing and distributed systems.
20250228 LYD VKU AI Blended-Learning.pptx
sap open course for s4hana steps from ECC to s4
Machine learning based COVID-19 study performance prediction
MIND Revenue Release Quarter 2 2025 Press Release
NewMind AI Weekly Chronicles - August'25 Week I

Why Most Migration Projects Fail – Don’t Be a Statistic Webinar

  • 1. © Concept Searching 2017 Why Most Migration Projects Fail – Don’t Be a Statistic Michael Paye Chief Technology Officer Concept Searching mikep@conceptsearching.com www.conceptsearching.com marketing@conceptsearching.com Twitter @conceptsearch Robert Piddocke Vice President of Channel and Business Development Concept Searching robertp@conceptsearching.com
  • 2. © Concept Searching 2017 Robert Piddocke – Vice President of Channel and Business Development is passionate about information management and governance. He has worked for several information management companies and assisted with a number of migration projects. In addition, he is an information retrieval geek, who has authored two books on SharePoint Search. Michael Paye – Chief Technology Officer at Concept Searching has been the driving force behind many of the company's recent innovations, including the SharePoint Add-in and hybrid search products. He has a wealth of experience across the Microsoft platform and related technologies, and oversees all product development.
  • 3. © Concept Searching 2017 Agenda • Who we are and what we do • Success rate and other fun facts • The business requirements • Best practices – data profiling • How long is this going to take? • Records management • Cloud migration • Metadata and auto-classification • The process • Case study • Takeaways
  • 4. © Concept Searching 2017 • Company founded in 2002 • Product launched in 2003 • Focus on management of structured and unstructured information • Profitable, debt free • Technology Platform • Delivered as a web service • Automatic concept identification, content tagging, auto-classification, taxonomy management • Only statistical vendor that can extract conceptual metadata • 8 years KMWorld ‘100 Companies that Matter in Knowledge Management’ 8 years KMWorld ‘Trend Setting Product’ • Authority to Operate enterprise wide US Air Force, NETCON US Army, and Canadian SLSA • Client base: Fortune 500/1000 organizations in Healthcare, Financial Services, Manufacturing, Energy, Professional Services, Pharmaceutical, Public sector and DoD • Microsoft Gold Certification in Application Development • Member of SharePoint PAC and TAP programs • Deployed as a full trust Add-in for all versions of SharePoint on-premises and SharePoint Online, including the latest vNext dedicated platform and the government cloud The Global Leader in Managed Metadata Solutions
  • 5. © Concept Searching 2017 Concept Searching’s technology platforms deliver semantic metadata generation, auto-classification and taxonomy/Term Store management, and are fully integrated with all versions of SharePoint on-premises, Microsoft Online/Office 365, and OneDrive for Business What Do We Do? These infrastructure platforms integrate not only with SharePoint but also other content repositories, search engines and file shares, enabling our clients to add structure and manage their enterprise content, regardless of environment The resulting classification metadata is used by clients to deliver ‘intelligent metadata solutions’ in areas such as enhanced search, migration, data privacy, records management, policy enforcement, compliance, text analytics, and business and social collaboration
  • 6. © Concept Searching 2017 Unique Approach – Compound Term Processing • Remains unique in the industry • Ability to identify and correctly weight multi-word concepts in unstructured text 6 Concept Searching provides Automatic Concept Term Extraction Triple Baseball Three Heart Organ Center Bypass Highway Avoid
  • 7. © Concept Searching 2017 Success Rate and Other Fun Facts • Only 16% of migration projects were on time and on budget • 64% were late • Estimate for overtime overruns was 40% • 37% were over budget • More than half of survey respondents blamed poor, inadequate or unrealistic scoping caused budget overruns (Bloor Research Data Migration White Paper) MIGRATION IS AN OPPORTUNITY
  • 8. © Concept Searching 2017 Large Scale Projects Don’t Do Any Better • Only 16% of these did not experience either time or cost overruns • 19% of projects did not have a separate budget or a timescale for data migration, so these could not be measured • Of the remainder • 3% went over budget but were delivered on time • 46% went over time but not over budget • 51% went over both time and budget • How can you run over time but not go over budget? • Appears to be easier to exceed timescales and stay within budget than to do the reverse
  • 9. © Concept Searching 2017 The Business Requirements Critical First Steps • Identify what information exists • Determine what may need migrating • Determine how the business needs to use the information • Is migration the right response? • How to maintain business continuity • Who needs access and how quickly? • How is the information used? • How do you need to read, edit, reuse, print, publish, share or make use of the information? • Is it being kept because of regulatory or historical preservation? • Is the content still within the retention period? • How long do you need to keep it? • Can it be archived or deleted?
  • 10. © Concept Searching 2017 There is a gross underestimation of just how complex and challenging a data migration can become. It is often perceived as a data ‘shunt and grunt’ exercise, tagged on to the end of the much higher visibility target implementation. Data Migration Pro
  • 11. © Concept Searching 2017 Best Practices – Data Profiling • The quality of your project planning will influence any application that uses metadata after the migration • Search, data protection, records management, eDiscovery, text analytics • 90% do not use data profiling* – ? • Include quality analysis • Almost all of you use hand coding • Reactive or proactive? • Data profiling by hand is typically unfeasible documents (Bloor Research)
  • 12. © Concept Searching 2017 Best Practices – Data Profiling • What content is moving? • What content can we get rid of? ROT • How can it be grouped? • What content requires special handling – privacy, undeclared records? • What content requires changes? • Most likely the metadata • How volatile is the content? • Data profiling outputs include lists that detail • Content to be migrated in logical groups • Content that requires special handling • Content that will require changes along with the scope • Do it before time and budget estimates
  • 14. © Concept Searching 2017 Content Optimization • 1% of corporate information is on litigation hold, 5% is in a records retention category, 25% has current business value • This means 69% of data most organizations keep can, and should, be deleted (CGOC) • 60% of documents are obsolete (eLaw) • 3%-5% of files are lost or misplaced – annual losses to a Fortune 1000 company with one million files is $5M (Survey reported in Information Week) • Companies typically misfile 2% - 7% of their records (New York City Chapter, ARMA International) • Costs $4-$7 to recreate it • US managers spend an average of 4 weeks per year searching for misfiled, mislabeled, untracked, or lost papers (Cuadro Associates) • 90% of content is never accessed after creation
  • 15. © Concept Searching 2017 Best Practices – Content Optimization Content Optimization • De-duplication • Versioning • Identification of records that were never declared • Identification of content containing security vulnerabilities • Elimination of redundant, obsolete, and trivial content • Addresses noncompliant content and content that should be archived • Can be used to auto-classify content from diverse internal or external repositories 95% of your data is unstructured Annual growth rate of unstructured content is 120%
  • 16. © Concept Searching 2017 Migration of Records ISO standard 15489-1:2001 Defines records management as the field of management responsible for the efficient and systematic control of the creation, receipt, maintenance, use, and disposition of records, including the processes for capturing and maintaining evidence of and information about business activities and transactions in the form of records
  • 17. © Concept Searching 2017 Best of Breed? Hardly Solutions can be categorized into five broad categories 1. Preserving the original technology used to create or store the records 2. Emulating the original technology on new platforms 3. Migrating the software necessary to retrieve, deliver, and use the records 4. Migrating the records to up-to-date formats 5. Converting records to standard forms Each has pros and cons
  • 18. © Concept Searching 2017 Fundamental Problems in Records Management? • The problem of content migration is not specific to records systems – it is a universal problem • Hampers the ability to manage content over a period of time, impacting data privacy, eDiscovery, litigation • The first generation of electronic records management system specifications – everything from the US DoD 5015.2 that came out in 1998 to MoReq2 that came out in 2008 – did not attempt to tackle the problem • They told vendors what types of metadata to put into their products – but they did not tell vendors how to implement that metadata
  • 20. © Concept Searching 2017 Cloud Migration 1. Planning and assessment 2. Duplicate environments 3. Staff training 4. Third party migration tools 5. Leases and licenses 6. Migration consulting
  • 21. © Concept Searching 2017 Exchange • Emails are opened by only one third of recipients • About 90% is junk • Still must be managed • Classification of emails and attachments as content is created or ingested • Identifies unprotected security vulnerabilities, as well as confidential information as defined by an organization • Removes breach from search • Sends to a secure repository, or person • Disables download unless authorized • Added value for compliance officers who can identify vulnerabilities within the content • Can migrate content to other content sources • Retains existing security AIIM notes that 87% of organizations are concerned about cloud chaos. The desire to shift strategies to embrace newer content management tools has refocused attention around .migration and disposition.
  • 22. © Concept Searching 2017 Reducing the Quality of Metadata The process of migration is lossy • Enormous amount of time to move metatada • Most will not manually map content to new system counterpart metadata • What are the approaches? • Maintain the original application • Cost of licenses fees and support • Users will need to go to the old repository to retrieve information – sooner or later they will forget • Try to connect search policies to new and old system and execute search • Poor search results and ranking • Leave old application, fix new only • Poor ranking • Users need access to old system
  • 23. © Concept Searching 2017 Auto-classification in Intelligent Migration Solution • Content migration is an opportunity to bring governance to existing processes • Ability to address high value content, content that should be deleted, duplicate documents, records that were never declared, unprotected privacy exposures • Normalize systems that may have fallen out of compliance • Opportunity to get rid of dark data, ROT, content garbage • File shares to file shares, file shares to SharePoint, SharePoint to SharePoint, custom action from any other repository – .NET code and web services, plug-in architecture to custom develop content sources and destination sources After the migration, content is organized in a hierarchical format, managed internally, and is comprised of multi-term metadata that is unique to each discrete piece of content
  • 24. © Concept Searching 2017 How Long Is This Going to Take? Specifically for a migration to the web, but most are applicable to an on-premises environment • Content needs to be rewritten for the web because it is not up to a certain standard or has been copied directly from traditional media • Content needs to be reorganized to fit a new information architecture • Content is old or irrelevant and hasn’t been touched in the last five years • Most of the content is no longer performing and not attracting many visitors • No recent content quality analysis based on usage and conversion exists • Project managers underestimate effort to manually convert and enter content • Content is not fully separated from presentations, making automation difficult • Metadata around content needs to be regenerated because classifications or other systems have changed – retagging content becomes a major pain point • Persistency of URLs can become a problem after automated content migration – URLs can change and follow different patterns, turning defining and implementing redirect systems into a difficult new project
  • 26. © Concept Searching 2017 Situation: • Global automotive organization • 40,000 users Challenge: • Migration of over 20 million documents from SharePoint on-premises to the Office 365 dedicated vNext platform • Improved enterprise search and collaboration across 30 content sources • Simplified access to information for a variety of stakeholders Solution: • conceptClassifier for Office 365 platform Benefits: • Cost reduction – decommission of 50 on-premises servers to 5 • Content now auto-classified and searchable in the cloud • Ease of access to information • Improved business production Case Study – Not the Norm
  • 27. © Concept Searching 2017 How Did the Process Work? • Create a taxonomy • Taxonomy designed for subject-matter experts • Easy to use • Begin auto-classification • Update taxonomy using the taxonomy prompt ‘Suggest Clues from Class’ • Reiterate for content optimization, security breaches, records • Index and classify the whole corpus in alignment with business requirements
  • 28. © Concept Searching 2017 What Was the Result? • Reduced on-premises servers from 50 servers to 5 • Achieved immediate improvements in enterprise search and eDiscovery, enabled concept based searching • Accomplished in two weeks • Successful migration of 20 million documents
  • 29. © Concept Searching 2017 Takeaways • See migration as an OPPORTUNITY • Reduce your risk through content optimization • Planning is a key component • Failure due to poor planning and budget overruns
  • 30. © Concept Searching 2017 Thank You Michael Paye Chief Technology Officer Concept Searching mikep@conceptsearching.com www.conceptsearching.com marketing@conceptsearching.com Twitter @conceptsearch Robert Piddocke Vice President of Channel and Business Development Concept Searching robertp@conceptsearching.com