SlideShare a Scribd company logo
Collaboration ate my Intranet
Managing Unstructured
content with Search
Powered Solutions
ALMOND LABS
SATURDAY FEB 23, 2013



2/25/2013   PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS]   1
Almond Labs Identity Service Application
                                 Reach across barriers to your extended organization




Pranav Sharma
@ePranav

Co-Founder, Almond Labs
@AlmondLabs




2/25/2013           PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS]        2
Thanks To Our Sponsors!




2/25/2013   PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS]   3
SharePoint User Group
SharePoint
  ◦   End Users
  ◦   Administrators
  ◦   Architects
  ◦   Developers
  ◦   IT Pros

Meetings: 2nd Tuesday of the
month, Microsoft Malvern, 5:30-8 pm

WEB: www.TriStateSharePoint.org
EMAIL: info@TriStateSharePoint.org
TWITTER: @tristateSP



2/25/2013              PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS]   4
SharePoint Network
Are you an independent consultant or remote worker who deals with
SharePoint, Office or Office365?
Do you sometimes feel cut off from the rest of the SharePoint world?
Do you need help with technical or business issues, or just want the
chance to socialize with others?


If so, then the SharePoint Network might be for you!
www.SharePointNetwork.org




2/25/2013          PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS]   5
Managing Unstructured
content with Search Powered
Solutions
Traditional approach
  ◦ Static taxonomy mirroring organization structure
  ◦ Users are expected to browse to content
     ◦ Presentation template (Marketing)
     ◦ Vacation policy (HR)

Problems with Traditional approach
  ◦ Users have to invest a lot of thought into where their desired content lives
  ◦ Barrier to entry for users to contribute, if users are not familiar with
    taxonomy
Search Powered Solutions
  ◦ Allows for organic growth & collaboration
  ◦ Allows users to discover content instead of expecting them to browse
  ◦ Enables users to quickly identify relevant content



2/25/2013                  PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS]    6
Continuou
s Crawl
Parallel search crawls keep
content fresh without
manual crawl schedules.




     2/25/2013                PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS]   7
Refiners
Provide a drill down
behavior based on search
result metadata, now covers
entire result set (as opposed
the first 200 results)




     2/25/2013                  PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS]   8
Document Preview Panels
For identifying content in the browser. Allows users to stay on the search page while locating
the correct document

2/25/2013                 PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS]                   9
PDF Support
2/25/2013   PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS]   10
Query Rules
Allow for customizing result sets based on “intent.” Allows for promotion or highlighting of
content based on rules around the search query

2/25/2013                 PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS]                 11
Query Suggestions
Dynamic feature in SharePoint 2013, but you can also set explicit suggestions that will be
offered to the user for common queries

2/25/2013                 PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS]               12
Result Sources
“Containers” for search results that hold results from specific sources and/or are filtered by
specific criteria.

2/25/2013                  PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS]                  13
Result Type Display Templates
Highlight different types of results differently in the same result set


2/25/2013                   PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS]   14
Search
Schema
Searchable

Queryable

Retrievable

Refineable

Sortable




     2/25/2013   PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS]   15
Entity
Extraction
Extract metadata from
crawled content without
explicit content
classification




     2/25/2013            PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS]   16
Content
Aggregatio
n
Improvements in freshness
of search results allows for
search driven content
agregration “rollups” and
“summary views”




     2/25/2013                 PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS]   17
Communit
y Site




 2/25/2013   PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS]   18
My Sites




  2/25/2013   PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS]   19
Logical Architecture
2/25/2013   PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS]   20
VM Host 1                                      VM Host 2
                                   Web Tier


Physical/                                          Web Server 1
                                                                         Query
                                                                       Processing
                                                                       Distributed                Web Server 2
                                                                                                                     Query
                                                                                                                   Processing
                                                                                                                   Distributed           Office Web

Virtual                      Application Tier
                                                                         Cache                                       Cache               Apps Server




Architectur                                        Application
                                                    Server 1
                                                                          Crawl

                                                                        Content
                                                                       Processing
                                                                                                  Application
                                                                                                   Server 2
                                                                                                                        Crawl

                                                                                                                    Content
                                                                                                                   Processing

e                                                                       Replica      Index Partition 0 Replica     Other Services



Redundant search               Database Tier
components                                          Database Server 1                               Database Server 2


Mirrored search index to                                                Crawl Db
increase query throughput                                               Link Db

                                                    Content Db 1      Content Db 2
                                                                                                     Redundant copies of all
Isolated distributed cache                          Content Db 3      Content Db 4                  databases using SQL 2012
service                                                    Configuration Db
                                                                                                           AlwaysOn

                                                    All other SharePoint databases
SQL 2012 AlwaysOn for
database redundancy




     2/25/2013                     PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS]                                             21
2/25/2013   PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS]   22
Almond Labs Identity Service Application
                                 Reach across barriers to your extended organization




Pranav Sharma
@ePranav

Co-Founder, Almond Labs
@AlmondLabs




2/25/2013           PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS]       23

More Related Content

PPTX
Building the Perfect SharePoint 2010 Farm - Sharing the Point South America
PPTX
Infrastructure Best Practices for SharePoint On-Premises presented by Michael...
PPTX
What SQL DBAs need to know about SharePoint-Indianapolis 2013
PPTX
Viestinnän seminaari 8.11.2012 / SharePoint
PDF
SQL Server and SharePoint - Best Practices presented by Steffen Krause, Micro...
PDF
ITPro's taking the SharePoint 2013 Red Pill
PDF
SharePoint 2013 – the upgrade story
PPTX
What SharePoint Admins need to know about SQL-Cinncinati
Building the Perfect SharePoint 2010 Farm - Sharing the Point South America
Infrastructure Best Practices for SharePoint On-Premises presented by Michael...
What SQL DBAs need to know about SharePoint-Indianapolis 2013
Viestinnän seminaari 8.11.2012 / SharePoint
SQL Server and SharePoint - Best Practices presented by Steffen Krause, Micro...
ITPro's taking the SharePoint 2013 Red Pill
SharePoint 2013 – the upgrade story
What SharePoint Admins need to know about SQL-Cinncinati

What's hot (20)

PPTX
Ultimate SharePoint 2013 Infrastructure Best Practices Session - SPKSLO 2012
PPTX
MetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
PPTX
Architectural changes in SharePoint 2013
PPTX
Share point 2010 installation and mainteinance, best practices
PPTX
Practical SharePoint 2010 Architecture Planning
PPTX
Sps Ottawa - Storing Your Content in SharePoint
PDF
Planning SharePoint 2013 Search for IT PROs
PPTX
What SQL DBA's need to know about SharePoint-St. Louis 2013
PDF
JavaOne 2009 - Full-Text Search: Human Heaven and Database Savior in the Cloud
PPTX
Baltimore SharePoint Users Group - Worst Practices and Administrative Blunders
PPTX
Baltimore SPUG - Worst Practices and Blunders
PPTX
SPS Kansas City: What SharePoint Admin need to know about SQL
PDF
SharePoint 2010 best practices for infrastructure deployments SharePoint Sat...
PPTX
SPCAdriatics - 10 Things I Like In SharePoint 2013 Search
PDF
SharePoint 2013 supported DB's
PPTX
Understanding and Configuring an Effective SharePoint 2013 Search
PPT
Improving HR Document Availability and Process Workflows with Electronic Imaging
PPTX
10 Things I Like in SharePoint 2013 Search
PDF
10 Steps Optimize Share Point Performance
PDF
RichFacesWhatIsNewIn330
Ultimate SharePoint 2013 Infrastructure Best Practices Session - SPKSLO 2012
MetaVis Webinar - 10 Things I Like in SharePoint 2013 Search
Architectural changes in SharePoint 2013
Share point 2010 installation and mainteinance, best practices
Practical SharePoint 2010 Architecture Planning
Sps Ottawa - Storing Your Content in SharePoint
Planning SharePoint 2013 Search for IT PROs
What SQL DBA's need to know about SharePoint-St. Louis 2013
JavaOne 2009 - Full-Text Search: Human Heaven and Database Savior in the Cloud
Baltimore SharePoint Users Group - Worst Practices and Administrative Blunders
Baltimore SPUG - Worst Practices and Blunders
SPS Kansas City: What SharePoint Admin need to know about SQL
SharePoint 2010 best practices for infrastructure deployments SharePoint Sat...
SPCAdriatics - 10 Things I Like In SharePoint 2013 Search
SharePoint 2013 supported DB's
Understanding and Configuring an Effective SharePoint 2013 Search
Improving HR Document Availability and Process Workflows with Electronic Imaging
10 Things I Like in SharePoint 2013 Search
10 Steps Optimize Share Point Performance
RichFacesWhatIsNewIn330
Ad

Viewers also liked (6)

PPTX
nickelring OAP final submission
PPTX
Using SPMetal for faster SharePoint development
PPTX
Using SP Metal for faster share point development
PDF
Data Insights OEP interim submission
DOCX
16357 1415 3816944-33314245_thorntons_case_study_report
nickelring OAP final submission
Using SPMetal for faster SharePoint development
Using SP Metal for faster share point development
Data Insights OEP interim submission
16357 1415 3816944-33314245_thorntons_case_study_report
Ad

Similar to Search poweredsolutions (20)

PDF
Interoperability Ms Sap
PPT
Seamless Integrations between WebCenter Content, Site Studio, and WebCenter S...
PPTX
Deep Dive into SharePoint Topologies and Server Architecture for SharePoint 2013
PPTX
What’s New in SharePoint 2013 for IT Pros
PDF
Smart Crawler Automation with RMI
PPT
Websphere - Introduction to jdbc
PDF
Putting the "Share" and "Point" back in SharePoint 2013
PPTX
Design a share point 2013 architecture – the basics
PDF
Mike Taulty MIX10 Silverlight 4 Patterns Frameworks
PPTX
Arnia-Cubrid - Programatica2010
PDF
Active Directory Site And Services.pdf
PDF
Active Directory Site And Services.pdf
PDF
A04210106
PPTX
SharePoint 2010 - Tips and Tricks of the Trade - Avoiding Administrative Blun...
PPTX
Getting started with SharePoint 2013 online development
KEY
FatWire Tutorial For Site Studio Developers
PDF
Building apps with HBase - Big Data TechCon Boston
PPTX
Building the Perfect SharePoint 2010 Farm - MS Days Bulgaria 2012
PPTX
CUBRID presentation at Programatica Conference 2010
PDF
IBM Connections Design To #NOTFAIL
Interoperability Ms Sap
Seamless Integrations between WebCenter Content, Site Studio, and WebCenter S...
Deep Dive into SharePoint Topologies and Server Architecture for SharePoint 2013
What’s New in SharePoint 2013 for IT Pros
Smart Crawler Automation with RMI
Websphere - Introduction to jdbc
Putting the "Share" and "Point" back in SharePoint 2013
Design a share point 2013 architecture – the basics
Mike Taulty MIX10 Silverlight 4 Patterns Frameworks
Arnia-Cubrid - Programatica2010
Active Directory Site And Services.pdf
Active Directory Site And Services.pdf
A04210106
SharePoint 2010 - Tips and Tricks of the Trade - Avoiding Administrative Blun...
Getting started with SharePoint 2013 online development
FatWire Tutorial For Site Studio Developers
Building apps with HBase - Big Data TechCon Boston
Building the Perfect SharePoint 2010 Farm - MS Days Bulgaria 2012
CUBRID presentation at Programatica Conference 2010
IBM Connections Design To #NOTFAIL

Search poweredsolutions

  • 1. Collaboration ate my Intranet Managing Unstructured content with Search Powered Solutions ALMOND LABS SATURDAY FEB 23, 2013 2/25/2013 PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS] 1
  • 2. Almond Labs Identity Service Application Reach across barriers to your extended organization Pranav Sharma @ePranav Co-Founder, Almond Labs @AlmondLabs 2/25/2013 PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS] 2
  • 3. Thanks To Our Sponsors! 2/25/2013 PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS] 3
  • 4. SharePoint User Group SharePoint ◦ End Users ◦ Administrators ◦ Architects ◦ Developers ◦ IT Pros Meetings: 2nd Tuesday of the month, Microsoft Malvern, 5:30-8 pm WEB: www.TriStateSharePoint.org EMAIL: info@TriStateSharePoint.org TWITTER: @tristateSP 2/25/2013 PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS] 4
  • 5. SharePoint Network Are you an independent consultant or remote worker who deals with SharePoint, Office or Office365? Do you sometimes feel cut off from the rest of the SharePoint world? Do you need help with technical or business issues, or just want the chance to socialize with others? If so, then the SharePoint Network might be for you! www.SharePointNetwork.org 2/25/2013 PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS] 5
  • 6. Managing Unstructured content with Search Powered Solutions Traditional approach ◦ Static taxonomy mirroring organization structure ◦ Users are expected to browse to content ◦ Presentation template (Marketing) ◦ Vacation policy (HR) Problems with Traditional approach ◦ Users have to invest a lot of thought into where their desired content lives ◦ Barrier to entry for users to contribute, if users are not familiar with taxonomy Search Powered Solutions ◦ Allows for organic growth & collaboration ◦ Allows users to discover content instead of expecting them to browse ◦ Enables users to quickly identify relevant content 2/25/2013 PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS] 6
  • 7. Continuou s Crawl Parallel search crawls keep content fresh without manual crawl schedules. 2/25/2013 PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS] 7
  • 8. Refiners Provide a drill down behavior based on search result metadata, now covers entire result set (as opposed the first 200 results) 2/25/2013 PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS] 8
  • 9. Document Preview Panels For identifying content in the browser. Allows users to stay on the search page while locating the correct document 2/25/2013 PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS] 9
  • 10. PDF Support 2/25/2013 PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS] 10
  • 11. Query Rules Allow for customizing result sets based on “intent.” Allows for promotion or highlighting of content based on rules around the search query 2/25/2013 PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS] 11
  • 12. Query Suggestions Dynamic feature in SharePoint 2013, but you can also set explicit suggestions that will be offered to the user for common queries 2/25/2013 PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS] 12
  • 13. Result Sources “Containers” for search results that hold results from specific sources and/or are filtered by specific criteria. 2/25/2013 PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS] 13
  • 14. Result Type Display Templates Highlight different types of results differently in the same result set 2/25/2013 PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS] 14
  • 15. Search Schema Searchable Queryable Retrievable Refineable Sortable 2/25/2013 PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS] 15
  • 16. Entity Extraction Extract metadata from crawled content without explicit content classification 2/25/2013 PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS] 16
  • 17. Content Aggregatio n Improvements in freshness of search results allows for search driven content agregration “rollups” and “summary views” 2/25/2013 PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS] 17
  • 18. Communit y Site 2/25/2013 PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS] 18
  • 19. My Sites 2/25/2013 PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS] 19
  • 20. Logical Architecture 2/25/2013 PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS] 20
  • 21. VM Host 1 VM Host 2 Web Tier Physical/ Web Server 1 Query Processing Distributed Web Server 2 Query Processing Distributed Office Web Virtual Application Tier Cache Cache Apps Server Architectur Application Server 1 Crawl Content Processing Application Server 2 Crawl Content Processing e Replica Index Partition 0 Replica Other Services Redundant search Database Tier components Database Server 1 Database Server 2 Mirrored search index to Crawl Db increase query throughput Link Db Content Db 1 Content Db 2 Redundant copies of all Isolated distributed cache Content Db 3 Content Db 4 databases using SQL 2012 service Configuration Db AlwaysOn All other SharePoint databases SQL 2012 AlwaysOn for database redundancy 2/25/2013 PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS] 21
  • 22. 2/25/2013 PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS] 22
  • 23. Almond Labs Identity Service Application Reach across barriers to your extended organization Pranav Sharma @ePranav Co-Founder, Almond Labs @AlmondLabs 2/25/2013 PRANAV SHARMA [@EPRANAV] | ALMOND LABS [@ALMONDLABS] 23

Editor's Notes

  • #4: Please encourage folks to visit the sponsors in the lobby. Everyone should have a sponsor bingo card. If they get initials from each sponsor they can be entered in our drawings to win great prizes, including Kindles, and a Surface RT.
  • #5: This is the local Philly SharePoint user group. We cover a range of topics for all audiences.
  • #6: This is a local group that is just getting started as an adjunct to the user group. As the slide says, it is a support and social group for independent and remote workers in the Philly area. More information is available on the website.
  • #7: Findability, ease of use
  • #8: Addresses two fundamental issues with full/incremental crawls:they can run in parallelone deep change will not result in degraded freshness on all following changesOnly for sharepoint sourcesThe interval of 15 minutes is the default value, but you can change is with the PowerShell cmdlet Set-SPEnterpriseSearchCrawlContentSourceUser Crawler impact rules to set maximum number of simultaneous requests
  • #9: Unified Core Search engine combining SP & FASTDeep result counts (disabled by default)Studies showed that while it may be nice to see them, they weren’t actually used that much by users.Available refiners == Managed Properties/_catalogs/masterpage/Display Templates/Filters/Can drive refiners from termstore managed metadata. “Faceted Navigation” (need to catalog enable a library)Knowledge Library
  • #10: Need office web apps in environment for document previews (separate box)
  • #11: PDF indexing support OOTB (iFilter)Item_PDF.html, ~sitecollection/_catalogs/masterpage/Display Templates/Search/Item_Site_HoverPanel.js
  • #12: Replacement for best betsSearch Keywords < Query Rules.Search Keywords say “when the query is ‘image library’, promote a link to our image library.”Query Rules say “when it looks like the user wants images, show useful image results.”The power of Query Rules is this generality. You can infer what the user wants; Instead of just promoting specific results, they can show whole other blocks of results relevant to the user’s query.
  • #13: User clicks results 6 times (not the same results)Logged and entry pairs of (query, result) are added to the Analytics Link Store DB, specifically to the MSSQLogPageClick tableTimer job “Prepare Query Suggestions” creates suggestions dailyManual importCentral Admin > Search Service ApplicationImport-SPEnterpriseSearchPopularQueries –ResultSource (Search Result Source where suggestions show)
  • #14: Scopes replaced by result sourcesEasy to search remote sources, including other farmsAble to create complex search queriesQuery Builder tool: Able to any combination of managed property inclusion/exclusionQuery Testing from within query builder toolSite & Site collection admins can manage & configure result sources for their site collection{searchTerms} is a query variablehttp://www.bing.com/search?q={searchterms}&format=rss&Market=en-Us
  • #15: Render different items differently: Person, Proposal, Client Information, photos, videosQuickly find what you’re looking forFilter based on result source or type of content (Query rules)
  • #16: Crawled Properties are automatically created when SharePoint encounters data in a columnManaged properties drive facetsSearchable: (adds noise)Enables querying against the content of the managed property.  The content of this managed property is included in the full-text index. For example, if the property is "author", a simple query for "Smith" returns items containing the word "Smith" and items whose author property contains "Smith". Queryable: Enables querying against the specific managed property. The managed property field name must be included in the query, either specified in the query itself or included in the query programmatically. If the managed property is "author", the query must contain "author:Smith". Retrievable: Enables the content of this managed property to be returned in search results. Enable this setting for managed properties that are relevant to present in search results. Refiner: User as RefinerSortable: Sort result set on server
  • #17: Company name extraction based on inclusion listCustom entity extraction – Example SharePoint feature(?)
  • #18: Basics – Select from Documents, Content Types, Tags, Videos, moreRefiners – Filters like department, author, file type/extensionSettings – Performance (priority), Query roles, page loadingNumber of Items to showDisplay template – Control, ItemProperty mappings
  • #19: Forum experience to categorize and cultivate discussions with a broad group of peopleModerators manage, Visitors can joinPortal rolls it all up
  • #21: Crawl– Crawls content sources; content and metadata. Connectors/Protocol handlers. Scalable. Uses crawl DBA) Crawl DB – Tracking/Historicaly info such as last crawl time, last crawl id, type of updateContent Processing (*) – Artifacts for index. Document Parsing & Property Mapping. Linguistics processing, entity extraction. Links/URLs for Links DB where Analytics adds relevanceAnalytics (*) – Search & Usage analytics. Improve relevance, recommendations (search reports & deep links)B) Link DB – Unprocessed Link info from content processing and search click dataC) Alanytics reporting db – results of usage analysis & statisics info for search reportsIndex – 1 for each index replica. Stored in index file. Responds to queries (load balanced). Index = Sum (Partitions). Replicas for each partition possibleQuery Processing – Analyzes & Processes queries & results. Queries are optimized for precision, recall, relevancy.Search administration – run system processes essential to searche