SlideShare a Scribd company logo
“ The use of visual analytics tools for unstructured content analysis” David Whitehead TBA, Vancouver Visual Analytics
Visual Analytics
My Proposal Project Investigate adopting visual analytics tools for unstructured content analysis and provide these tools as a service for TBA's and IS's in support of client and sector needs.  Stage 1: needs analysis What unstructured content analysis does CISTI perform?  What types of client questions could/should we use VA tools to help answer?  How would different CISTI teams make use of VA tools?  What experience does CISTI have with VA and unstructured content analysis tools? Stage 2: survey available tools What tools are available?  What are the tradeoffs for each tool?  How well do they meet CISTI's needs?  Stage 3: pilot study Select and deploy 1 or 2 VA tools for use in a pilot study  Select and train up to 3 IS/TBA teams in the use of Visual Analysis techniques and tools  Each team to use VA tools to work with at least 2 client projects  Teams to report on the effectiveness, usability, strengths and weaknesses of the selected VA tools.  Clients to feedback on the usefulness of the VA tool analysis/findings  Deliverables Unstructured content analysis tools needs assessment Survey of available visual analytics tools Train 1 or 2 IS/TBA teams on Visual Analytics techniques Pilot Study report
My Proposal Project Investigate adopting visual analytics tools for unstructured content analysis and provide these tools as a service for TBA's and IS's in support of client and sector needs.  Stage 1: needs analysis What unstructured content analysis does CISTI perform?  What types of client questions could/should we use VA tools to help answer?  How would different CISTI teams make use of VA tools?  What experience does CISTI have with VA and unstructured content analysis tools? Stage 2: survey available tools What tools are available?  What are the tradeoffs for each tool?  How well do they meet CISTI's needs?  Stage 3: pilot study Select and deploy 1 or 2 VA tools for use in a pilot study  Select and train up to 3 IS/TBA teams in the use of Visual Analysis techniques and tools  Each team to use VA tools to work with at least 2 client projects  Teams to report on the effectiveness, usability, strengths and weaknesses of the selected VA tools.  Clients to feedback on the usefulness of the VA tool analysis/findings  Deliverables Unstructured content analysis tools needs assessment Survey of available visual analytics tools Train 1 or 2 IS/TBA teams on Visual Analytics techniques Pilot Study report OVERLY AMBITIOUS
Reality One Visual Analytics tool: “ Starlight” from Futurepoint Systems Two projects: Catalog of the BC Wireless Industry Finding an appropriate partner for an enterprise USB Flash drive solution
I tried one Visual Analytics tool out on two projects: Catalog of the BC Wireless Industry Finding an appropriate partner for an enterprise USB Flash drive solution Reality Practical
Why Visual Analytics I don’t look for terrorists, but I do reduce risk
Visual Analytics “ People use visual analytics tools and techniques to synthesize information and derive insight from massive, dynamic, ambiguous, and often conflicting data; detect the expected and discover the unexpected; provide timely, defensible, and understandable assessments; and communicate assessment effectively for action.” From: http://en.wikipedia.org/wiki/Visual_analytics
Visual Analytics Visual Analytics Sounds a lot like CISTI! synthesize information  derive insight from massive, dynamic, ambiguous, conflicting data detect the expected and discover the unexpected provide timely, defensible, and understandable assessments communicate assessment effectively for action
Why explore Visual Analytics Clients come to IS’s and TBA’s expecting "the magic answers" and "a crystal ball“ Our success depends on our ability to meet that un-written brand promise, as unrealistic as it may be. IS’s and TBA’s need to be the experts in information retrieval and analysis, including sophisticated analysis of unstructured content. IS’s and TBA’s have the expertise to interpret and convey the results to business clients.
Visual Analytics Turns This Into This 1 - 10 of about 3,660,000 for  enterprise   usb   content   distribution
Demo Incredibly Amazing Demo
Multiple Views
Multiple Views
Subject Views
Good Clustering of Data Helps identify key topics for analysis Quickly eliminates results of no value Provides some confidence in the uniqueness of a solution Allows broader coverage than manual approaches
Bad Deeper Analysis requires a lot of preparation and data manipulation Getting the data from here to something consumable is hard Requires a lot of learning to get meaningful results Best used by experts in a domain
Awesome! Data input tools Automatically crawl sites Convert unstructured  documents into data Look for multiple parameters simultaneously Automatically extract entities like people, places and companies
Test 1 Enterprise USB Flash Application Test Traditional Approach Visual Analytics Approach 7 hours  Read 53 web pages Identified 20 potential partners Narrowed to 4 likely partners Identified 2 key competing approaches 2 hours Analyzed about 600 web pages Identified 1 likely partner Identified 7 competing approaches
Test 2 Wireless Industry Profile Traditional Approach Visual Analytics Approach 1 month  Read about 1000 web pages Classified 93 pre-defined technologies across 277 mobile industry firms in 2 cities (Vancouver & Ottawa) 8 hours 1 st  web crawler pass Analyzed about 11000 web pages starting with links to 137 companies Identified 123 separate technologies
General Observations Analytics result in much faster generation of garbage out Many trials needed to get the automation right Once right – automation dramatically helps in large environmental scans. The two techniques complement each other well, help to double check findings Helps find more outliers Helps identify useful items to look for Investigative techniques are essential to drawing real conclusions Very hard to communicate the process and the comparative value of results using the tools alone.
Visual Analytics “ Visual Analytics is the integration of interactive visualization with analysis techniques to answer a growing range of questions in science, business, and analysis. It can attack certain problems whose size, complexity, and need for closely coupled human and machine analysis may make them otherwise intractable.”   From: http://en.wikipedia.org/wiki/Visual_analytics
Why learn more… Anyone can use Google… VA tools are the tools used by leaders in large scale information analysis such as government security forces Visual Analytics help make sense of information overload Making sense of information overload is the essential skill for information analysts. Information overload is every industries problem VA tools are moving out of the security world into the realm of business and scientific analysis expertise in applying tools like VA to analysis tasks in the fields of business and science is what will set CISTI IS’s and TBA’s apart.

More Related Content

PDF
2015 Forrester Report
PDF
Forrester on Big Data
PDF
Empirical discovery concept model
PPTX
Iterative Discovery and Analysis: Workflow / Activity and Capability Model
PDF
Data analytics course with technologies
PDF
Self-service analytics risk_September_2016
PPTX
Making Predictive Analytics Practical: How Marketing Can Drive Engagement
2015 Forrester Report
Forrester on Big Data
Empirical discovery concept model
Iterative Discovery and Analysis: Workflow / Activity and Capability Model
Data analytics course with technologies
Self-service analytics risk_September_2016
Making Predictive Analytics Practical: How Marketing Can Drive Engagement

What's hot (20)

PDF
H2O World - Machine Learning for non-data scientists
PPTX
Data Quality Analytics: Understanding what is in your data, before using it
PPT
Streamlined Product Evaluation
PPT
Micropanel Webinar 8 24-11
PDF
Developing an Analytical Mindset – Becoming an Analytical Competitor
PPTX
Extending Enterprise Search at AstraZeneca
PDF
Elsevier
PPTX
Supporting innovation in insurance with randomized experimentation
PPTX
Emvigo Data Visualization - E Commerce Deck
PDF
Big data in action - Watson in banking Wealth management
PDF
Getting Results through Data-driven Procurement
PPTX
Max diff scaling for research access(4)
PDF
Text/Content Analytics 2011: User Perspectives on Solutions and Providers
PPTX
Next Gen Clinical Data Sciences
PDF
Meetup7 integration microservices_machine_learning
PDF
CATI surveys in a CX environment
PDF
Bigdataanalytics
PPTX
An Industry Perspective on Subjectivity, Sentiment, and Social
PDF
What’s next for healthcare information technology innovation?
PPTX
Automation Isn't Enough: You Need Robotics or AI
H2O World - Machine Learning for non-data scientists
Data Quality Analytics: Understanding what is in your data, before using it
Streamlined Product Evaluation
Micropanel Webinar 8 24-11
Developing an Analytical Mindset – Becoming an Analytical Competitor
Extending Enterprise Search at AstraZeneca
Elsevier
Supporting innovation in insurance with randomized experimentation
Emvigo Data Visualization - E Commerce Deck
Big data in action - Watson in banking Wealth management
Getting Results through Data-driven Procurement
Max diff scaling for research access(4)
Text/Content Analytics 2011: User Perspectives on Solutions and Providers
Next Gen Clinical Data Sciences
Meetup7 integration microservices_machine_learning
CATI surveys in a CX environment
Bigdataanalytics
An Industry Perspective on Subjectivity, Sentiment, and Social
What’s next for healthcare information technology innovation?
Automation Isn't Enough: You Need Robotics or AI
Ad

Viewers also liked (20)

PDF
in*Bug: Software Defect Analytics
PPTX
Life Science Analytics
PDF
Integrating Structure and Analytics with Unstructured Data
PDF
Digital analytics: Visualization (Lecture 5)
PDF
Web analytics using R
PPTX
Inside Out of A Web Analyst Mind
PDF
A STUDY ON CHALLENGES & OPPORTUNITIES FOR FREIGHT FORWARDERS IN INDIA AND EXI...
PPTX
Using Unstructured Text Data to Stay Ahead of Market Trends and Quantify Cust...
DOCX
Biometrics research paper
PDF
Future of Visitor Audience segmentation
PDF
The what, why and how of web analytics testing
PPT
Problem And Prospectus Of Export House
PDF
Advanced analytics proposal review guide
ODP
Hadoop at aadhaar
PDF
IBM Watson Content Analytics Redbook
PPTX
Digital analytics with R - Sydney Users of R Forum - May 2015
PDF
Design to Differentiate An Approach to Test, Target and Learn
PDF
Getting Started with Unstructured Data
PDF
Advanced Defect Management
PPTX
Models of audience segmentation
in*Bug: Software Defect Analytics
Life Science Analytics
Integrating Structure and Analytics with Unstructured Data
Digital analytics: Visualization (Lecture 5)
Web analytics using R
Inside Out of A Web Analyst Mind
A STUDY ON CHALLENGES & OPPORTUNITIES FOR FREIGHT FORWARDERS IN INDIA AND EXI...
Using Unstructured Text Data to Stay Ahead of Market Trends and Quantify Cust...
Biometrics research paper
Future of Visitor Audience segmentation
The what, why and how of web analytics testing
Problem And Prospectus Of Export House
Advanced analytics proposal review guide
Hadoop at aadhaar
IBM Watson Content Analytics Redbook
Digital analytics with R - Sydney Users of R Forum - May 2015
Design to Differentiate An Approach to Test, Target and Learn
Getting Started with Unstructured Data
Advanced Defect Management
Models of audience segmentation
Ad

Similar to Proposal 12 - Visual Analytics (20)

PPTX
Designing Guidelines for Visual Analytics System to Augment Organizational An...
PPTX
Bigdata analytics
PDF
Visual analytics
PPTX
Automated BI Modernizations
PDF
See Your Data In A New Way. SAS Visual Analytics
PDF
SAS Visual Analytics
PPTX
Bi 4.0 Migration Strategy and Best Practices
PDF
Visualisation and forecasting on IT capacity planning data
PDF
Tech trends 2011
PPTX
Sbi_VA
PDF
SAS Visual Analytics Overview
PDF
Andy Kirk Malofiej 20 Presentation
PDF
Visual Analytics
PDF
When Worlds Collide: Intelligence, Analytics and Operations
PPT
Enhancing AT through ID Techniques
PDF
“Big Picture”: Mixed-Initiative Visual Analytics of Big Data (VINCI 2013 Keyn...
PDF
BI congres 2014-4: thinking out of the box - Jos Cools - Crosspoint
PDF
Data Dinner Parties
PDF
Are you getting the most out of your data?
PPTX
Business Visualization: Dashboard & Storyboarding
Designing Guidelines for Visual Analytics System to Augment Organizational An...
Bigdata analytics
Visual analytics
Automated BI Modernizations
See Your Data In A New Way. SAS Visual Analytics
SAS Visual Analytics
Bi 4.0 Migration Strategy and Best Practices
Visualisation and forecasting on IT capacity planning data
Tech trends 2011
Sbi_VA
SAS Visual Analytics Overview
Andy Kirk Malofiej 20 Presentation
Visual Analytics
When Worlds Collide: Intelligence, Analytics and Operations
Enhancing AT through ID Techniques
“Big Picture”: Mixed-Initiative Visual Analytics of Big Data (VINCI 2013 Keyn...
BI congres 2014-4: thinking out of the box - Jos Cools - Crosspoint
Data Dinner Parties
Are you getting the most out of your data?
Business Visualization: Dashboard & Storyboarding

Recently uploaded (20)

PPTX
Spectroscopy.pptx food analysis technology
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPTX
Programs and apps: productivity, graphics, security and other tools
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Review of recent advances in non-invasive hemoglobin estimation
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PPT
Teaching material agriculture food technology
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
Spectroscopy.pptx food analysis technology
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Spectral efficient network and resource selection model in 5G networks
Chapter 3 Spatial Domain Image Processing.pdf
Programs and apps: productivity, graphics, security and other tools
“AI and Expert System Decision Support & Business Intelligence Systems”
Encapsulation_ Review paper, used for researhc scholars
Review of recent advances in non-invasive hemoglobin estimation
Understanding_Digital_Forensics_Presentation.pptx
Teaching material agriculture food technology
Reach Out and Touch Someone: Haptics and Empathic Computing
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
The Rise and Fall of 3GPP – Time for a Sabbatical?
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Diabetes mellitus diagnosis method based random forest with bat algorithm
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Advanced methodologies resolving dimensionality complications for autism neur...
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
NewMind AI Weekly Chronicles - August'25 Week I
Building Integrated photovoltaic BIPV_UPV.pdf

Proposal 12 - Visual Analytics

  • 1. “ The use of visual analytics tools for unstructured content analysis” David Whitehead TBA, Vancouver Visual Analytics
  • 3. My Proposal Project Investigate adopting visual analytics tools for unstructured content analysis and provide these tools as a service for TBA's and IS's in support of client and sector needs. Stage 1: needs analysis What unstructured content analysis does CISTI perform? What types of client questions could/should we use VA tools to help answer? How would different CISTI teams make use of VA tools? What experience does CISTI have with VA and unstructured content analysis tools? Stage 2: survey available tools What tools are available? What are the tradeoffs for each tool? How well do they meet CISTI's needs? Stage 3: pilot study Select and deploy 1 or 2 VA tools for use in a pilot study Select and train up to 3 IS/TBA teams in the use of Visual Analysis techniques and tools Each team to use VA tools to work with at least 2 client projects Teams to report on the effectiveness, usability, strengths and weaknesses of the selected VA tools. Clients to feedback on the usefulness of the VA tool analysis/findings Deliverables Unstructured content analysis tools needs assessment Survey of available visual analytics tools Train 1 or 2 IS/TBA teams on Visual Analytics techniques Pilot Study report
  • 4. My Proposal Project Investigate adopting visual analytics tools for unstructured content analysis and provide these tools as a service for TBA's and IS's in support of client and sector needs. Stage 1: needs analysis What unstructured content analysis does CISTI perform? What types of client questions could/should we use VA tools to help answer? How would different CISTI teams make use of VA tools? What experience does CISTI have with VA and unstructured content analysis tools? Stage 2: survey available tools What tools are available? What are the tradeoffs for each tool? How well do they meet CISTI's needs? Stage 3: pilot study Select and deploy 1 or 2 VA tools for use in a pilot study Select and train up to 3 IS/TBA teams in the use of Visual Analysis techniques and tools Each team to use VA tools to work with at least 2 client projects Teams to report on the effectiveness, usability, strengths and weaknesses of the selected VA tools. Clients to feedback on the usefulness of the VA tool analysis/findings Deliverables Unstructured content analysis tools needs assessment Survey of available visual analytics tools Train 1 or 2 IS/TBA teams on Visual Analytics techniques Pilot Study report OVERLY AMBITIOUS
  • 5. Reality One Visual Analytics tool: “ Starlight” from Futurepoint Systems Two projects: Catalog of the BC Wireless Industry Finding an appropriate partner for an enterprise USB Flash drive solution
  • 6. I tried one Visual Analytics tool out on two projects: Catalog of the BC Wireless Industry Finding an appropriate partner for an enterprise USB Flash drive solution Reality Practical
  • 7. Why Visual Analytics I don’t look for terrorists, but I do reduce risk
  • 8. Visual Analytics “ People use visual analytics tools and techniques to synthesize information and derive insight from massive, dynamic, ambiguous, and often conflicting data; detect the expected and discover the unexpected; provide timely, defensible, and understandable assessments; and communicate assessment effectively for action.” From: http://en.wikipedia.org/wiki/Visual_analytics
  • 9. Visual Analytics Visual Analytics Sounds a lot like CISTI! synthesize information derive insight from massive, dynamic, ambiguous, conflicting data detect the expected and discover the unexpected provide timely, defensible, and understandable assessments communicate assessment effectively for action
  • 10. Why explore Visual Analytics Clients come to IS’s and TBA’s expecting "the magic answers" and "a crystal ball“ Our success depends on our ability to meet that un-written brand promise, as unrealistic as it may be. IS’s and TBA’s need to be the experts in information retrieval and analysis, including sophisticated analysis of unstructured content. IS’s and TBA’s have the expertise to interpret and convey the results to business clients.
  • 11. Visual Analytics Turns This Into This 1 - 10 of about 3,660,000 for enterprise usb content distribution
  • 16. Good Clustering of Data Helps identify key topics for analysis Quickly eliminates results of no value Provides some confidence in the uniqueness of a solution Allows broader coverage than manual approaches
  • 17. Bad Deeper Analysis requires a lot of preparation and data manipulation Getting the data from here to something consumable is hard Requires a lot of learning to get meaningful results Best used by experts in a domain
  • 18. Awesome! Data input tools Automatically crawl sites Convert unstructured documents into data Look for multiple parameters simultaneously Automatically extract entities like people, places and companies
  • 19. Test 1 Enterprise USB Flash Application Test Traditional Approach Visual Analytics Approach 7 hours Read 53 web pages Identified 20 potential partners Narrowed to 4 likely partners Identified 2 key competing approaches 2 hours Analyzed about 600 web pages Identified 1 likely partner Identified 7 competing approaches
  • 20. Test 2 Wireless Industry Profile Traditional Approach Visual Analytics Approach 1 month Read about 1000 web pages Classified 93 pre-defined technologies across 277 mobile industry firms in 2 cities (Vancouver & Ottawa) 8 hours 1 st web crawler pass Analyzed about 11000 web pages starting with links to 137 companies Identified 123 separate technologies
  • 21. General Observations Analytics result in much faster generation of garbage out Many trials needed to get the automation right Once right – automation dramatically helps in large environmental scans. The two techniques complement each other well, help to double check findings Helps find more outliers Helps identify useful items to look for Investigative techniques are essential to drawing real conclusions Very hard to communicate the process and the comparative value of results using the tools alone.
  • 22. Visual Analytics “ Visual Analytics is the integration of interactive visualization with analysis techniques to answer a growing range of questions in science, business, and analysis. It can attack certain problems whose size, complexity, and need for closely coupled human and machine analysis may make them otherwise intractable.” From: http://en.wikipedia.org/wiki/Visual_analytics
  • 23. Why learn more… Anyone can use Google… VA tools are the tools used by leaders in large scale information analysis such as government security forces Visual Analytics help make sense of information overload Making sense of information overload is the essential skill for information analysts. Information overload is every industries problem VA tools are moving out of the security world into the realm of business and scientific analysis expertise in applying tools like VA to analysis tasks in the fields of business and science is what will set CISTI IS’s and TBA’s apart.