SlideShare a Scribd company logo
2
Most read
3
Most read
4
Most read
SNJB’S LATE SAU K. B. JAIN COE, CHANDWAD
DEPARTMENT OF COMPUTER ENGINEERING
ACADEMIC YEAR 2020-21
SUBJECT: DATAANALYTICS
CASE STUDY ON GINA
By
Name: Divya Prafull Wani
Roll No:41
Class: BE Computer
Date:16-08-2020
Case StudyOn GINA
1
GINA
■ Global Innovation Network and Analysis.
■ The GINA case study provides an example of how a team applied the Data Analytics
Lifecycle to analyze innovation data at EMC.
■ GINA is a group of senior technologists located in centers of excellence around the world.
■ The GINA team thought its approach would provide a means to share ideas globally and
increase knowledge sharing among members who may be separated geographically.
■ It planned to create a data repository containing both structured and unstructured data to
accomplish 3 main goals .
1. Store Formal and informal data.
2. Track research from global technologists.
3. Mine the data for patterns and insights to improve the teams operations and strategy.
Case StudyOn GINA 2
Phase 1 : Discovery
■ Team Members and Roles
 Business user, project sponsor, project manager -Vice President from Office of CTO.
 BI analyst – person from IT
 Data Engineer and DBA – people from IT
 Data Scientist – distinguished engineer.
■ The data fell into two categories
 5 years of idea submissions from internal innovation contests.
 Minutes an notes representing innovation and research activity from around the world.
■ The data fell into two categories
 5 years of idea submissions from internal innovation contests.
 Minutes an notes representing innovation and research activity from around the world.
■ Hypothesis grouped into 2 categories
 Descriptive analytics of what is happening to spark further creativity, collaboration, an asset generation
 Predictive analytics to advise executive management of where it should be investing in the future.
Case StudyOn GINA 3
Phase 2 : Data Preparation
■ Set up anAnalytical Sandbox to store and experiment on the data.
■ Discovered that certain data needed conditioning and normalization and that missing
datasets were critical.
■ Team recognized that poor quality data could impact subsequent steps.
■ They discovered many names were misspelled and problems with extra spaces.
■ Important to determine what level of data quality and cleanliness was sufficient for
the project being undertaken.
Case StudyOn GINA 4
Phase 3: Model Planning
■ Included following considerations :
 Identify the right milestones to achieve the goals
 Trace how people move ideas from each ,milestone towards the goal.
 Once this is done, trace ideas that die and others that reach the goal.Compare the
journeys of ideas that make it and those that do not.
 Compare times and outcomes using a few different methods.These could be as simple
as t-tests or perhaps involve different types of classificationAlgorithms.
Case StudyOn GINA 5
Phase 4 : Model Building
■ The GINA team employed several analytical methods.This included work by the data
scientist using Natural Language Processing (NLP) techniques on the textual
descriptions of the innovation Roadmap ideas.
■ Social Network Analysis using R and Rstudio.
■ Developed SocialGraphs andVisualizations.
Case StudyOn GINA 6
Social Graph Data
Submitters and Finalists
and Graph of top
innovation influencers
• Fig shows socai graphs that portray
relationships between idea submitters within
GINA.
• Each colour represents an innovator from a
different country.
• The large dots with red circles around them
represent hubs.
• A hub represents a person with high
connectivity and a high “betweeness” score.
• The team usedTableau software for data
visualization and exploration and used the
Pivotal Greenplum database as the main data
repository and analytics engine.
Case StudyOn GINA 7
Phase 5 : Communicate Results
■ This project was considered successful in identifying boundary spanners and hidden
innovators.
■ The GINA project promoted knowledge sharing related to innovation an researchers
spanning multiple areas within the company and outside of it.
■ The GINA also enables EMC to cultivate additional property leads to research topics
and provided opportunities to forge relationships with universities for joint academic
research in the fields of Data Science and Big Data.
■ Study was successful in identifying hidden innovators.(found high density in Cork,
Ireland)
■ The CTO office launched longitudinal studies.
Case StudyOn GINA 8
Phase 6: Operationalize
■ Deployment was not really discussed
■ Key Findings
 Need more data in future
 Some data were sensitive
 A parallel initiative needs to be created to improve basic BI activities.
 A mechanism is needed to continually reevaluate the model after deployment.
Case StudyOn GINA 9
Analytic Plan from the EMC GINA
Project
Case StudyOn GINA 10
Reference
■ Data-science-and-big-data-analy-nieizv_book
■ https://bhavanakhivsara.wordpress.com/subjects/data-analytics/
Case StudyOn GINA 11
ThankYou!!
Case StudyOn GINA 12

More Related Content

PPTX
Case study
PPTX
Apple Vs Samsung: Patent War
PPTX
Blue Eyes Technology PPT
PPTX
Diabetes Mellitus
PPTX
Hypertension
PPTX
Republic Act No. 11313 Safe Spaces Act (Bawal Bastos Law).pptx
PPTX
Power Point Presentation on Artificial Intelligence
Case study
Apple Vs Samsung: Patent War
Blue Eyes Technology PPT
Diabetes Mellitus
Hypertension
Republic Act No. 11313 Safe Spaces Act (Bawal Bastos Law).pptx
Power Point Presentation on Artificial Intelligence

What's hot (20)

PPTX
Case study on gina(gobal innovation network and analysis)
PPTX
Data Science
PPTX
Data mining , Knowledge Discovery Process, Classification
PPTX
Data Mining: Application and trends in data mining
PPTX
Data visualization
PPTX
Big Data Analytics
PDF
Data Visualization in Data Science
PPT
Introduction to Data Mining
PPTX
Kdd process
PDF
IoT Architecture
PPT
Data mining
PPTX
Data analytics presentation- Management career institute
PPTX
Computer vision
PPTX
Computer communication networks chapter 1 ppt (vtu odd sem EC)
PPTX
Machine Learning and Artificial Intelligence
PPTX
Introduction to Data Mining
PPTX
Machine Learning ppt.pptx
PPTX
introduction to data science
PPTX
Data cleansing
PPTX
Data Mining: What is Data Mining?
Case study on gina(gobal innovation network and analysis)
Data Science
Data mining , Knowledge Discovery Process, Classification
Data Mining: Application and trends in data mining
Data visualization
Big Data Analytics
Data Visualization in Data Science
Introduction to Data Mining
Kdd process
IoT Architecture
Data mining
Data analytics presentation- Management career institute
Computer vision
Computer communication networks chapter 1 ppt (vtu odd sem EC)
Machine Learning and Artificial Intelligence
Introduction to Data Mining
Machine Learning ppt.pptx
introduction to data science
Data cleansing
Data Mining: What is Data Mining?
Ad

Similar to Case Study on GINA(Global Innovation Network and Analysis) based on Data Analytic Life Cycle (20)

PPTX
IT Capstone Report Fall 2022.pptx
PDF
Toward supporting decision-making under uncertainty in digital humanities wit...
PDF
New Data Science Framework for Analysing and Mining Big Data - Charith Silva
PDF
Data Mining Applications And Feature Scope Survey
PDF
Democratizing co-production of thematic co-explorations for Citizen Observato...
PDF
Customer Research For Product Managers - Dawn of The Data Age Lecture Series
PDF
A Survey on Big Data Analytics
PPTX
Fealing - Improving indicators to inform policy
PDF
1. Overview_of_data_analytics (1).pdf
PDF
Fundamentals of data mining and its applications
PDF
Ojcst vol12 n4_p_132-146
PDF
An Evaluation Of Big Data Analytics Projects And The Project Predictive Analy...
PPTX
Data Analytics in Industry Verticals, Data Analytics Lifecycle, Challenges of...
PDF
CODATA: Open Data, FAIR Data and Open Science/Simon Hodson
PDF
Software Analytics
PDF
Business Research Methods 9th Edition Zikmund Solutions Manual
PDF
Business Research Methods 9th Edition Zikmund Solutions Manual
PPTX
Turning FAIR into Reality: Final outcomes from the European Commission FAIR D...
PDF
2b A-Using Big Data for the Sustainable Development Goals 10222015.pdf
PDF
Apidays Paris 2023 - Crafting Sustainable Bytes for a Greener Digital Future,...
IT Capstone Report Fall 2022.pptx
Toward supporting decision-making under uncertainty in digital humanities wit...
New Data Science Framework for Analysing and Mining Big Data - Charith Silva
Data Mining Applications And Feature Scope Survey
Democratizing co-production of thematic co-explorations for Citizen Observato...
Customer Research For Product Managers - Dawn of The Data Age Lecture Series
A Survey on Big Data Analytics
Fealing - Improving indicators to inform policy
1. Overview_of_data_analytics (1).pdf
Fundamentals of data mining and its applications
Ojcst vol12 n4_p_132-146
An Evaluation Of Big Data Analytics Projects And The Project Predictive Analy...
Data Analytics in Industry Verticals, Data Analytics Lifecycle, Challenges of...
CODATA: Open Data, FAIR Data and Open Science/Simon Hodson
Software Analytics
Business Research Methods 9th Edition Zikmund Solutions Manual
Business Research Methods 9th Edition Zikmund Solutions Manual
Turning FAIR into Reality: Final outcomes from the European Commission FAIR D...
2b A-Using Big Data for the Sustainable Development Goals 10222015.pdf
Apidays Paris 2023 - Crafting Sustainable Bytes for a Greener Digital Future,...
Ad

More from divyawani2 (6)

PDF
Case Study on Cray T3E Architecture
PDF
Case study on Transaction in Grocery Store
PPTX
Case study on smart card (embeded system) based on IOT
PPTX
Automatic Water Dispenser using IOT
PPTX
Case study on automatic washing machine based on Internet of Things(IOT)
PPTX
Flutter technology Based on Web Development
Case Study on Cray T3E Architecture
Case study on Transaction in Grocery Store
Case study on smart card (embeded system) based on IOT
Automatic Water Dispenser using IOT
Case study on automatic washing machine based on Internet of Things(IOT)
Flutter technology Based on Web Development

Recently uploaded (20)

PPTX
Introduction to machine learning and Linear Models
PPT
ISS -ESG Data flows What is ESG and HowHow
PDF
Business Analytics and business intelligence.pdf
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PPT
Reliability_Chapter_ presentation 1221.5784
PPTX
Computer network topology notes for revision
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPTX
climate analysis of Dhaka ,Banglades.pptx
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PDF
.pdf is not working space design for the following data for the following dat...
PPTX
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PPTX
Database Infoormation System (DBIS).pptx
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PPTX
IB Computer Science - Internal Assessment.pptx
Introduction to machine learning and Linear Models
ISS -ESG Data flows What is ESG and HowHow
Business Analytics and business intelligence.pdf
oil_refinery_comprehensive_20250804084928 (1).pptx
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
Reliability_Chapter_ presentation 1221.5784
Computer network topology notes for revision
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
climate analysis of Dhaka ,Banglades.pptx
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
Introduction-to-Cloud-ComputingFinal.pptx
Galatica Smart Energy Infrastructure Startup Pitch Deck
.pdf is not working space design for the following data for the following dat...
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
Database Infoormation System (DBIS).pptx
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
IB Computer Science - Internal Assessment.pptx

Case Study on GINA(Global Innovation Network and Analysis) based on Data Analytic Life Cycle

  • 1. SNJB’S LATE SAU K. B. JAIN COE, CHANDWAD DEPARTMENT OF COMPUTER ENGINEERING ACADEMIC YEAR 2020-21 SUBJECT: DATAANALYTICS CASE STUDY ON GINA By Name: Divya Prafull Wani Roll No:41 Class: BE Computer Date:16-08-2020 Case StudyOn GINA 1
  • 2. GINA ■ Global Innovation Network and Analysis. ■ The GINA case study provides an example of how a team applied the Data Analytics Lifecycle to analyze innovation data at EMC. ■ GINA is a group of senior technologists located in centers of excellence around the world. ■ The GINA team thought its approach would provide a means to share ideas globally and increase knowledge sharing among members who may be separated geographically. ■ It planned to create a data repository containing both structured and unstructured data to accomplish 3 main goals . 1. Store Formal and informal data. 2. Track research from global technologists. 3. Mine the data for patterns and insights to improve the teams operations and strategy. Case StudyOn GINA 2
  • 3. Phase 1 : Discovery ■ Team Members and Roles  Business user, project sponsor, project manager -Vice President from Office of CTO.  BI analyst – person from IT  Data Engineer and DBA – people from IT  Data Scientist – distinguished engineer. ■ The data fell into two categories  5 years of idea submissions from internal innovation contests.  Minutes an notes representing innovation and research activity from around the world. ■ The data fell into two categories  5 years of idea submissions from internal innovation contests.  Minutes an notes representing innovation and research activity from around the world. ■ Hypothesis grouped into 2 categories  Descriptive analytics of what is happening to spark further creativity, collaboration, an asset generation  Predictive analytics to advise executive management of where it should be investing in the future. Case StudyOn GINA 3
  • 4. Phase 2 : Data Preparation ■ Set up anAnalytical Sandbox to store and experiment on the data. ■ Discovered that certain data needed conditioning and normalization and that missing datasets were critical. ■ Team recognized that poor quality data could impact subsequent steps. ■ They discovered many names were misspelled and problems with extra spaces. ■ Important to determine what level of data quality and cleanliness was sufficient for the project being undertaken. Case StudyOn GINA 4
  • 5. Phase 3: Model Planning ■ Included following considerations :  Identify the right milestones to achieve the goals  Trace how people move ideas from each ,milestone towards the goal.  Once this is done, trace ideas that die and others that reach the goal.Compare the journeys of ideas that make it and those that do not.  Compare times and outcomes using a few different methods.These could be as simple as t-tests or perhaps involve different types of classificationAlgorithms. Case StudyOn GINA 5
  • 6. Phase 4 : Model Building ■ The GINA team employed several analytical methods.This included work by the data scientist using Natural Language Processing (NLP) techniques on the textual descriptions of the innovation Roadmap ideas. ■ Social Network Analysis using R and Rstudio. ■ Developed SocialGraphs andVisualizations. Case StudyOn GINA 6
  • 7. Social Graph Data Submitters and Finalists and Graph of top innovation influencers • Fig shows socai graphs that portray relationships between idea submitters within GINA. • Each colour represents an innovator from a different country. • The large dots with red circles around them represent hubs. • A hub represents a person with high connectivity and a high “betweeness” score. • The team usedTableau software for data visualization and exploration and used the Pivotal Greenplum database as the main data repository and analytics engine. Case StudyOn GINA 7
  • 8. Phase 5 : Communicate Results ■ This project was considered successful in identifying boundary spanners and hidden innovators. ■ The GINA project promoted knowledge sharing related to innovation an researchers spanning multiple areas within the company and outside of it. ■ The GINA also enables EMC to cultivate additional property leads to research topics and provided opportunities to forge relationships with universities for joint academic research in the fields of Data Science and Big Data. ■ Study was successful in identifying hidden innovators.(found high density in Cork, Ireland) ■ The CTO office launched longitudinal studies. Case StudyOn GINA 8
  • 9. Phase 6: Operationalize ■ Deployment was not really discussed ■ Key Findings  Need more data in future  Some data were sensitive  A parallel initiative needs to be created to improve basic BI activities.  A mechanism is needed to continually reevaluate the model after deployment. Case StudyOn GINA 9
  • 10. Analytic Plan from the EMC GINA Project Case StudyOn GINA 10