SlideShare a Scribd company logo
Intro to
Network Analysis in Gephi
victor.Blaer@gmail.com
Disklaimer
The right mindset
Network visualization is messy. You get into technical difficulties and have to search the internet and peoples minds for
solutions. The data itself is often inconsistent and you need to find ways to still get insights from it. The whole process is
full of trial and error with many dead ends. There is no undo for most actions in network visualization. If you like a specific
view on the data, better export it, because with the press of a button it may look totally different and even if you start with
the same file and do the same things, the result will probably look different. It is possible to re-create certain views, but it
takes a lot of time.
Get ready to get your hands dirty and spend hours without getting anywhere and keep going or starting fresh the next day.
Over time it gets better. You will learn other peoples tricks and develop your own to get faster to useful results. You will
experience the bliss of data that finally makes sense and the satisfaction when you share a beautiful visualization with
others.
Keep going, share your process, save often and have fun.
Goals
Super Duper Quick intro to Gephi
Basic Network Lingo to sound like the kool kids
Nodes, Edges and the Restaurant at the End of the Universe
Visualizing Co-workers (no, not that way.)
IF (Time_Left) > 5 mins THEN “sigma.js” ELSE END.
What is Gephi?
Created in 2008 by 4 French computing engineers
Free java based open source software to analyse and visualize
networks
Get the audience involved
Let’s put the FUN into FUNdamentals
Interactive exercise
Definitions-ish
Ω Nodes – represent the entity you’re analysing
Ω Edges – represent the connections between them
Ω Degree – the number of edges/lines coming and
going from the circle things
Ω Modularity – the different type of communities within
your network (i.e. workmates, sports teams,
etc.
Understanding the Interface of Gephi
Ω Three Different Views: Overview, Data Laboratory and
Preview
Ω The Overview is used to work interactively with the graph.
Ω The Data Laboratory gives you the raw data as a table.
Ω The Preview is used to generate beautiful looking versions of
the graph. Each view accesses the same graph. If you change
something in the Overview it’s changed in the Data Laboratory
and the Preview as well.
Overview
This is where you spend most of
the time interactively exploring a
graph. You can define how nodes
and edges look, use different
algorithms to calculate the layout
of the graph, use filters to work
only with certain parts of the graph
and calculate various statistics.
Main Interface
Layout The layout area gives you access to
various algorithms, you can use to
calculate the layout of your graph. Each
algorithm has it advantages and
disadvantages. For social network
graphs ForceAtlas 2 or OpenOrd is a
good starting point. Some algorithm finish
themselves, some need to be stopped
once you like the result. Some algorithms
can be run after others and only do
minimal adjustments (expansion,
contraction, noverlap, label adjust) others
will completely change how the graph
looks like. You can click on each setting
of each algorithm to get additional
information of what it does.
Helps with spreading nodes
out (I use 10 )
Statistics & Filters
While the statistics options
aren’t self explaining, they all
work the same. You click on
Run and get a results page
displayed. As long as you don’t
want to write a paper on a
graph, you probably only need
Modularity, which identifies
sub-communities
Filters are cool
Graph
In the center of the Overview view is the
Graph. You can move around by holding the
right mouse button, zoom with your scroll
wheel and select/drag/color/??? nodes by
clicking left on them.
Each of the settings on the side and bottom
has a mouse over tooltip. Be careful with the
settings at the bottom left. The first one, the
magnifying glass is useful, when you get lost,
because it centers the view on your graph.
The three settings below reset colors and
sizes. Irreversible. The buttons and the
bottom help you to make the graph more
readable or exploreable at all, if it’s to big.
Turning of the display of edges helps a lot in
such cases.
Data Laboratory
The Data Laboratory gives you the
raw data. You can switch between
nodes and edges. Through the
configuration you can set if you
want to see everything or only the
things that are visible in the
Overview. For example if you
activated a certain filter, you
probably want to only see the things
that fit to the filter. You can export
the data or import it and other
things. Sometimes you don’t need
to do anything here, especially with
a nicely prepared data set.
Preview
Once you are happy with your graph, you can use the Preview to render it. There are different presets
and many settings to render it as you want it. At the bottom left you have the option to export it as
SVG, PDF or PNG. With the Preview ratio, you can set to only render a percentage of the whole
graph. This helps a lot if you need to find the right settings for a big graph which takes some minutes
to hours to be render completely. In the Preview you move around by holding the left mouse button
while moving. Think of it as grabbing the whole image and moving it. You can also zoom in and out.
But you can choose nodes or do other things you can do in the Overview.
OK, now we got the lingo,
let’s get some data!
1-Nodes/Actors
2-Edges (how
they’re connected)
Step 1- After install, click new project
Step 2: In this example we are going to import that data from CSV files and we are going to
use them for ease of use. Once you click New Project you will get the following Screen,
then click Data Laboratory for importing data.
Step 3: Once you Open the Data Laboratory pane now you click Import Spreadsheet. First
import the as table: “Nodes table” with browsing the nodes.csv. Then click next and Finish.
Nota Bene – Node file
Make sure that the CSV file containing
nodes needs to include a column named
“ID” containing unique node indentifiers.
Which will result in the following once you
click the finish button.
Step 4: Now you again click Import Spreadsheet. First import the as table: “Edges table”
with browsing the Edges.csv. Then click next and Finish.
Nota Bene – Edges file
The edge list CSV should include columns
titles “Source” & “Target”
containing the node IDs of the start and
end node for each edge, as well as any
other attributes you would like to include.
The results will be available once you click on
the Edges in the Data Table as given below:
Steps 5: Now when you click the overview
button right below the toolbar you can see the
following network diagram created.
Step 6: We can use the Layouts to make it
look little better using the Force Atlas 2 and
Label Adjust to look clean and better.
Introduction to Network Analysis in Gephi
Turn On Labels
Run Statistics Stuff
1 node with 5 degrees, 2 with 4 and 3
with 3 degrees
To Summarize – Went from (Node) &
(Edge) to
Male
Fem
ale
Ambi
guou
s
Colouring
nodes by
gender
Playing with different Layout algorithms is
a good idea as well
Twitter Data set from interwebs (my viz)
Sigma.js – export plugin
Cheatsheet

More Related Content

PPTX
Data analytics
PPTX
Presentation on Big Data
PDF
Data warehousing
PPT
Social Network Analysis
PDF
Data cleansing project
PPTX
Basics Gephi Tutorial
PDF
PDF
Building Decision Tree model with numerical attributes
Data analytics
Presentation on Big Data
Data warehousing
Social Network Analysis
Data cleansing project
Basics Gephi Tutorial
Building Decision Tree model with numerical attributes

What's hot (20)

PDF
Data Visualization
PDF
Data Mining
PPTX
Tableau
PPTX
Introduction to Data Visualization
PDF
Analytics, Business Intelligence, and Data Science - What's the Progression?
PPTX
Tableau slideshare
PPT
Data Warehouse Basic Guide
PPTX
OLAP & DATA WAREHOUSE
PDF
Web Intelligence - Tutorial1
PPT
Generic Graph And Psets
PDF
DI&A Slides: Descriptive, Prescriptive, and Predictive Analytics
PPTX
kinds of analytics
PPTX
Data Mining: Data processing
PPTX
Tableau Visual analytics complete deck 2
PDF
Data Science Project Lifecycle
PPTX
Chapter 1 big data
PPTX
Exploratory data analysis
PDF
Decision tree
Data Visualization
Data Mining
Tableau
Introduction to Data Visualization
Analytics, Business Intelligence, and Data Science - What's the Progression?
Tableau slideshare
Data Warehouse Basic Guide
OLAP & DATA WAREHOUSE
Web Intelligence - Tutorial1
Generic Graph And Psets
DI&A Slides: Descriptive, Prescriptive, and Predictive Analytics
kinds of analytics
Data Mining: Data processing
Tableau Visual analytics complete deck 2
Data Science Project Lifecycle
Chapter 1 big data
Exploratory data analysis
Decision tree
Ad

Viewers also liked (11)

PDF
Gephi Consortium Presentation
PDF
Gephi short introduction
PPTX
Gephi bbc
ODP
Use case for Using Gephi for Social Network Analysis of facebook
PDF
Gephi : dynamic features
PPTX
Interactive visualization and exploration of network data with Gephi
PDF
SP1: Exploratory Network Analysis with Gephi
PPTX
Gephi Plugin Developer Workshop
PPTX
Social Network Analysis Using Gephi
PPT
Visualize your Twitter network
PDF
Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...
Gephi Consortium Presentation
Gephi short introduction
Gephi bbc
Use case for Using Gephi for Social Network Analysis of facebook
Gephi : dynamic features
Interactive visualization and exploration of network data with Gephi
SP1: Exploratory Network Analysis with Gephi
Gephi Plugin Developer Workshop
Social Network Analysis Using Gephi
Visualize your Twitter network
Network Visualization guest lecture at #DataVizQMSS at @Columbia / #SNA at PU...
Ad

Similar to Introduction to Network Analysis in Gephi (20)

PDF
Advanced excel 2010 & 2013 updated Terrabiz
PDF
Remix Your Data: Visualizing Library Instruction Statistics
PDF
datavisualization-5thUnit.pdf
PPTX
Module 2 Introduction to SA.pptx
PDF
Tutorials.pdfNDBDNBDAN,D NKDB NC cnc nm cCN
DOCX
manual.docx
PDF
Data Visualisation
PPS
2D CAD Module by gonzalochris
DOCX
High score plus guide (3)
PDF
社會網絡分析UCINET Quick Start Guide
PPTX
5 Hacks to create awesome Presentations
PDF
Microsoft Project 2013 Quickstart
DOCX
Step-by-Step - Windows 7
PDF
9 Sketchup Tips for beter 3d modeling workflow
PPTX
Tableau Basic Questions
PPTX
Powerpoint slide
PPT
Freemind
PDF
AtlasCamp 2013: ADG / Lean UX
PDF
Exp31sp1 at a-glance-en
Advanced excel 2010 & 2013 updated Terrabiz
Remix Your Data: Visualizing Library Instruction Statistics
datavisualization-5thUnit.pdf
Module 2 Introduction to SA.pptx
Tutorials.pdfNDBDNBDAN,D NKDB NC cnc nm cCN
manual.docx
Data Visualisation
2D CAD Module by gonzalochris
High score plus guide (3)
社會網絡分析UCINET Quick Start Guide
5 Hacks to create awesome Presentations
Microsoft Project 2013 Quickstart
Step-by-Step - Windows 7
9 Sketchup Tips for beter 3d modeling workflow
Tableau Basic Questions
Powerpoint slide
Freemind
AtlasCamp 2013: ADG / Lean UX
Exp31sp1 at a-glance-en

Recently uploaded (20)

PPTX
1_Introduction to advance data techniques.pptx
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PPTX
Business Acumen Training GuidePresentation.pptx
PPT
Quality review (1)_presentation of this 21
PPTX
IB Computer Science - Internal Assessment.pptx
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PDF
.pdf is not working space design for the following data for the following dat...
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PPTX
Database Infoormation System (DBIS).pptx
PPTX
Computer network topology notes for revision
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PDF
Fluorescence-microscope_Botany_detailed content
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
1_Introduction to advance data techniques.pptx
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
Data_Analytics_and_PowerBI_Presentation.pptx
Business Acumen Training GuidePresentation.pptx
Quality review (1)_presentation of this 21
IB Computer Science - Internal Assessment.pptx
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
Acceptance and paychological effects of mandatory extra coach I classes.pptx
.pdf is not working space design for the following data for the following dat...
Introduction-to-Cloud-ComputingFinal.pptx
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
Database Infoormation System (DBIS).pptx
Computer network topology notes for revision
IBA_Chapter_11_Slides_Final_Accessible.pptx
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
Fluorescence-microscope_Botany_detailed content
Galatica Smart Energy Infrastructure Startup Pitch Deck

Introduction to Network Analysis in Gephi

  • 1. Intro to Network Analysis in Gephi victor.Blaer@gmail.com
  • 2. Disklaimer The right mindset Network visualization is messy. You get into technical difficulties and have to search the internet and peoples minds for solutions. The data itself is often inconsistent and you need to find ways to still get insights from it. The whole process is full of trial and error with many dead ends. There is no undo for most actions in network visualization. If you like a specific view on the data, better export it, because with the press of a button it may look totally different and even if you start with the same file and do the same things, the result will probably look different. It is possible to re-create certain views, but it takes a lot of time. Get ready to get your hands dirty and spend hours without getting anywhere and keep going or starting fresh the next day. Over time it gets better. You will learn other peoples tricks and develop your own to get faster to useful results. You will experience the bliss of data that finally makes sense and the satisfaction when you share a beautiful visualization with others. Keep going, share your process, save often and have fun.
  • 3. Goals Super Duper Quick intro to Gephi Basic Network Lingo to sound like the kool kids Nodes, Edges and the Restaurant at the End of the Universe Visualizing Co-workers (no, not that way.) IF (Time_Left) > 5 mins THEN “sigma.js” ELSE END.
  • 4. What is Gephi? Created in 2008 by 4 French computing engineers Free java based open source software to analyse and visualize networks
  • 5. Get the audience involved Let’s put the FUN into FUNdamentals Interactive exercise
  • 6. Definitions-ish Ω Nodes – represent the entity you’re analysing Ω Edges – represent the connections between them Ω Degree – the number of edges/lines coming and going from the circle things Ω Modularity – the different type of communities within your network (i.e. workmates, sports teams, etc.
  • 7. Understanding the Interface of Gephi Ω Three Different Views: Overview, Data Laboratory and Preview Ω The Overview is used to work interactively with the graph. Ω The Data Laboratory gives you the raw data as a table. Ω The Preview is used to generate beautiful looking versions of the graph. Each view accesses the same graph. If you change something in the Overview it’s changed in the Data Laboratory and the Preview as well.
  • 8. Overview This is where you spend most of the time interactively exploring a graph. You can define how nodes and edges look, use different algorithms to calculate the layout of the graph, use filters to work only with certain parts of the graph and calculate various statistics.
  • 10. Layout The layout area gives you access to various algorithms, you can use to calculate the layout of your graph. Each algorithm has it advantages and disadvantages. For social network graphs ForceAtlas 2 or OpenOrd is a good starting point. Some algorithm finish themselves, some need to be stopped once you like the result. Some algorithms can be run after others and only do minimal adjustments (expansion, contraction, noverlap, label adjust) others will completely change how the graph looks like. You can click on each setting of each algorithm to get additional information of what it does. Helps with spreading nodes out (I use 10 )
  • 11. Statistics & Filters While the statistics options aren’t self explaining, they all work the same. You click on Run and get a results page displayed. As long as you don’t want to write a paper on a graph, you probably only need Modularity, which identifies sub-communities
  • 13. Graph In the center of the Overview view is the Graph. You can move around by holding the right mouse button, zoom with your scroll wheel and select/drag/color/??? nodes by clicking left on them. Each of the settings on the side and bottom has a mouse over tooltip. Be careful with the settings at the bottom left. The first one, the magnifying glass is useful, when you get lost, because it centers the view on your graph. The three settings below reset colors and sizes. Irreversible. The buttons and the bottom help you to make the graph more readable or exploreable at all, if it’s to big. Turning of the display of edges helps a lot in such cases.
  • 14. Data Laboratory The Data Laboratory gives you the raw data. You can switch between nodes and edges. Through the configuration you can set if you want to see everything or only the things that are visible in the Overview. For example if you activated a certain filter, you probably want to only see the things that fit to the filter. You can export the data or import it and other things. Sometimes you don’t need to do anything here, especially with a nicely prepared data set.
  • 15. Preview Once you are happy with your graph, you can use the Preview to render it. There are different presets and many settings to render it as you want it. At the bottom left you have the option to export it as SVG, PDF or PNG. With the Preview ratio, you can set to only render a percentage of the whole graph. This helps a lot if you need to find the right settings for a big graph which takes some minutes to hours to be render completely. In the Preview you move around by holding the left mouse button while moving. Think of it as grabbing the whole image and moving it. You can also zoom in and out. But you can choose nodes or do other things you can do in the Overview.
  • 16. OK, now we got the lingo, let’s get some data!
  • 18. Step 1- After install, click new project
  • 19. Step 2: In this example we are going to import that data from CSV files and we are going to use them for ease of use. Once you click New Project you will get the following Screen, then click Data Laboratory for importing data.
  • 20. Step 3: Once you Open the Data Laboratory pane now you click Import Spreadsheet. First import the as table: “Nodes table” with browsing the nodes.csv. Then click next and Finish. Nota Bene – Node file Make sure that the CSV file containing nodes needs to include a column named “ID” containing unique node indentifiers.
  • 21. Which will result in the following once you click the finish button.
  • 22. Step 4: Now you again click Import Spreadsheet. First import the as table: “Edges table” with browsing the Edges.csv. Then click next and Finish. Nota Bene – Edges file The edge list CSV should include columns titles “Source” & “Target” containing the node IDs of the start and end node for each edge, as well as any other attributes you would like to include.
  • 23. The results will be available once you click on the Edges in the Data Table as given below:
  • 24. Steps 5: Now when you click the overview button right below the toolbar you can see the following network diagram created.
  • 25. Step 6: We can use the Layouts to make it look little better using the Force Atlas 2 and Label Adjust to look clean and better.
  • 29. 1 node with 5 degrees, 2 with 4 and 3 with 3 degrees
  • 30. To Summarize – Went from (Node) & (Edge) to
  • 32. Playing with different Layout algorithms is a good idea as well
  • 33. Twitter Data set from interwebs (my viz)