SlideShare a Scribd company logo
Visualization of evolutionary
cascades of messages using
       force-directed graphs
                            Artjom Kurapov
                   Supervisor: Helena Kruus

               Master’s thesis defense, 9 may 2011
Agenda

   Background
   Practical work
       Pling.ee,opensource Gephi
       Web-tool demo and twitter
Background

   Types of networks
   Properties / areas of application
   Research interest
Topics crossroads
Goals

   Visualize social networks (preferably in Estonia)
   Compare friends and messages topology
   Try to mine data visually using cascades

                                      A



                                            C



                                      B            D
Pling
Pling – Qualitative measure
                                 Friends   Messages
Average clustering coefficient   0.135     0.043
Average degree                   4.313     2.202
GCC diameter                     20        38
Average GCC diameter             5.38      13.009
Topic and interface matters

   Out of 18.6 mln messages - no (clearly visible)
    cascade

Possibly because
 89% private

 86% sent using phone
Javascript tool

   Up to 1000 nodes
   Can add nodes on the fly
   Navigation and filtering
   Properties calculation
   Recursive algorithm
Twitter

   Friendship and message network mined
   218 users / 12643 messages, 6.89% retweets
                     100000
                      10000
                       1000
                        100
                         10
                          1
                              0   2   3   4   5   7   8
Thank you
Questions?

More Related Content

PDF
NUHope_Jan_2015
PPTX
The Mobile Future
PDF
Η δικτύωση των δημοσίων βιβλιοθηκών την εποχή του "Καλλικράτη"
PPTX
Writing J27
PDF
Forum May 2011 The NYS ALBETAC at NYU
PPT
DCLT Forum May 2012:'Trip to china' By Ying Huang
PPT
A Brief History
PPTX
Celebrating Mexicans
NUHope_Jan_2015
The Mobile Future
Η δικτύωση των δημοσίων βιβλιοθηκών την εποχή του "Καλλικράτη"
Writing J27
Forum May 2011 The NYS ALBETAC at NYU
DCLT Forum May 2012:'Trip to china' By Ying Huang
A Brief History
Celebrating Mexicans

Viewers also liked (19)

PDF
Ensamble coral como momento de arendizaje
PPTX
Influenza diego
PPT
Wisdom Circles Presentation09
DOC
Rngnthn t2
PPTX
Songs & chants in the chinese classroom nclc 2011
PDF
Spain V Miguel Hernandez
PPTX
Edu expo anonymous peer review
PDF
6114 k2 pemkab. kuningan
PPTX
Online marketing trends in the UK
PPT
Jorge Delgado Work
PPTX
Presentation1
PPS
12 checex
PDF
Forum may 2011 yun zhang's presentation
PDF
Forum May 2011 Bing Qiu Getting Tenure
PPT
How effective is the combination of your main
PDF
Sociala Medier - Hot Eller Möjlighet
PPT
Joanne Wang: Teaching Math Provides Students with Authentic Exposure and COnt...
ODP
Cambridge Solutions E Assessment
Ensamble coral como momento de arendizaje
Influenza diego
Wisdom Circles Presentation09
Rngnthn t2
Songs & chants in the chinese classroom nclc 2011
Spain V Miguel Hernandez
Edu expo anonymous peer review
6114 k2 pemkab. kuningan
Online marketing trends in the UK
Jorge Delgado Work
Presentation1
12 checex
Forum may 2011 yun zhang's presentation
Forum May 2011 Bing Qiu Getting Tenure
How effective is the combination of your main
Sociala Medier - Hot Eller Möjlighet
Joanne Wang: Teaching Math Provides Students with Authentic Exposure and COnt...
Cambridge Solutions E Assessment
Ad

Similar to Visualization of evolutionary cascades of messages using force-directed graphs (20)

PDF
Breaking the Barrier: Interactive Election Campaign Communication
PDF
Four degrees of separation
PDF
Information Visualization Project
PPTX
2013 NodeXL Social Media Network Analysis
PDF
How to measure Twitter
PDF
Twitter for CS10 @ Berkeley (Spring 2011)
PPTX
20121001 pawcon 2012-marc smith - mapping collections of connections in socia...
PPTX
Finnish twittercensus 2013, statistics for Twitter in Finland
PPTX
20121010 marc smith - mapping collections of connections in social media with...
PDF
Twitter: Social Network Or News Medium?
PDF
Twitter: Social Network Or News Medium?
PDF
The anatomy of the Facebook social graph
PDF
[HCII2011] Mining Social Relationships in Micro-blogging systems
PDF
Unleashing Twitter Data for Fun and Insight
PDF
Unleashing twitter data for fun and insight
PPTX
Brave New Task: User Account Matching
PPTX
LSS'11: Charting Collections Of Connections In Social Media
PPTX
20111103 con tech2011-marc smith
PPTX
Real Time Analytics for Big Data - A twitter inspired case study
Breaking the Barrier: Interactive Election Campaign Communication
Four degrees of separation
Information Visualization Project
2013 NodeXL Social Media Network Analysis
How to measure Twitter
Twitter for CS10 @ Berkeley (Spring 2011)
20121001 pawcon 2012-marc smith - mapping collections of connections in socia...
Finnish twittercensus 2013, statistics for Twitter in Finland
20121010 marc smith - mapping collections of connections in social media with...
Twitter: Social Network Or News Medium?
Twitter: Social Network Or News Medium?
The anatomy of the Facebook social graph
[HCII2011] Mining Social Relationships in Micro-blogging systems
Unleashing Twitter Data for Fun and Insight
Unleashing twitter data for fun and insight
Brave New Task: User Account Matching
LSS'11: Charting Collections Of Connections In Social Media
20111103 con tech2011-marc smith
Real Time Analytics for Big Data - A twitter inspired case study
Ad

More from Артём Курапов (8)

PPTX
Scaling GraphQL Subscriptions
PDF
Variety of automated tests
PPTX
PPTX
Php storm intro
PPTX
PPTX
В облаке AWS
PPTX
Devclub hääletamine
PPTX
OAuthоризация и API социальных сетей
Scaling GraphQL Subscriptions
Variety of automated tests
Php storm intro
В облаке AWS
Devclub hääletamine
OAuthоризация и API социальных сетей

Recently uploaded (20)

PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
Big Data Technologies - Introduction.pptx
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Empathic Computing: Creating Shared Understanding
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Machine learning based COVID-19 study performance prediction
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
KodekX | Application Modernization Development
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
20250228 LYD VKU AI Blended-Learning.pptx
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Big Data Technologies - Introduction.pptx
Chapter 3 Spatial Domain Image Processing.pdf
Empathic Computing: Creating Shared Understanding
Unlocking AI with Model Context Protocol (MCP)
Machine learning based COVID-19 study performance prediction
Dropbox Q2 2025 Financial Results & Investor Presentation
KodekX | Application Modernization Development
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Mobile App Security Testing_ A Comprehensive Guide.pdf

Visualization of evolutionary cascades of messages using force-directed graphs

  • 1. Visualization of evolutionary cascades of messages using force-directed graphs Artjom Kurapov Supervisor: Helena Kruus Master’s thesis defense, 9 may 2011
  • 2. Agenda  Background  Practical work  Pling.ee,opensource Gephi  Web-tool demo and twitter
  • 3. Background  Types of networks  Properties / areas of application  Research interest
  • 5. Goals  Visualize social networks (preferably in Estonia)  Compare friends and messages topology  Try to mine data visually using cascades A C B D
  • 7. Pling – Qualitative measure Friends Messages Average clustering coefficient 0.135 0.043 Average degree 4.313 2.202 GCC diameter 20 38 Average GCC diameter 5.38 13.009
  • 8. Topic and interface matters  Out of 18.6 mln messages - no (clearly visible) cascade Possibly because  89% private  86% sent using phone
  • 9. Javascript tool  Up to 1000 nodes  Can add nodes on the fly  Navigation and filtering  Properties calculation  Recursive algorithm
  • 10. Twitter  Friendship and message network mined  218 users / 12643 messages, 6.89% retweets 100000 10000 1000 100 10 1 0 2 3 4 5 7 8

Editor's Notes

  • #3: So, first a little introduction in the field,then some large dataset research I’ve done,Then personally made browser tool. A small demo, features and issues faced.And a small twitter dataset results
  • #4: Networks are everywhere. Most of us here study technological and information networks. But there are also biochemical, ecological and most interestingly – social networks which influence our daily life. These include sexual connections, friendship networks, citations or any kind of social behavior associated with it. In fact if you go strict about it, then citation is not really social behavior, since its directed and doesn’t imply talking to the real person. So its more like network of document dependencies. So it is important how you define connection and objects.Networks have different properties, some of which I list in the paper. And of course some of them are relevant only in one field, like bipartite graphs are only needed if you want to visualize them. Or cliques if you want to use clique analysis done.There are also different research interests. Like drawing, or how networks evolve, or how do they break apart, or where does traffic goes through, or how do can we do all kind of graph puzzles. Like graph search, coloring or solve travelling salesman problems.
  • #5: So to visualize such network and its processes, one needs to see surroundings in this field – like sociology with its laws of diffusion and prefferential attachment, likenetwork properties, drawing algorithms and its complexity, and ofcourse work that has been done before – both theoretical and practical as existing software.
  • #6: As a thesis goal, I suggest mining data through frequency analysis of messages and making a network topology map. That means that we want a graph representation of a network,We want both friendships and messages datasets,And then we want to see how they correlate and lead to higher forms of messages – cascades.And my hypothesis is that cascades are parts of social thought. Thus evolutionary cascades are linked cascades across multiple topics.
  • #7: So I have studied Estonian social network pling.ee which belongs to Elisa Eesti AS and has 75 thousands users on the left as friendship network and 12 thousand on the right as message network. As you can see its different, and assortative mixing is present. This means that we have red nodes is here are russian and blue are estonian users. This was read from the messages and symbols they used.
  • #8: So the numbers differ as well.. As you can see since it was a small portion of messages, the network is rather young and has bigger diameter. A the same time average degree is smaller which is natural, since people don’t talk to all of their friends. And clustering coefficient is also smaller, which is partially dependent on that degree tendency.
  • #9: The bad news for me was that I was not able to find a single cascade. Possibly because only around 14% were sent from the browser and there were no explicit resharing function in the interface. But comparing it to twitter – people there invented RT themselves. Most likely it’s the topic of discussion that didn’t stimulate sharing, since 89% of talks were private and almost all are teenagers discussing their love life.
  • #10: So to study cascades and make visualization, I’ve tried building own tool that is written in javascript and can draw small datasets along with its analysis.I’ve also done two dataset extractions from twitter.Its browser based, can do navigation.
  • #11: From 12 thousand messages, around 7% can be considered as a direct cascade. But there may be more, since I didn’t take into account normal posts with directed form, that can also lead to smaller forms of cascades.On the graph you can see how depth of the retweet depends on its number in the dataset.(demo here)
  • #13: I don’t talk about evolutionary network, because I study static snapshots here, but in general network does evolve from disconnected components into GCC. But it depends on a network. For example buyers in electronic shops, even though they may suggest products, don’t always lead to new customers with connection. So customers are not connected to anyone. On the other hand, there may be certain clusters in case there is some sort of affiliate network campaign.P – polynomial complexityT(n) = O (n^k)NP – nondeterministic polynomial complexity. Nondeterministic automata can have multiple decision paths from a single state.“NP complete” problems don’t have a polynomial time algorithm.“NP hard” are at least as hard as NP-complete.2. Yes, in social networks GCC diameter is maximal at first stages of network evolution, and decreases over time. I’m not so sure about other network types. Because social networks do get denser.. Since each new node can connect to 0,1 or all nodes, alpha is So in lowest case they grow linearly with exponent equal to 1, meaning like a tree.. In other case they can grow quadratically, with exponent equal to 2, they each new node basically connects to all other nodes. So the more people join in, the more friends can know the other end of the graph. Thus – smaller diameter.If you think of technological networks, then I don’t think making a wiring from japan to brasil is so easy.3. Markov centrality is one of the ways one can find most influential nodes in the network. Although its very complex to compute, my work also lists others centrality measures. And I think that4. Cascade analysis and data mining is still hand work.5. I used Fruchterman-Reingold and Yifan Hu algorithms for local forces and for adaptive cooling. I’ve added my own version of recursive force summing and presented it in the work.