SlideShare a Scribd company logo
Detecting Communities in Science Blogs Christina K. Pikas [email_address]     http://terpconnect.umd.edu/~cpikas/ScienceBlogging
Problem Area eScience includes using electronic tools both for conducting science and for communicating about science There are an abundance of tools both online and offline  to help scientists communicate Lots of scientists and members of the interested public  maintain blogs (~2500?) Ultimate Questions: Why?  With whom are scientists communicating?  What are scientists communicating about?  What is the value to the scientists and to science?
Specific Problem Addressed What is the nature of the  science blogosphere? What is its shape? Who are the central participants? What is the connectivity? Where are the potential information flows?
Outline Background Methods Data gathering Analysis Results Discussion
Background: Blogs Defined by format Individual posts, with permanent URLs Comments Links In content In blogroll In comments and trackbacks Community develops around single blogs and among blogs through commenting
Posts Links to Static Pages Links and automatically generated content http://dorigo.wordpress.com/
Access to posts by search  and older posts using  the calendar A list of most recent posts is automatically generated
A list of categories the blogger used to describe his posts.  Clicking will list all of the posts in that category. The blogroll is a list of blogs the author reads or endorses to some extent. Access to the older posts  by month.
The individual post page looks a lot like the blog home page
But with Comments, which may be signed with the the commenter’s URL And a form to leave your own comment. Typically your e-mail will not appear on the site
Background: Social Network Analysis Uses connections between actors to understand potential flows of information and influence Uses graph theoretic methods to find Central or prestigious actors Cohesive subgroups including communities
Methods: Sample Selection Operational Definition of  Science Blog Blogs maintained by  scientists  that deal with  any aspect of being a scientist Blogs about  scientific topics  by non-scientists Omitted Primarily political speech Ones maintained by corporations Non-English language
Methods: Data Gathering Two Networks: Links and Commenters Link Data (Blogroll) Used seed list developed in previous study  using directories and searches Snowball sampled using links from blogrolls Visited and copied links Commenter Data Selected most central blogs from blogroll data Used Perl scripts to pull the commenter URLs  from each of the last 10 posts
Methods: Analysis Used social network analysis  and graphing software Examined graph and calculated basic descriptive statistics Found centrality and prestige measures Degree:  the links in and out Betweenness: the number of shortest paths that flow through that node Closeness: short paths to other nodes
Methods: Analysis Located cohesive subgroups Link methods Components LS Sets Clustering methods Community detection techniques Newman-Girvan Spin Glass
Results: Link Analysis (Blogroll) One large component There were 1091 nodes, 6621 arcs Diameter is 9 In-degree ranges from 1 to 292, with the median in-degree of 3, and mean 6 10 of the top 20 blogs by in-degree are authored  or co-authored by women 4 of the top 5 blogs by closeness are authored or co-authored by women
 
Results: Commenter 5 components, the largest with 911,  others with 11 or fewer nodes 938 nodes (starting with the 46), 1152 arcs The largest component has a diameter of 5
 
Discussion: Links (Blogroll) Most of the blogs were connected in one dense component A result of the diffusion of blogs? There were a few very central blogs, and then many less central Typical skewed distribution The community of women scientists merits further study
Discussion: Commenters Analysis easily located a notorious commenter who leaves incendiary comments on physics and chemistry blogs High out-degree, no links in Traffic on the women scientist blogs is more uniform, with frequent comments that are widely distributed among the blogs Indicates a different use
Take Home Messages   The science blogosphere is densely connected with many opportunities for influence and information diffusion Communities tend to form within disciplinary boundaries An exception is the community of women scientist bloggers who are  from many different disciplines
Acknowledgements Thanks to Dr. Jen Golbeck for supervising this work as part of an independent study Thanks also to Dr. Alan Neustadtl for SNA advice Dr. Dagobert Soergel for research advice
Christina K. Pikas Doctoral Student University of Maryland College of Information Studies [email_address] http://terpconnect.umd.edu/~cpikas/ScienceBlogging

More Related Content

PPT
Sunbelt05
PPTX
Network analyses of psychological science
PPT
Using Social Media to Increase your Research Impact
PPTX
Automated arXiv feeds on Twitter: On the role of bots in scholarly communication
PDF
Altmetrics: Listening & Giving Voice to Ideas with Social Media Data
PPT
In Search of Australian Blogs: Determining the Extent of the Contemporary Aus...
PPT
A Social Network System based on an Ontology in the Korea Institute of Orient...
PPTX
Cite track presentation
Sunbelt05
Network analyses of psychological science
Using Social Media to Increase your Research Impact
Automated arXiv feeds on Twitter: On the role of bots in scholarly communication
Altmetrics: Listening & Giving Voice to Ideas with Social Media Data
In Search of Australian Blogs: Determining the Extent of the Contemporary Aus...
A Social Network System based on an Ontology in the Korea Institute of Orient...
Cite track presentation

What's hot (20)

PDF
Using Google Scholar to get similar paper to your class/gradation project
PPTX
An Introduction to NodeXL for Social Scientists
PPTX
Linking and referencing
PPTX
Searching for Communities: a Facebook Way
PPT
Blogs for Information Management
PDF
The Role of Open Access & Social Media in Knowledge Mobilization and Discovery
PPTX
Promoting your research using social media
PDF
Social media impact after publishing
PPTX
Disciplinary Differences in Twitter Scholarly Communication
PDF
You're Hired: Examining Acceptance of Social Media Screening of Job Applicants
PPTX
Carma internet research module preparing for manuscript submission
PPTX
Open science: your questions answered
PPTX
Academic social networks site as networked socio-technical systems for schola...
PPTX
Social Network Analysis - an Introduction (minus the Maths)
PPT
Elsevier - Labs on Line
PPTX
Enhancing the Visibility and Impact of Your Research
PPTX
Are the scientists on to something altmetrics 6 16
PPTX
Using social media to promote your research
ODP
Talk before you type: coordination in Wikipedia
PPT
Bearcat Search: Implementing Federated Searching at the Newman Library
Using Google Scholar to get similar paper to your class/gradation project
An Introduction to NodeXL for Social Scientists
Linking and referencing
Searching for Communities: a Facebook Way
Blogs for Information Management
The Role of Open Access & Social Media in Knowledge Mobilization and Discovery
Promoting your research using social media
Social media impact after publishing
Disciplinary Differences in Twitter Scholarly Communication
You're Hired: Examining Acceptance of Social Media Screening of Job Applicants
Carma internet research module preparing for manuscript submission
Open science: your questions answered
Academic social networks site as networked socio-technical systems for schola...
Social Network Analysis - an Introduction (minus the Maths)
Elsevier - Labs on Line
Enhancing the Visibility and Impact of Your Research
Are the scientists on to something altmetrics 6 16
Using social media to promote your research
Talk before you type: coordination in Wikipedia
Bearcat Search: Implementing Federated Searching at the Newman Library
Ad

Similar to Detecting Communities in Science Blogs (20)

PPTX
Thefutureofsciencebloggingopenscilogsslideshare 141123163616-conversion-gate01
PPTX
The Future of Science Blogging - One Perspective, #SciLogs14
PPTX
Scientist vs
PPT
Can Science Blogging Enhance Your Research Life?
PPTX
Social media for science communication - URMA Presentation
PPTX
IScico Module 3 - Let's collaborate!
PPTX
Social media for science communication campus communicators
PPTX
All the Science That’s Fit to Blog - A Dissertation Talk
PPTX
Social Media for Scientists
PDF
Comcient1213-2 Let's collaborate
PDF
Science Blogs
PDF
Science dissemination 2.0: Social media for researchers
PDF
What can science networking online do for you
PPTX
2013 beacon-congress-social-media
PPTX
20140121 ctm-comcient
PPTX
Ic20152 - Relevant references in science dissemination
PPTX
Social Media for Scientists
PPTX
BGI training lecture: Scott Edmunds - Science 2.0, why new developments on th...
PPTX
Discussing research with the public in the blogosphere
PPTX
Science Blogging
Thefutureofsciencebloggingopenscilogsslideshare 141123163616-conversion-gate01
The Future of Science Blogging - One Perspective, #SciLogs14
Scientist vs
Can Science Blogging Enhance Your Research Life?
Social media for science communication - URMA Presentation
IScico Module 3 - Let's collaborate!
Social media for science communication campus communicators
All the Science That’s Fit to Blog - A Dissertation Talk
Social Media for Scientists
Comcient1213-2 Let's collaborate
Science Blogs
Science dissemination 2.0: Social media for researchers
What can science networking online do for you
2013 beacon-congress-social-media
20140121 ctm-comcient
Ic20152 - Relevant references in science dissemination
Social Media for Scientists
BGI training lecture: Scott Edmunds - Science 2.0, why new developments on th...
Discussing research with the public in the blogosphere
Science Blogging
Ad

More from Christina Pikas (20)

PDF
Pikas blogs for personal knowledge management (pkm) final for asist 04
POTX
Using Bibliometrics to Keep Up with the Joneses
PDF
Pikas using bibliometrics to make sense of research proposals
PPTX
The role of new information and communication technologies in information and...
PPTX
Library support for metrics: What can and should we do?
PPTX
Pikas bibliometricsfor21may2015
PPT
Pikas research day 2014 poster
PPTX
Pikas casci talk 11262013 final
PPT
Pikas 2013 research day poster 03022013
PPTX
Pikas research day 2011
PPTX
Pikas research day 2011 presentation
PPTX
Pikas xtreme reference 2010 final
PPTX
Pikas Psp Presentation February 2010
PPTX
DRAFT PSP Presentation February 2010
PPT
Updating The Standard Model
PPT
Pikas Developments In Sci Tech Search Web Search U 08
PPT
Pikas Asist2007 PIM Senior Engineers Final
PPT
Pikas Maryland Tech Day 2007
PPT
Pikas Sla07 Physicists Uses Of Blogs
PDF
Innovative Libraries: Tales from the Stacks
Pikas blogs for personal knowledge management (pkm) final for asist 04
Using Bibliometrics to Keep Up with the Joneses
Pikas using bibliometrics to make sense of research proposals
The role of new information and communication technologies in information and...
Library support for metrics: What can and should we do?
Pikas bibliometricsfor21may2015
Pikas research day 2014 poster
Pikas casci talk 11262013 final
Pikas 2013 research day poster 03022013
Pikas research day 2011
Pikas research day 2011 presentation
Pikas xtreme reference 2010 final
Pikas Psp Presentation February 2010
DRAFT PSP Presentation February 2010
Updating The Standard Model
Pikas Developments In Sci Tech Search Web Search U 08
Pikas Asist2007 PIM Senior Engineers Final
Pikas Maryland Tech Day 2007
Pikas Sla07 Physicists Uses Of Blogs
Innovative Libraries: Tales from the Stacks

Recently uploaded (20)

PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Encapsulation theory and applications.pdf
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
Cloud computing and distributed systems.
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
Big Data Technologies - Introduction.pptx
PPTX
A Presentation on Artificial Intelligence
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Approach and Philosophy of On baking technology
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Electronic commerce courselecture one. Pdf
Understanding_Digital_Forensics_Presentation.pptx
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Encapsulation theory and applications.pdf
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Cloud computing and distributed systems.
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Dropbox Q2 2025 Financial Results & Investor Presentation
Big Data Technologies - Introduction.pptx
A Presentation on Artificial Intelligence
The Rise and Fall of 3GPP – Time for a Sabbatical?
Chapter 3 Spatial Domain Image Processing.pdf
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Encapsulation_ Review paper, used for researhc scholars
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
Spectral efficient network and resource selection model in 5G networks
Approach and Philosophy of On baking technology
NewMind AI Weekly Chronicles - August'25 Week I
Electronic commerce courselecture one. Pdf

Detecting Communities in Science Blogs

  • 1. Detecting Communities in Science Blogs Christina K. Pikas [email_address]     http://terpconnect.umd.edu/~cpikas/ScienceBlogging
  • 2. Problem Area eScience includes using electronic tools both for conducting science and for communicating about science There are an abundance of tools both online and offline to help scientists communicate Lots of scientists and members of the interested public maintain blogs (~2500?) Ultimate Questions: Why? With whom are scientists communicating? What are scientists communicating about? What is the value to the scientists and to science?
  • 3. Specific Problem Addressed What is the nature of the science blogosphere? What is its shape? Who are the central participants? What is the connectivity? Where are the potential information flows?
  • 4. Outline Background Methods Data gathering Analysis Results Discussion
  • 5. Background: Blogs Defined by format Individual posts, with permanent URLs Comments Links In content In blogroll In comments and trackbacks Community develops around single blogs and among blogs through commenting
  • 6. Posts Links to Static Pages Links and automatically generated content http://dorigo.wordpress.com/
  • 7. Access to posts by search and older posts using the calendar A list of most recent posts is automatically generated
  • 8. A list of categories the blogger used to describe his posts. Clicking will list all of the posts in that category. The blogroll is a list of blogs the author reads or endorses to some extent. Access to the older posts by month.
  • 9. The individual post page looks a lot like the blog home page
  • 10. But with Comments, which may be signed with the the commenter’s URL And a form to leave your own comment. Typically your e-mail will not appear on the site
  • 11. Background: Social Network Analysis Uses connections between actors to understand potential flows of information and influence Uses graph theoretic methods to find Central or prestigious actors Cohesive subgroups including communities
  • 12. Methods: Sample Selection Operational Definition of Science Blog Blogs maintained by scientists that deal with any aspect of being a scientist Blogs about scientific topics by non-scientists Omitted Primarily political speech Ones maintained by corporations Non-English language
  • 13. Methods: Data Gathering Two Networks: Links and Commenters Link Data (Blogroll) Used seed list developed in previous study using directories and searches Snowball sampled using links from blogrolls Visited and copied links Commenter Data Selected most central blogs from blogroll data Used Perl scripts to pull the commenter URLs from each of the last 10 posts
  • 14. Methods: Analysis Used social network analysis and graphing software Examined graph and calculated basic descriptive statistics Found centrality and prestige measures Degree: the links in and out Betweenness: the number of shortest paths that flow through that node Closeness: short paths to other nodes
  • 15. Methods: Analysis Located cohesive subgroups Link methods Components LS Sets Clustering methods Community detection techniques Newman-Girvan Spin Glass
  • 16. Results: Link Analysis (Blogroll) One large component There were 1091 nodes, 6621 arcs Diameter is 9 In-degree ranges from 1 to 292, with the median in-degree of 3, and mean 6 10 of the top 20 blogs by in-degree are authored or co-authored by women 4 of the top 5 blogs by closeness are authored or co-authored by women
  • 17.  
  • 18. Results: Commenter 5 components, the largest with 911, others with 11 or fewer nodes 938 nodes (starting with the 46), 1152 arcs The largest component has a diameter of 5
  • 19.  
  • 20. Discussion: Links (Blogroll) Most of the blogs were connected in one dense component A result of the diffusion of blogs? There were a few very central blogs, and then many less central Typical skewed distribution The community of women scientists merits further study
  • 21. Discussion: Commenters Analysis easily located a notorious commenter who leaves incendiary comments on physics and chemistry blogs High out-degree, no links in Traffic on the women scientist blogs is more uniform, with frequent comments that are widely distributed among the blogs Indicates a different use
  • 22. Take Home Messages   The science blogosphere is densely connected with many opportunities for influence and information diffusion Communities tend to form within disciplinary boundaries An exception is the community of women scientist bloggers who are from many different disciplines
  • 23. Acknowledgements Thanks to Dr. Jen Golbeck for supervising this work as part of an independent study Thanks also to Dr. Alan Neustadtl for SNA advice Dr. Dagobert Soergel for research advice
  • 24. Christina K. Pikas Doctoral Student University of Maryland College of Information Studies [email_address] http://terpconnect.umd.edu/~cpikas/ScienceBlogging