SlideShare a Scribd company logo
SiocLog: Providing IRC Discussion Logs as Linked Data  Tuukka Hastrup 1 , Uldis Bojars 2  and  John G. Breslin 2, 3 1  University of Jyväskylä, Finland 2  DERI, NUI Galway, Ireland  3  School of Engineering and Informatics, NUI Galway, Ireland
Motivation IRC conversations are quite disconnected from the Web and even from other IRC channels and networks Often there is valuable and needed information in an IRC chat that cannot be linked to people, topics or events, or in general referenced from elsewhere This may be useful to people who do not use IRC, by those on other networks, or simply by people who leave and rejoin a channel
Motivation (2) SIOC provides a framework for linking social media contributions to other content and Linked Data resources, and IRC can become part of that framework We also need mechanisms to link the IRC contributions to the people who made them, hence the use of Web ID
Background We will begin by introducing the various areas relevant to this system: IRC Linked Data SIOC Web ID
Internet Relay Chat (IRC) Instant messaging / internet chat is a major form of social interaction online It is often disconnected from the Web: Due to the different protocols involved Due to its real-time nature / lack of persistent storage IRC was one of the earliest chat systems It has an important role amongst open-source communities, web communities, and even geeks! Hundreds of thousands of users online at any time
Linked Data Building a “Web of Data” to enhance the current Web Exposing, sharing and connecting data about things via dereferenceable URIs Linking datasets together that were not previously connected, for example: Music and people Real-world things and places The Linking Open Data (LOD) effort aims to link various open datasets together (DBpedia, GeoNames, etc.)
Semantically-Interlinked Online Communities (SIOC) An effort from DERI, NUI Galway to discover how we can create / establish ontologies on the Semantic Web Goal of the SIOC ontology is to address interoperability issues on the (Social) Web http://sioc-project.org/ SIOC has been adopted in a framework of 50 applications or modules deployed on over 400 sites Various domains: Web 2.0, enterprise information integration, HCLS, e-government
 
Some of the SIOC core ontology classes and properties
Some examples of where SIOC is already use (about 50 implementations / applications)
Web ID A Web ID is a web address that identifies a person as a Linked Data item A Web ID should also lead to a document with more information about that person (e.g. FOAF, other RDF) For more information, see the definition in this paper: Ching-Man Au Yeung, Ilaria Liccardi, Kanghao Lu, Oshani Seneviratne, Tim Berners-Lee, “ Decentralization: The Future of Online Social Networking ”, W3C Workshop on Future of Social Networking
Design
Mapping IRC identifiers to URIs on the Web irc://freenode (IRC Network) irc://freenode/%23 channel (Channel) No identifier (Message) irc://freenode/ persona ,isuser (Chat Persona) http://irc.sioc-project.org/#freenode http://irc.sioc-project.org/ channel #channel http://irc.sioc-project.org/ channel /0000-00-00 #00:00:00.00 http://irc.sioc-project.org/users/ persona #user
Some of the internal and external links
Browsing the Linked Data
Creating a link between a user account on IRC and a personal profile Claiming a Web ID creates a link [black] between a user account (a sioc:User that created a sioc:Post in a sioct:ChatChannel) and a person (foaf:Person) The person can manually verify this: By pointing back to the sioc:User from their foaf:Person definition [grey]
Web IDs in SiocLog A Web ID can be claimed using mttlbot Can claim using standard IRC services /msg nickserv set property webid SomeWebID
Implementation 2000 lines of Python source code 1000 lines of Zope/TAL HTML templates Twisted, SimpleTAL and Redland libraries Four major components: IRC interface, data analysis, data integration, Web
Implementation (2) IRC interface: Discussion logger / persona monitor on Twisted Data analysis: Process logs, a filters pipeline, sinks for stats / output Data integration: Queries for external Linked Data (personal profiles) Web interface: Requests via CGI, publishes as HTML and RDF
Finding the names of friends of an IRC persona with SPARQL semwebquery –sparql &quot;SELECT ?name WHERE { ?person foaf:holdsAccount <http://irc.sioc-project.org/users/melvster#user> . ?person foaf:knows ?friend . ?friend foaf:name ?name . }&quot;
Validation 291 chat personas on five channels 22,418 chat messages 51 chat personas have associated Web IDs claimed using mttlbot (2/3) or nickserv (1/3) 44 of those have a valid associated RDF document Scalable (projected 4 million triples in 10 years) SiocLog data being consumed by the “Towards linked sensor data for Hackystat” project SiocLog interfaces to FOAF Me for new profile creation
Future work Extend to instant messaging and private messaging Study of IRC communities where users and content are distributed across channels and networks
Acknowledgements We would like to thank Science Foundation Ireland for their support under grant  SFI/08/CE/I1380 (Líon 2) Thanks also to Benja Fallenstein and Dan Brickley for their insights
Summary IRC conversations are quite disconnected from the Web and even from other IRC channels and networks Often there is valuable and needed information in an IRC chat that cannot be linked to people, topics or events, or in general referenced from elsewhere SIOC provides a framework for interlinking social media to other content and Linked Data, and IRC has been integrated as a part of that framework We also used mechanisms to link IRC contributions to the people who made them via Web ID and FOAF

More Related Content

PPSX
Internet2007
PDF
IETF's Role and Mandate in Internet Governance by Mohit Batra
PPSX
Internet
PPTX
Internetnuovo1
PPSX
Internet va
PPSX
Internet in Vc
PDF
Dw communication
PPTX
Activity 11 common online terminologies
Internet2007
IETF's Role and Mandate in Internet Governance by Mohit Batra
Internet
Internetnuovo1
Internet va
Internet in Vc
Dw communication
Activity 11 common online terminologies

What's hot (15)

PPTX
Activity 13 common online terminologies
PDF
Computer networking Dr. Jayarama Reddy
PPT
TID Chapter 2 Introduction To Internet
PPTX
Activity 13 common online terminologies
PPT
web 1.0, 2.0, 3.0
PPTX
Internet and its uses
DOCX
Internet
PPTX
Web 3.0 & IoT (English)
PPTX
Internet and Its Uses
PPTX
Introduction to Internet By Anamika
PPTX
Chapter1a McHaney 2nd edition
PPTX
Bittorrent - Piracy Sailing on Web 2.0
PPTX
Common Online Terminologies
PPT
What is the organisational difference between Internet & Intranet?
PPTX
Introduction to internet
Activity 13 common online terminologies
Computer networking Dr. Jayarama Reddy
TID Chapter 2 Introduction To Internet
Activity 13 common online terminologies
web 1.0, 2.0, 3.0
Internet and its uses
Internet
Web 3.0 & IoT (English)
Internet and Its Uses
Introduction to Internet By Anamika
Chapter1a McHaney 2nd edition
Bittorrent - Piracy Sailing on Web 2.0
Common Online Terminologies
What is the organisational difference between Internet & Intranet?
Introduction to internet
Ad

Viewers also liked (7)

PPT
eMarketer Webinar: Tips for Reaching & Engaging the Elusive Millennial
PPT
eMarketer Webinar: Integrating Search and Display—Tactics for More Effective ...
PPTX
1. Bloody Web Developers
PDF
Growing Galway's Startup Community
PDF
Startup Community: What Galway Can Do Next
PDF
Adding More Semantics to the Social Web
PDF
eMarketer Webinar: Cross-Device Targeting--What to Watch for in 2017
eMarketer Webinar: Tips for Reaching & Engaging the Elusive Millennial
eMarketer Webinar: Integrating Search and Display—Tactics for More Effective ...
1. Bloody Web Developers
Growing Galway's Startup Community
Startup Community: What Galway Can Do Next
Adding More Semantics to the Social Web
eMarketer Webinar: Cross-Device Targeting--What to Watch for in 2017
Ad

Similar to SiocLog: Providing IRC Discussion Logs as Linked Data (20)

PPT
Data Accessibility and Me: Introducing SIOC, FOAF and the Linked Data Web
PPT
Enhancing the Web Experience
PPT
DataPortability and Me: Introducing SIOC, FOAF and the Semantic Web
PPT
SIOC: Semantic Web for Social Media Sites
PPT
The SIOC Project
PPT
Interlinking Online Communities and Enriching Social Software with the Semant...
PPT
Weaving SIOC into the Web of Linked Data
PPTX
Breaking Down Walls in Enterprise with Social Semantics
PPT
Data Portability with SIOC and FOAF
PPT
JeromeDL Tutorial
PPTX
The Social Semantic Web
PPT
Geo-annotations in Semantic Digital Libraries
PPTX
Fa13 7718-ch3-kim
DOCX
Sup documentation
PPTX
What happened to the Semantic Web?
PDF
unit-5 (2).pdf
ODP
PPT
myExperiment @ Nettab
PPTX
New ICT Trends and Issues of Librarianship
PPTX
Activity9 common online terminologies
Data Accessibility and Me: Introducing SIOC, FOAF and the Linked Data Web
Enhancing the Web Experience
DataPortability and Me: Introducing SIOC, FOAF and the Semantic Web
SIOC: Semantic Web for Social Media Sites
The SIOC Project
Interlinking Online Communities and Enriching Social Software with the Semant...
Weaving SIOC into the Web of Linked Data
Breaking Down Walls in Enterprise with Social Semantics
Data Portability with SIOC and FOAF
JeromeDL Tutorial
The Social Semantic Web
Geo-annotations in Semantic Digital Libraries
Fa13 7718-ch3-kim
Sup documentation
What happened to the Semantic Web?
unit-5 (2).pdf
myExperiment @ Nettab
New ICT Trends and Issues of Librarianship
Activity9 common online terminologies

More from John Breslin (20)

PDF
Ireland: Island of Innovation and Entrepreneurship
PDF
Old Ireland in Colour
PDF
A Balanced Routing Algorithm for Blockchain Offline Channels using Flocking
PDF
Collusion Attack from Hubs in the Blockchain Offline Channel Network
PDF
Collaborative Leadership to Increase the Northern & Western Region’s Innovati...
PDF
TRICS: Teaching Researchers and Innovators how to Create Startups
PDF
Entrepreneurship is in Our DNA
PDF
Galway City Innovation District
PDF
Innovation Districts and Innovation Hubs
PDF
Disciplined mHealth Entrepreneurship
PDF
Searching for Startups
PPTX
Intellectual Property: Protecting Ideas, Designs and Brands in the Real World...
PDF
Innovation and Entrepreneurship: Tips, Tools and Tricks
PDF
Communities and Tech: Build Which and What Will Come?
PDF
Data Analytics and Industry-Academic Partnerships: An Irish Perspective
PPT
“I Like” - Analysing Interactions within Social Networks to Assert the Trustw...
PPT
John Breslin at the Innovation Academy
PDF
Engaging Citizens in Research and Innovation: Opportunities and Challenges Af...
PDF
dW/dI: How a small country like Ireland can make a big difference to the worl...
PDF
Social Media for Charities
Ireland: Island of Innovation and Entrepreneurship
Old Ireland in Colour
A Balanced Routing Algorithm for Blockchain Offline Channels using Flocking
Collusion Attack from Hubs in the Blockchain Offline Channel Network
Collaborative Leadership to Increase the Northern & Western Region’s Innovati...
TRICS: Teaching Researchers and Innovators how to Create Startups
Entrepreneurship is in Our DNA
Galway City Innovation District
Innovation Districts and Innovation Hubs
Disciplined mHealth Entrepreneurship
Searching for Startups
Intellectual Property: Protecting Ideas, Designs and Brands in the Real World...
Innovation and Entrepreneurship: Tips, Tools and Tricks
Communities and Tech: Build Which and What Will Come?
Data Analytics and Industry-Academic Partnerships: An Irish Perspective
“I Like” - Analysing Interactions within Social Networks to Assert the Trustw...
John Breslin at the Innovation Academy
Engaging Citizens in Research and Innovation: Opportunities and Challenges Af...
dW/dI: How a small country like Ireland can make a big difference to the worl...
Social Media for Charities

Recently uploaded (20)

PDF
Enhancing emotion recognition model for a student engagement use case through...
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PPTX
Tartificialntelligence_presentation.pptx
PPTX
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
PDF
A comparative study of natural language inference in Swahili using monolingua...
PDF
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
PPTX
TLE Review Electricity (Electricity).pptx
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
project resource management chapter-09.pdf
PPTX
Final SEM Unit 1 for mit wpu at pune .pptx
PPTX
cloud_computing_Infrastucture_as_cloud_p
PPTX
The various Industrial Revolutions .pptx
PDF
STKI Israel Market Study 2025 version august
PDF
A contest of sentiment analysis: k-nearest neighbor versus neural network
PDF
NewMind AI Weekly Chronicles – August ’25 Week III
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PDF
Getting started with AI Agents and Multi-Agent Systems
PDF
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
PDF
Web App vs Mobile App What Should You Build First.pdf
PDF
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
Enhancing emotion recognition model for a student engagement use case through...
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Tartificialntelligence_presentation.pptx
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
A comparative study of natural language inference in Swahili using monolingua...
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
TLE Review Electricity (Electricity).pptx
Assigned Numbers - 2025 - Bluetooth® Document
project resource management chapter-09.pdf
Final SEM Unit 1 for mit wpu at pune .pptx
cloud_computing_Infrastucture_as_cloud_p
The various Industrial Revolutions .pptx
STKI Israel Market Study 2025 version august
A contest of sentiment analysis: k-nearest neighbor versus neural network
NewMind AI Weekly Chronicles – August ’25 Week III
Group 1 Presentation -Planning and Decision Making .pptx
Getting started with AI Agents and Multi-Agent Systems
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
Web App vs Mobile App What Should You Build First.pdf
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf

SiocLog: Providing IRC Discussion Logs as Linked Data

  • 1. SiocLog: Providing IRC Discussion Logs as Linked Data Tuukka Hastrup 1 , Uldis Bojars 2 and John G. Breslin 2, 3 1 University of Jyväskylä, Finland 2 DERI, NUI Galway, Ireland 3 School of Engineering and Informatics, NUI Galway, Ireland
  • 2. Motivation IRC conversations are quite disconnected from the Web and even from other IRC channels and networks Often there is valuable and needed information in an IRC chat that cannot be linked to people, topics or events, or in general referenced from elsewhere This may be useful to people who do not use IRC, by those on other networks, or simply by people who leave and rejoin a channel
  • 3. Motivation (2) SIOC provides a framework for linking social media contributions to other content and Linked Data resources, and IRC can become part of that framework We also need mechanisms to link the IRC contributions to the people who made them, hence the use of Web ID
  • 4. Background We will begin by introducing the various areas relevant to this system: IRC Linked Data SIOC Web ID
  • 5. Internet Relay Chat (IRC) Instant messaging / internet chat is a major form of social interaction online It is often disconnected from the Web: Due to the different protocols involved Due to its real-time nature / lack of persistent storage IRC was one of the earliest chat systems It has an important role amongst open-source communities, web communities, and even geeks! Hundreds of thousands of users online at any time
  • 6. Linked Data Building a “Web of Data” to enhance the current Web Exposing, sharing and connecting data about things via dereferenceable URIs Linking datasets together that were not previously connected, for example: Music and people Real-world things and places The Linking Open Data (LOD) effort aims to link various open datasets together (DBpedia, GeoNames, etc.)
  • 7. Semantically-Interlinked Online Communities (SIOC) An effort from DERI, NUI Galway to discover how we can create / establish ontologies on the Semantic Web Goal of the SIOC ontology is to address interoperability issues on the (Social) Web http://sioc-project.org/ SIOC has been adopted in a framework of 50 applications or modules deployed on over 400 sites Various domains: Web 2.0, enterprise information integration, HCLS, e-government
  • 8.  
  • 9. Some of the SIOC core ontology classes and properties
  • 10. Some examples of where SIOC is already use (about 50 implementations / applications)
  • 11. Web ID A Web ID is a web address that identifies a person as a Linked Data item A Web ID should also lead to a document with more information about that person (e.g. FOAF, other RDF) For more information, see the definition in this paper: Ching-Man Au Yeung, Ilaria Liccardi, Kanghao Lu, Oshani Seneviratne, Tim Berners-Lee, “ Decentralization: The Future of Online Social Networking ”, W3C Workshop on Future of Social Networking
  • 13. Mapping IRC identifiers to URIs on the Web irc://freenode (IRC Network) irc://freenode/%23 channel (Channel) No identifier (Message) irc://freenode/ persona ,isuser (Chat Persona) http://irc.sioc-project.org/#freenode http://irc.sioc-project.org/ channel #channel http://irc.sioc-project.org/ channel /0000-00-00 #00:00:00.00 http://irc.sioc-project.org/users/ persona #user
  • 14. Some of the internal and external links
  • 16. Creating a link between a user account on IRC and a personal profile Claiming a Web ID creates a link [black] between a user account (a sioc:User that created a sioc:Post in a sioct:ChatChannel) and a person (foaf:Person) The person can manually verify this: By pointing back to the sioc:User from their foaf:Person definition [grey]
  • 17. Web IDs in SiocLog A Web ID can be claimed using mttlbot Can claim using standard IRC services /msg nickserv set property webid SomeWebID
  • 18. Implementation 2000 lines of Python source code 1000 lines of Zope/TAL HTML templates Twisted, SimpleTAL and Redland libraries Four major components: IRC interface, data analysis, data integration, Web
  • 19. Implementation (2) IRC interface: Discussion logger / persona monitor on Twisted Data analysis: Process logs, a filters pipeline, sinks for stats / output Data integration: Queries for external Linked Data (personal profiles) Web interface: Requests via CGI, publishes as HTML and RDF
  • 20. Finding the names of friends of an IRC persona with SPARQL semwebquery –sparql &quot;SELECT ?name WHERE { ?person foaf:holdsAccount <http://irc.sioc-project.org/users/melvster#user> . ?person foaf:knows ?friend . ?friend foaf:name ?name . }&quot;
  • 21. Validation 291 chat personas on five channels 22,418 chat messages 51 chat personas have associated Web IDs claimed using mttlbot (2/3) or nickserv (1/3) 44 of those have a valid associated RDF document Scalable (projected 4 million triples in 10 years) SiocLog data being consumed by the “Towards linked sensor data for Hackystat” project SiocLog interfaces to FOAF Me for new profile creation
  • 22. Future work Extend to instant messaging and private messaging Study of IRC communities where users and content are distributed across channels and networks
  • 23. Acknowledgements We would like to thank Science Foundation Ireland for their support under grant SFI/08/CE/I1380 (Líon 2) Thanks also to Benja Fallenstein and Dan Brickley for their insights
  • 24. Summary IRC conversations are quite disconnected from the Web and even from other IRC channels and networks Often there is valuable and needed information in an IRC chat that cannot be linked to people, topics or events, or in general referenced from elsewhere SIOC provides a framework for interlinking social media to other content and Linked Data, and IRC has been integrated as a part of that framework We also used mechanisms to link IRC contributions to the people who made them via Web ID and FOAF