SlideShare a Scribd company logo
@twitter Mining #MicroblogsUsing #SemanticTechnologiesSelver Softic, Martin Ebner, Herbert Mühlburger , Thomas Altmann, Behnam Taraghi
Web 2.0 -  well knownstoryWeb 2.0 technologiesbroughtuserscloserto Web …Wikis, Blogs, Forums …Podcasts, RSS, XML …… thenusersstartedtogeneratecontent  …Source: http:mediabistro.com
From Web toSocial WebResult = a vastofinformationText, Pictures, Audio, Videos ….Communication, networking, exchangeofdataWeb becamemore personalCultural, geographicalandsocialbordersdisappearedSource: http://www.ignitesocialmedia.com
Social Media Boom!
Swap2010 twitter minining using semantic web technologies and linked data
Socialsitesaredatasilossource: www.pidgintech.com
But still disconnected ?source: www.pidgintech.com
Data is still captured in Walled Garden!
StatementsSocial Web relies on usersandcommunicationamongthemWhilecommunicatingusersproduceorconsumecontentSocialsitesaredatasilosrich on varietyofinformationThisinformationcouldbeinterestingfor:monitoring of trends, advertising, statistics, reputation, news broadcasting , tagging …Thisdataiscaptured in Walledgarden !!!
QuestionsHowtousethisdatatogainmoreusefulinsightsWhataretheadvantagesof online (offline) search on such dataandhowtoreachit in an uniform wayIs itpossibletostructurize, connectandexposethedata in order tobeusedbyhumansandmachinesmoreefficientlyWhatwould an architecturelooklikeforthisissue
Social Web TrendsMicrobloggingSocialBookmarkingSocial NetworkingSocial MarketingSharing Photos, Videos …Source: http://socialwebresearch.com
MicroblogsMicroblogsUsedforcommunication,publishingandinformationexchangeSimple forprocessingInformation  generatedbymany different usersSocialuserrelationsTripartitecommunicationstructureVarietyofinformationsNoboundariesbyculture,locationortechnology (mobile users)TwitterMost PopularLarge amountoddataBut limitedAccording: http://an.kaist.ac.kr/traces/WWW2010.html41.7 million user profiles, 1.47 billion social relations, 4,262 trending topics, and 106 million tweets
SemanticaspectsandTwitterTwitterUser realtionsTweetsasshortinformationartefactsCommunication withtripartitepatternTime relatedinformationVocabulariesSIOC, FOAF, Dublin Core
Linked Data andTwitterTwittercontainsinfos on:People, Organisations, Locations, Trends …LOD CloudcontainsBillionsoftriplesabout:Geolocations , dataaboutscience, government, commonknowledge, persons, news …VocabulariesMOAT, CommmonTag
Architecture model
Acquisition - Grabeeter
GrabeeterSearch in your TweetsFilter your Tweets by dateSearch in your Tweets offline using the Grabeeter ClientFilter your tweets offline using the Grabeeter ClientGrabeeter provides an API
Triplification Module AuthorDateContentReciever<tweet url="http://grabeeter.tugraz.at/tweet/199272" text="Sitting in Prater #vienna, launch party. Nice" screen_name="selvers" created="2010-08-19" twitterUrl="http://twitter.com/selvers/status/21606926237"/>RDF StoreTriplifier
Triplification Module@prefix foaf: <http://xmlns.com/foaf/0.1/> .@prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> .@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .@prefix sioc: <http://rdfs.org/sioc/ns#> .@prefix sioct: <http://rdfs.org/sioc/types#> .@prefix dcterms: <http://purl.org/dc/terms/#> .<http://twitter.com/selvers/status/21606926237>  rdf:typesioct:MicroblogPost ;sioc:content "Sitting in Prater #vienna, launch party. Nice" ;sioc:has_creator  <http://twitter.com/selvers/>  ;foaf:maker <http://grabeteer.tugraz.at/foaf/selvers/> ;dcterms:created  “2010-08-19” ;rdfs:sameAs  <http://grabeeter.tugraz.at/tweet/199272> .<http://twitter.com/selvers/>  rdf:typefoaf:Person ;foaf:name  "SelverSoftic" ;foaf:depiction <http://a0.twimg.com/profile_images/905118560/f9e4b6eba.13070201_3_normal.jpg> ;foaf:knows <http://twitter.com/hmuehlburger/> ;foaf:knows <http://twitter.com/mhausenblas/> ;foaf:knows <http://twitter.com/mebner/> . …
Interlinking ModuleHashtags (People, Organisation, Locations)MOAT, CommonTagLater NLP processedcontent, SILK FrameworkSELECT ?post ?content ?maker ?nameWHERE {?post rdf:typesioct:MicroblogPost;foaf:maker ?maker;      ?makerfoaf:name ?name;sioc:content ?content.FILTER(regex(?content,#vienna))} Classifiertag: tagName "vienna" ;moat: tagMeaning<http://dbpedia .org/resource/Vienna>tag: taggedResource <http://twitter.com/selvers/status/2160692623>
Analysis
Conclusions & OutlookCurrentstateofthearttechnologiessufficetorealisetheproposedarchitectureparadigmInterlinkingwith LOD Cloud (Tweet-O-Sphere)Involving NLP MethodsSentiment classification(Re)TaggingofTweetsProviding SPARQL Endpoint + Lookup ServiceasresearchinterfaceSocialSemantic Web Apps
Questions?

More Related Content

PPT
@twitter Try out #Grabeeter to Export, Archive and Search Your Tweets
PPTX
Webinar: CWAF for Mid Market/Enterprise Organizations
PPT
Ethical Hacking and Network Security
PPTX
2018 Hacked Website Trends
PDF
Preserving privacy while sharing data
PPTX
Sucuri Webinar: Website Security Primer for Digital Marketers
PDF
Sucuri Webinar: Hacked Website Trend Report Q1/2016
PPTX
Sucuri Webinar: Simple Steps To Secure Your Online Store
@twitter Try out #Grabeeter to Export, Archive and Search Your Tweets
Webinar: CWAF for Mid Market/Enterprise Organizations
Ethical Hacking and Network Security
2018 Hacked Website Trends
Preserving privacy while sharing data
Sucuri Webinar: Website Security Primer for Digital Marketers
Sucuri Webinar: Hacked Website Trend Report Q1/2016
Sucuri Webinar: Simple Steps To Secure Your Online Store

What's hot (15)

PPTX
Webinar: Personal Online Privacy - Sucuri Security
PDF
Sucuri Webinar: How to clean hacked WordPress sites
PDF
Sucuri Webinar: Impacts of a website compromise
PPTX
Sucuri Webinar: How Websites Get Hacked
PDF
obtain additional security
PPTX
Sucuri Webinar: What is SEO Spam and How to Fight It
PPTX
Webinar: eCommerce Compliance - PCI meets GDPR
PPTX
Why Do Hackers Hack?
PPTX
Logs: Understanding Them to Better Manage Your WordPress Site
PPTX
Steps to Keep Your Site Clean
PDF
Sucuri Webinar: Defending Your Google Brand Reputation and Analytics Reports
PPTX
Getting the word out: How to implement your online branding strategy
PPT
Website Security
PDF
Cyber security lifting the veil of hacking webinar
PPTX
Sucuri Webinar: How Caching Options Can Impact Your Website Speed
Webinar: Personal Online Privacy - Sucuri Security
Sucuri Webinar: How to clean hacked WordPress sites
Sucuri Webinar: Impacts of a website compromise
Sucuri Webinar: How Websites Get Hacked
obtain additional security
Sucuri Webinar: What is SEO Spam and How to Fight It
Webinar: eCommerce Compliance - PCI meets GDPR
Why Do Hackers Hack?
Logs: Understanding Them to Better Manage Your WordPress Site
Steps to Keep Your Site Clean
Sucuri Webinar: Defending Your Google Brand Reputation and Analytics Reports
Getting the word out: How to implement your online branding strategy
Website Security
Cyber security lifting the veil of hacking webinar
Sucuri Webinar: How Caching Options Can Impact Your Website Speed
Ad

Similar to Swap2010 twitter minining using semantic web technologies and linked data (20)

PDF
Microblogging: A Semantic Web and Distributed Approach
PPT
Data Portability with SIOC and FOAF
PDF
Real-time Semantic Web with Twitter Annotations
PPTX
The Social Semantic Web
PPT
ESSIR 2013 - IR and Social Media
PDF
"Whatever I can get..."
PDF
The Social Semantic Web: New York Times Edition
PPTX
Breaking Down Walls in Enterprise with Social Semantics
PPT
Peter Mika's Presentation at SSSW 2011
PDF
Social Media Dataset
PDF
Social networks: technical issues
PDF
The Social Semantic Web
PDF
Harsh Horizons For the Socialmediaforum
PPT
PPT
Web 2.0 2006: Implications for the LMS
PPT
Web 2.0 and the LMS
PPT
Web 2.0 intro
PDF
Real-time Tweet Analysis w/ Maltego Carbon 3.5.3
PPTX
Making things findable
PPT
Social Web for VU Dagje Studeren
Microblogging: A Semantic Web and Distributed Approach
Data Portability with SIOC and FOAF
Real-time Semantic Web with Twitter Annotations
The Social Semantic Web
ESSIR 2013 - IR and Social Media
"Whatever I can get..."
The Social Semantic Web: New York Times Edition
Breaking Down Walls in Enterprise with Social Semantics
Peter Mika's Presentation at SSSW 2011
Social Media Dataset
Social networks: technical issues
The Social Semantic Web
Harsh Horizons For the Socialmediaforum
Web 2.0 2006: Implications for the LMS
Web 2.0 and the LMS
Web 2.0 intro
Real-time Tweet Analysis w/ Maltego Carbon 3.5.3
Making things findable
Social Web for VU Dagje Studeren
Ad

Recently uploaded (20)

PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Electronic commerce courselecture one. Pdf
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PPT
Teaching material agriculture food technology
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Review of recent advances in non-invasive hemoglobin estimation
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Electronic commerce courselecture one. Pdf
NewMind AI Weekly Chronicles - August'25 Week I
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
The Rise and Fall of 3GPP – Time for a Sabbatical?
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Dropbox Q2 2025 Financial Results & Investor Presentation
Chapter 3 Spatial Domain Image Processing.pdf
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
CIFDAQ's Market Insight: SEC Turns Pro Crypto
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Network Security Unit 5.pdf for BCA BBA.
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Teaching material agriculture food technology
Mobile App Security Testing_ A Comprehensive Guide.pdf
Advanced methodologies resolving dimensionality complications for autism neur...
Review of recent advances in non-invasive hemoglobin estimation
“AI and Expert System Decision Support & Business Intelligence Systems”
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...

Swap2010 twitter minining using semantic web technologies and linked data

  • 1. @twitter Mining #MicroblogsUsing #SemanticTechnologiesSelver Softic, Martin Ebner, Herbert Mühlburger , Thomas Altmann, Behnam Taraghi
  • 2. Web 2.0 - well knownstoryWeb 2.0 technologiesbroughtuserscloserto Web …Wikis, Blogs, Forums …Podcasts, RSS, XML …… thenusersstartedtogeneratecontent …Source: http:mediabistro.com
  • 3. From Web toSocial WebResult = a vastofinformationText, Pictures, Audio, Videos ….Communication, networking, exchangeofdataWeb becamemore personalCultural, geographicalandsocialbordersdisappearedSource: http://www.ignitesocialmedia.com
  • 7. But still disconnected ?source: www.pidgintech.com
  • 8. Data is still captured in Walled Garden!
  • 9. StatementsSocial Web relies on usersandcommunicationamongthemWhilecommunicatingusersproduceorconsumecontentSocialsitesaredatasilosrich on varietyofinformationThisinformationcouldbeinterestingfor:monitoring of trends, advertising, statistics, reputation, news broadcasting , tagging …Thisdataiscaptured in Walledgarden !!!
  • 10. QuestionsHowtousethisdatatogainmoreusefulinsightsWhataretheadvantagesof online (offline) search on such dataandhowtoreachit in an uniform wayIs itpossibletostructurize, connectandexposethedata in order tobeusedbyhumansandmachinesmoreefficientlyWhatwould an architecturelooklikeforthisissue
  • 11. Social Web TrendsMicrobloggingSocialBookmarkingSocial NetworkingSocial MarketingSharing Photos, Videos …Source: http://socialwebresearch.com
  • 12. MicroblogsMicroblogsUsedforcommunication,publishingandinformationexchangeSimple forprocessingInformation generatedbymany different usersSocialuserrelationsTripartitecommunicationstructureVarietyofinformationsNoboundariesbyculture,locationortechnology (mobile users)TwitterMost PopularLarge amountoddataBut limitedAccording: http://an.kaist.ac.kr/traces/WWW2010.html41.7 million user profiles, 1.47 billion social relations, 4,262 trending topics, and 106 million tweets
  • 14. Linked Data andTwitterTwittercontainsinfos on:People, Organisations, Locations, Trends …LOD CloudcontainsBillionsoftriplesabout:Geolocations , dataaboutscience, government, commonknowledge, persons, news …VocabulariesMOAT, CommmonTag
  • 17. GrabeeterSearch in your TweetsFilter your Tweets by dateSearch in your Tweets offline using the Grabeeter ClientFilter your tweets offline using the Grabeeter ClientGrabeeter provides an API
  • 18. Triplification Module AuthorDateContentReciever<tweet url="http://grabeeter.tugraz.at/tweet/199272" text="Sitting in Prater #vienna, launch party. Nice" screen_name="selvers" created="2010-08-19" twitterUrl="http://twitter.com/selvers/status/21606926237"/>RDF StoreTriplifier
  • 19. Triplification Module@prefix foaf: <http://xmlns.com/foaf/0.1/> .@prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> .@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .@prefix sioc: <http://rdfs.org/sioc/ns#> .@prefix sioct: <http://rdfs.org/sioc/types#> .@prefix dcterms: <http://purl.org/dc/terms/#> .<http://twitter.com/selvers/status/21606926237> rdf:typesioct:MicroblogPost ;sioc:content "Sitting in Prater #vienna, launch party. Nice" ;sioc:has_creator <http://twitter.com/selvers/> ;foaf:maker <http://grabeteer.tugraz.at/foaf/selvers/> ;dcterms:created “2010-08-19” ;rdfs:sameAs <http://grabeeter.tugraz.at/tweet/199272> .<http://twitter.com/selvers/> rdf:typefoaf:Person ;foaf:name "SelverSoftic" ;foaf:depiction <http://a0.twimg.com/profile_images/905118560/f9e4b6eba.13070201_3_normal.jpg> ;foaf:knows <http://twitter.com/hmuehlburger/> ;foaf:knows <http://twitter.com/mhausenblas/> ;foaf:knows <http://twitter.com/mebner/> . …
  • 20. Interlinking ModuleHashtags (People, Organisation, Locations)MOAT, CommonTagLater NLP processedcontent, SILK FrameworkSELECT ?post ?content ?maker ?nameWHERE {?post rdf:typesioct:MicroblogPost;foaf:maker ?maker; ?makerfoaf:name ?name;sioc:content ?content.FILTER(regex(?content,#vienna))} Classifiertag: tagName "vienna" ;moat: tagMeaning<http://dbpedia .org/resource/Vienna>tag: taggedResource <http://twitter.com/selvers/status/2160692623>
  • 22. Conclusions & OutlookCurrentstateofthearttechnologiessufficetorealisetheproposedarchitectureparadigmInterlinkingwith LOD Cloud (Tweet-O-Sphere)Involving NLP MethodsSentiment classification(Re)TaggingofTweetsProviding SPARQL Endpoint + Lookup ServiceasresearchinterfaceSocialSemantic Web Apps