SlideShare a Scribd company logo
Web2.0 and 3.0,  Social networks and Journalists  Mining  information  from  social networking sites Journalists are increasingly turning to social networks to look for case studies, contacts and expert opinion. But searching social networks can be frustrating and time consuming.
Used carefully,  google  can be far more effective use google’s advanced operators to source only from social network profiles and pages using specific search terms This technique can also allow you to pin-point specific information in profiles.
Google’s  advanced operators Google  allows various ‘advanced’ operators. These are typed directly into the  Google  Search field. Used correctly and with care, they can be far more effective than using the ‘advanced’ search page.
A crash course in advanced operators: That search will look for pages that include the terms ‘patient’, ‘help’ and MRSA - but only in the UK’s Health Protection Agency
The operator ends with a colon, and then no space. These are the most important operators for what we are discussing today. site: www.hpa .org.uk - restricts the search only to HPA pages inurl: privacy - restricts the search only to pages that have the word ‘privacy’ in the url intitle: semantic - restricts the search only to pages with ‘semantic’ in the title of the page
link: www.hpa .org.uk - restricts the search only to pages that link to the HPA filetype: pdf - restricts the search only to pdf documents allintitle :privacy research - will return pages that have both ‘privacy’ and ‘research’ in the title. info: www.hpa .org.uk accesses Google information about that site such as similar sites and site that link to it Advanced operators are extremely powerful and can be used to access information on website servers for example.
The technique varies depending on the social network We’ll start by looking at bebo.com Bebo profiles usually have a url that looks like this: http://www.bebo.com/Profile.jsp?MemberId=xxxxxxx The url normally contains the terms: ‘profile’ and ‘memberid’ Using google’s advanced operators we can include these terms in our search strings to search only within bebo profiles.
This search string: site:bebo.com inurl:memberid inurl:bebo Returned around 34 million hits in  October 2008
Imagine you are looking for people who work for Pfizer in bebo.com Search the bebo.com site and you’ll get around  85 hits
But Search in google using this string: site:.bebo.com inurl:memberid inurl:bebo pfizer And you get  1940  hits.
And many of those include open profiles from people who work for Pfizer
Search for the term  “ tomb-stoning” in bebo and you get 3 people.
But use this string in google: And you get 98 profiles of people who claim to ‘tomb stone’ site:.bebo.com inurl:memberid inurl:bebo  “ tomb-stoning”
Friendster.com  requires a login to search profiles and within the  Friendster  pages But you can get around this barrier using search engine operators combined with other search terms....
Returns nearly 7 million hits. For example, this search: inurl:profiles inurl:friendster
Searching within those results for ‘Oslo’ Initially, google only returns 2 results. When google hides many ‘similar pages’ you need to  ‘repeat the search with omitted results included’
When we do that, google returns  2,260  profiles from people in Oslo or who mention Oslo
Livejournal You can search LiveJournal communities and members via the ‘explore’ page. For example, imagine you are writing a story about the hospital acquired infection - MRSA.  You can search for ‘mrsa’ in the liverjournal search field.
And we get 3 matches for communities interested in MRSA. And 18 matches for users.
LiveJournal ‘community’ pages normally have URLs structured like this: http://community.livejournal.com/zen_within/ And LiveJournal ‘user’ pages normally have URLs structured like this: http://username.livejournal.com/XXXXX Using the same tactics as before, we can use Google’s advanced operators to search livejournal’s pages more effectively.
In October, this search: inurl:livejournal site:livejournal.com  returned more than 55 million hits. And this search: inurl:livejournal site:livejournal.com mrsa  returned more than 2,480 hits.
You can go on refining your results using similar tactics. For example, this search: inurl:livejournal site:livejournal.com mrsa inurl:community returns 373 results for ‘community’ pages only.
Including the UKs ‘Cynical Nurse’ community: Where there is a thread on MRSA
Myspace Myspace profiles usually have a url that looks like this: http://profile.myspace.com/index.cfm?fuseaction=user.viewprofile&friendid=xxxxx The url normally contains the terms: ‘fuseaction’ and ‘viewprofile’ Using these terms to explore Myspace content using Google: site:myspace.com inurl:fuseaction Returns million 17 million hits in October 2008.
Use Google as an extra tool to search Myspace.  For example, if you search for ‘MRSA’ under ‘people’ in Myspace, you get 49 profiles. But this search in Google:  site:myspace.com inurl:viewprofile MRSA  returns 2890 results.
Linkedin Linkedin  is generally seen as the professional social network for business people. But it is very difficult to search or view any profiles unless you are a member
As a member, if I search for ‘Pfizer’ under ‘people’ I get only 20 hits
But here are some of the 290,000 hits I obtained using: site:linkedin.com pfizer in Google And many of those are Pfizer employees...
Here is the ‘public profile’ Pfizer’s Associate Director of Global Regulatory Affairs. This gives her current position, previous experience, education.  Other profiles give interests. This ‘public’ listing can be found in Google when you enter specific names.  But this technique allows you to search using company names or job titles etc.
Imagine you are doing a story on the highly controversial ‘pro-anorexia’ sites and ‘pro-ana’ trend. Often linked to ‘thinspo’ sites.  Search for those terms in bebo and you get roughly 55 references in “people” - many of those are closed profiles.
Use this search term in google: site:.bebo.com inurl:memberid inurl:bebo pro-ana OR pro-anorexia and you get more than 600 hits
With links to pro-ana websites, Potential case studies and anecdotes And other leads and links
Using:  inurl:livejournal site:livejournal inurl:community pro-ana We can explore Livejournal’s community sites that are pro-ana or campaigning against pro-ana. Adding in other terms to narrow focus
We get 115 hits in Livejournal Community pages that mention  London . Some of those are potential leads.  By adding  ‘London’  to the search string
Be  flexible  with these tactics Try  different  strategies with different social networks Hone  your results by adding additional search terms Use Google’s  ‘search within results’  option to  drill  down further
Using these tactics In May this year I set myself the target of: finding  personal information  related to somene under  16 years of age, someone’s precise location; and, personal information related to someone’s work. In 10 minutes I was able to find:
- the  mobile number  of a  15-year-old girl  in South London;  - the  address  of where a  17-year-old waitress  works in Kent; and,  - the  e-mail address  and  salary  of an  Accenture  employee. These kind of privacy blunders litter sites such as Bebo.com, Myspace.com and Facebook and the debate about how best to protect people from identity theft has intensified as social networking has exploded in popularity.
Related tactics prove so successful at reaching sensitive, personal information that journalism.co.uk wrote to the Press Complaints Commission.  We are likely to do so again. See demonstration.

More Related Content

PPTX
What is the new facebook search engine
PDF
Conflicting Content Your biggest nightmare
PDF
DIY basic Facebook data mining
PDF
Facebook Manual
PPTX
Recruiting using Social media Cpl
PPTX
Ms tech credible sources
PPT
Com 495 Discussion Leader
PPT
Pipl
What is the new facebook search engine
Conflicting Content Your biggest nightmare
DIY basic Facebook data mining
Facebook Manual
Recruiting using Social media Cpl
Ms tech credible sources
Com 495 Discussion Leader
Pipl

What's hot (20)

PDF
Nsa responds 3 snowden media run-bys with usg
PDF
Search Engine Optimization - Aykut Aslantaş
PDF
How to Search Twitter
PPTX
Conservative party - Web analysis
PPTX
Newsgathering and monitoring the social web
PPTX
News gathering & social media monitoring platforms
PPTX
Dad's Garage Digital Media Audit
PPTX
3/9/11 Boston Area SharePoint Users Group Meeting
PPTX
10/12/11 Boston Area SharePoint Users Group Meeting
PPTX
Boston Area SharePoint User Group 10/21/10 Meeting
PDF
News-gathering and Monitoring | by: Menna El-hosary
PDF
SearchLoveLondon 2019 - Faisal Anderson - Spying on Google: Using Log File An...
PPTX
7/13/11 Boston Area SharePoint Users Group Meeting
PDF
8/11/10 Boston Area SharePoint Users Group meeting
PPTX
SEO & the effect of social media in search
PPTX
Boston Area SharePoint User Group 11/10/10 Meeting
PPTX
BASPUG Meeting deck from 7/11/12
PPTX
4/13/11 Boston Area SharePoint Users Group Meeting
PPTX
Boston Area SharePoint Users Group November 9th, 2011 Meeting
PPTX
Finding stories by newsgathering and monitoring on social web .pptx
Nsa responds 3 snowden media run-bys with usg
Search Engine Optimization - Aykut Aslantaş
How to Search Twitter
Conservative party - Web analysis
Newsgathering and monitoring the social web
News gathering & social media monitoring platforms
Dad's Garage Digital Media Audit
3/9/11 Boston Area SharePoint Users Group Meeting
10/12/11 Boston Area SharePoint Users Group Meeting
Boston Area SharePoint User Group 10/21/10 Meeting
News-gathering and Monitoring | by: Menna El-hosary
SearchLoveLondon 2019 - Faisal Anderson - Spying on Google: Using Log File An...
7/13/11 Boston Area SharePoint Users Group Meeting
8/11/10 Boston Area SharePoint Users Group meeting
SEO & the effect of social media in search
Boston Area SharePoint User Group 11/10/10 Meeting
BASPUG Meeting deck from 7/11/12
4/13/11 Boston Area SharePoint Users Group Meeting
Boston Area SharePoint Users Group November 9th, 2011 Meeting
Finding stories by newsgathering and monitoring on social web .pptx
Ad

Viewers also liked (9)

PPT
Introduction to Data Mining
PPTX
Data Mining: Graph mining and social network analysis
PDF
Social Data Mining
PPTX
Data Mining: Graph mining and social network analysis
PPTX
Social media mining PPT
PPTX
Social Media Mining - Chapter 3 (Network Measures)
PPTX
Data mining for social media
PDF
Data mining in social network
PPTX
Curiosity Bits Python Tutorial: Mining Facebook Fan Page - getting posts and ...
Introduction to Data Mining
Data Mining: Graph mining and social network analysis
Social Data Mining
Data Mining: Graph mining and social network analysis
Social media mining PPT
Social Media Mining - Chapter 3 (Network Measures)
Data mining for social media
Data mining in social network
Curiosity Bits Python Tutorial: Mining Facebook Fan Page - getting posts and ...
Ad

Similar to Journalists and the Social Web 1 (20)

PPTX
Advanced Social Networking: Library and Online Tools for Job Seekers
PPT
Research
PPT
PDF
Advanced Search: A Journalism Tools Session
DOCX
Linked in of everything possible v2
DOCX
Linked in of everything possible v2
PDF
Advanced Recruiting Techniques
PDF
Advance Recruiting Technique
PPT
SEO and Social Search on Facebook, LinkedIn, and Twitter
PPT
SEO and Social Search on Facebook, LinkedIn, and Twitter by Scott Wilder
PPT
Lesson 4: Researching & The Internet
PPTX
Information update may 2011
PPT
Academic Skills 4
PPTX
Search engine optimization
PDF
QSG v1.1
PDF
Source Candidates using Facebook
PDF
What is Graph Search?
PPT
Think Like Google - What You Need to Know About SEO
PPT
SEO is broken – giving way to social search
PPT
Job huntingdec2011
Advanced Social Networking: Library and Online Tools for Job Seekers
Research
Advanced Search: A Journalism Tools Session
Linked in of everything possible v2
Linked in of everything possible v2
Advanced Recruiting Techniques
Advance Recruiting Technique
SEO and Social Search on Facebook, LinkedIn, and Twitter
SEO and Social Search on Facebook, LinkedIn, and Twitter by Scott Wilder
Lesson 4: Researching & The Internet
Information update may 2011
Academic Skills 4
Search engine optimization
QSG v1.1
Source Candidates using Facebook
What is Graph Search?
Think Like Google - What You Need to Know About SEO
SEO is broken – giving way to social search
Job huntingdec2011

Recently uploaded (20)

PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPTX
SOPHOS-XG Firewall Administrator PPT.pptx
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Electronic commerce courselecture one. Pdf
PPTX
1. Introduction to Computer Programming.pptx
PPTX
Tartificialntelligence_presentation.pptx
PPTX
A Presentation on Artificial Intelligence
PDF
Approach and Philosophy of On baking technology
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Encapsulation theory and applications.pdf
PPT
Teaching material agriculture food technology
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Empathic Computing: Creating Shared Understanding
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
Encapsulation_ Review paper, used for researhc scholars
Assigned Numbers - 2025 - Bluetooth® Document
Per capita expenditure prediction using model stacking based on satellite ima...
SOPHOS-XG Firewall Administrator PPT.pptx
Reach Out and Touch Someone: Haptics and Empathic Computing
Electronic commerce courselecture one. Pdf
1. Introduction to Computer Programming.pptx
Tartificialntelligence_presentation.pptx
A Presentation on Artificial Intelligence
Approach and Philosophy of On baking technology
Dropbox Q2 2025 Financial Results & Investor Presentation
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Diabetes mellitus diagnosis method based random forest with bat algorithm
Encapsulation theory and applications.pdf
Teaching material agriculture food technology
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Building Integrated photovoltaic BIPV_UPV.pdf
MYSQL Presentation for SQL database connectivity
Empathic Computing: Creating Shared Understanding
20250228 LYD VKU AI Blended-Learning.pptx

Journalists and the Social Web 1

  • 1. Web2.0 and 3.0, Social networks and Journalists Mining information from social networking sites Journalists are increasingly turning to social networks to look for case studies, contacts and expert opinion. But searching social networks can be frustrating and time consuming.
  • 2. Used carefully, google can be far more effective use google’s advanced operators to source only from social network profiles and pages using specific search terms This technique can also allow you to pin-point specific information in profiles.
  • 3. Google’s advanced operators Google allows various ‘advanced’ operators. These are typed directly into the Google Search field. Used correctly and with care, they can be far more effective than using the ‘advanced’ search page.
  • 4. A crash course in advanced operators: That search will look for pages that include the terms ‘patient’, ‘help’ and MRSA - but only in the UK’s Health Protection Agency
  • 5. The operator ends with a colon, and then no space. These are the most important operators for what we are discussing today. site: www.hpa .org.uk - restricts the search only to HPA pages inurl: privacy - restricts the search only to pages that have the word ‘privacy’ in the url intitle: semantic - restricts the search only to pages with ‘semantic’ in the title of the page
  • 6. link: www.hpa .org.uk - restricts the search only to pages that link to the HPA filetype: pdf - restricts the search only to pdf documents allintitle :privacy research - will return pages that have both ‘privacy’ and ‘research’ in the title. info: www.hpa .org.uk accesses Google information about that site such as similar sites and site that link to it Advanced operators are extremely powerful and can be used to access information on website servers for example.
  • 7. The technique varies depending on the social network We’ll start by looking at bebo.com Bebo profiles usually have a url that looks like this: http://www.bebo.com/Profile.jsp?MemberId=xxxxxxx The url normally contains the terms: ‘profile’ and ‘memberid’ Using google’s advanced operators we can include these terms in our search strings to search only within bebo profiles.
  • 8. This search string: site:bebo.com inurl:memberid inurl:bebo Returned around 34 million hits in October 2008
  • 9. Imagine you are looking for people who work for Pfizer in bebo.com Search the bebo.com site and you’ll get around 85 hits
  • 10. But Search in google using this string: site:.bebo.com inurl:memberid inurl:bebo pfizer And you get 1940 hits.
  • 11. And many of those include open profiles from people who work for Pfizer
  • 12. Search for the term “ tomb-stoning” in bebo and you get 3 people.
  • 13. But use this string in google: And you get 98 profiles of people who claim to ‘tomb stone’ site:.bebo.com inurl:memberid inurl:bebo “ tomb-stoning”
  • 14. Friendster.com requires a login to search profiles and within the Friendster pages But you can get around this barrier using search engine operators combined with other search terms....
  • 15. Returns nearly 7 million hits. For example, this search: inurl:profiles inurl:friendster
  • 16. Searching within those results for ‘Oslo’ Initially, google only returns 2 results. When google hides many ‘similar pages’ you need to ‘repeat the search with omitted results included’
  • 17. When we do that, google returns 2,260 profiles from people in Oslo or who mention Oslo
  • 18. Livejournal You can search LiveJournal communities and members via the ‘explore’ page. For example, imagine you are writing a story about the hospital acquired infection - MRSA. You can search for ‘mrsa’ in the liverjournal search field.
  • 19. And we get 3 matches for communities interested in MRSA. And 18 matches for users.
  • 20. LiveJournal ‘community’ pages normally have URLs structured like this: http://community.livejournal.com/zen_within/ And LiveJournal ‘user’ pages normally have URLs structured like this: http://username.livejournal.com/XXXXX Using the same tactics as before, we can use Google’s advanced operators to search livejournal’s pages more effectively.
  • 21. In October, this search: inurl:livejournal site:livejournal.com returned more than 55 million hits. And this search: inurl:livejournal site:livejournal.com mrsa returned more than 2,480 hits.
  • 22. You can go on refining your results using similar tactics. For example, this search: inurl:livejournal site:livejournal.com mrsa inurl:community returns 373 results for ‘community’ pages only.
  • 23. Including the UKs ‘Cynical Nurse’ community: Where there is a thread on MRSA
  • 24. Myspace Myspace profiles usually have a url that looks like this: http://profile.myspace.com/index.cfm?fuseaction=user.viewprofile&friendid=xxxxx The url normally contains the terms: ‘fuseaction’ and ‘viewprofile’ Using these terms to explore Myspace content using Google: site:myspace.com inurl:fuseaction Returns million 17 million hits in October 2008.
  • 25. Use Google as an extra tool to search Myspace. For example, if you search for ‘MRSA’ under ‘people’ in Myspace, you get 49 profiles. But this search in Google: site:myspace.com inurl:viewprofile MRSA returns 2890 results.
  • 26. Linkedin Linkedin is generally seen as the professional social network for business people. But it is very difficult to search or view any profiles unless you are a member
  • 27. As a member, if I search for ‘Pfizer’ under ‘people’ I get only 20 hits
  • 28. But here are some of the 290,000 hits I obtained using: site:linkedin.com pfizer in Google And many of those are Pfizer employees...
  • 29. Here is the ‘public profile’ Pfizer’s Associate Director of Global Regulatory Affairs. This gives her current position, previous experience, education. Other profiles give interests. This ‘public’ listing can be found in Google when you enter specific names. But this technique allows you to search using company names or job titles etc.
  • 30. Imagine you are doing a story on the highly controversial ‘pro-anorexia’ sites and ‘pro-ana’ trend. Often linked to ‘thinspo’ sites. Search for those terms in bebo and you get roughly 55 references in “people” - many of those are closed profiles.
  • 31. Use this search term in google: site:.bebo.com inurl:memberid inurl:bebo pro-ana OR pro-anorexia and you get more than 600 hits
  • 32. With links to pro-ana websites, Potential case studies and anecdotes And other leads and links
  • 33. Using: inurl:livejournal site:livejournal inurl:community pro-ana We can explore Livejournal’s community sites that are pro-ana or campaigning against pro-ana. Adding in other terms to narrow focus
  • 34. We get 115 hits in Livejournal Community pages that mention London . Some of those are potential leads. By adding ‘London’ to the search string
  • 35. Be flexible with these tactics Try different strategies with different social networks Hone your results by adding additional search terms Use Google’s ‘search within results’ option to drill down further
  • 36. Using these tactics In May this year I set myself the target of: finding personal information related to somene under 16 years of age, someone’s precise location; and, personal information related to someone’s work. In 10 minutes I was able to find:
  • 37. - the mobile number of a 15-year-old girl in South London; - the address of where a 17-year-old waitress works in Kent; and, - the e-mail address and salary of an Accenture employee. These kind of privacy blunders litter sites such as Bebo.com, Myspace.com and Facebook and the debate about how best to protect people from identity theft has intensified as social networking has exploded in popularity.
  • 38. Related tactics prove so successful at reaching sensitive, personal information that journalism.co.uk wrote to the Press Complaints Commission. We are likely to do so again. See demonstration.