SlideShare a Scribd company logo
When Search Becomes Research – Part I




Query design




Anat Ben-David
Science, Technology and Society, Bar-Ilan University
anatbd@gmail.com

Erik Borra
Digital Methods Initiative, University of Amsterdam
erik@digitalmethods.net
When Search Becomes Research




Turning Google into a research tool




“We look at Google search results and see society, instead of
Google” (Rogers, Stevenson, Weltevrede 2009)
Words/keywords




When words become “issue language”




Actors and their terminology: Keywords as positioning efforts
Note program, anti-program as well as efforts at net neutrality (Cf.
Akrich & Latour, 1992)
Keywords and source sets




“side by sidedness” offical / non-official … different kinds of actors
(living side by side) in an issue map. (Rogers, 2004)
DMI Workshop: When Search Becomes Research
DMI Workshop: When Search Becomes Research
DMI Workshop: When Search Becomes Research
DMI Workshop: When Search Becomes Research
DMI Workshop: When Search Becomes Research
example:
Program / anti-program



How is the issue of “google street view” and privacy is being treated when google-
related sites are excluded from the search?




1. “Google street view” +privacy                                        33,100,000 results
2. “Google street view” +privacy site:google.*                               1,840 results
3. “Google street view” +privacy –site:google.*                        36,000,000 results
search/research




Search operators and syntax




Use, for example: +, ~, OR, NOT, SITE:, “”
See, for example: http://www.googleguide.com/category/query-
input/
example:
Program / anti-program




A query design example using advanced operators:




• “~Cellular Phone” + “brain tumor” + “not associated”
• “~Cellular Phone” + “brain tumor” + “270%”
• Compare two queries across different actors. Add “site:.edu”,
  “site:.com”, .”site:.org”, etc.
search/research




Research protocol For using Google


Google Settings:
• For the “universal Google” go to http://google.com/ncr or http://
google.com/intl/en
• Log out of your Gmail account
• Google preferences:
     Set interface and search language
     SafeSearch: Off
     Google Instant: Off
     Nr of Results: 100 per page
search/research




Research protocol For using Google


Clean browser
• Log out
• Clear cookies and the browser’s search history
• Or: create a “research browser” (i.e. install a new one)


“Turning off search history personalization”
example:
Program / anti-program




A query design example using advanced operators:




• “~Cellular Phone” + “brain tumor” + “not associated”
• “~Cellular Phone” + “brain tumor” + “270%”
• Compare two queries across different actors. Add “site:.edu”,
  “site:.com”, .”site:.org”, etc.
Example:
nationality of issues: Rights types




Can the search engine be repurposed to show which rights are specific per country?


Method

1. Query the term "rights" in national terminology per different Google
country (e.g. ‘droits’ in .fr, ‘rechten’ in .nl)

2. Fetch the top 10 unique rights types.

3. Visualize top 10 issues per country and mark unique issues.


https://wiki.digitalmethods.net/Dmi/NationalityofIssues
Body Text




Body text
search/research




Research protocol




Saving results for verification and retrieval
• “Save page as” in the browser, name files and folder consistently
• Collect right types in spreadsheet (incl. translation)
• Merge results and collect saved files in one place
DMI Workshop: When Search Becomes Research
DMI Workshop: When Search Becomes Research
DMI Workshop: When Search Becomes Research
DMI Workshop: When Search Becomes Research
search/research




Questions and related tools




Using Lippmannian Device aka Google scraper:
Resonance of controversial terms

• What are the relevant issues in the controversy?
• Where do controversial terms resonate?
EXAMPLE:
CLIMATE CHANGE SKEPTICS




Where do the skeptics get “air time”? Where are their audiences?


BBC cancels ‘Planet Relief’ program about environmental issues

“The only reason why this became an issue is that there is a small but
vociferous group of climate ‘skeptics’ lobbying agains taking action”

- BBC News, 5 september 2007




https://wiki.digitalmethods.net/Dmi/ClimateChangeSkeptics
example:
climate change skeptics




Query design: What are the sources?




Top 100 results for the query “climate change”




http://www.google.com/search?q="climate+change"&num=100
example:
climate change skeptics




Query design: What are the issues?




Derive list of climate change skeptics
Sources: motherjones.com, wikipedia.org, heartland.org

Compare the three lists and retain the skeptics that are mentioned in
at least two of the lists
example:
climate change skeptics




Skeptics

S. Fred Singer
Robert Balling
Sallie Baliunas
Patrick Michaels
Richard Lindzen
Steven Milloy
Timothy Ball
Paul Driessen
Willie Soon
Sherwood B. Idso
Frederick Seitz
example:
climate change skeptics




Google Scraper: Batch query Google




http://tools.issuecrawler.net/beta/scrapeGoogle

Enter sources in the top box
Enter keywords in the bottom box (mind the quotes)
Click “scrape Google”
Warning: excessive usage will bring this tool down
        Make sure to pay attention to query design

Body Text




Body text
http://www.google.com/search?num=100&q=%22S.+Fred+Singer
%22+site%3Aipcc.ch

http://www.google.com/search?num=100&q=%22Patrick+Michaels
%22+site%3Aipcc.ch
Body Text




Body text
Body Text




Body text
Body Text




Body text
Body Text




Body text
Body Text




Body text
Body Text




Body text
Body Text




Body text
Body Text




Body text
Body Text




Body text
Recap




Steps in method




Question
Query design
Google or Google Scraper
Tag clouds

More Related Content

PDF
Elsevier/Maryland Publishing Connect - 14_0331 (pdf)
PPTX
Reproducible Research in the Humanities
PDF
Semantic Search Engine: Semantic Search and Query Parsing with Phrases and En...
PPTX
Copyright edtc6340.66 april_canales#4
PPTX
Agile Science Ignite Talk Given at Health Foo 2013
PPTX
Online Search Techniques-Boolean Searching
PPTX
Google Master:Gain Research Power
PDF
Scientific Misconduct
Elsevier/Maryland Publishing Connect - 14_0331 (pdf)
Reproducible Research in the Humanities
Semantic Search Engine: Semantic Search and Query Parsing with Phrases and En...
Copyright edtc6340.66 april_canales#4
Agile Science Ignite Talk Given at Health Foo 2013
Online Search Techniques-Boolean Searching
Google Master:Gain Research Power
Scientific Misconduct

Viewers also liked (8)

PDF
Rogers digitalmethodsaftersocialmedia nov2013_optimized_
PDF
Digital Methods Tool Medley
PDF
Tracking the Trackers tutorial at the Digital Methods Summer School 2013
PDF
Rogers studyingpoliticalissues mar2014_optimized_ii_
PDF
Digital Methods Summer School 2014 Tool Medley
PDF
Digital Methods Summer School 2015 Tool Medley
PDF
Rogers data days_2014_slides_opti
PPTX
Richard Rogers, Otherwise Engaged: Critical Analytics and the New Meanings of...
Rogers digitalmethodsaftersocialmedia nov2013_optimized_
Digital Methods Tool Medley
Tracking the Trackers tutorial at the Digital Methods Summer School 2013
Rogers studyingpoliticalissues mar2014_optimized_ii_
Digital Methods Summer School 2014 Tool Medley
Digital Methods Summer School 2015 Tool Medley
Rogers data days_2014_slides_opti
Richard Rogers, Otherwise Engaged: Critical Analytics and the New Meanings of...
Ad

Similar to DMI Workshop: When Search Becomes Research (20)

PDF
Digital Literacy: Learning How to Search and Evaluate Information
PPT
SLA Summer 2008
PPT
Search Analytics: Conversations with Your Customers
PPT
Using Search Analytics to Diagnose What’s Ailing your Information Architecture
PPT
Search Analytics for Fun and Profit
PPTX
RESEARCHING YOUR TOPIC_edit.pptx
PPTX
Learning How to Search and Evaluate Information
PPTX
Google, Products and Information Seraching
PPT
Google and google scholar
PPT
Google and google scholar
PDF
Creating & managing your scholarly web presence
PPTX
Google Tips and Tricks: ISTE 2014 Presentation
PDF
Evolution of Search
PDF
Beyond User Research
PPTX
Exploring Google Dorks for Ethical Hacking.pptx
PPTX
Big Data in NATO and Your Role
PPT
Finding information on the Web - methodology
PPTX
Basic Engineering Design (Part 2): Researching the Need
PDF
Gafe purde cse session - NEWEST
PPT
FSU SLIS InfoSvcs Wk 3 - Web Search & Evaluation
Digital Literacy: Learning How to Search and Evaluate Information
SLA Summer 2008
Search Analytics: Conversations with Your Customers
Using Search Analytics to Diagnose What’s Ailing your Information Architecture
Search Analytics for Fun and Profit
RESEARCHING YOUR TOPIC_edit.pptx
Learning How to Search and Evaluate Information
Google, Products and Information Seraching
Google and google scholar
Google and google scholar
Creating & managing your scholarly web presence
Google Tips and Tricks: ISTE 2014 Presentation
Evolution of Search
Beyond User Research
Exploring Google Dorks for Ethical Hacking.pptx
Big Data in NATO and Your Role
Finding information on the Web - methodology
Basic Engineering Design (Part 2): Researching the Need
Gafe purde cse session - NEWEST
FSU SLIS InfoSvcs Wk 3 - Web Search & Evaluation
Ad

More from Digital Methods Initiative (20)

PDF
Query Design for Digital Methods by Richard Rogers
PDF
Digital Methods by Richard Rogers
PDF
The Birth of Social Media Methods
PPTX
Interactive visualization and exploration of network data with Gephi
PDF
National Tracking Ecologies - Digital Methods Summer School 2013
PDF
Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013
PDF
Repurposing Wikipedia: Wikipedia as data set and analytical device
PDF
Crawling and Scraping tutorial at the Digital Methods Summer School 2013
PDF
Studying Facebook via Data Extraction: a Netvizz tutorial at the Digital Meth...
PDF
Digital Methods Summer School 2013 Tool Medley
PDF
Hashtag lifelines
KEY
Traces of the Trackers. Tracking the Trackers: A historical analysis using th...
PDF
Post-social methods? Issues in live research, by Noortje Marres and Esther We...
KEY
Web Flags Summer School 2012
PDF
Dmi12 workshops - crawling and scraping
PDF
Digital Methods Tool Medley. Digital Methods Summer School 2012
PDF
Digital Methods Winterschool 2012: API - Interfaces to the Cloud
PDF
DMI Workshop: Crawling and Scraping
PDF
DMI Workshop: Data visualization. Analytical clouding.
KEY
DMI Workshop: Wikileaks and the Myth of (Data-Driven) Citizen Journalism (wik...
Query Design for Digital Methods by Richard Rogers
Digital Methods by Richard Rogers
The Birth of Social Media Methods
Interactive visualization and exploration of network data with Gephi
National Tracking Ecologies - Digital Methods Summer School 2013
Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013
Repurposing Wikipedia: Wikipedia as data set and analytical device
Crawling and Scraping tutorial at the Digital Methods Summer School 2013
Studying Facebook via Data Extraction: a Netvizz tutorial at the Digital Meth...
Digital Methods Summer School 2013 Tool Medley
Hashtag lifelines
Traces of the Trackers. Tracking the Trackers: A historical analysis using th...
Post-social methods? Issues in live research, by Noortje Marres and Esther We...
Web Flags Summer School 2012
Dmi12 workshops - crawling and scraping
Digital Methods Tool Medley. Digital Methods Summer School 2012
Digital Methods Winterschool 2012: API - Interfaces to the Cloud
DMI Workshop: Crawling and Scraping
DMI Workshop: Data visualization. Analytical clouding.
DMI Workshop: Wikileaks and the Myth of (Data-Driven) Citizen Journalism (wik...

Recently uploaded (20)

PDF
Microbial disease of the cardiovascular and lymphatic systems
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PDF
Pre independence Education in Inndia.pdf
PPTX
Renaissance Architecture: A Journey from Faith to Humanism
PPTX
PPH.pptx obstetrics and gynecology in nursing
PPTX
The Healthy Child – Unit II | Child Health Nursing I | B.Sc Nursing 5th Semester
PDF
01-Introduction-to-Information-Management.pdf
PDF
TR - Agricultural Crops Production NC III.pdf
PDF
Mark Klimek Lecture Notes_240423 revision books _173037.pdf
PPTX
Week 4 Term 3 Study Techniques revisited.pptx
PPTX
Introduction to Child Health Nursing – Unit I | Child Health Nursing I | B.Sc...
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PDF
Classroom Observation Tools for Teachers
PPTX
Institutional Correction lecture only . . .
PDF
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PPTX
master seminar digital applications in india
Microbial disease of the cardiovascular and lymphatic systems
Final Presentation General Medicine 03-08-2024.pptx
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
Pre independence Education in Inndia.pdf
Renaissance Architecture: A Journey from Faith to Humanism
PPH.pptx obstetrics and gynecology in nursing
The Healthy Child – Unit II | Child Health Nursing I | B.Sc Nursing 5th Semester
01-Introduction-to-Information-Management.pdf
TR - Agricultural Crops Production NC III.pdf
Mark Klimek Lecture Notes_240423 revision books _173037.pdf
Week 4 Term 3 Study Techniques revisited.pptx
Introduction to Child Health Nursing – Unit I | Child Health Nursing I | B.Sc...
Abdominal Access Techniques with Prof. Dr. R K Mishra
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
Classroom Observation Tools for Teachers
Institutional Correction lecture only . . .
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
master seminar digital applications in india

DMI Workshop: When Search Becomes Research

  • 1. When Search Becomes Research – Part I Query design Anat Ben-David Science, Technology and Society, Bar-Ilan University anatbd@gmail.com Erik Borra Digital Methods Initiative, University of Amsterdam erik@digitalmethods.net
  • 2. When Search Becomes Research Turning Google into a research tool “We look at Google search results and see society, instead of Google” (Rogers, Stevenson, Weltevrede 2009)
  • 3. Words/keywords When words become “issue language” Actors and their terminology: Keywords as positioning efforts Note program, anti-program as well as efforts at net neutrality (Cf. Akrich & Latour, 1992)
  • 4. Keywords and source sets “side by sidedness” offical / non-official … different kinds of actors (living side by side) in an issue map. (Rogers, 2004)
  • 10. example: Program / anti-program How is the issue of “google street view” and privacy is being treated when google- related sites are excluded from the search? 1. “Google street view” +privacy 33,100,000 results 2. “Google street view” +privacy site:google.* 1,840 results 3. “Google street view” +privacy –site:google.* 36,000,000 results
  • 11. search/research Search operators and syntax Use, for example: +, ~, OR, NOT, SITE:, “” See, for example: http://www.googleguide.com/category/query- input/
  • 12. example: Program / anti-program A query design example using advanced operators: • “~Cellular Phone” + “brain tumor” + “not associated” • “~Cellular Phone” + “brain tumor” + “270%” • Compare two queries across different actors. Add “site:.edu”, “site:.com”, .”site:.org”, etc.
  • 13. search/research Research protocol For using Google Google Settings: • For the “universal Google” go to http://google.com/ncr or http:// google.com/intl/en • Log out of your Gmail account • Google preferences: Set interface and search language SafeSearch: Off Google Instant: Off Nr of Results: 100 per page
  • 14. search/research Research protocol For using Google Clean browser • Log out • Clear cookies and the browser’s search history • Or: create a “research browser” (i.e. install a new one) “Turning off search history personalization”
  • 15. example: Program / anti-program A query design example using advanced operators: • “~Cellular Phone” + “brain tumor” + “not associated” • “~Cellular Phone” + “brain tumor” + “270%” • Compare two queries across different actors. Add “site:.edu”, “site:.com”, .”site:.org”, etc.
  • 16. Example: nationality of issues: Rights types Can the search engine be repurposed to show which rights are specific per country? Method 1. Query the term "rights" in national terminology per different Google country (e.g. ‘droits’ in .fr, ‘rechten’ in .nl) 2. Fetch the top 10 unique rights types. 3. Visualize top 10 issues per country and mark unique issues. https://wiki.digitalmethods.net/Dmi/NationalityofIssues
  • 18. search/research Research protocol Saving results for verification and retrieval • “Save page as” in the browser, name files and folder consistently • Collect right types in spreadsheet (incl. translation) • Merge results and collect saved files in one place
  • 23. search/research Questions and related tools Using Lippmannian Device aka Google scraper: Resonance of controversial terms • What are the relevant issues in the controversy? • Where do controversial terms resonate?
  • 24. EXAMPLE: CLIMATE CHANGE SKEPTICS Where do the skeptics get “air time”? Where are their audiences? BBC cancels ‘Planet Relief’ program about environmental issues “The only reason why this became an issue is that there is a small but vociferous group of climate ‘skeptics’ lobbying agains taking action” - BBC News, 5 september 2007 https://wiki.digitalmethods.net/Dmi/ClimateChangeSkeptics
  • 25. example: climate change skeptics Query design: What are the sources? Top 100 results for the query “climate change” http://www.google.com/search?q="climate+change"&num=100
  • 26. example: climate change skeptics Query design: What are the issues? Derive list of climate change skeptics Sources: motherjones.com, wikipedia.org, heartland.org Compare the three lists and retain the skeptics that are mentioned in at least two of the lists
  • 27. example: climate change skeptics Skeptics S. Fred Singer Robert Balling Sallie Baliunas Patrick Michaels Richard Lindzen Steven Milloy Timothy Ball Paul Driessen Willie Soon Sherwood B. Idso Frederick Seitz
  • 28. example: climate change skeptics Google Scraper: Batch query Google http://tools.issuecrawler.net/beta/scrapeGoogle Enter sources in the top box Enter keywords in the bottom box (mind the quotes) Click “scrape Google”
  • 29. Warning: excessive usage will bring this tool down Make sure to pay attention to query design Body Text Body text
  • 40. Recap Steps in method Question Query design Google or Google Scraper Tag clouds