SlideShare a Scribd company logo
Messing with DataTony Hirst,Dept of Communication and Systems,The Open University#lak11
How can non-programmers do the sort of things that we might normally think we need a developer to do?
“Data Is A Dish Best Served Raw”http://eagereyes.org/
ND
DiscoveryAcquisition as dataRepresentationCleansing(Visual) Analysis
APIs, screenscraping and standardised document formats
RepresentationStructured OR or unstructuredHuman readable (text) or opaque document format?
Google Spreadsheets as a database
Lak11 ws-messing withdata
=QUERY('ISO Country Codes'!A2:B268,                 "select A,B where A contains 'FRANCE' limit 1")))
Lak11 ws-messing withdata
Many Eyes
Lak11 ws-messing withdata
Lak11 ws-messing withdata
Lak11 ws-messing withdata
Visualising Networks
Lak11 ws-messing withdata
Lak11 ws-messing withdata
Lak11 ws-messing withdata
Lak11 ws-messing withdata
Lak11 ws-messing withdata
[ Context – resource discovery in special interest areas ]
http://flic.kr/p/9mvfHh
Lak11 ws-messing withdata
Lak11 ws-messing withdata
Lak11 ws-messing withdata
Put the data to work: Google CSEblogroll
Lak11 ws-messing withdata
More views ondelicious…
Graph of folk who follow three or more people (ish) who tweeted a link to soti/rin blog post
Lak11 ws-messing withdata
Data Cleansing
Google Refine
Stanford Data Wrangler
Google Analytics
http://code.google.com/apis/analytics/docs/gdata/gdataExplorer.htmlhttp://bit.ly/fzj9yv
Lak11 ws-messing withdata
OUseful.info

More Related Content

PDF
Web Scraping
PPTX
Web scraping
PPT
New information for new journalists pt2: data
PDF
Web Data Extraction: A Crash Course
PDF
Fairhair.ai – alan turing institute june '17 (public)
PPTX
Farirhair.ai: AI platform to mine competitive intelligence from billions of u...
PDF
Semantic Web Introduction - a perspective of data annotations
PPTX
Big Data Cloud June 3rd Meetup - Presentation by Mark Davis
Web Scraping
Web scraping
New information for new journalists pt2: data
Web Data Extraction: A Crash Course
Fairhair.ai – alan turing institute june '17 (public)
Farirhair.ai: AI platform to mine competitive intelligence from billions of u...
Semantic Web Introduction - a perspective of data annotations
Big Data Cloud June 3rd Meetup - Presentation by Mark Davis

Similar to Lak11 ws-messing withdata (20)

PDF
Data Tools cosystem_for_non_programmers
PDF
Data tools ecosystem for non-programmers
PDF
Data Science: Harnessing Open Data for High Impact Solutions
PDF
Data Journalism 2: cleaning, combining, communicating
PDF
The data we want
PPTX
Onlineinfo2012 - Scraping
PPTX
Data Liberation - Tony Hirst
PPT
Why Open Data?
PDF
Keeping Identity Graphs In Sync With Apache Spark
PPTX
Google refine from a business perspective
PPTX
Google refine from a business perspective
PPTX
Google refine tutotial
PDF
Data Journalism 2: Interrogating, Visualising and Mashing
PPTX
Big data 101
PPTX
Google refine tutotial
PPTX
Google refine tutotial
PDF
Managing Connected Big Data in Art with Neo4j Graph Database - Lorenzo Speran...
PPTX
Department of Commerce App Challenge: Big Data Dashboards
PDF
Introduction to Open Data and Data Science
PPTX
Google refine tutotial
Data Tools cosystem_for_non_programmers
Data tools ecosystem for non-programmers
Data Science: Harnessing Open Data for High Impact Solutions
Data Journalism 2: cleaning, combining, communicating
The data we want
Onlineinfo2012 - Scraping
Data Liberation - Tony Hirst
Why Open Data?
Keeping Identity Graphs In Sync With Apache Spark
Google refine from a business perspective
Google refine from a business perspective
Google refine tutotial
Data Journalism 2: Interrogating, Visualising and Mashing
Big data 101
Google refine tutotial
Google refine tutotial
Managing Connected Big Data in Art with Neo4j Graph Database - Lorenzo Speran...
Department of Commerce App Challenge: Big Data Dashboards
Introduction to Open Data and Data Science
Google refine tutotial
Ad

More from Tony Hirst (20)

PPTX
15 in 20 research fiesta
PPTX
Dev8d jupyter
PPTX
Ili 16 robot
PDF
Jupyternotebooks ou.pptx
PDF
Virtual computing.pptx
PPTX
ouseful-parlihacks
PDF
Gors appropriate
PPTX
Gors appropriate
PPTX
Robotlab jupyter
PDF
Fco open data in half day th-v2
PPTX
Notes on the Future - ILI2015 Workshop
PPTX
Community Journalism Conf - hyperlocal data wire
PPTX
Residential school 2015_robotics_interest
PPTX
Data Mining - Separating Fact From Fiction - NetIKX
PPTX
Week4
PPTX
A Quick Tour of OpenRefine
PPTX
Conversations with data
PPTX
Data reuse OU workshop bingo
PPTX
Inspiring content - You Don't Need Big Data to Tell Good Data Stories
PDF
Lincoln jun14datajournalism
15 in 20 research fiesta
Dev8d jupyter
Ili 16 robot
Jupyternotebooks ou.pptx
Virtual computing.pptx
ouseful-parlihacks
Gors appropriate
Gors appropriate
Robotlab jupyter
Fco open data in half day th-v2
Notes on the Future - ILI2015 Workshop
Community Journalism Conf - hyperlocal data wire
Residential school 2015_robotics_interest
Data Mining - Separating Fact From Fiction - NetIKX
Week4
A Quick Tour of OpenRefine
Conversations with data
Data reuse OU workshop bingo
Inspiring content - You Don't Need Big Data to Tell Good Data Stories
Lincoln jun14datajournalism
Ad

Recently uploaded (20)

PPTX
Final Presentation General Medicine 03-08-2024.pptx
PDF
Computing-Curriculum for Schools in Ghana
PDF
SOIL: Factor, Horizon, Process, Classification, Degradation, Conservation
PDF
Trump Administration's workforce development strategy
PPTX
Chinmaya Tiranga Azadi Quiz (Class 7-8 )
PDF
medical_surgical_nursing_10th_edition_ignatavicius_TEST_BANK_pdf.pdf
PPTX
UV-Visible spectroscopy..pptx UV-Visible Spectroscopy – Electronic Transition...
PDF
A systematic review of self-coping strategies used by university students to ...
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PDF
Empowerment Technology for Senior High School Guide
PPTX
Orientation - ARALprogram of Deped to the Parents.pptx
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PDF
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
PDF
Paper A Mock Exam 9_ Attempt review.pdf.
PDF
IGGE1 Understanding the Self1234567891011
PPTX
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
PDF
1_English_Language_Set_2.pdf probationary
PPTX
Radiologic_Anatomy_of_the_Brachial_plexus [final].pptx
PPTX
A powerpoint presentation on the Revised K-10 Science Shaping Paper
PPTX
History, Philosophy and sociology of education (1).pptx
Final Presentation General Medicine 03-08-2024.pptx
Computing-Curriculum for Schools in Ghana
SOIL: Factor, Horizon, Process, Classification, Degradation, Conservation
Trump Administration's workforce development strategy
Chinmaya Tiranga Azadi Quiz (Class 7-8 )
medical_surgical_nursing_10th_edition_ignatavicius_TEST_BANK_pdf.pdf
UV-Visible spectroscopy..pptx UV-Visible Spectroscopy – Electronic Transition...
A systematic review of self-coping strategies used by university students to ...
Final Presentation General Medicine 03-08-2024.pptx
Empowerment Technology for Senior High School Guide
Orientation - ARALprogram of Deped to the Parents.pptx
Supply Chain Operations Speaking Notes -ICLT Program
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
Paper A Mock Exam 9_ Attempt review.pdf.
IGGE1 Understanding the Self1234567891011
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
1_English_Language_Set_2.pdf probationary
Radiologic_Anatomy_of_the_Brachial_plexus [final].pptx
A powerpoint presentation on the Revised K-10 Science Shaping Paper
History, Philosophy and sociology of education (1).pptx

Lak11 ws-messing withdata