SlideShare a Scribd company logo
PHPUK Data Web Data Science Data Mining BIG BIG BIG BIG
Agenda BIG DATA WEB
BIG DATA SCIENCE
BIG DATA MINING
SUMMARY
com
Web2.0 Moores Law Economics Bandwidth SOCIAL open/share
 
Data web Web 2.0 + mobile Cloud Computing Data Science
In practice Existing data
Always working
Every webpage personalized
DataWeb Summary Data -  expanding fast rate
Economic free cloud
Personalization real time
Science applied to society
Data Science What is data science?
Data lifecycle
Case studies
What is data science? Combines three areas
Engineering
Mathematics Statistics ML
Communication PP to infographic, product, API
Data lifecycle Comes from? Data conditioning Scale Tell a story Intelligence
Case Studies Range of perspectives
Cloudera
Bitly
LinkedIN
e-commerce
Cloudera Jeff Hammerbacher
http://jeffhammerbacher.com/
Video
http://www.cloudera.com/?resource=orbitz-ideas-jeff-hammerbacher-evolving-new-analytical-platform-apache-hadoop
Enterprise side – Dataspaces
Bitly Hilary Mason

More Related Content

PPTX
Cloud computing Sustainabilty
PPT
dataportability barcampScotland2008
PPTX
Roy zemi museums
PPTX
Pablo henrique de oliveira
PPTX
Presentación
PPTX
Yu & Big Pechakucha
PPTX
Sumie, Natsumo, Rina Pechakucha
PPTX
Rapee & Jo Pechakucha
Cloud computing Sustainabilty
dataportability barcampScotland2008
Roy zemi museums
Pablo henrique de oliveira
Presentación
Yu & Big Pechakucha
Sumie, Natsumo, Rina Pechakucha
Rapee & Jo Pechakucha

Similar to Big dataweb, science, mining (20)

PPSX
Intro to Data Science Big Data
PPTX
Big dataorig
PPTX
BI, AI/ML, Use Cases, Business Impact and how to get started
PDF
00-01 DSnDA.pdf
PDF
PPTX
Big Data and the Art of Data Science
PPTX
Introduction to data science
PPTX
NYC Open Data Meetup-- Thoughtworks chief data scientist talk
PDF
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
PPTX
In-Depth Data Analytics
PPTX
Big Data and Data Science: The Technologies Shaping Our Lives
PDF
Big Data & Social Analytics presentation
PPTX
What is Data Science
PPTX
Introduction to Big Data and Data Science
PDF
EDF2013: Big Data Tutorial: Marko Grobelnik
PDF
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
PPT
Big data
PDF
Data science-Introductions-Real World Application
PDF
Data Scientist Toolbox
PDF
DMTM 2015 - 02 Data Mining
Intro to Data Science Big Data
Big dataorig
BI, AI/ML, Use Cases, Business Impact and how to get started
00-01 DSnDA.pdf
Big Data and the Art of Data Science
Introduction to data science
NYC Open Data Meetup-- Thoughtworks chief data scientist talk
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
In-Depth Data Analytics
Big Data and Data Science: The Technologies Shaping Our Lives
Big Data & Social Analytics presentation
What is Data Science
Introduction to Big Data and Data Science
EDF2013: Big Data Tutorial: Marko Grobelnik
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
Big data
Data science-Introductions-Real World Application
Data Scientist Toolbox
DMTM 2015 - 02 Data Mining
Ad

More from James Littlejohn (20)

PDF
LKNhealth.org
ODP
Introduction to Idea
ODP
IofT Edinburgh Meetup + blockchain science health wearable
PPTX
Vision for a health blockchain
ODP
Dsensor.org peer to peer science
ODP
Peer to Peer Science - Dsensor.org
ODP
ProjectSAFE London
ODP
MAIDSAFE Installer DEMO Project SAFE London
ODP
Dsensor.org Programmable Science
ODP
Dapps for Web Developers Aberdeen Techmeetup
ODP
Currency money & post money
ODP
Hands on BDD Javascript
ODP
QS Techmeetup Aberdeen
ODP
Open Source Free(DOM)
ODP
MightyMeetup Webapps talk
ODP
Wanttobe.org.uk
PPT
LifestyleLinking Open Source Project
PPTX
comparetheuniversities
PPT
Volunteer report card - charity hack
PPT
beginners guide to semantic web barcamGlasgow2
LKNhealth.org
Introduction to Idea
IofT Edinburgh Meetup + blockchain science health wearable
Vision for a health blockchain
Dsensor.org peer to peer science
Peer to Peer Science - Dsensor.org
ProjectSAFE London
MAIDSAFE Installer DEMO Project SAFE London
Dsensor.org Programmable Science
Dapps for Web Developers Aberdeen Techmeetup
Currency money & post money
Hands on BDD Javascript
QS Techmeetup Aberdeen
Open Source Free(DOM)
MightyMeetup Webapps talk
Wanttobe.org.uk
LifestyleLinking Open Source Project
comparetheuniversities
Volunteer report card - charity hack
beginners guide to semantic web barcamGlasgow2
Ad

Recently uploaded (20)

PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Approach and Philosophy of On baking technology
PPT
Teaching material agriculture food technology
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PPTX
Big Data Technologies - Introduction.pptx
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
KodekX | Application Modernization Development
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Machine learning based COVID-19 study performance prediction
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Encapsulation theory and applications.pdf
PPTX
Cloud computing and distributed systems.
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
MIND Revenue Release Quarter 2 2025 Press Release
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Approach and Philosophy of On baking technology
Teaching material agriculture food technology
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Reach Out and Touch Someone: Haptics and Empathic Computing
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Big Data Technologies - Introduction.pptx
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
KodekX | Application Modernization Development
Programs and apps: productivity, graphics, security and other tools
Machine learning based COVID-19 study performance prediction
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Digital-Transformation-Roadmap-for-Companies.pptx
Encapsulation theory and applications.pdf
Cloud computing and distributed systems.
Understanding_Digital_Forensics_Presentation.pptx
The Rise and Fall of 3GPP – Time for a Sabbatical?
MIND Revenue Release Quarter 2 2025 Press Release

Big dataweb, science, mining

Editor's Notes

  • #2: Welcome thank for invite, background, assumed read profile First talk, as an entrepreneur through n in at the deepend always good, make sure you learn to swim fast.
  • #3: 10min set the science 10min what is data science and review of characters in the industry, what saying whats being leartn, OPEN source 20 hands on code. 10 min Q&A
  • #4: Started with a dot, physists tells you a big bang! Data story began. .com commercial, transaction focus, e-commerce automation, mechanical Burst – thinking continueium – web2.0
  • #5: Send it to friends, family, share openess story build application Infractucture – econonmics Pace of data – bandwidth FB Zuckerberg open share ration growing faster than more law. What happens as we cycle through this and speeds up – DATA web web2.0 squared web3.0 . . . .
  • #7: Open and share accellerate (privacy debate – wont go there) How is could difference from moore law, that plus more – hadoop, more to go in the cloud, don t want per hour, want what I need, NOSQL, data portability etc. Data science- what does it all mean?
  • #9: Re cap and make conclusions
  • #10: Live state of physics 1800 Chairman Google Community rallying around Data Science, strataconf. Structure, local meetups How does data live? Characters in the industry, I ve been reading about, useful to link to post get started.
  • #11: Add three hats graphics yellow hard hat, prof hat and marketing hat! Dave mccure!
  • #12: Data Flow Clean keep up to date include new? (big problem? If data with answer is not included, doesn 't matter how smart you DM is !) Algorithm – magic Present -communicate, API portable, feedback loop, etc
  • #13: Range of business, infrastructure hadoop cloudera, business linkedin, amazon e-commerce, health everything LL me Link into data mining,
  • #14: Infrastructure stack
  • #15: Cross source view of world
  • #17: Amazon and ebay talk tomorrow keynote
  • #18: Yahoo meetup James Sarwoski Wisdom of the Crowd book, prediction markets, choice bet with money better, what if replace bet with money with bet with your life? Need to measure life? Set hypthosis – test. Need curiosity to apply ideas Smart on our own – smarter networked? Only live life in real time Lots of 'path' already worn
  • #19: Next push of the web? Start up to existing need skill set, education market adopting to skill up work place Picture of a cat, = curiosity
  • #21: Picuture small med large show different level of granularity of data What hypothsisi are you trying to ask? Lets go and see what each is usfeul for?
  • #22: Show live site stats Need to get screen shot
  • #23: Got chrome or FF Code open files Story show class of data lifecycle, clean, make wise, UI API RDF Example, choices made, two words limit 50 FREQUENCY PLAYING GOT image assumption try and crowd source everything, getting start, re start once started Use Couch DB to show top50 May change two words or limit to 100? Trade off with speed We know what the answer will look like? Just getting there. Not always awere choice made, frequency of matching, weights attached 'Rule' be consistent Could be better but is quantums better than what we have Learn by doing ie learn be accident! 'play god slide'
  • #33: Dave winer not so much data for and against, to be use to make what we need.
  • #34: Speak on conf. On future of language, our job to pursudate in data science ie this direction