ITEC, Klagenfurt University, Austria
What is a Hype and  Where Can I Get One? Mathias Lux [email_address] Department for Information Technology, Klagenfurt University, Austria
What is this about … Power Laws & Pareto Distributions Just a Theory? Conclusions ITEC, Klagenfurt University, Austria by betta_design http://www.flickr.com/photos/betta_design/2200198472/
The Long Tail Common for certain distributions Zipf‘s Law Power Law Pareto Distribution  In Web 2 Context Chris Anderson … ITEC, Klagenfurt University, Austria maitland 82 - http://www.flickr.com/photos/maitland82/346065497/
Zipf‘s Law Few events occur often, many occur rarely P n  ~ 1/n a  ... Frequency of the n th  ranked item, a close to 1. Prominent examples Ranking of words in documents Ranking of cities and their size Ranking of movies and sold cinema tickets …  and many more ITEC, Klagenfurt University, Austria
Zipf‘s Law Plot of the word frequency in Wikipedia Most popular:  the, of, and from http://en.wikipedia.org/wiki/Zipf's_law ITEC, Klagenfurt University, Austria
Pareto Distribution 80:20 Rule Economics Continous (Zipf is discrete) Practical issues Time Management, … ITEC, Klagenfurt University, Austria
Power Law Made famous by Albert Barabasi Scale free networks (web, power supply, …) In-degree  of web sites, etc. Defines actually a class of distributions f(x)=a*x^b + e Pareto and Zipf are part of the group  ITEC, Klagenfurt University, Austria
How to detect a power law? Simple empirical tests Draw points on a log-log plot Is it a „straight line“? ITEC, Klagenfurt University, Austria
How to detect a power law? Statistical Means E.g. KS-Test, Chi-Square Test Open  research  issue … See e.g. Clauset, A., Shalizi, C.R., Newman, M.E.J.:  Power-law distributions in empirical data. arXiv:0706.1062v1 (2007) ITEC, Klagenfurt University, Austria
A note on plots … ITEC, Klagenfurt University, Austria Taken from phun.org, tnx to enzo nadrag
A note on statistical means … http://www.phun.org/newspics/funny_friday/2538.jpg  tnx  to Enzo Nadrag ITEC, Klagenfurt University, Austria
Zipf, Pareto & Power Law: Conclusions They emerge when people are involved They have interesting characteristics Mean has virtually no information Area under the curve (cp. amazon’s long tail strategy) Power laws emerge somehow … Multiple generative models (preferntial attachement, memory kernels, etc.) No one knows for sure ITEC, Klagenfurt University, Austria
Is this just theory? Basically: YES! But there are related practical questions  Are you using Flickr? How many “interesting” photos did you publish? How many views do your photos have? Imagine you publish a video on YouTube What are the chances that your video is a big hit? How to “help out” the process of getting a big hit? Can one distinguish between hit or flop? ITEC, Klagenfurt University, Austria
Is this just theory? (2) More related practical questions  Do you have a website? How to “flat out” resource popularity? How select popular resources (e.g. for caching, adaptation, preprocessing)? ITEC, Klagenfurt University, Austria
Big hits on YouTube ITEC, Klagenfurt University, Austria ©  2007 by Aigner Thomas and Oraze Manuel
Getting popular … Starting with the first view (user) Some other users find the same resource They point other to it Blogging, Digging, word of mouth Multiplicator of information – cp. Metcalfe’s law Number of views (users) “explodes” ITEC, Klagenfurt University, Austria
Some graphs … ITEC, Klagenfurt University, Austria Data from del.icio.us Shows  bookmarks / day relative user count
Observations There is an initial bend in the curve The mean user # at the bend is rather small  Around 50 There are outliers Google Video was doomed  to be a success ITEC, Klagenfurt University, Austria
Conclusions If there is a bend … Chances are better for a big hit. Time is still an issue Slow start, long vs. short hype, etc. Resources without this bend: Better Chances that they are shelf warmers  Decision support for portfolio adaptation ITEC, Klagenfurt University, Austria
The Flickr way Flickr defined “Interestingness” Patented  combining views, comments, age, etc. Interesting photos are presented Users see new photos Not all photos (2.000 new / minute, checked Feb. 1 2008, ~ 11.oo UTC) They have no “big hit” ITEC, Klagenfurt University, Austria Kudos given to Horst Gutmann and Marian Kogler
The YouTube way Smaller resource data base than Flickr Around 45 videos a day (65.000 a day) But a lot more views (data Feb. 1st, 08) 73.245.607 for „Evolution of Dance“  20 most viewed have > 30M views Not obvious counter strategy  Might not (yet) be necessary ITEC, Klagenfurt University, Austria
Digg Assumption: Diggs also follow a power law Quite reasonable … How to avoid the Digg- effect? Digg has a mirror … ITEC, Klagenfurt University, Austria
Thanks ... ... for your  attention You are interested? Then talk to me … ITEC, Klagenfurt University, Austria by Gexydaf http://www.flickr.com/photos/gexydaf/2208215419/
Mathias Lux Affiliation Klagenfurt University, ITEC Contact mathias @ juggle.at http://www.semanticmetadata.net ITEC, Klagenfurt University, Austria

More Related Content

PDF
Gamification of Community Information Systems
PDF
The Legacy of ROLE - Where are we at the workplace?
PDF
The Legacy and the Future of Research Networks in Technology-Enhanced Learning
ODP
Actividad Terminal (Parte II)
PDF
Why information architects are needed in the kitchen?
PPTX
CBMI 2013 Presentation: User Intentions in Multimedia
PDF
User Intentions or "The other end of the camera ..."
PDF
Bignell_&_Cain_(2007)
Gamification of Community Information Systems
The Legacy of ROLE - Where are we at the workplace?
The Legacy and the Future of Research Networks in Technology-Enhanced Learning
Actividad Terminal (Parte II)
Why information architects are needed in the kitchen?
CBMI 2013 Presentation: User Intentions in Multimedia
User Intentions or "The other end of the camera ..."
Bignell_&_Cain_(2007)

Similar to Power Laws Popularity And Interestingness (16)

PDF
Tutorial 6 (web graph attributes)
PPTX
powerlaws.pptx
PPT
The Fundamentals of Market Share | Kyle Findlay
PPT
PPT
Intro To Power Laws (March 2008)
PPTX
SThAR_corporate_presentation_19-09-2009
PPT
Role of Scaling in Developing an Understanding of How Systems Work OR the dan...
PPTX
2010 06-08 chania stochastic web modelling - copy
PPTX
Power Laws and Rich-Get-Richer Phenomena
PPTX
Cascade Project
PDF
Master Minds on Data Science - Arno Siebes
PDF
Graph Evolution Models
PPTX
DWDM_UNIT5.pptxgggggggggggggggggggggggggg
PPTX
Big Data & The Role Analytics Can Play In Our Organizations
PPTX
Social Media Mining - Chapter 4 (Network Models)
PPTX
Probabilistic Interestingness Measures - An Introduction with Bayesian Belief...
Tutorial 6 (web graph attributes)
powerlaws.pptx
The Fundamentals of Market Share | Kyle Findlay
Intro To Power Laws (March 2008)
SThAR_corporate_presentation_19-09-2009
Role of Scaling in Developing an Understanding of How Systems Work OR the dan...
2010 06-08 chania stochastic web modelling - copy
Power Laws and Rich-Get-Richer Phenomena
Cascade Project
Master Minds on Data Science - Arno Siebes
Graph Evolution Models
DWDM_UNIT5.pptxgggggggggggggggggggggggggg
Big Data & The Role Analytics Can Play In Our Organizations
Social Media Mining - Chapter 4 (Network Models)
Probabilistic Interestingness Measures - An Introduction with Bayesian Belief...
Ad

More from dermotte (9)

PPTX
Invited Talk OAGM Workshop Salzburg, May 2015
PDF
LIRE presentation at the ACM Multimedia Open Source Software Competition 2013
PPTX
Why did you record this video?
PPTX
Content based image retrieval with LIRe
PPTX
Ohne LIRe keine Bildsuche
PDF
Callisto: Content Based Tag Recommendation for Images
PDF
Visual Information Retrieval
PDF
Using Visual Features to Improve Tag Suggestions in Image Sharing Sites :: pr...
PPT
Aspects of broad folksonomies
Invited Talk OAGM Workshop Salzburg, May 2015
LIRE presentation at the ACM Multimedia Open Source Software Competition 2013
Why did you record this video?
Content based image retrieval with LIRe
Ohne LIRe keine Bildsuche
Callisto: Content Based Tag Recommendation for Images
Visual Information Retrieval
Using Visual Features to Improve Tag Suggestions in Image Sharing Sites :: pr...
Aspects of broad folksonomies
Ad

Recently uploaded (20)

PDF
OpenACC and Open Hackathons Monthly Highlights July 2025
PDF
A proposed approach for plagiarism detection in Myanmar Unicode text
PDF
A contest of sentiment analysis: k-nearest neighbor versus neural network
DOCX
search engine optimization ppt fir known well about this
PDF
A review of recent deep learning applications in wood surface defect identifi...
PPTX
Chapter 5: Probability Theory and Statistics
PDF
Enhancing plagiarism detection using data pre-processing and machine learning...
PDF
Five Habits of High-Impact Board Members
PDF
Getting started with AI Agents and Multi-Agent Systems
PPTX
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
PPT
Module 1.ppt Iot fundamentals and Architecture
PDF
sustainability-14-14877-v2.pddhzftheheeeee
PPTX
2018-HIPAA-Renewal-Training for executives
PDF
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
PDF
1 - Historical Antecedents, Social Consideration.pdf
PDF
Credit Without Borders: AI and Financial Inclusion in Bangladesh
PDF
Consumable AI The What, Why & How for Small Teams.pdf
PDF
Improvisation in detection of pomegranate leaf disease using transfer learni...
PPTX
Benefits of Physical activity for teenagers.pptx
PPTX
TEXTILE technology diploma scope and career opportunities
OpenACC and Open Hackathons Monthly Highlights July 2025
A proposed approach for plagiarism detection in Myanmar Unicode text
A contest of sentiment analysis: k-nearest neighbor versus neural network
search engine optimization ppt fir known well about this
A review of recent deep learning applications in wood surface defect identifi...
Chapter 5: Probability Theory and Statistics
Enhancing plagiarism detection using data pre-processing and machine learning...
Five Habits of High-Impact Board Members
Getting started with AI Agents and Multi-Agent Systems
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
Module 1.ppt Iot fundamentals and Architecture
sustainability-14-14877-v2.pddhzftheheeeee
2018-HIPAA-Renewal-Training for executives
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
1 - Historical Antecedents, Social Consideration.pdf
Credit Without Borders: AI and Financial Inclusion in Bangladesh
Consumable AI The What, Why & How for Small Teams.pdf
Improvisation in detection of pomegranate leaf disease using transfer learni...
Benefits of Physical activity for teenagers.pptx
TEXTILE technology diploma scope and career opportunities

Power Laws Popularity And Interestingness

  • 2. What is a Hype and Where Can I Get One? Mathias Lux [email_address] Department for Information Technology, Klagenfurt University, Austria
  • 3. What is this about … Power Laws & Pareto Distributions Just a Theory? Conclusions ITEC, Klagenfurt University, Austria by betta_design http://www.flickr.com/photos/betta_design/2200198472/
  • 4. The Long Tail Common for certain distributions Zipf‘s Law Power Law Pareto Distribution In Web 2 Context Chris Anderson … ITEC, Klagenfurt University, Austria maitland 82 - http://www.flickr.com/photos/maitland82/346065497/
  • 5. Zipf‘s Law Few events occur often, many occur rarely P n ~ 1/n a ... Frequency of the n th ranked item, a close to 1. Prominent examples Ranking of words in documents Ranking of cities and their size Ranking of movies and sold cinema tickets … and many more ITEC, Klagenfurt University, Austria
  • 6. Zipf‘s Law Plot of the word frequency in Wikipedia Most popular: the, of, and from http://en.wikipedia.org/wiki/Zipf's_law ITEC, Klagenfurt University, Austria
  • 7. Pareto Distribution 80:20 Rule Economics Continous (Zipf is discrete) Practical issues Time Management, … ITEC, Klagenfurt University, Austria
  • 8. Power Law Made famous by Albert Barabasi Scale free networks (web, power supply, …) In-degree of web sites, etc. Defines actually a class of distributions f(x)=a*x^b + e Pareto and Zipf are part of the group ITEC, Klagenfurt University, Austria
  • 9. How to detect a power law? Simple empirical tests Draw points on a log-log plot Is it a „straight line“? ITEC, Klagenfurt University, Austria
  • 10. How to detect a power law? Statistical Means E.g. KS-Test, Chi-Square Test Open research issue … See e.g. Clauset, A., Shalizi, C.R., Newman, M.E.J.: Power-law distributions in empirical data. arXiv:0706.1062v1 (2007) ITEC, Klagenfurt University, Austria
  • 11. A note on plots … ITEC, Klagenfurt University, Austria Taken from phun.org, tnx to enzo nadrag
  • 12. A note on statistical means … http://www.phun.org/newspics/funny_friday/2538.jpg tnx to Enzo Nadrag ITEC, Klagenfurt University, Austria
  • 13. Zipf, Pareto & Power Law: Conclusions They emerge when people are involved They have interesting characteristics Mean has virtually no information Area under the curve (cp. amazon’s long tail strategy) Power laws emerge somehow … Multiple generative models (preferntial attachement, memory kernels, etc.) No one knows for sure ITEC, Klagenfurt University, Austria
  • 14. Is this just theory? Basically: YES! But there are related practical questions Are you using Flickr? How many “interesting” photos did you publish? How many views do your photos have? Imagine you publish a video on YouTube What are the chances that your video is a big hit? How to “help out” the process of getting a big hit? Can one distinguish between hit or flop? ITEC, Klagenfurt University, Austria
  • 15. Is this just theory? (2) More related practical questions Do you have a website? How to “flat out” resource popularity? How select popular resources (e.g. for caching, adaptation, preprocessing)? ITEC, Klagenfurt University, Austria
  • 16. Big hits on YouTube ITEC, Klagenfurt University, Austria © 2007 by Aigner Thomas and Oraze Manuel
  • 17. Getting popular … Starting with the first view (user) Some other users find the same resource They point other to it Blogging, Digging, word of mouth Multiplicator of information – cp. Metcalfe’s law Number of views (users) “explodes” ITEC, Klagenfurt University, Austria
  • 18. Some graphs … ITEC, Klagenfurt University, Austria Data from del.icio.us Shows bookmarks / day relative user count
  • 19. Observations There is an initial bend in the curve The mean user # at the bend is rather small Around 50 There are outliers Google Video was doomed to be a success ITEC, Klagenfurt University, Austria
  • 20. Conclusions If there is a bend … Chances are better for a big hit. Time is still an issue Slow start, long vs. short hype, etc. Resources without this bend: Better Chances that they are shelf warmers Decision support for portfolio adaptation ITEC, Klagenfurt University, Austria
  • 21. The Flickr way Flickr defined “Interestingness” Patented combining views, comments, age, etc. Interesting photos are presented Users see new photos Not all photos (2.000 new / minute, checked Feb. 1 2008, ~ 11.oo UTC) They have no “big hit” ITEC, Klagenfurt University, Austria Kudos given to Horst Gutmann and Marian Kogler
  • 22. The YouTube way Smaller resource data base than Flickr Around 45 videos a day (65.000 a day) But a lot more views (data Feb. 1st, 08) 73.245.607 for „Evolution of Dance“ 20 most viewed have > 30M views Not obvious counter strategy Might not (yet) be necessary ITEC, Klagenfurt University, Austria
  • 23. Digg Assumption: Diggs also follow a power law Quite reasonable … How to avoid the Digg- effect? Digg has a mirror … ITEC, Klagenfurt University, Austria
  • 24. Thanks ... ... for your attention You are interested? Then talk to me … ITEC, Klagenfurt University, Austria by Gexydaf http://www.flickr.com/photos/gexydaf/2208215419/
  • 25. Mathias Lux Affiliation Klagenfurt University, ITEC Contact mathias @ juggle.at http://www.semanticmetadata.net ITEC, Klagenfurt University, Austria