The document discusses beautifying data in the real world. It describes how much data exists on the internet, which is estimated to reach nearly 1,000 exabytes by 2015. It also discusses open notebook science, crowdsourcing data, and challenges with real world data like noise and barriers to presentation. Unique identifiers for chemicals and options for analyzing data are examined. The document proposes using semantic web technologies like RDF and SPARQL to build knowledge from beautified data and create non-obvious relationships. It demonstrates visualizing data through services like Google Docs and Second Life.