Web log analysis is a standard procedure on most sites. As the number of visits grow this is one of the first practical applications of Big Data systems. The goal of the presentation is to demonstrate, on an example, how to build a system to analyse web logs. As a basic tool I'm suggestion Cloudera CDH, as a tool for data collection StreamSets, and for keeping I suggest parallel storage in two formats: Tab separated for analysis, and ElasticSearch with Kibana front-end for quick insights and dashboards.
Related topics: