This document discusses how Eway uses big data and cloud computing. It describes how Eway collects raw transaction data using Google Cloud Compute Engine instances, converts the data to Apache Parquet files for efficient storage and querying, uploads the files to Google Cloud Storage, and then explores the data using tools like Google Cloud Datalab and Grafana/PowerBI for analytics and visualization. The goals are to generate business and operational insights, power product features, and monitor systems. Principles for building such a data system include keeping it simple, avoiding duplication, having single responsibilities, and focusing on scalability and cost efficiency.
Related topics: