Twitter replicates petabytes of data from its Hadoop clusters to Google Cloud Storage daily using a data replicator architecture. The replicator copies data in hourly/daily partitions from source clusters to destination clusters and Google Cloud Storage, maintaining consistent access for users through a unified file system abstraction and data access layer. This replication enables unlocking analytics tools on Google Cloud Platform and provides a backup of Twitter data on cloud object storage.