The document outlines lessons learned from operating 2000 event-driven microservices at Wix, addressing challenges such as producing message failures, out-of-order processing, and large payload sending. It describes the implementation of tools and techniques for resilience, including the use of Kafka and the development of Greyhound, which provides a resilient producer and consumer retries. Overall, the experience has led to improved decoupling, resilience, and scalability of microservices while maintaining data consistency.
Related topics: