SlideShare a Scribd company logo
Building a Healthy Elasticsearch Ecosystem
—— How Grab use ES and cloud technologies among teams with
different use cases
1
• About Grab
• Beyond Logs
• Among Tech Family
• Inside Cloud
Grab: Building a Healthy Elasticsearch Ecosystem
Grab: Building a Healthy Elasticsearch Ecosystem
Beyond Logs
—— Solving challenging use cases with ES
Confidential - For Internal Use Only -Confidential - For Internal Use Only -
> 15
Online
Service
~ 10
Tech Family
Multiple challenging use cases
Use case types
Search Logs Monitoring Support
> 24
Prod Cluster
7
Solving interesting real life problems with Elasticsearch
8
Our Challenges
CBD area there are a lot of one-way lanes
Pickup / dropoff is usually single lane
Point of Interest Problem
9
Our Challenges
Limited time for waiting and high cost if we miss it
Point of Interest Problem
CBD area there are a lot of one-way lanes
Pickup / dropoff is usually single lane
10
Point of Interest Problem
ES Solution
Flatten ES data as much as possible.
Avoid nested query
Store hierarchy in relational database
Adding cache to decrease latency for search
Our Challenges
Limited time for waiting and high cost if we miss it
CBD area there are a lot of one-way lanes
Pickup / dropoff is usually single lane
11
Food Merchant Operational Hours Challenge
Merchant has different operational hours
Our Challenges
12
Food Merchant Operation Time Challenge
Multiple ops-hours for different meal times
Ops-hours and service might be different everyday
Our Challenges
Merchant has different operational hours (ops-hours)
13
Food Merchant Operation Time Challenge
Time Fragment, search fragment with term query
Range data type and use range query.
ES Solution
Multiple ops-hours for different meal times
Ops-hours and service might be different everyday
Our Challenges
Merchant has different operational hours (ops-hours)
14
Food Merchant Operation Time Challenge
Delivery radius varies for different merchants
Jakarta
Terrain impact the delivery difficulty
Sagaing
Geoshapes datatype,geoshape search
Time Fragment, search fragment with term query
Range data type and use range query.
ES Solution
Multiple ops-hours for different meal times
Ops-hours and service might be different everyday
Our Challenges
Merchant has different operational hours (ops-hours)
Among Tech Family
—— Connect different teams with ES
Confidential - For Internal Use Only -Confidential - For Internal Use Only -
> 15
Services
~ 10
Tech Families
User Group Demographic
Engineer DA/DS Ops CE
> 6
User Groups
Solving search problem in different case
Different production requirement
Customised solution for different services
Access control between department
Deal with different user pain point.
Consider different learning curve
Provide relational data for DA / DS
Automation for Ops team
Grab ES Summary
17
Fraud Engineer
—— Online fraud detection
—— They have huge amount of data
—— They focus on searching in a large
portion of data
—— They would require scalability from ES.
ENGINEER
Service Engineer
—— They mainly handle online service.
—— Usually they have strong technical
background
—— The focus on stability, observability,
log management for troubleshoot.
—— The service QPS always grow with
business.
Our Customers
18
Our Customers
CE
Customer Service
—— Real-time user history inquiry
—— Easy and user friendly tools
—— Low QPS but latency sensitive
19
OPS
DBA
—— Troubleshooting by searching logs
—— Alert / Monitoring other database
—— Internal tool usage
SRE
—— Deploy timeline to track last changes
—— Storing application log
Our Customers
20
DATA ANALYST
Product
—— Historical data searching
Data analyst
—— Daily report from data warehouse
—— Very complex aggregation and query
Our Customers
21
INFO SEC
User activities
—— Audit internal user activity
—— Audit user DB access
—— Audit query executed on the database
Our Customers
22
DATA ENGINEER
Production -> Data Warehouse
—— Data transform and clean up
—— Deal with complex SQL from DA, PA.
—— Daily report generated from data lake.
Our Customers
Cluster Health?
How to protect from bad read
—— Reads will affect each other.
—— Bad query is not controllable
and can not be throttled
How to manage read
—— Separate critical / non-critical
read.
Access Control?
Security concern
—— User should only access their cluster.
—— Separate service user and individual user
management.
What’s the concern?
Manage user group
—— Different users should see different content.
—— Access will be defined based on user group
Data stratification
Hot-cold layer (Read Separate)
—— Cold layer -> offline query /
aggregation.
—— Downstream pipeline from
hot to cold layer.
API service (Read Priority)
—— Micro service help other
engineer focus on business logic.
Our Solution
—— Role base access
control
Our Solution
Access Category
Issue different VPNs
—— Control the amount of people
to access the subnet.
Jumpcloud
—— Integrate individual user
access with company user group.
—— Automate the user access
application process
26
Service
How do we solve all these problem step by step?
27
Service
Stability:
—— Load balancer to keep up time.
—— ECE container service to keep
node up 24hrs.
Monitoring:
—— Shared Monitoring:
Cloudwatch, adjacent monitoring
tools for L1 troubleshoot.
Divide and conquer
28
Eng & CE
Security Control:
—— Accessing using engineer vpn.
—— Apply access by User group.
—— Kibana provide easy query UI
which reduce difficulty to use.
Doorman:
—— Provide temporary cluster
access
—— Engineer own the
access management
Divide and conquer
29
Ops & SRE
Monitoring:
—— ECE cluster metric monitoring.
—— Ingest to ELK to store service
logs
Management:
—— Automation tools: Protal, deal
with cluster creation, index/
mapping management.
Divide and conquer
30
Analyst
Data Warehouse:
—— Build down stream data
pipeline.
—— Data transform on ES data.
—— Provide SQL to access data
Load Testing:
—— Get production data from
warehouse for load testing.
—— Require InfoSec approval
Divide and conquer
Inside Cloud
—— Leveraging cloud technology
How we evolve
How we evolve
2017
Open Source stage
Heavy operation
—— AWS EC2, apply for weeks.
—— IP world, login and start the
service
—— Oncall nightmare, login to
recover nodes.
Limited cluster
—— 1 critical production cluster
—— 1 none-critical cluster.
—— Several playground clusters.
How we evolve
2017
Exploring stage
1 Month 1 cluster
—— More team are exploring ES.
—— Need to explore a manageable solution.
AWS ES
—— A manageable solution.
—— Save oncall’s life with managed
infrastructure.
—— Several playground clusters.
2018Q1
How we evolve
2017
ECE small deployment
ECE as our next solution
—— Manageable private infrastructure
—— Use Elasticsearch as what we learn on
the official website.
—— Learning how to use ECE.
Mgrating to ECE
—— Data transfer script
—— Lots of communication with service
team on migration steps.
2018Q1
2018Q3
How we evolve
2017
ECE Median deployment
From 3 clusters to 30 clusters
—— Scale ECE to get stable performance
—— Upgrade ECE to get newest features.
—— Almost 0 downtime for 1 year.
—— We learn a lot from Elasticsearch
support.
Integrate ECE with Grab
—— Automate ECE operation
—— Integrate ECE with our Portal
—— Integrate ECE with Cloud service
2018Q1
2018Q3
2019Q3
37
Coupling Useful Cloud Service with ES
Enhance
—— ES provide invert
index, which facilitate
adding index to dataset
—— Export RDS logs to
troubleshoot
—— Add cache for ES
search layer
—— Build ES down
stream
—— Indexing
—— Logs
—— Search cache
—— Format
—— Analysis
—— Pipeline
Health
Data Flow
DB cooperation
Multi-layer Monitoring
38
RDS
Database generate
slowlog, error logs. Find
the problem by seeing
this with Cloudwath. 

However, it’s lack of
searching capability and
visualization.
Relation DB
CW
Cloud monitoring
can collect logs from
multiple services and
store in one monitoring
service.

Still, the UI is not
good for searching and
lack of customized
alerting.
Cloud
Monitoring
Lambda
Every log will
trigger a piece of code,
which handle the RDS
log to ES or slack.

Its depends on
event types, then it will
decide the way to deal
with logs.
Serverless
ES
Pipeline send logs
/ events to ES. If it’s
indexed properly, users
should be able to use
Kibana to search the
logs.
Elasticsearch
Alerting
Watcher alerting
did a lot of work to
integrate with those
popular alerts tools. We
can config the actions
to send the alerts
properly.
Alerts
Coupling Useful Cloud Service with ES - Example
Self-service Platform
—— Next stage of providing search in Grab
Thank You
谢谢
Arkun
Terima Kasih
Khob Chai
ขอขอบคุณ
Cảm ơn bạn
eက#$ဇ&$တင)ပ+တယ)

More Related Content

PPTX
Blockchain in Audit
PDF
►TOP 13 • Blockchain Use Cases
PDF
PPTX
Spotlight on Financial Services with Calypso and SAP ASE
PPTX
The Power of Python :: How It Can Help With Technical SEO | Bristol SEO May 2...
PPTX
Mail flow in Exchange Online
PPTX
Blockchain 101 + Use Cases + Why Blockchain As a Service
PDF
What is a blockchain?
Blockchain in Audit
►TOP 13 • Blockchain Use Cases
Spotlight on Financial Services with Calypso and SAP ASE
The Power of Python :: How It Can Help With Technical SEO | Bristol SEO May 2...
Mail flow in Exchange Online
Blockchain 101 + Use Cases + Why Blockchain As a Service
What is a blockchain?

What's hot (15)

PPTX
On-Page SEO Techniques for 2022
PDF
Webinar digitally transforming healthcare with blockchain
PDF
Blue Ribbon Mastermind Presentation
PPT
Cryptocurrency
PDF
Scaling Sales: Growth Strategies Of The Fastest Growing Internet Retailers
PPT
Change Cycle
PDF
Mobile Is Eating the World (2015)
PPTX
Murex training | Murex online video tutorial
DOCX
Types of blockchain
PPTX
Bitcoin
PDF
SEO + Content Marketing Best Practices
PDF
Salesforce Marketing Cloud: Creating 1:1 Journeys
PDF
Conversion Conference Las Vegas 2017
PDF
Mass Conversions | Product Mix | Conversion Optimization
PDF
Key Benefits Of Salesforce Mobile Applications
On-Page SEO Techniques for 2022
Webinar digitally transforming healthcare with blockchain
Blue Ribbon Mastermind Presentation
Cryptocurrency
Scaling Sales: Growth Strategies Of The Fastest Growing Internet Retailers
Change Cycle
Mobile Is Eating the World (2015)
Murex training | Murex online video tutorial
Types of blockchain
Bitcoin
SEO + Content Marketing Best Practices
Salesforce Marketing Cloud: Creating 1:1 Journeys
Conversion Conference Las Vegas 2017
Mass Conversions | Product Mix | Conversion Optimization
Key Benefits Of Salesforce Mobile Applications
Ad

Similar to Grab: Building a Healthy Elasticsearch Ecosystem (20)

PDF
Better Search and Business Analytics at Southern Glazer’s Wine & Spirits
PDF
Bandwidth: Use Cases for Elastic Cloud on Kubernetes
PDF
Five Years of EC2 Distilled
PPTX
Achieve big data analytic platform with lambda architecture on cloud
PDF
Observability at scale: Hear from the Elastic Cloud SRE team
PDF
Building data intensive applications
PDF
Taking Care of Business at Office Depot with Elastic Cloud Enterprise
PPTX
Migrating enterprise workloads to AWS
PDF
How eStruxture Data Centers is Using ECE to Rapidly Scale Their Business
PPTX
analytic engine - a common big data computation service on the aws
PDF
Using AWS Elasticsearch for fast feedback on business data
PPTX
Andrew May - Getting Certified for Fun and Profit
PDF
What is Amazon Web Services & How to Start to deploy your apps ?
PDF
Log Analytics with AWS
PDF
Learn about AWS Certifications - Andrew May, Columbus
PDF
Big data and Analytics on AWS
PDF
AWS Training.pdf
PDF
AWS Training.pdf
PDF
Introduction to Amazon Web Services
PDF
Dean Bryen: Scaling The Platform For Your Startup
Better Search and Business Analytics at Southern Glazer’s Wine & Spirits
Bandwidth: Use Cases for Elastic Cloud on Kubernetes
Five Years of EC2 Distilled
Achieve big data analytic platform with lambda architecture on cloud
Observability at scale: Hear from the Elastic Cloud SRE team
Building data intensive applications
Taking Care of Business at Office Depot with Elastic Cloud Enterprise
Migrating enterprise workloads to AWS
How eStruxture Data Centers is Using ECE to Rapidly Scale Their Business
analytic engine - a common big data computation service on the aws
Using AWS Elasticsearch for fast feedback on business data
Andrew May - Getting Certified for Fun and Profit
What is Amazon Web Services & How to Start to deploy your apps ?
Log Analytics with AWS
Learn about AWS Certifications - Andrew May, Columbus
Big data and Analytics on AWS
AWS Training.pdf
AWS Training.pdf
Introduction to Amazon Web Services
Dean Bryen: Scaling The Platform For Your Startup
Ad

More from Elasticsearch (20)

PDF
An introduction to Elasticsearch's advanced relevance ranking toolbox
PDF
From MSP to MSSP using Elastic
PDF
Cómo crear excelentes experiencias de búsqueda en sitios web
PDF
Te damos la bienvenida a una nueva forma de realizar búsquedas
PDF
Tirez pleinement parti d'Elastic grâce à Elastic Cloud
PDF
Comment transformer vos données en informations exploitables
PDF
Plongez au cœur de la recherche dans tous ses états.
PDF
Modernising One Legal Se@rch with Elastic Enterprise Search [Customer Story]
PDF
An introduction to Elasticsearch's advanced relevance ranking toolbox
PDF
Welcome to a new state of find
PDF
Building great website search experiences
PDF
Keynote: Harnessing the power of Elasticsearch for simplified search
PDF
Cómo transformar los datos en análisis con los que tomar decisiones
PDF
Explore relève les défis Big Data avec Elastic Cloud
PDF
Comment transformer vos données en informations exploitables
PDF
Transforming data into actionable insights
PDF
Opening Keynote: Why Elastic?
PDF
Empowering agencies using Elastic as a Service inside Government
PDF
The opportunities and challenges of data for public good
PDF
Enterprise search and unstructured data with CGI and Elastic
An introduction to Elasticsearch's advanced relevance ranking toolbox
From MSP to MSSP using Elastic
Cómo crear excelentes experiencias de búsqueda en sitios web
Te damos la bienvenida a una nueva forma de realizar búsquedas
Tirez pleinement parti d'Elastic grâce à Elastic Cloud
Comment transformer vos données en informations exploitables
Plongez au cœur de la recherche dans tous ses états.
Modernising One Legal Se@rch with Elastic Enterprise Search [Customer Story]
An introduction to Elasticsearch's advanced relevance ranking toolbox
Welcome to a new state of find
Building great website search experiences
Keynote: Harnessing the power of Elasticsearch for simplified search
Cómo transformar los datos en análisis con los que tomar decisiones
Explore relève les défis Big Data avec Elastic Cloud
Comment transformer vos données en informations exploitables
Transforming data into actionable insights
Opening Keynote: Why Elastic?
Empowering agencies using Elastic as a Service inside Government
The opportunities and challenges of data for public good
Enterprise search and unstructured data with CGI and Elastic

Recently uploaded (20)

PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Machine learning based COVID-19 study performance prediction
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPTX
Cloud computing and distributed systems.
PDF
GamePlan Trading System Review: Professional Trader's Honest Take
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Empathic Computing: Creating Shared Understanding
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
[발표본] 너의 과제는 클라우드에 있어_KTDS_김동현_20250524.pdf
PDF
KodekX | Application Modernization Development
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PPT
Teaching material agriculture food technology
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Network Security Unit 5.pdf for BCA BBA.
Review of recent advances in non-invasive hemoglobin estimation
Machine learning based COVID-19 study performance prediction
Chapter 3 Spatial Domain Image Processing.pdf
Cloud computing and distributed systems.
GamePlan Trading System Review: Professional Trader's Honest Take
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Spectral efficient network and resource selection model in 5G networks
Empathic Computing: Creating Shared Understanding
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Dropbox Q2 2025 Financial Results & Investor Presentation
Reach Out and Touch Someone: Haptics and Empathic Computing
[발표본] 너의 과제는 클라우드에 있어_KTDS_김동현_20250524.pdf
KodekX | Application Modernization Development
MYSQL Presentation for SQL database connectivity
Mobile App Security Testing_ A Comprehensive Guide.pdf
Teaching material agriculture food technology
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...

Grab: Building a Healthy Elasticsearch Ecosystem

  • 1. Building a Healthy Elasticsearch Ecosystem —— How Grab use ES and cloud technologies among teams with different use cases 1
  • 2. • About Grab • Beyond Logs • Among Tech Family • Inside Cloud
  • 5. Beyond Logs —— Solving challenging use cases with ES
  • 6. Confidential - For Internal Use Only -Confidential - For Internal Use Only - > 15 Online Service ~ 10 Tech Family Multiple challenging use cases Use case types Search Logs Monitoring Support > 24 Prod Cluster
  • 7. 7 Solving interesting real life problems with Elasticsearch
  • 8. 8 Our Challenges CBD area there are a lot of one-way lanes Pickup / dropoff is usually single lane Point of Interest Problem
  • 9. 9 Our Challenges Limited time for waiting and high cost if we miss it Point of Interest Problem CBD area there are a lot of one-way lanes Pickup / dropoff is usually single lane
  • 10. 10 Point of Interest Problem ES Solution Flatten ES data as much as possible. Avoid nested query Store hierarchy in relational database Adding cache to decrease latency for search Our Challenges Limited time for waiting and high cost if we miss it CBD area there are a lot of one-way lanes Pickup / dropoff is usually single lane
  • 11. 11 Food Merchant Operational Hours Challenge Merchant has different operational hours Our Challenges
  • 12. 12 Food Merchant Operation Time Challenge Multiple ops-hours for different meal times Ops-hours and service might be different everyday Our Challenges Merchant has different operational hours (ops-hours)
  • 13. 13 Food Merchant Operation Time Challenge Time Fragment, search fragment with term query Range data type and use range query. ES Solution Multiple ops-hours for different meal times Ops-hours and service might be different everyday Our Challenges Merchant has different operational hours (ops-hours)
  • 14. 14 Food Merchant Operation Time Challenge Delivery radius varies for different merchants Jakarta Terrain impact the delivery difficulty Sagaing Geoshapes datatype,geoshape search Time Fragment, search fragment with term query Range data type and use range query. ES Solution Multiple ops-hours for different meal times Ops-hours and service might be different everyday Our Challenges Merchant has different operational hours (ops-hours)
  • 15. Among Tech Family —— Connect different teams with ES
  • 16. Confidential - For Internal Use Only -Confidential - For Internal Use Only - > 15 Services ~ 10 Tech Families User Group Demographic Engineer DA/DS Ops CE > 6 User Groups Solving search problem in different case Different production requirement Customised solution for different services Access control between department Deal with different user pain point. Consider different learning curve Provide relational data for DA / DS Automation for Ops team Grab ES Summary
  • 17. 17 Fraud Engineer —— Online fraud detection —— They have huge amount of data —— They focus on searching in a large portion of data —— They would require scalability from ES. ENGINEER Service Engineer —— They mainly handle online service. —— Usually they have strong technical background —— The focus on stability, observability, log management for troubleshoot. —— The service QPS always grow with business. Our Customers
  • 18. 18 Our Customers CE Customer Service —— Real-time user history inquiry —— Easy and user friendly tools —— Low QPS but latency sensitive
  • 19. 19 OPS DBA —— Troubleshooting by searching logs —— Alert / Monitoring other database —— Internal tool usage SRE —— Deploy timeline to track last changes —— Storing application log Our Customers
  • 20. 20 DATA ANALYST Product —— Historical data searching Data analyst —— Daily report from data warehouse —— Very complex aggregation and query Our Customers
  • 21. 21 INFO SEC User activities —— Audit internal user activity —— Audit user DB access —— Audit query executed on the database Our Customers
  • 22. 22 DATA ENGINEER Production -> Data Warehouse —— Data transform and clean up —— Deal with complex SQL from DA, PA. —— Daily report generated from data lake. Our Customers
  • 23. Cluster Health? How to protect from bad read —— Reads will affect each other. —— Bad query is not controllable and can not be throttled How to manage read —— Separate critical / non-critical read. Access Control? Security concern —— User should only access their cluster. —— Separate service user and individual user management. What’s the concern? Manage user group —— Different users should see different content. —— Access will be defined based on user group
  • 24. Data stratification Hot-cold layer (Read Separate) —— Cold layer -> offline query / aggregation. —— Downstream pipeline from hot to cold layer. API service (Read Priority) —— Micro service help other engineer focus on business logic. Our Solution
  • 25. —— Role base access control Our Solution Access Category Issue different VPNs —— Control the amount of people to access the subnet. Jumpcloud —— Integrate individual user access with company user group. —— Automate the user access application process
  • 26. 26 Service How do we solve all these problem step by step?
  • 27. 27 Service Stability: —— Load balancer to keep up time. —— ECE container service to keep node up 24hrs. Monitoring: —— Shared Monitoring: Cloudwatch, adjacent monitoring tools for L1 troubleshoot. Divide and conquer
  • 28. 28 Eng & CE Security Control: —— Accessing using engineer vpn. —— Apply access by User group. —— Kibana provide easy query UI which reduce difficulty to use. Doorman: —— Provide temporary cluster access —— Engineer own the access management Divide and conquer
  • 29. 29 Ops & SRE Monitoring: —— ECE cluster metric monitoring. —— Ingest to ELK to store service logs Management: —— Automation tools: Protal, deal with cluster creation, index/ mapping management. Divide and conquer
  • 30. 30 Analyst Data Warehouse: —— Build down stream data pipeline. —— Data transform on ES data. —— Provide SQL to access data Load Testing: —— Get production data from warehouse for load testing. —— Require InfoSec approval Divide and conquer
  • 31. Inside Cloud —— Leveraging cloud technology
  • 33. How we evolve 2017 Open Source stage Heavy operation —— AWS EC2, apply for weeks. —— IP world, login and start the service —— Oncall nightmare, login to recover nodes. Limited cluster —— 1 critical production cluster —— 1 none-critical cluster. —— Several playground clusters.
  • 34. How we evolve 2017 Exploring stage 1 Month 1 cluster —— More team are exploring ES. —— Need to explore a manageable solution. AWS ES —— A manageable solution. —— Save oncall’s life with managed infrastructure. —— Several playground clusters. 2018Q1
  • 35. How we evolve 2017 ECE small deployment ECE as our next solution —— Manageable private infrastructure —— Use Elasticsearch as what we learn on the official website. —— Learning how to use ECE. Mgrating to ECE —— Data transfer script —— Lots of communication with service team on migration steps. 2018Q1 2018Q3
  • 36. How we evolve 2017 ECE Median deployment From 3 clusters to 30 clusters —— Scale ECE to get stable performance —— Upgrade ECE to get newest features. —— Almost 0 downtime for 1 year. —— We learn a lot from Elasticsearch support. Integrate ECE with Grab —— Automate ECE operation —— Integrate ECE with our Portal —— Integrate ECE with Cloud service 2018Q1 2018Q3 2019Q3
  • 37. 37 Coupling Useful Cloud Service with ES Enhance —— ES provide invert index, which facilitate adding index to dataset —— Export RDS logs to troubleshoot —— Add cache for ES search layer —— Build ES down stream —— Indexing —— Logs —— Search cache —— Format —— Analysis —— Pipeline Health Data Flow DB cooperation Multi-layer Monitoring
  • 38. 38 RDS Database generate slowlog, error logs. Find the problem by seeing this with Cloudwath. However, it’s lack of searching capability and visualization. Relation DB CW Cloud monitoring can collect logs from multiple services and store in one monitoring service. Still, the UI is not good for searching and lack of customized alerting. Cloud Monitoring Lambda Every log will trigger a piece of code, which handle the RDS log to ES or slack. Its depends on event types, then it will decide the way to deal with logs. Serverless ES Pipeline send logs / events to ES. If it’s indexed properly, users should be able to use Kibana to search the logs. Elasticsearch Alerting Watcher alerting did a lot of work to integrate with those popular alerts tools. We can config the actions to send the alerts properly. Alerts Coupling Useful Cloud Service with ES - Example
  • 39. Self-service Platform —— Next stage of providing search in Grab
  • 40. Thank You 谢谢 Arkun Terima Kasih Khob Chai ขอขอบคุณ Cảm ơn bạn eက#$ဇ&$တင)ပ+တယ)