Powering Interactive Analytics with Alluxio and Presto

1
Powering interactive analytics
with Alluxio and Presto
Dmytro Dermanskyi
DATA ORCHESTRATION SUMMIT 2020

22
™
© All Rights Reserved
™
WalkMe: Digital Adoption Platform

33
™
™
Insights: WalkMe’s web analytics solution

44
™
™
Funnels analysis

55
™
User must perform these steps to submit a support ticket
1 2 3
Click “Contact
support” link
Visit “Submit a ticket”
page
Fill in the details and
open the ticket1 2 3
Funnel example: “Submit support ticket” journey

66
™
™
Funnel example: UI

77
™
™
Funnel example: SQL behind the scenes

88
™
™
Initial architecture

99
™
™
Challenges on the way to Alluxio
1. Glue tables data location paths are fixed to s3:// “filesystem”.
2. Alluxio metadata needs to be periodically refreshed to sync changes in
S3 since some S3 files are written not through Alluxio.
1. In a collocated deployment we need to carefully distribute memory
allocations between Alluxio and Presto processes. By default EMR gives
Presto JVMs most of available RAM.

1010
™
™
Problem 1: Pointing Glue tables to Alluxio
Solution:
1. Export Glue catalog into a classic Hive metastore.
2. During the export fix table location paths to use alluxio:// instead of
s3://.
3. Connect EMR Presto to the Hive metastore instead of Glue catalog.
4. Repeat the steps 1 and 2 periodically to sync changes in Glue(e.g. new
partitions, new tables) to the Hive metastore.

1111
™
™
Disadvantages of the solution:
● Glue-to-Hive export job is really complex.
● Necessary to keep two table catalogs: Glue and Hive metastore.
Resolving and syncing schema changes is difficult.
● The export job depends on proprietary Glue API which can change with
time and break the job.

1212
™
™
Possible alternative solution: Alluxio Catalog Service
Just delegate the problem to Alluxio
https://docs.alluxio.io/os/user/stable/en/core-services/Catalog.html

1313
™
™
Problem 2: Alluxio metadata sync for S3 changes
Solution:
alluxio fs ls [-R|-f]
Can be executed:
● Upon file change events from S3
● Periodically with crontab

1414
™
™
Problem 3: Presto vs Alluxio memory contention
By default EMR gives Presto JVM processes most available RAM:
EC2 node type RAM EMR Presto -Xmx
c5.2xlarge 16GB 12GB
r5.4xlarge 128GB 100GB
c5.4xlarge 32GB 24.5GB
c4.8xlarge 60Gb 47GB
c5.9xlarge 72GB 55GB
Remaining memory may not be enough for Alluxio processes.
Especially if you want Alluxio to cache data in RAM.

1515
™
™
EMR doesn’t provide means to adjust default Presto JVM config. But we need such
control to appropriately distribute memory allocation for processes.

1616
™
™
Solution 1:
Don’t use EMR Presto but deploy our own Presto distribution:
● This brings additional complexity and more work.
● Though also gives more flexibility for Presto version choice and
configuration.

1717
™
™
Solution 2:
Use EMR bootstrap script to setup a crontab script that fixes EMR Presto JVM
config as soon as it provisioned by EMR:
NOTE: EMR runs bootstrap scripts before installing Presto
● Allows reusing EMR Presto distribution (no need to deploy own Presto).
● “Hacky” solution.

1818
™
™
Alluxio-based architecture

1919
™
™
Performance improvement
Average query execution time has dropped from ~20 sec to ~1 sec.

2020
™
™
Lessons learned: Memory cannot be overcommitted
● Make sure to not use default EMR Presto JVM config since there might be
not enough memory left for Alluxio processes. Otherwise beware OOM
crashes.
● Chose appropriate caching medium for Alluxio:
○ To cache in RAM use memory optimized instances.
○ Consider caching on SSD. Both NVMe and EBS gp2 disks can give
significant performance boost.

2121
™
™
Lessons learned: Data locality matters
We tried deploying Alluxio as a separate from Presto cluster:
+ This design allows decoupling of storage (S3), caching layer (Alluxio) and
compute (Presto). You can give Alluxio as much resources (RAM/SSD) as
necessary.
- Query performance is at least 2-3 times worse comparing to the collocated
deployment.

2222
™
™
Lessons learned: Alluxio metastore can grow large
When the number of files in UFS is becoming huge (e.g. hundreds of
thousand) consider the following:
● Alluxio journal will have gigabytes of size. Make sure the disk where it’s
kept is large enough. Or use UFS for it (though it’s not so simple).
● It may be worth switching from HEAP to RocksDB-based metastore. This
way gigabytes of master RAM taken by metastore can be freed up.

2323
™
™
Lessons learned: Monitor everything!
What can go wrong:
● Presto workers can crash
● Alluxio workers can crash
● Presto/Alluxio masters can crash
● EC2 instance can run out of disk space
● Queries can start to fail because of lots of different reasons
● Queries can start to just hang and timeout
● You name it ...

Powering Interactive Analytics with Alluxio and Presto

More Related Content

What's hot (20)

Similar to Powering Interactive Analytics with Alluxio and Presto (20)

More from Alluxio, Inc. (20)

Recently uploaded (20)

Powering Interactive Analytics with Alluxio and Presto