SlideShare a Scribd company logo
Deploy a Spark Application
on a Spark Cluster @
Elastic Map Reduce AWS
Dr. Rim Moussa
University of Carthage
Amazon S3 -Amazon Simple Storage Service
 Upload
– S3 bucket for Spark code: .jar
– S3 bucket for Data
 Uploads to S3 might be done via Terminal for big data sets
laumch EC2 instance to upload data into S3 bucket
curl ftp://ftp.ais.dk/ais_data/dk_csv_jun2018.zip | aws s3 cp -
s3://aisdma
 Manipulation of S3 buckets and files can be done via
Terminal
aws s3 ls s3://data.info
aws s3 cp s3://spark.jars/rm-1.0-veracity.jar .
aws s3 rm s3://data.info
2
Open Datasets on Amazon
 Amazon has a repository of big datasets
https://registry.opendata.aws/
 Amazon implements a program AWS Public Dataset Program
in order to democratize access to data and encourage the
development of communities that benefit from access to
shared datasets.
https://aws.amazon.com/opendata/public-datasets/
3
4
5
Amazon Elastic MapReduce
6
Amazon Elastic MapReduce
7
Amazon Elastic MapReduce
8
Amazon Elastic MapReduce
9
Hardware Tab
10
Cluster ready
11
ssh Master
12
→ ssh Master
13
Click on master
14
Click on EC2 instance of master
15
Review Master Security Groups
16
Check “allow pinging and ssh”
17
→ ssh master
18
Steps towards submitting a workflow
19
Create S3 bucket for Workflow output
20
Submit job to Spark master
21
Job result
22
Re-check data.info S3 bucket
23
Download results
24
Terminate cluster
25
Terminate cluster
26

More Related Content

PDF
AWSのDatabase・Analytics系サービス 概要と使いどころをざくっとおさらい
PDF
Big data, Cloud, and the NOAA CRADA at The Climate Corporation
PDF
Quantitative Precipitation Estimation at The Climate Corporation
PDF
Building real apps on serverless
PDF
AWS Certified Solutions Architect - Associate SAA-C03 Dumps
PDF
SAA-C03 Practice Questions – Prepare Like a Pro for the AWS Exam
PDF
SAA-C03 Exam Dumps for 2025 – Pass Your AWS Associate Exam on First Attempt
PDF
Updated 2025 SAA-C03 Exam Guide – Pass AWS Solutions Architect Associate with...
AWSのDatabase・Analytics系サービス 概要と使いどころをざくっとおさらい
Big data, Cloud, and the NOAA CRADA at The Climate Corporation
Quantitative Precipitation Estimation at The Climate Corporation
Building real apps on serverless
AWS Certified Solutions Architect - Associate SAA-C03 Dumps
SAA-C03 Practice Questions – Prepare Like a Pro for the AWS Exam
SAA-C03 Exam Dumps for 2025 – Pass Your AWS Associate Exam on First Attempt
Updated 2025 SAA-C03 Exam Guide – Pass AWS Solutions Architect Associate with...

Similar to Spark EMR AWS (15)

PPTX
AWS Certified Solutions Architect Professional Course S15-S18
PDF
AWS tutorial-Part5 to 10(Combined):Overview of various AWS services and offer...
PPTX
Best AWS Services List 2022
PDF
AWS Certified SysOps Administrator Associate SOA-C02 pdf
PDF
Introduction to Amazon Web Services
DOCX
Updated SAA-C03 Dumps for 2024 Secure Your AWS Certification
PPTX
Amazon web services session 4
PPTX
Aws primer Amazon Web Services
PDF
Aws storage services whitepaper v9
PDF
Aws storage services whitepaper v9
PDF
AWS Big Data Landscape
PPTX
Building a Modern Data Platform on AWS. Public Sector Summit Brussels 2019
PDF
Top 30+ Latest AWS Certification Interview Questions on AWS BI & Data Visuali...
PPTX
PDF
Automated security analysis of aws clouds v1.0
AWS Certified Solutions Architect Professional Course S15-S18
AWS tutorial-Part5 to 10(Combined):Overview of various AWS services and offer...
Best AWS Services List 2022
AWS Certified SysOps Administrator Associate SOA-C02 pdf
Introduction to Amazon Web Services
Updated SAA-C03 Dumps for 2024 Secure Your AWS Certification
Amazon web services session 4
Aws primer Amazon Web Services
Aws storage services whitepaper v9
Aws storage services whitepaper v9
AWS Big Data Landscape
Building a Modern Data Platform on AWS. Public Sector Summit Brussels 2019
Top 30+ Latest AWS Certification Interview Questions on AWS BI & Data Visuali...
Automated security analysis of aws clouds v1.0
Ad

More from Rim Moussa (6)

PDF
Keynote27nov
PDF
Smartnets2018
PDF
Isncc2020
PDF
Compsac 2018
PDF
Bicod2017
PDF
Teaching big data
Keynote27nov
Smartnets2018
Isncc2020
Compsac 2018
Bicod2017
Teaching big data
Ad

Recently uploaded (20)

PPTX
Cell Structure & Organelles in detailed.
PPTX
Tissue processing ( HISTOPATHOLOGICAL TECHNIQUE
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PPTX
Presentation on HIE in infants and its manifestations
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PPTX
Lesson notes of climatology university.
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PDF
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
PDF
RMMM.pdf make it easy to upload and study
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PDF
Complications of Minimal Access Surgery at WLH
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PDF
Microbial disease of the cardiovascular and lymphatic systems
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
Cell Structure & Organelles in detailed.
Tissue processing ( HISTOPATHOLOGICAL TECHNIQUE
Abdominal Access Techniques with Prof. Dr. R K Mishra
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
Presentation on HIE in infants and its manifestations
Final Presentation General Medicine 03-08-2024.pptx
Final Presentation General Medicine 03-08-2024.pptx
Module 4: Burden of Disease Tutorial Slides S2 2025
Lesson notes of climatology university.
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
RMMM.pdf make it easy to upload and study
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
Supply Chain Operations Speaking Notes -ICLT Program
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
Complications of Minimal Access Surgery at WLH
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
Microbial diseases, their pathogenesis and prophylaxis
Microbial disease of the cardiovascular and lymphatic systems
STATICS OF THE RIGID BODIES Hibbelers.pdf

Spark EMR AWS