SlideShare a Scribd company logo
3 ways to efficiently
migrate your big data
to AWS cloud
AWS Services useful
in the migration
process
Amazon EMR is a service that allows
cost-effective and fast processing of large
amounts of data. It uses the Hadoop and
Spark frameworks based on Amazon EC2 and Amazon
S3. It allows for efficient processing of large amounts
of data in processes such as indexing, data mining,
machine learning or financial analysis.
Amazon S3 (Simple Storage Service) is a
fully manager extraction, transformation and
loading (ETL) service that makes it easier for
clients to prepare and load data for analysis. It also allows
you to configure, coordinate and monitor complex data
flows.
AWS Glue is a fully managed
extraction, transformation and
loading (ETL) service that makes it
easier for clients to prepare and load data for
analysis. It also allows you to configure,
coordinate and monitor complex data flows.
Open source software
supporting big data
Open source software
Apache Hadoop is software for distributed
storage and processing of large data sets using
computer clusters.
Apache Spark is a software that is a
programming platform for distributed
computing.
▪ Hadoop is designed to efficiently support batch processing, while Spark is
designed to efficiently handle data in real-time.
▪ Hadoop is a high-latency computing structure that has no interactive mode,
while Spark gives low-latency computing and can process data interactively.
▪ Apache Spark is also a component of the Hadoop Ecosystem. Spark’s main
idea was to perform memory processing.
3 approaches to the
migration process
There are few approaches in cloud migration, but
these 3 allow you to make conscious decisions
about your architecture.
3 APPROACHES TO THE
MIGRATION PROCESS
It relies on redesigning the existing
infrastructure in such a way to make
full use of cloud computing. The
approach relies on the analysing the
existing architecture and the way it’s
being designed, which will allow to
provide benefits such as lower
memory and hardware costs, increase
operational flexibility to ensure
business benefits.
Re-architecting
It is an ideal solution when we need
more efficient infrastructure. By
transferring the workloads of the
existing environment, we can avoid
most of the changes that can occur
during re-architecting. A smaller
number of changes also reduces the
risk associated with unexpected work,
and thus your solution can come back
sooner or enter the market.
Lift and shift
It’s a combination of two previous
approaches. In this mode, the part
responsible for fast migration is
associated with lift and shift. Re-
architecting, in turn, supports the
possibilities of redesigning the needed
solutions. This approach allows a great
deal of flexibility, which allows you to
experiment with cloud solutions and
gain the necessary experience before
you permanently decide to move to
the cloud.
Hybrid
Prototyping in
the spirit of
best practices
Knowing the migration possibilities to the cloud,
let’s move on to prototyping. When learning new
solutions, there is always a learning stage. And as
you know, practice is its best form. Prototyping
should be crucial when implementing new
services and products. Here is the scenario the
same as before – the cheaper option is to check the application at the
prototyping stage. There is a similar story with instance types. The worst
assumption is that the application running in the on-premise
environment will work the same way in the cloud environment. There are
many factors that affect this. It’s worth running applications with loads
that can occur in the real world in a test environment.
Best Preactices
in prototyping
1. Make a list of all potential assumptions and uncertainties
while remembering what may have the greatest impact
on the environment.
2. First, select and implement the most risky aspects of
migration.
3. Set your goals in advance and don’t be afraid to ask. The
answers will help in project verification or answer the
question of how a given solution works.
4. Always prototype under similar conditions in which you
want to operate. You can start with a smaller
environment or set of features and then use the scale.
5. Iteration and Continuous Integration as the basis for creating
implementation tests. Using an automated environment and
scripts, you can run the test in several environments.
6. Ask the expert for verification to be able to check the test
configuration and environment. This will allow you to eliminate
errors and check if the results are not falsified.
7. Correctly running the tests will allow you to remove variables
that may be due to dependencies.
8. Document the test results and ask for verification to ensure
they are reliable.
9. Don’t take all assumptions for granted! In the big data
area, too many factors affect performance, functionality
and cost.
10.Prototyping aims to verify the assumptions of the project
with a fairly high degree of certainty. In general, more
effort put into the prototype, taking into account many
factors, will give greater confidence that the project will
operate in a production environment.
11. And above all, don’t be afraid to seek help – from AWS
Authorized Partners, AWS Support and in documentation
Any questions?
We can help you!
Feel free to contact us
kontakt@lcloud.pl
www.lcloud.pl
Thank you for your time!
All source materials in the presentation have been appropriately marked.

More Related Content

DOCX
Correlation bug in pertmaster
PPTX
Significance of metrics
PDF
Hfsp bringing size based scheduling to hadoop
PDF
"HFSP: Size-based Scheduling for Hadoop" presentation for BigData 2014
PDF
Basic forecasting
PDF
Big Data in Production: Lessons from Running in the Cloud
PPT
Hw09 Protein Alignment
PPTX
Data science and cloud computing
Correlation bug in pertmaster
Significance of metrics
Hfsp bringing size based scheduling to hadoop
"HFSP: Size-based Scheduling for Hadoop" presentation for BigData 2014
Basic forecasting
Big Data in Production: Lessons from Running in the Cloud
Hw09 Protein Alignment
Data science and cloud computing

Similar to 3 ways to efficiently migrate your big data to AWS cloud | LCloud (20)

PDF
Cloud cost optimization an essential guide to aws cloud migration
PDF
Financial Services Industry Forum
PDF
How to select cloud migration service providers.pdf
PDF
Why Cloud Migration is the Key to Future-Proofing Your Business.pdf
PDF
Strategies for on premise to Google Cloud migration - Mateusz Pytel, GetInData
PDF
5 Points to Consider - Enterprise Road Map to AWS Cloud
PPTX
Cloud Strategy
PDF
Migración a la Nube: Preparación y Mejores Prácticas
PDF
From Legacy To Innovation.pdf
PPTX
Navigating Complexity: A Practical Guide to Successful Legacy to Cloud Migration
PPTX
Migrating Legacy Applications to AWS Cloud: Strategies and Challenges
PDF
AWS Cloud Migration Guide for Successful Business Shift
PDF
Cloud Migration Checklist_ Everything You Need Before You Start.pdf
PDF
Cloud-Migration-Best-Practices-A-Practical-Guide-to-AWS-Azure-and-Google-Clou...
PPTX
Cloud Computing Courses Online.pptx Join Now
PPTX
wndoNDKLAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
PPTX
Cloud migration presentation
PDF
Cloud Migration Key Points to Consider.pdf
PDF
Mass Migration Strategy - A Key Step in the Enterprise Transformation - AWS C...
Cloud cost optimization an essential guide to aws cloud migration
Financial Services Industry Forum
How to select cloud migration service providers.pdf
Why Cloud Migration is the Key to Future-Proofing Your Business.pdf
Strategies for on premise to Google Cloud migration - Mateusz Pytel, GetInData
5 Points to Consider - Enterprise Road Map to AWS Cloud
Cloud Strategy
Migración a la Nube: Preparación y Mejores Prácticas
From Legacy To Innovation.pdf
Navigating Complexity: A Practical Guide to Successful Legacy to Cloud Migration
Migrating Legacy Applications to AWS Cloud: Strategies and Challenges
AWS Cloud Migration Guide for Successful Business Shift
Cloud Migration Checklist_ Everything You Need Before You Start.pdf
Cloud-Migration-Best-Practices-A-Practical-Guide-to-AWS-Azure-and-Google-Clou...
Cloud Computing Courses Online.pptx Join Now
wndoNDKLAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
Cloud migration presentation
Cloud Migration Key Points to Consider.pdf
Mass Migration Strategy - A Key Step in the Enterprise Transformation - AWS C...
Ad

More from LCloud (15)

PDF
Zero Trust - Unlock the highest level of Security
PDF
From vision to real value | Generative AI (GenAI)
PDF
Well architected tool - Serverless and Machine Learning Lens
PDF
On a trail with Amazon Detective | LCloud
PPTX
"Don’t Run with Scissors: Serverless Security Survival Guide" | Hillel Solow,...
PDF
"Building a Production-Grade Serverless Deployment" - Eoin Shanaghy, CTO, fo...
PDF
Amazon Aurora MySQL - tips & tricks in configuration | LCloud
PDF
AWS Landing Zone Essentials PL | LCloud
PDF
Security management using devops | LCloud
PDF
Amazon Neptune - visually more options
PDF
How to use AWS practices to provide the enterprise architecture in the cloud
PDF
Overview of Amazon Web Services - kwiecień 2017
PPTX
Good practices to design and implement IT architecture based on AWS
PPTX
Lcloud na AWS re: Invent 2016 w Las Vegas
PPTX
What to know about Amazon Elastic Block Store (EBS)
Zero Trust - Unlock the highest level of Security
From vision to real value | Generative AI (GenAI)
Well architected tool - Serverless and Machine Learning Lens
On a trail with Amazon Detective | LCloud
"Don’t Run with Scissors: Serverless Security Survival Guide" | Hillel Solow,...
"Building a Production-Grade Serverless Deployment" - Eoin Shanaghy, CTO, fo...
Amazon Aurora MySQL - tips & tricks in configuration | LCloud
AWS Landing Zone Essentials PL | LCloud
Security management using devops | LCloud
Amazon Neptune - visually more options
How to use AWS practices to provide the enterprise architecture in the cloud
Overview of Amazon Web Services - kwiecień 2017
Good practices to design and implement IT architecture based on AWS
Lcloud na AWS re: Invent 2016 w Las Vegas
What to know about Amazon Elastic Block Store (EBS)
Ad

Recently uploaded (20)

PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
project resource management chapter-09.pdf
PDF
Zenith AI: Advanced Artificial Intelligence
PDF
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
PDF
Hindi spoken digit analysis for native and non-native speakers
PPTX
Tartificialntelligence_presentation.pptx
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
Approach and Philosophy of On baking technology
PDF
Univ-Connecticut-ChatGPT-Presentaion.pdf
PDF
A comparative study of natural language inference in Swahili using monolingua...
PDF
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
PPTX
cloud_computing_Infrastucture_as_cloud_p
PPTX
A Presentation on Artificial Intelligence
PDF
Encapsulation theory and applications.pdf
PDF
Mushroom cultivation and it's methods.pdf
PPTX
OMC Textile Division Presentation 2021.pptx
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Encapsulation_ Review paper, used for researhc scholars
project resource management chapter-09.pdf
Zenith AI: Advanced Artificial Intelligence
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
Hindi spoken digit analysis for native and non-native speakers
Tartificialntelligence_presentation.pptx
MIND Revenue Release Quarter 2 2025 Press Release
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
Assigned Numbers - 2025 - Bluetooth® Document
Approach and Philosophy of On baking technology
Univ-Connecticut-ChatGPT-Presentaion.pdf
A comparative study of natural language inference in Swahili using monolingua...
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
cloud_computing_Infrastucture_as_cloud_p
A Presentation on Artificial Intelligence
Encapsulation theory and applications.pdf
Mushroom cultivation and it's methods.pdf
OMC Textile Division Presentation 2021.pptx
gpt5_lecture_notes_comprehensive_20250812015547.pdf

3 ways to efficiently migrate your big data to AWS cloud | LCloud

  • 1. 3 ways to efficiently migrate your big data to AWS cloud
  • 2. AWS Services useful in the migration process
  • 3. Amazon EMR is a service that allows cost-effective and fast processing of large amounts of data. It uses the Hadoop and Spark frameworks based on Amazon EC2 and Amazon S3. It allows for efficient processing of large amounts of data in processes such as indexing, data mining, machine learning or financial analysis.
  • 4. Amazon S3 (Simple Storage Service) is a fully manager extraction, transformation and loading (ETL) service that makes it easier for clients to prepare and load data for analysis. It also allows you to configure, coordinate and monitor complex data flows.
  • 5. AWS Glue is a fully managed extraction, transformation and loading (ETL) service that makes it easier for clients to prepare and load data for analysis. It also allows you to configure, coordinate and monitor complex data flows.
  • 7. Open source software Apache Hadoop is software for distributed storage and processing of large data sets using computer clusters. Apache Spark is a software that is a programming platform for distributed computing. ▪ Hadoop is designed to efficiently support batch processing, while Spark is designed to efficiently handle data in real-time. ▪ Hadoop is a high-latency computing structure that has no interactive mode, while Spark gives low-latency computing and can process data interactively. ▪ Apache Spark is also a component of the Hadoop Ecosystem. Spark’s main idea was to perform memory processing.
  • 8. 3 approaches to the migration process
  • 9. There are few approaches in cloud migration, but these 3 allow you to make conscious decisions about your architecture. 3 APPROACHES TO THE MIGRATION PROCESS
  • 10. It relies on redesigning the existing infrastructure in such a way to make full use of cloud computing. The approach relies on the analysing the existing architecture and the way it’s being designed, which will allow to provide benefits such as lower memory and hardware costs, increase operational flexibility to ensure business benefits. Re-architecting
  • 11. It is an ideal solution when we need more efficient infrastructure. By transferring the workloads of the existing environment, we can avoid most of the changes that can occur during re-architecting. A smaller number of changes also reduces the risk associated with unexpected work, and thus your solution can come back sooner or enter the market. Lift and shift
  • 12. It’s a combination of two previous approaches. In this mode, the part responsible for fast migration is associated with lift and shift. Re- architecting, in turn, supports the possibilities of redesigning the needed solutions. This approach allows a great deal of flexibility, which allows you to experiment with cloud solutions and gain the necessary experience before you permanently decide to move to the cloud. Hybrid
  • 13. Prototyping in the spirit of best practices
  • 14. Knowing the migration possibilities to the cloud, let’s move on to prototyping. When learning new solutions, there is always a learning stage. And as you know, practice is its best form. Prototyping should be crucial when implementing new services and products. Here is the scenario the same as before – the cheaper option is to check the application at the prototyping stage. There is a similar story with instance types. The worst assumption is that the application running in the on-premise environment will work the same way in the cloud environment. There are many factors that affect this. It’s worth running applications with loads that can occur in the real world in a test environment.
  • 16. 1. Make a list of all potential assumptions and uncertainties while remembering what may have the greatest impact on the environment. 2. First, select and implement the most risky aspects of migration. 3. Set your goals in advance and don’t be afraid to ask. The answers will help in project verification or answer the question of how a given solution works. 4. Always prototype under similar conditions in which you want to operate. You can start with a smaller environment or set of features and then use the scale.
  • 17. 5. Iteration and Continuous Integration as the basis for creating implementation tests. Using an automated environment and scripts, you can run the test in several environments. 6. Ask the expert for verification to be able to check the test configuration and environment. This will allow you to eliminate errors and check if the results are not falsified. 7. Correctly running the tests will allow you to remove variables that may be due to dependencies. 8. Document the test results and ask for verification to ensure they are reliable.
  • 18. 9. Don’t take all assumptions for granted! In the big data area, too many factors affect performance, functionality and cost. 10.Prototyping aims to verify the assumptions of the project with a fairly high degree of certainty. In general, more effort put into the prototype, taking into account many factors, will give greater confidence that the project will operate in a production environment. 11. And above all, don’t be afraid to seek help – from AWS Authorized Partners, AWS Support and in documentation
  • 19. Any questions? We can help you! Feel free to contact us kontakt@lcloud.pl www.lcloud.pl Thank you for your time! All source materials in the presentation have been appropriately marked.