SlideShare a Scribd company logo
MODERN BIG DATA ANALYSIS
WITH SQL SPECIALIZATION
L
O
V
E
L
Y
P
R
O
F
E
S
S
I
O
N
A
L
U
N
I
V
E
R
S
I
T
Y
C S E 4 4 3 - S E M I N A R O N
S U M M E R T R A I N N G
Q A Z I M A A Z A R S H A D
B . T E C H C S E
1 1 9 0 6 4 2 4
TABLE OF
CONTENTS
01 My Course
I will discuss my summer
training MOOC. The tasks,
assignments, timeline, etc.
I will talk about the
experience and skills I have
gained from the course and
project.
I will give an overview of my
training and talk over future
scope of BIG DATA.
I will explain my project in
detail. The aim, problems
solved, approach, tech stack,
etc.
Project Conclusion
Learning Outcomes
02 04
03
OVERVIEW OF
COURSE
Modern Big Data Analysis wth SQL
Specialization
“Modern Big Data Analysis with SQL Specialization” is
an online specialization course offered by CLOUDERA
consisting of 3 courses “Foundations for Big Data
Analysis with SQL”, “Analyzing Big Data with SQL”, and
“Managing Big Data in Clusters and Cloud Storage”.
This specialization teaches the essential skills for
working with large-scale data using SQL.
Foundations for Big Data
Analysis with SQL
Analyzing Big Data with
SQL
Managing Big Data in
Clusters and Cloud
Storage
TIMELINE
13 May 2021
Completed the Summer Training before the deadline -
10th August 2021
12 July 2021
Enrolled in MOOC
Completed the
Specialization
Each course comprised of multiple
quizzes and assignments. A
minimum of 80% marks were
required in most quizzes and
assignments to pass the course.
It took approximately 1 week to complete the
project.
Will discuss project later
19 July 2021
Commenced Project
25 July 2021
Finished Project Work
THE
What is my project all about
PROJECT
OVERVIEW
TECH STACK
INVOLVED
APPLICATION
MOVIES DATA
ANALYSIS
Created a database using 4
different data sets and
calculated significant
results using SQL queries.
Results calculated from the
database is done using SQL
query operations. I have
used MySQL Workbench to
design and manage the
database.
This project is useful in
calculating results from
past movies to analyze
trends in the movie
industries across the globe.
Data analysis of a very large films
database.
Created this project using SQL and
MySQL Workbench.
This project can be used as a demo
for managing other data sets.
WHY THIS
PROJECT
The main aim was to
implement all the
knowledge and skills
gained in summer training
on a real project.
PRACTICAL
IMPLEMENTATION
LEARNING
EXPLORE
SOLVE ISSUE
1
2
3
4
I learned a lot from this project,
and revised all the learnings from
my summer training.
I did this project to get practical
exposure of managing a large
data set.
I got the opportunity to explore
more about this domain, learn
new things, and enhance existing
skills.
I learned how to manage large
datasets. This project calculates
important results from
unstructured data.
WHICH
PROBLEMS
THIS
PROJECT
SOLVES
The movie's database
contains information
regarding the name of the
movies, the year when
they were released,
country of origin, duration
of films, the language of
films, the budget of films,
etc.
This project will also
serve as a prototype
to design, manage,
and study any other
datasets in future.
ANALYZES
UNSTRUCTURED
DATA
DEDUCE
IMPORTANT
FACTS
INSIGHTS
ABOUT PAST
TRENDS IN
FILMS
INDUSTRY
ER DIAGRAM
PROJECT DESIGN
This ER Diagram is
created using CREATELY.
An Entity Relationship (ER) Diagram
is a type of flowchart that
illustrates how “entities” such as
people, objects or concepts relate
to each other within a system.
These are a few screenshots of data sets,
code and query results in MySQL Workbench
and from GitHub repository.
SCREEN
SHOTS
SWOT ANALYSIS
Strength
S
O
W
T
Opportunity
Weakness
Threat
Simple and easy in handling
large data. Data analysis can
be performed easily using
basic commands.
SWOT analysis of my project(In genral of SQL)
will assess the four important aspects of the
project(SQL).
We can improve project by
representing results in
graphs by exporting results
to other tools.
SQL is good for fetching
data but it cannot be used
alone for visualizing data.
Python and tools like
tableau will be preferred
over SQL when the
requirement is to visualize
the data into figures.
LEARNING
OUTCOMES
Skills and technologies learned
from the summer training and
experience gained from
working on real projects.
SQL
Data Analysis
MySQL
Workbench
DBMS
FUTURE OF
SQL
DATABASES
T h e f u t u r e o f S Q L
S e r v e r w i l l d e p e n d o n
t h e f u t u r e o f t h e u s e
o f S Q L a s a q u e r y
l a n g u a g e .
SQL, being the ANSI and ISO
standard for relational
databases, is adapting to the
changing world of data
transforming into big data.
Data is increasing
exponentially, and data is
not going to reduce, so there
will always be a need to
manage data.
SQL started as an IBM
project in 1974 to
implement a “relational
model of data”.
SQL in the Present
SQL in the Future
SQL In the Past
THANK YOU

More Related Content

PPTX
Summer Internship project presentation
PDF
Machine learning Summer Training report
PDF
Summer Training Project.pdf
PPT
Introduction to data structures and Algorithm
PPT
BANKING SYSTEM
PDF
Cse443 Project Report - LPU (Modern Big Data Analysis with SQL Specialization)
PDF
Human values unit 2 notes
PDF
Data science presentation
Summer Internship project presentation
Machine learning Summer Training report
Summer Training Project.pdf
Introduction to data structures and Algorithm
BANKING SYSTEM
Cse443 Project Report - LPU (Modern Big Data Analysis with SQL Specialization)
Human values unit 2 notes
Data science presentation

What's hot (20)

PDF
Quiz app (android) Documentation
PDF
SRS for student database management system
PDF
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
PPTX
Data science applications and usecases
PPTX
CSE Final Year Project Presentation on Android Application
PDF
Data warehouse architecture
DOCX
Computer science industrial training report carried out at web info net ltd ...
DOCX
Online shopping
PDF
CS8691 - Artificial Intelligence.pdf
PPTX
Informatica PowerCenter
DOCX
Industrial Training report on java
PDF
Introduction on Data Science
DOCX
Big data lecture notes
PDF
Next Generation Technologies (November – 2018) [Choice Based | Question Paper]
PPTX
ETL in the Cloud With Microsoft Azure
PPT
Sql server T-sql basics ppt-3
PDF
What Is Power BI? | Introduction To Microsoft Power BI | Power BI Training | ...
DOCX
Food delivery application report
PPTX
Azure SQL Database & Azure SQL Data Warehouse
PPTX
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
Quiz app (android) Documentation
SRS for student database management system
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Data science applications and usecases
CSE Final Year Project Presentation on Android Application
Data warehouse architecture
Computer science industrial training report carried out at web info net ltd ...
Online shopping
CS8691 - Artificial Intelligence.pdf
Informatica PowerCenter
Industrial Training report on java
Introduction on Data Science
Big data lecture notes
Next Generation Technologies (November – 2018) [Choice Based | Question Paper]
ETL in the Cloud With Microsoft Azure
Sql server T-sql basics ppt-3
What Is Power BI? | Introduction To Microsoft Power BI | Power BI Training | ...
Food delivery application report
Azure SQL Database & Azure SQL Data Warehouse
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
Ad

Similar to LPU Summer Training Project Viva PPT - Modern Big Data Analysis with SQL Specialization (20)

PPTX
A Comprehensive Learning Path to Become a Data Science 2021.pptx
DOCX
Resume_Md ZakirHussain
DOCX
ZakirHussain
PPTX
UNIT 2.pptx BI
DOCX
Zakir_Hussain_cv
DOCX
Resume
PPTX
Hithai Shree.J and Varsha.R.pptx
PPTX
vishwa ppt.pptxvishwa ppt.pptxvishwa ppt.pptx
PDF
Board Infinity Data Science Brochure - data science learning path
PPTX
PortfolioHanawayM
PPTX
Portfolio
PPTX
NoSQL Module -5.pptx nosql module 4 notes
DOC
SushantResume
PPSX
Dennis Schmid Portfolio
PPTX
Discover deep insights with Salesforce Einstein Analytics and Discovery
PDF
49.INS2065.Computer Based Technologies.TA.NguyenDucAnh.pdf
DOCX
CV_JaimeSantosGonzález_20250107_1524_English
PDF
PDF
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
PDF
Data_Engineering_Learning_Roadmap.pdf
A Comprehensive Learning Path to Become a Data Science 2021.pptx
Resume_Md ZakirHussain
ZakirHussain
UNIT 2.pptx BI
Zakir_Hussain_cv
Resume
Hithai Shree.J and Varsha.R.pptx
vishwa ppt.pptxvishwa ppt.pptxvishwa ppt.pptx
Board Infinity Data Science Brochure - data science learning path
PortfolioHanawayM
Portfolio
NoSQL Module -5.pptx nosql module 4 notes
SushantResume
Dennis Schmid Portfolio
Discover deep insights with Salesforce Einstein Analytics and Discovery
49.INS2065.Computer Based Technologies.TA.NguyenDucAnh.pdf
CV_JaimeSantosGonzález_20250107_1524_English
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
Data_Engineering_Learning_Roadmap.pdf
Ad

More from Qazi Maaz Arshad (12)

PDF
INT213 Project Report: Income Tax Calculator
PDF
INT217 Project Viva Presentation: Excel Dashboard
PDF
INT217 Project Report: Excel Dashboard
PPTX
Municipal Solid Waste Management in Developing Countries
PPTX
Ultrasonic testing
PPTX
Zero energy building
PPTX
Open data science
PPTX
Ecotourism
PPTX
Gesture Robotics
PPTX
Benefits of Ecotourism in India
PPTX
Battery disposal
PDF
MGN231 Community Development Project Report
INT213 Project Report: Income Tax Calculator
INT217 Project Viva Presentation: Excel Dashboard
INT217 Project Report: Excel Dashboard
Municipal Solid Waste Management in Developing Countries
Ultrasonic testing
Zero energy building
Open data science
Ecotourism
Gesture Robotics
Benefits of Ecotourism in India
Battery disposal
MGN231 Community Development Project Report

Recently uploaded (20)

PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PDF
LNK 2025 (2).pdf MWEHEHEHEHEHEHEHEHEHEHE
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
Cell Types and Its function , kingdom of life
PDF
Complications of Minimal Access Surgery at WLH
PDF
LDMMIA Reiki Yoga Finals Review Spring Summer
PDF
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
PPTX
Radiologic_Anatomy_of_the_Brachial_plexus [final].pptx
PDF
Anesthesia in Laparoscopic Surgery in India
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PDF
Classroom Observation Tools for Teachers
PDF
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PDF
ChatGPT for Dummies - Pam Baker Ccesa007.pdf
PDF
Chinmaya Tiranga quiz Grand Finale.pdf
PDF
Microbial disease of the cardiovascular and lymphatic systems
PDF
Weekly quiz Compilation Jan -July 25.pdf
Module 4: Burden of Disease Tutorial Slides S2 2025
LNK 2025 (2).pdf MWEHEHEHEHEHEHEHEHEHEHE
Final Presentation General Medicine 03-08-2024.pptx
Cell Types and Its function , kingdom of life
Complications of Minimal Access Surgery at WLH
LDMMIA Reiki Yoga Finals Review Spring Summer
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
Radiologic_Anatomy_of_the_Brachial_plexus [final].pptx
Anesthesia in Laparoscopic Surgery in India
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
Classroom Observation Tools for Teachers
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
ChatGPT for Dummies - Pam Baker Ccesa007.pdf
Chinmaya Tiranga quiz Grand Finale.pdf
Microbial disease of the cardiovascular and lymphatic systems
Weekly quiz Compilation Jan -July 25.pdf

LPU Summer Training Project Viva PPT - Modern Big Data Analysis with SQL Specialization

  • 1. MODERN BIG DATA ANALYSIS WITH SQL SPECIALIZATION L O V E L Y P R O F E S S I O N A L U N I V E R S I T Y C S E 4 4 3 - S E M I N A R O N S U M M E R T R A I N N G Q A Z I M A A Z A R S H A D B . T E C H C S E 1 1 9 0 6 4 2 4
  • 2. TABLE OF CONTENTS 01 My Course I will discuss my summer training MOOC. The tasks, assignments, timeline, etc. I will talk about the experience and skills I have gained from the course and project. I will give an overview of my training and talk over future scope of BIG DATA. I will explain my project in detail. The aim, problems solved, approach, tech stack, etc. Project Conclusion Learning Outcomes 02 04 03
  • 3. OVERVIEW OF COURSE Modern Big Data Analysis wth SQL Specialization “Modern Big Data Analysis with SQL Specialization” is an online specialization course offered by CLOUDERA consisting of 3 courses “Foundations for Big Data Analysis with SQL”, “Analyzing Big Data with SQL”, and “Managing Big Data in Clusters and Cloud Storage”. This specialization teaches the essential skills for working with large-scale data using SQL.
  • 4. Foundations for Big Data Analysis with SQL Analyzing Big Data with SQL Managing Big Data in Clusters and Cloud Storage
  • 5. TIMELINE 13 May 2021 Completed the Summer Training before the deadline - 10th August 2021 12 July 2021 Enrolled in MOOC Completed the Specialization Each course comprised of multiple quizzes and assignments. A minimum of 80% marks were required in most quizzes and assignments to pass the course.
  • 6. It took approximately 1 week to complete the project. Will discuss project later 19 July 2021 Commenced Project 25 July 2021 Finished Project Work
  • 7. THE
  • 8. What is my project all about PROJECT OVERVIEW TECH STACK INVOLVED APPLICATION MOVIES DATA ANALYSIS Created a database using 4 different data sets and calculated significant results using SQL queries. Results calculated from the database is done using SQL query operations. I have used MySQL Workbench to design and manage the database. This project is useful in calculating results from past movies to analyze trends in the movie industries across the globe. Data analysis of a very large films database. Created this project using SQL and MySQL Workbench. This project can be used as a demo for managing other data sets.
  • 9. WHY THIS PROJECT The main aim was to implement all the knowledge and skills gained in summer training on a real project. PRACTICAL IMPLEMENTATION LEARNING EXPLORE SOLVE ISSUE 1 2 3 4 I learned a lot from this project, and revised all the learnings from my summer training. I did this project to get practical exposure of managing a large data set. I got the opportunity to explore more about this domain, learn new things, and enhance existing skills. I learned how to manage large datasets. This project calculates important results from unstructured data.
  • 10. WHICH PROBLEMS THIS PROJECT SOLVES The movie's database contains information regarding the name of the movies, the year when they were released, country of origin, duration of films, the language of films, the budget of films, etc. This project will also serve as a prototype to design, manage, and study any other datasets in future. ANALYZES UNSTRUCTURED DATA DEDUCE IMPORTANT FACTS INSIGHTS ABOUT PAST TRENDS IN FILMS INDUSTRY
  • 11. ER DIAGRAM PROJECT DESIGN This ER Diagram is created using CREATELY. An Entity Relationship (ER) Diagram is a type of flowchart that illustrates how “entities” such as people, objects or concepts relate to each other within a system.
  • 12. These are a few screenshots of data sets, code and query results in MySQL Workbench and from GitHub repository. SCREEN SHOTS
  • 13. SWOT ANALYSIS Strength S O W T Opportunity Weakness Threat Simple and easy in handling large data. Data analysis can be performed easily using basic commands. SWOT analysis of my project(In genral of SQL) will assess the four important aspects of the project(SQL). We can improve project by representing results in graphs by exporting results to other tools. SQL is good for fetching data but it cannot be used alone for visualizing data. Python and tools like tableau will be preferred over SQL when the requirement is to visualize the data into figures.
  • 14. LEARNING OUTCOMES Skills and technologies learned from the summer training and experience gained from working on real projects. SQL Data Analysis MySQL Workbench DBMS
  • 15. FUTURE OF SQL DATABASES T h e f u t u r e o f S Q L S e r v e r w i l l d e p e n d o n t h e f u t u r e o f t h e u s e o f S Q L a s a q u e r y l a n g u a g e . SQL, being the ANSI and ISO standard for relational databases, is adapting to the changing world of data transforming into big data. Data is increasing exponentially, and data is not going to reduce, so there will always be a need to manage data. SQL started as an IBM project in 1974 to implement a “relational model of data”. SQL in the Present SQL in the Future SQL In the Past