SlideShare a Scribd company logo
Introduction to Data
Science
Prepared by :
Sulav Acharya
Gangadhar Shah
Objective
• Foster interdisciplinary research/education
• Promote academia and industry partnership and outreach
• Apprehend the
field of Data Science impact and important to society
• Reflect on its applications, important and advantage
Contents
▪ Getting started with Data Science
▪ Components of Data Science
▪ Understanding concept of Data Science
▪ Application of Data Science
Data All Around
Lots of data is collected and warehoused
• Web data
• E-Commerce
• Financial transactions
• Bank/Credit transaction
• Online trading and purchasing
• Social Network
Getting Started with Data Science
▪ Data Science is the study of data to
extract meaningful insights for business.
▪ It all combines math and statistics, specialized
programming, advance analytics, artificial intelligence(AI)
and machine learning with specific subject matter.
▪ It is an area that manage, manipulate, extracts and
interprets knowledge from tremendous amount of data
▪ Data science principles apply to all big and small data
Data Science Real Life
Applications
Fraud detection
▪ Investigate fraud pattern in
past data
▪ Early detection is important
▪ Precision is important
▪ Real-time analytics
Recommender System
▪ The ability to
offer unique personalize service
Netflix recommender system valued
at $1B per year
Amazon recommender system drives a
20-30% lift in sales annually
Understanding the concept of Data
Science
Statistical Analysis
 Statistics provide the information to educate how things work
 It is a science of learning from data and of measuring, controlling
and communicating uncertainty
 It is used to design experiments, analyze data, and make informed
decision
 Data science involve the collection, organization, analysis
and visualization of large amounts of data
 Meanwhile, using mathematical model to quantify relationships
between variable and outcomes and make predictions based
on those relationship
Microsoft Edge
PDF Document
Big Data Analytics
Big data is any data that is expensive to manage and hard to
extract value from
• Volume
The size of the data
• Velocity
The latency of data processing relative to growing demand
• Variety and Complexity
The diversity of sources, format, quality, structure
Data Mining
▪ Subset of Data Science that involves analyzing large
data sets to find patterns and other useful information
▪ It is process of extracting knowledge or insight from large
amounts of data using various statistical and computational
techniques.
▪ The data can be structured, semi-structured or unstructured
and can be stored in various forms such as database and data
warehouses.
▪ The primary goal of data mining is to discover hidden patterns
and relationships in data that can be used to make
informed decisions or predictions.
Visualization
▪ Data visualization is the process of
generating graphical representations like graphs, charts or maps
to represent data of information.
▪ Its benefits include communicating your result or findings
and monitoring the model's performance.
Artificial Intelligence
▪ Artificial Intelligence works by simulating human intelligence
through the use of algorithms, data and computational power.
▪ AI systems often utilize machine learning algorithms commonly use
in data science for prediction modeling and pattern recognition.
Machine Learning
Machine Learning (ML) is a branch of Artificial Intelligence(AI) that use
algorithms to automate data analysis, build model that enable computer to
learn from data, identify pattern and make decision in minimal human
presence.
Learning
Computer can learn from data sets to perform tasks like analyzing data,
categorizing images or predicting image or price fluctuations.
Identifying Pattern
Computer can identify pattern in large amount of data collection
Making Decision
Computer can make data-driven recommendations and decision based on
input data
Model Deployment
▪ Model Deployment in data science is the process of integrating a
machine learning model into a productive environment so that it
take input and produce output.
Data Governance and Ethics
▪ Data governance is everything you do to ensure data is
secure, private , accurate, available and usable.
▪ It is rules, standards and processes defining how data is
handled within the company.
Sample project
Microsoft Edge
PDF Document

More Related Content

PPTX
Data science and business analytics
PPTX
Data Science Training in Chandigarh h
PPTX
Data Science Mastery Course in Pitampura
PDF
Data Analytics and Big Data on IoT
PDF
Introduction to Data Science
PPTX
data science process in data analytics.pptx
PPTX
Data Science comparison with AI, ML, BI, and data warehousing, data mining.
PPTX
DATA SCIENCE PPT BY TEACHERDADAPLUS.pptx
Data science and business analytics
Data Science Training in Chandigarh h
Data Science Mastery Course in Pitampura
Data Analytics and Big Data on IoT
Introduction to Data Science
data science process in data analytics.pptx
Data Science comparison with AI, ML, BI, and data warehousing, data mining.
DATA SCIENCE PPT BY TEACHERDADAPLUS.pptx

Similar to Intoduction to Data Science By Sulav Acharya (20)

PDF
Applications & Research Topics in Machine Learning
PPTX
Data science
PPTX
Big Data & Data Science Pengantar Imu Komputer_C5.pptx
PPTX
Big data Analytics Unit - CCS334 Syllabus
DOCX
Understanding Data Mining: Benefits, Challenges, and How AI & ML Help
PPTX
Data science training presentation for high-quality education and training in...
PPTX
Data science in business Administration Nagarajan.pptx
PPTX
DataScienceandVisualization_Mod_1_ppt.pptx
PPTX
DATA SCIENCE PPT1.pptx
PPTX
DATA SCIENCE PPT.pptx
PDF
DataSciencePowerPointPresentationFull.pdf
PPTX
Introduction To Data Mining and Data Mining Techniques.pptx
PDF
CS3352-Foundations of Data Science Notes.pdf
PPTX
Data science applications and usecases
PDF
Best Data Science training institute in Hyderabad
PPTX
This is abouts are you doing the same time who is the best person to be safe and
PDF
Unlock the power of information: Data Science Course In Kerala
PPTX
Data Mining & Applications
PPTX
Data mining
PPTX
Data Science topic and introduction to basic concepts involving data manageme...
Applications & Research Topics in Machine Learning
Data science
Big Data & Data Science Pengantar Imu Komputer_C5.pptx
Big data Analytics Unit - CCS334 Syllabus
Understanding Data Mining: Benefits, Challenges, and How AI & ML Help
Data science training presentation for high-quality education and training in...
Data science in business Administration Nagarajan.pptx
DataScienceandVisualization_Mod_1_ppt.pptx
DATA SCIENCE PPT1.pptx
DATA SCIENCE PPT.pptx
DataSciencePowerPointPresentationFull.pdf
Introduction To Data Mining and Data Mining Techniques.pptx
CS3352-Foundations of Data Science Notes.pdf
Data science applications and usecases
Best Data Science training institute in Hyderabad
This is abouts are you doing the same time who is the best person to be safe and
Unlock the power of information: Data Science Course In Kerala
Data Mining & Applications
Data mining
Data Science topic and introduction to basic concepts involving data manageme...
Ad

Recently uploaded (20)

PDF
STKI Israel Market Study 2025 version august
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
NewMind AI Weekly Chronicles – August ’25 Week III
PPTX
OMC Textile Division Presentation 2021.pptx
PPTX
O2C Customer Invoices to Receipt V15A.pptx
PPTX
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
PDF
Web App vs Mobile App What Should You Build First.pdf
PPTX
Modernising the Digital Integration Hub
PDF
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
PPTX
observCloud-Native Containerability and monitoring.pptx
PDF
1 - Historical Antecedents, Social Consideration.pdf
PDF
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
PDF
A novel scalable deep ensemble learning framework for big data classification...
PDF
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
PDF
DP Operators-handbook-extract for the Mautical Institute
PDF
2021 HotChips TSMC Packaging Technologies for Chiplets and 3D_0819 publish_pu...
PDF
WOOl fibre morphology and structure.pdf for textiles
STKI Israel Market Study 2025 version august
Group 1 Presentation -Planning and Decision Making .pptx
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Assigned Numbers - 2025 - Bluetooth® Document
gpt5_lecture_notes_comprehensive_20250812015547.pdf
NewMind AI Weekly Chronicles – August ’25 Week III
OMC Textile Division Presentation 2021.pptx
O2C Customer Invoices to Receipt V15A.pptx
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
Web App vs Mobile App What Should You Build First.pdf
Modernising the Digital Integration Hub
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
observCloud-Native Containerability and monitoring.pptx
1 - Historical Antecedents, Social Consideration.pdf
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
A novel scalable deep ensemble learning framework for big data classification...
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
DP Operators-handbook-extract for the Mautical Institute
2021 HotChips TSMC Packaging Technologies for Chiplets and 3D_0819 publish_pu...
WOOl fibre morphology and structure.pdf for textiles
Ad

Intoduction to Data Science By Sulav Acharya

  • 1. Introduction to Data Science Prepared by : Sulav Acharya Gangadhar Shah
  • 2. Objective • Foster interdisciplinary research/education • Promote academia and industry partnership and outreach • Apprehend the field of Data Science impact and important to society • Reflect on its applications, important and advantage
  • 3. Contents ▪ Getting started with Data Science ▪ Components of Data Science ▪ Understanding concept of Data Science ▪ Application of Data Science
  • 4. Data All Around Lots of data is collected and warehoused • Web data • E-Commerce • Financial transactions • Bank/Credit transaction • Online trading and purchasing • Social Network
  • 5. Getting Started with Data Science ▪ Data Science is the study of data to extract meaningful insights for business. ▪ It all combines math and statistics, specialized programming, advance analytics, artificial intelligence(AI) and machine learning with specific subject matter. ▪ It is an area that manage, manipulate, extracts and interprets knowledge from tremendous amount of data ▪ Data science principles apply to all big and small data
  • 6. Data Science Real Life Applications Fraud detection ▪ Investigate fraud pattern in past data ▪ Early detection is important ▪ Precision is important ▪ Real-time analytics Recommender System ▪ The ability to offer unique personalize service Netflix recommender system valued at $1B per year Amazon recommender system drives a 20-30% lift in sales annually
  • 7. Understanding the concept of Data Science Statistical Analysis  Statistics provide the information to educate how things work  It is a science of learning from data and of measuring, controlling and communicating uncertainty  It is used to design experiments, analyze data, and make informed decision  Data science involve the collection, organization, analysis and visualization of large amounts of data  Meanwhile, using mathematical model to quantify relationships between variable and outcomes and make predictions based on those relationship Microsoft Edge PDF Document
  • 8. Big Data Analytics Big data is any data that is expensive to manage and hard to extract value from • Volume The size of the data • Velocity The latency of data processing relative to growing demand • Variety and Complexity The diversity of sources, format, quality, structure
  • 9. Data Mining ▪ Subset of Data Science that involves analyzing large data sets to find patterns and other useful information ▪ It is process of extracting knowledge or insight from large amounts of data using various statistical and computational techniques. ▪ The data can be structured, semi-structured or unstructured and can be stored in various forms such as database and data warehouses. ▪ The primary goal of data mining is to discover hidden patterns and relationships in data that can be used to make informed decisions or predictions.
  • 10. Visualization ▪ Data visualization is the process of generating graphical representations like graphs, charts or maps to represent data of information. ▪ Its benefits include communicating your result or findings and monitoring the model's performance.
  • 11. Artificial Intelligence ▪ Artificial Intelligence works by simulating human intelligence through the use of algorithms, data and computational power. ▪ AI systems often utilize machine learning algorithms commonly use in data science for prediction modeling and pattern recognition.
  • 12. Machine Learning Machine Learning (ML) is a branch of Artificial Intelligence(AI) that use algorithms to automate data analysis, build model that enable computer to learn from data, identify pattern and make decision in minimal human presence. Learning Computer can learn from data sets to perform tasks like analyzing data, categorizing images or predicting image or price fluctuations. Identifying Pattern Computer can identify pattern in large amount of data collection Making Decision Computer can make data-driven recommendations and decision based on input data
  • 13. Model Deployment ▪ Model Deployment in data science is the process of integrating a machine learning model into a productive environment so that it take input and produce output.
  • 14. Data Governance and Ethics ▪ Data governance is everything you do to ensure data is secure, private , accurate, available and usable. ▪ It is rules, standards and processes defining how data is handled within the company.