SlideShare a Scribd company logo
Mtech First_Year Data Analytics in Industry with power bI
Mtech First_Year Data Analytics in Industry with power bI
Unit 1 : Introduction to Data Analytics
•Introduction to Business Intelligence
•History on Methodologies for Data Analytics
•Types of Data Analytics
•Big Data and Data Science
•State of the Practice in Analytics
•Characteristics of Data Analysis
•Applications of Data Analysis
•Difference between Data Analytics, Data Analysis, Data Mining, Data Science, Machine
Learning, and Big Data
Syllabus
 Definition of Business Intelligence (BI):
 Business Intelligence (BI) refers to the technology-driven process for analyzing data and
presenting actionable information to help executives, managers, and other corporate end users
make informed business decisions.
 BI encompasses a variety of tools, applications, and methodologies that enable organizations to
collect data from internal and external sources, prepare it for analysis, develop and run queries
against the data, and create reports, dashboards, and data visualizations.
 Key Concepts of BI:
 Data Warehousing: Centralized storage of large amounts of data collected from various sources
for easy access and analysis.
 Data Visualization: The graphical representation of data to help users quickly understand
trends, patterns, and insights.
 Reporting: The process of organizing data into summaries and distributing them to decision-
makers for use in strategy formulation.
Importance of BI in Organizations:
 Improved Decision Making:
1. BI tools provide real-time data, allowing organizations to make more informed decisions based on evidence rather than
intuition or guesswork.
2. It offers insights into customer behavior, market trends, and internal performance metrics.
 Operational Efficiency:
1. BI helps to streamline operations by identifying inefficiencies, tracking performance, and monitoring progress in real-
time.
2. Automated reporting and alerts can help businesses save time and reduce manual effort.
 Competitive Advantage:
1. By using BI to analyze market trends, competitors, and customer needs, organizations can identify opportunities to
differentiate themselves from their competitors.
2. Predictive analytics through BI can help anticipate changes and position the organization strategically.
 Cost Management:
1. BI enables organizations to track spending, evaluate return on investment (ROI), and identify areas where they can cut
costs or improve margins.
 Customer Insights and Personalization:
1. BI tools help in segmenting customers based on behavior, preferences, and demographics, which leads to more
personalized services.
2. This can enhance customer satisfaction and retention rates.
Examples of BI Tools:
 Power BI (Microsoft)
 Tableau
 Looker (Google Cloud)
Date/Year Milestone
1950s-1960s
 Emergence of Data Processing:
Mainframe computers used for processing large volumes of data (e.g., US Census Bureau's automation for
census tabulation).
1970s-1980s
 Rise of Business Intelligence (BI):
Development of decision support systems (DSS) and relational database management systems (RDBMS). IBM
introduces first RDBMS.
1990s
 Era of Data Warehousing and OLAP:
Widespread adoption of data warehouses and Online Analytical Processing (OLAP). Ralph Kimball's “The Data
Warehouse Toolkit” published.
2000s
 Big Data and Advanced Analytics:
Advent of big data technologies. Google’s MapReduce paper leads to frameworks like Apache Hadoop for
distributed data processing.
2010s
 Democratization of Data Analytics:
Cloud computing, open-source tools, and self-service analytics platforms (e.g., Tableau) make data analytics
more accessible.
2020s
 Integration of AI and Machine Learning:
AI-driven analytics, automation, and innovations such as GPT-3 expand possibilities for text analytics and
predictive modeling.
History on Methodologies for Data Analytics
Mtech First_Year Data Analytics in Industry with power bI
Mtech First_Year Data Analytics in Industry with power bI
Mtech First_Year Data Analytics in Industry with power bI
What is Big Data?
Big Data refers to the vast volumes of data generated at high velocity from a variety of
sources. This data is characterized by the three V’s: Volume, Velocity, and Variety.
1.Volume:
Big Data involves large datasets that are too complex for traditional data processing tools
to handle. These datasets can range from terabytes to petabytes of information.
2.Velocity:
Big Data is generated in real-time or near real-time, requiring fast processing to extract
meaningful insights.
3.Variety: The data comes in multiple forms, including structured data (like databases),
semi-structured data (like XML files), and unstructured data (like text, images, and videos).
Mtech First_Year Data Analytics in Industry with power bI
Mtech First_Year Data Analytics in Industry with power bI
What is Data Science?
Data Science is an interdisciplinary field that utilizes scientific methods, algorithms, and
systems to extract knowledge and insights from structured and unstructured data. It
encompasses a variety of techniques from statistics, machine learning, data mining, and big
data analytics.
Data Scientists use their expertise to:
1.Analyze: They examine complex datasets to identify patterns, trends, and correlations.
2.Model: Using statistical models and machine learning algorithms, they create predictive
models that can forecast future trends or behaviors.
3.Interpret: They translate data findings into actionable business strategies and decisions.
Data Science involves a broad skill set, including proficiency in programming languages like
Python and R, knowledge of databases, and expertise in machine learning frameworks such
Mtech First_Year Data Analytics in Industry with power bI
Mtech First_Year Data Analytics in Industry with power bI
Data Analysis vs Data Analytics
Mtech First_Year Data Analytics in Industry with power bI
Mtech First_Year Data Analytics in Industry with power bI
Mtech First_Year Data Analytics in Industry with power bI
Data Analytics
 The process of inspecting, cleaning,
transforming, and modeling data to
discover useful information for decision-
making.
 Descriptive statistics, Data
visualization, BI tools (Tableau,
Power BI)
 Provide actionable
business insights and
support decision-
making.
 Analyzing sales data to
identify market trends.
Data Analysis
 The process of systematically applying
statistical and logical techniques to
describe and evaluate data.
 Excel, SQL, R, Python (Pandas,
Numpy), Statistical tests
 Understand specific
questions and extract
relevant information
from raw data.
 Finding correlations
between different sales
factors.
Data Mining
 The process of discovering patterns,
correlations, and anomalies in large
datasets, using techniques at the
intersection of machine learning,
statistics, and database systems.
 Algorithms (Apriori, Decision
Trees, K-Means), Python, R
 Extract patterns and
knowledge from large
datasets to predict
future trends.
 Analyzing customer
purchase behavior for
segmentation.
Data Science
 A multidisciplinary field that uses
scientific methods, processes, and
systems to extract insights from
structured and unstructured data.
 Machine Learning (scikit-learn,
TensorFlow), Data Engineering
(SQL, Hadoop), Statistics
 Derive insights, create
predictive models, and
guide strategic
decisions using data.
 Building a recommendation
engine for an e-commerce
platform.
Machine Learning
 A subset of AI, involving systems that
learn from data and improve
performance over time without being
explicitly programmed for every task.
 Supervised Learning
(Regression, Classification),
Unsupervised Learning
(Clustering, PCA), Deep Learning
 Automate decision-
making, build predictive
models, and enable AI.
 Predicting loan defaults
based on customer profiles.
Big Data
 Refers to extremely large and complex
datasets that cannot be processed with
traditional data-processing techniques.
 Hadoop, Spark, NoSQL
databases, Distributed
computing
 Manage, process, and
analyze massive
amounts of data
efficiently.
 Analysing social media data
for sentiment analysis
Term Definition Key Techniques/Tools Focus/
Objective
Examples of Usage

More Related Content

PDF
Unit-I.pdf Data Science unit 1 Introduction of data science
PPTX
DA DS traning.pptx. Data Science is marking its graph on a high note by expan...
PPTX
000 introduction to big data analytics 2021
PDF
KIT-601-L-UNIT-1 (Revised) Introduction to Data Analytcs.pdf
PDF
Difference b/w DataScience, Data Analyst
PPTX
Introduction to data analytics - Intro to Data Analytics
PDF
Introduction to Data Analytics and data analytics life cycle
PPTX
Big data Introduction
Unit-I.pdf Data Science unit 1 Introduction of data science
DA DS traning.pptx. Data Science is marking its graph on a high note by expan...
000 introduction to big data analytics 2021
KIT-601-L-UNIT-1 (Revised) Introduction to Data Analytcs.pdf
Difference b/w DataScience, Data Analyst
Introduction to data analytics - Intro to Data Analytics
Introduction to Data Analytics and data analytics life cycle
Big data Introduction

Similar to Mtech First_Year Data Analytics in Industry with power bI (20)

PPTX
Introduction to Data Analytics and Its Importance
PPTX
Wk02-Introduction to DA.pptx
PPSX
Intro to Data Science Big Data
PPTX
Business intelligence
PPTX
Business intelligence
PDF
Introduction to data science.pdf
PPTX
1.1. Evolution-and-Scope-of-Business-Analytics.pptx
PPTX
data analytics vs data analysis understanding the differencespptx
PDF
Data Analytics: Tools, Techniques &Trend
PPTX
BUSINESS_INTELLIGENT_AND_ANALYTICS.pptx
PDF
Data Science Introduction and Process in Data Science
PPTX
Fundamentals of Analytics and Statistic (1).pptx
PDF
lec1.pdf
PPT
01-introduction.ppt the paper that you can unless you want to join me because...
PPTX
Introductions to Business Analytics
PPTX
Introduction to business analyticsand historidal overview.pptx
PPTX
Introduction to Business Analytics (BA) , Importance of BA
PPTX
This is abouts are you doing the same time who is the best person to be safe and
PPTX
BI.pptx
PDF
Big data Analytics
Introduction to Data Analytics and Its Importance
Wk02-Introduction to DA.pptx
Intro to Data Science Big Data
Business intelligence
Business intelligence
Introduction to data science.pdf
1.1. Evolution-and-Scope-of-Business-Analytics.pptx
data analytics vs data analysis understanding the differencespptx
Data Analytics: Tools, Techniques &Trend
BUSINESS_INTELLIGENT_AND_ANALYTICS.pptx
Data Science Introduction and Process in Data Science
Fundamentals of Analytics and Statistic (1).pptx
lec1.pdf
01-introduction.ppt the paper that you can unless you want to join me because...
Introductions to Business Analytics
Introduction to business analyticsand historidal overview.pptx
Introduction to Business Analytics (BA) , Importance of BA
This is abouts are you doing the same time who is the best person to be safe and
BI.pptx
Big data Analytics
Ad

Recently uploaded (20)

PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPTX
Supervised vs unsupervised machine learning algorithms
PPTX
Business Acumen Training GuidePresentation.pptx
PPTX
Computer network topology notes for revision
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PDF
.pdf is not working space design for the following data for the following dat...
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPTX
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PPT
Reliability_Chapter_ presentation 1221.5784
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPTX
1_Introduction to advance data techniques.pptx
oil_refinery_comprehensive_20250804084928 (1).pptx
Supervised vs unsupervised machine learning algorithms
Business Acumen Training GuidePresentation.pptx
Computer network topology notes for revision
Acceptance and paychological effects of mandatory extra coach I classes.pptx
.pdf is not working space design for the following data for the following dat...
Miokarditis (Inflamasi pada Otot Jantung)
Business Ppt On Nestle.pptx huunnnhhgfvu
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
Introduction-to-Cloud-ComputingFinal.pptx
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
Reliability_Chapter_ presentation 1221.5784
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
IBA_Chapter_11_Slides_Final_Accessible.pptx
1_Introduction to advance data techniques.pptx
Ad

Mtech First_Year Data Analytics in Industry with power bI

  • 3. Unit 1 : Introduction to Data Analytics •Introduction to Business Intelligence •History on Methodologies for Data Analytics •Types of Data Analytics •Big Data and Data Science •State of the Practice in Analytics •Characteristics of Data Analysis •Applications of Data Analysis •Difference between Data Analytics, Data Analysis, Data Mining, Data Science, Machine Learning, and Big Data Syllabus
  • 4.  Definition of Business Intelligence (BI):  Business Intelligence (BI) refers to the technology-driven process for analyzing data and presenting actionable information to help executives, managers, and other corporate end users make informed business decisions.  BI encompasses a variety of tools, applications, and methodologies that enable organizations to collect data from internal and external sources, prepare it for analysis, develop and run queries against the data, and create reports, dashboards, and data visualizations.  Key Concepts of BI:  Data Warehousing: Centralized storage of large amounts of data collected from various sources for easy access and analysis.  Data Visualization: The graphical representation of data to help users quickly understand trends, patterns, and insights.  Reporting: The process of organizing data into summaries and distributing them to decision- makers for use in strategy formulation.
  • 5. Importance of BI in Organizations:  Improved Decision Making: 1. BI tools provide real-time data, allowing organizations to make more informed decisions based on evidence rather than intuition or guesswork. 2. It offers insights into customer behavior, market trends, and internal performance metrics.  Operational Efficiency: 1. BI helps to streamline operations by identifying inefficiencies, tracking performance, and monitoring progress in real- time. 2. Automated reporting and alerts can help businesses save time and reduce manual effort.  Competitive Advantage: 1. By using BI to analyze market trends, competitors, and customer needs, organizations can identify opportunities to differentiate themselves from their competitors. 2. Predictive analytics through BI can help anticipate changes and position the organization strategically.  Cost Management: 1. BI enables organizations to track spending, evaluate return on investment (ROI), and identify areas where they can cut costs or improve margins.  Customer Insights and Personalization: 1. BI tools help in segmenting customers based on behavior, preferences, and demographics, which leads to more personalized services. 2. This can enhance customer satisfaction and retention rates.
  • 6. Examples of BI Tools:  Power BI (Microsoft)  Tableau  Looker (Google Cloud)
  • 7. Date/Year Milestone 1950s-1960s  Emergence of Data Processing: Mainframe computers used for processing large volumes of data (e.g., US Census Bureau's automation for census tabulation). 1970s-1980s  Rise of Business Intelligence (BI): Development of decision support systems (DSS) and relational database management systems (RDBMS). IBM introduces first RDBMS. 1990s  Era of Data Warehousing and OLAP: Widespread adoption of data warehouses and Online Analytical Processing (OLAP). Ralph Kimball's “The Data Warehouse Toolkit” published. 2000s  Big Data and Advanced Analytics: Advent of big data technologies. Google’s MapReduce paper leads to frameworks like Apache Hadoop for distributed data processing. 2010s  Democratization of Data Analytics: Cloud computing, open-source tools, and self-service analytics platforms (e.g., Tableau) make data analytics more accessible. 2020s  Integration of AI and Machine Learning: AI-driven analytics, automation, and innovations such as GPT-3 expand possibilities for text analytics and predictive modeling. History on Methodologies for Data Analytics
  • 11. What is Big Data? Big Data refers to the vast volumes of data generated at high velocity from a variety of sources. This data is characterized by the three V’s: Volume, Velocity, and Variety. 1.Volume: Big Data involves large datasets that are too complex for traditional data processing tools to handle. These datasets can range from terabytes to petabytes of information. 2.Velocity: Big Data is generated in real-time or near real-time, requiring fast processing to extract meaningful insights. 3.Variety: The data comes in multiple forms, including structured data (like databases), semi-structured data (like XML files), and unstructured data (like text, images, and videos).
  • 14. What is Data Science? Data Science is an interdisciplinary field that utilizes scientific methods, algorithms, and systems to extract knowledge and insights from structured and unstructured data. It encompasses a variety of techniques from statistics, machine learning, data mining, and big data analytics. Data Scientists use their expertise to: 1.Analyze: They examine complex datasets to identify patterns, trends, and correlations. 2.Model: Using statistical models and machine learning algorithms, they create predictive models that can forecast future trends or behaviors. 3.Interpret: They translate data findings into actionable business strategies and decisions. Data Science involves a broad skill set, including proficiency in programming languages like Python and R, knowledge of databases, and expertise in machine learning frameworks such
  • 17. Data Analysis vs Data Analytics
  • 21. Data Analytics  The process of inspecting, cleaning, transforming, and modeling data to discover useful information for decision- making.  Descriptive statistics, Data visualization, BI tools (Tableau, Power BI)  Provide actionable business insights and support decision- making.  Analyzing sales data to identify market trends. Data Analysis  The process of systematically applying statistical and logical techniques to describe and evaluate data.  Excel, SQL, R, Python (Pandas, Numpy), Statistical tests  Understand specific questions and extract relevant information from raw data.  Finding correlations between different sales factors. Data Mining  The process of discovering patterns, correlations, and anomalies in large datasets, using techniques at the intersection of machine learning, statistics, and database systems.  Algorithms (Apriori, Decision Trees, K-Means), Python, R  Extract patterns and knowledge from large datasets to predict future trends.  Analyzing customer purchase behavior for segmentation. Data Science  A multidisciplinary field that uses scientific methods, processes, and systems to extract insights from structured and unstructured data.  Machine Learning (scikit-learn, TensorFlow), Data Engineering (SQL, Hadoop), Statistics  Derive insights, create predictive models, and guide strategic decisions using data.  Building a recommendation engine for an e-commerce platform. Machine Learning  A subset of AI, involving systems that learn from data and improve performance over time without being explicitly programmed for every task.  Supervised Learning (Regression, Classification), Unsupervised Learning (Clustering, PCA), Deep Learning  Automate decision- making, build predictive models, and enable AI.  Predicting loan defaults based on customer profiles. Big Data  Refers to extremely large and complex datasets that cannot be processed with traditional data-processing techniques.  Hadoop, Spark, NoSQL databases, Distributed computing  Manage, process, and analyze massive amounts of data efficiently.  Analysing social media data for sentiment analysis Term Definition Key Techniques/Tools Focus/ Objective Examples of Usage