SlideShare a Scribd company logo
Big Data and
Business
Intelligence
Dinesh Kumar P
Mallieswari D
“Without data you are just another
person with opinion”
- W. Edwards Deming.
https://tinyurl.com/svnvyyr
Register yourself
Pre-requisites
for workshop
Visual Studio 2017 15.9.12
or later - For ML.Net Auto
ML with Model Builder
Bold BI
Setup to Bold BI – www.boldbi.com
Create your own tenant.
• Option -1 Free Single User plan.
• Option -2 Choose any of the paid
plans. (Free trail – 15 days).
Just work for 1 day in my tenant.
• I will add you as a user via your
email.
• Will be available only for today.
For what basic skill,
different people get
different salaries?
$10,000
$5000
$1000
$500
$50
Decision Making
Big Data Hype
• Companies were selling Big Data solutions as a
silver bullet solution to all the enterprises’
problems.
• For a use case using Hive+Oozie didn’t provided
expected result. Butworked better with
MySQL+Cron.
• There was no ideal approach like below;
• Client - “I want to implement a Big Data solution”
• Consultant- “That could be the solution, but let us
discuss the problem first”.
Kafka and
Spark
• Streaming platform’s necessity.
• Hadoop made fast.
Big Data, Machine Learning till Business Intelligence
Big Data, Machine Learning till Business Intelligence
Big Data, Machine Learning till Business Intelligence
Cloud Burst
Allowed to be conservative in
investments.
Crawl before you walk and walk
before you run.
Elasticity both for Storage and
Processing.
Cloud services
Storage – HDFS –> Azure Data Lake...
Processing – Map Reduce –> Azure
Kubernetes cluster, Azure Synapse
Analytics, Azure Databricks...
Resources elasticity -> Create as you go via
Automation scripts (Powershell, Ansible).
That too Synapse take care of resource
computation by itself “server as service”.
Review the
customer
feedbacks
with 3 rating
out of 5
Requirement
• Build a cost-effective data pipeline system that
analyzes customer feedbacks provided in a
shopping website.
• Prepare a Machine Learning model to know the
sentiment of comments mentioned in
feedbacks.
• Also prepare a visualization that helps stake
holders for decision-making.
Big Data, Machine Learning till Business Intelligence
Big Data, Machine Learning till Business Intelligence
Azure Synapse
Analytics
• Simplifies your data lake and data warehousing
solutions – Datalakehouse.
• Reduces project development time for machine
learning, BI, and AI.
Big Data, Machine Learning till Business Intelligence
Sentiment
Analysis
• What drive’s business? – Its “customer
satisfaction”.
• Online Surveys, Social media posts-tweets-
comments, Support requests,…
• Sentiment Analysis - the quickest way for
decision makers to ask a team to drill down
further about feedbacks based on the ratio they
get for more data or over a period of time
(more bad / more good).
• Almost in any domain – Healthcare, Supply
Chain apart from just Retail.
Importance of
Data Analytics
and BI
• Analytics everywhere on few clicks - Excel – Ideas,
Outlook – Insights, PowerPoint – Design Ideas, Word – CV
Assistant, Read aloud, Spelling & Grammar.
PowerPoint Outlook Excel
Borrowed from - https://www.youtube.com/watch?v=4QoCbhsQeyE
What is
Machine
Learning ?
Looking for patterns in Data
Generate Code
Recognize those pattern in new
Data
Enter Text
What Machine Learning does ?
➢Predicting future
➢Organizing or Grouping
Enter Text
Features vs Labels
Features
Input
Labels
Output
Name, Amount, Where Issued,
Age of Cardholder
Genuine/Fraud
Enter Text
Types of algorithms
Enter Text
Five Questions
1. Is this A or
B ?
2. Is this
Weird ?
3. How
much or
many ?
4. How this
is organized
?
5. What
should I do
next ?
Enter Text
Role of Data Scientist
➢Which raw data to use?
➢How should that data be processed to
create prepared data?
➢Identify Combinations of prepared data
and machine learning algorithms should
you use to create the best model?
ML.Net
Open source cross-platform
machine learning framework.
Custom ML made like a play
with AutoML – GUI.
Not required much expertise in
ML.
ML.Net - cmd tool
https://visualstudiomagazine.com/articles/2019/09/30/automl-mlnet.aspx
Interpreting results
Bold BI
Business Intelligence tool.
80+ connectors. (SQL, NoSQL, File, Web)
Easy drag and drop designer.
Dashboards renderable in Mobile, TV, Large screen
displays.
Available as SaaS, On-premise and Embedded forms.
Drilled down maps, Forecast feature.
Prototype using
Azure Cognitive
Dashboard for
insights
Create own
model
Standardize the
pipeline
https://tinyurl.com/rbzna6f
Workshop walkthrough document
Big Data, Machine Learning till Business Intelligence
Resources
Free e-Book
http://bit.ly/a4r-mlbook
https://www.boldbi.com/whitepapers/key-
performance-indicators
Enter Text
Books & Movies
➢Cartoon Introduction to Statistics
➢Thinking Fast & Slow by Daniel Kaheman
➢Art of thinking clearly – Rolf Dobelli
➢I-Robot
➢Terminator 2

More Related Content

PDF
NUS-ISS Learning Day 2019-Leading digital product team with business team model
PDF
IBM Watson - Cognitive Robots
PPTX
3P Consulting – Nearshoring Intro Presentation
PDF
Artificial Intelligence - Building Teams & Products
PDF
Ibm watson for retail 2017
PDF
Be a great product leader by Adam Nash, VP Product, Dropbox
PPTX
“How to Develop a Content Strategy that Works”
PPTX
Diversification, Discovery, and Data: 13 Insights from 13 Years of Safari, pr...
NUS-ISS Learning Day 2019-Leading digital product team with business team model
IBM Watson - Cognitive Robots
3P Consulting – Nearshoring Intro Presentation
Artificial Intelligence - Building Teams & Products
Ibm watson for retail 2017
Be a great product leader by Adam Nash, VP Product, Dropbox
“How to Develop a Content Strategy that Works”
Diversification, Discovery, and Data: 13 Insights from 13 Years of Safari, pr...

What's hot (19)

PDF
GTD using Microsoft Outlook 2016
PDF
NUS-ISS Learning Day 2019-Understanding business context to drive analytics
PDF
Product Ideation and Customer Development
PDF
Get going - Boston
PPTX
The Home Depot by Caleb Roberts
PDF
Managing Machines: The New AI Dev Stack
PPTX
Bridging the AI Gap: Building Stakeholder Support
PPTX
Data Science at LinkedIn - Data-Driven Products & Insights
PPT
Shorten Your Sales Cycle In 7 Words Or Less
PDF
11 Things We've Learned Accelerating Startups
PPT
10 Questions About Information Architecture
PDF
Building a Data Driven Company
KEY
Just Push the Button
PPT
Conducting a Market Study & Developing the Business model- delivered at IIT R...
PPSX
Snowforce how to drive rapid business value with ai
PDF
Product Management for AI
PDF
Growth engine istart-june-2016
PDF
iLaunch: Get going
PDF
Daring To Dream Big
GTD using Microsoft Outlook 2016
NUS-ISS Learning Day 2019-Understanding business context to drive analytics
Product Ideation and Customer Development
Get going - Boston
The Home Depot by Caleb Roberts
Managing Machines: The New AI Dev Stack
Bridging the AI Gap: Building Stakeholder Support
Data Science at LinkedIn - Data-Driven Products & Insights
Shorten Your Sales Cycle In 7 Words Or Less
11 Things We've Learned Accelerating Startups
10 Questions About Information Architecture
Building a Data Driven Company
Just Push the Button
Conducting a Market Study & Developing the Business model- delivered at IIT R...
Snowforce how to drive rapid business value with ai
Product Management for AI
Growth engine istart-june-2016
iLaunch: Get going
Daring To Dream Big
Ad

Similar to Big Data, Machine Learning till Business Intelligence (20)

PDF
Website Personalization 101
PDF
Accelerate business transformation with AI Builder
PPT
Big Data and Social CRM
PPTX
Mobile Content Design
PDF
All you must know about Power BI!.
PDF
All you must know about Power BI!
PPTX
SalesStash Berkeley 2016
PDF
Why Your Dashboard Sucks: Applications of Design Thinking in Enterprise Busin...
PDF
Harnessing the power of content marketing final
PDF
Predicitve analytics for marketing 05 21-2014 Shree Dandekar
PDF
Мастер-класс Стартап Академии Сколково 16/02 на Get2Know
PPTX
Lean startup, customer development, and the business model canvas
PPTX
SFIMA SEO Agency Management Presentation May 2013
PPTX
Lean Analytics & Analytics Dashboards
PPTX
Bob Selfridge - Identify, Collect, and Act Upon Customer Interactions; Rinse,...
PDF
InfoVision_PM101_RPadaki
PDF
Data and data scientists are not equal to money david hoyle
PPTX
Build it…will they come by Shawn Trainer
PPTX
DEEP - Developing a Digital Buisness
PPTX
Insider's Guide to Marketing & Selling with WordPress [#WCPHX]
Website Personalization 101
Accelerate business transformation with AI Builder
Big Data and Social CRM
Mobile Content Design
All you must know about Power BI!.
All you must know about Power BI!
SalesStash Berkeley 2016
Why Your Dashboard Sucks: Applications of Design Thinking in Enterprise Busin...
Harnessing the power of content marketing final
Predicitve analytics for marketing 05 21-2014 Shree Dandekar
Мастер-класс Стартап Академии Сколково 16/02 на Get2Know
Lean startup, customer development, and the business model canvas
SFIMA SEO Agency Management Presentation May 2013
Lean Analytics & Analytics Dashboards
Bob Selfridge - Identify, Collect, and Act Upon Customer Interactions; Rinse,...
InfoVision_PM101_RPadaki
Data and data scientists are not equal to money david hoyle
Build it…will they come by Shawn Trainer
DEEP - Developing a Digital Buisness
Insider's Guide to Marketing & Selling with WordPress [#WCPHX]
Ad

Recently uploaded (20)

PDF
cuic standard and advanced reporting.pdf
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
KodekX | Application Modernization Development
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Encapsulation theory and applications.pdf
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPT
Teaching material agriculture food technology
PDF
Unlocking AI with Model Context Protocol (MCP)
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Approach and Philosophy of On baking technology
cuic standard and advanced reporting.pdf
The Rise and Fall of 3GPP – Time for a Sabbatical?
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Chapter 3 Spatial Domain Image Processing.pdf
NewMind AI Monthly Chronicles - July 2025
Network Security Unit 5.pdf for BCA BBA.
20250228 LYD VKU AI Blended-Learning.pptx
KodekX | Application Modernization Development
MYSQL Presentation for SQL database connectivity
Encapsulation theory and applications.pdf
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Diabetes mellitus diagnosis method based random forest with bat algorithm
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Teaching material agriculture food technology
Unlocking AI with Model Context Protocol (MCP)
“AI and Expert System Decision Support & Business Intelligence Systems”
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Approach and Philosophy of On baking technology

Big Data, Machine Learning till Business Intelligence

Editor's Notes

  • #3: Please register yourself. In the meantime, can we get introduced? Just let us know NAME, WHERE ARE WE FROM and importantly WHAT YOU EXPECT OUT OF TODAY’s EVENT.
  • #4: (or) Visual Studio or VS Code for Sentiment Analysis (or) ML.Net CLI
  • #6: Let me ask a simple question. For what basic skill, different people get different salary? Perhaps that is the same basic thing which all the ML or AI projects are solving and make our life easier.
  • #7: Its DECISION MAKING. And that’s what AI is doing for Humans right? I can say many real time examples for this. For now, lets take - the Google Maps. Before 5 years, if you are riding to a destination for which you don’t know the route, you will constantly ask the persons who are on your path way, think over the route and go on. But now, Google Maps in our mobile phone, decides and tells this is the right route. And we are travelling just by looking into that Map path. Notice that here decision making work is partially taken over by the software comparing to the same situation before 5 years. Software are no more just software. As engineers, we are making them as intelligent agents and make them evolve as more human in nature. Simply say, future machines with software will slowly rise to the conscious level of humans even!
  • #8: My opinion - Ecosystem projects are not growing to be compatible with Hadoop. E.g. Pig, Sqoop, Oozie,…
  • #12: https://trends.google.com/trends/explore?date=2012-01-14%202020-02-14&q=Apache%20Hadoop,Apache%20Spark,Apache%20Kafka
  • #15: Details - An online shopping site store their feedback data in Azure SQL Data Warehouse. The data, in short, is about the rating(1 to 5) and comments by the buyer for each delivered order. The management would like to extract insights about this feedbacks and improve its business. Analysis of bad feedbacks with 1,2 rating and good feedbacks that are with 4,5 rating is straight forward. Feedbacks with 3-rating expresses the biased feeling of customers with both good and bad comments. So apart from the overall rating, the comments in 3-rating should be segregated as good and bad comments using Machine Learning. Finally need to prepare a business dashboard to showcase the feedbacks for the sake of decision making.
  • #19: Synapse Analytics combines Big Data and Relational Data to combine as one.
  • #22: AI is everywhere. As you see in the slide, ‘Design Ideas’ in PowerPoint suggests you good designs and icon sets for the content, in a fraction of second than a human can think. ‘Insights’ in outlook lets you know, if anything you have promised is getting missed out without addressing it. ‘Ideas’ in Excel, automatically decides the suitable chart based on the data you have and populates it right inside of the Excel app automatically.
  • #23: Nowadays, a bank itself is software and takes the decision. Same case earlier humans took decisions using that software. A perfect example would be of sanctioning a personal loan. Things like the CIBIL score, a person’s authenticity available as up-to-date data has provided the power to decide to sanction a loan to machines themselves. Earlier, 1 or more bank employees should do the entire work of analyzing all these. Finally I would like to end with this quote from Confluent.IO. “Every Company is Becoming software, which were previously just Using the software.”
  • #27: Error from sensitivity to small fluctuations(Including anomaly) . Overfitting
  • #29: data scientist is a specialist in solving problems like the ones that arise in machine learning. People in this role typically have a wide range of skills. They’re comfortable working with complex machine learning algorithms, for example, and they also have a strong sense of which of these algorithms are likely to work best in different situations. A data scientist might also need to have software development skills, as they’re often called upon to write code.
  • #32: Dataset of 200 items had 180 low-satisfaction items, 10 medium-satisfaction items and 10 high-satisfaction items. A model could just predict low-satisfaction for all items and score 180 / 200 = 0.9000 accuracy for the MicroAccuracy metric. But the MacroAccurcy would be (0.9000 + 0.0500 + 0.0500) / 3 = 0.3333.
  • #38: Good book -- and free! Another recommended book on Azure Machine is Learning is "Predictive Analytics with Microsoft Azure Machine Learning " (https://www.amazon.com/Predictive-Analytics-Microsoft-Machine-Learning/dp/1484212010).
  • #40: I-Robot Terminator 2