SlideShare a Scribd company logo
CCW331-BUSINESS ANALYTICS
1
Data Warehouses and Data Mart
 Business intelligence (BI) is all about turning an organization’s
data into insights that can be used to inform business decisions.
 BI analysts will use BI tools, software or services to access and
analyze datasets and translate their findings into reports,
summaries, dashboards, graphs, charts or maps.
 In recent years, the advent of modern data visualization and
reporting tools has transformed the discipline, empowering
businesses to use big data insights to identify, develop and create
new business opportunities.
2
Business Intelligence
3
 “a set of methodologies, processes, architectures and technologies
that transform raw data into meaningful and useful information”.
 This can then be “used to enable more effective strategic, tactical
and operational insights and decision-making”.
 This definition acknowledges that data cannot be effectively
analyzed or used to generate meaningful insights if it is poor quality.
 BI should not be confused with‘business analytics’.
 Business intelligence is descriptive and uses metrics to generate
clear snapshots of business performance.
 Meanwhile, business analytics is predictive, and describes what
organizations should do in future to generate better outcomes.
Data Warehouse
4
 A DataWarehouse (DW) is a relational database that is designed
for query and analysis rather than transaction processing.
 It includes historical data derived from transaction data from
single and multiple sources.
 A Data Warehouse provides integrated, enterprise-wide,
historical data and focuses on providing support for decision-
makers for data modeling and analysis.
 A Data Warehouse is a group of data specific to the entire
organization, not only to a particular group of users.
 It is not used for daily operations and transaction processing but
used for making decisions.
5
 A Data Warehouse can be viewed as a data system with the
following attributes:
 It is a database designed for investigative tasks, using data
from various applications.
 It supports a relatively small number of clients with relatively
long interactions.
6
 It includes current and historical data to provide a historical
perspective of information.
 Its usage is read-intensive.
 It contains a few large tables.
 "Data Warehouse is a subject-oriented, integrated, and time-
variant store of information in support of management's
decisions."
7
Subject-Oriented
8
 A data warehouse target on the modeling and analysis of data
for decision-makers.
 Therefore, data warehouses typically provide a concise and
straightforward view around a particular subject, such as
customer, product, or sales, instead of the global
organization's ongoing operations.
 This is done by excluding data that are not useful concerning
the subject and including all data needed by the users to
understand the subject.
Integrated
9
 A data warehouse integrates various heterogeneous data
sources like RDBMS, flat files, and online transaction
records.
 It requires performing data cleaning and integration during
data warehousing to ensure consistency in naming
conventions, attributes types, etc., among different data
sources.
10
Time-Variant
11
 Historical information is kept in a data warehouse. For
example, one can retrieve files from 3 months, 6 months, 12
months, or even previous data from a data warehouse.
 These variations with a transactions system, where often only
the most current file is kept.
Non-Volatile
12
 The data warehouse is a physically separate data storage, which is
transformed from the source operational RDBMS.
 The operational updates of data do not occur in the data warehouse,
i.e., update, insert, and delete operations are not performed.
 It usually requires only two procedures in data accessing: Initial
loading of data and access to data.
 Therefore, the DW does not require transaction processing,
recovery, and concurrency capabilities, which allows for substantial
speedup of data retrieval.
 Non-Volatile defines that once entered into the warehouse, and data
should not change.
13
Goals of Data Warehousing
14
 To help reporting as well as analysis
 Maintain the organization's historical information
 Be the foundation for decision making.
15
Data Mart
16
 A Data Mart is a subset of a directorial information store,
generally oriented to a specific purpose or primary data
subject which may be distributed to provide business needs.
 Data Marts are analytical record stores designed to focus on
particular business functions for a specific community within
an organization.
 Data marts are derived from subsets of data in a data
warehouse, though in the bottom- up data warehouse design
methodology, the data warehouse is created from the union
of organizational data marts.
17
 The fundamental use of a data mart is Business
Intelligence (BI) applications.
 BI is used to gather, store, access, and analyze record.
 It can be used by smaller businesses to utilize the data they
have accumulated since it is less expensive than implementing
a data warehouse.
Reasons for creating a data mart
18
 Creates collective data by a group of users
 Easy access to frequently needed data
 Ease of creation
 Improves end-user response time
 Lower cost than implementing a complete data warehouses
 Potential clients are more clearly defined than in a
comprehensive data warehouse
 It contains only essential business data and is less cluttered.
Types of Data Marts
19
 There are mainly two approaches to designing data marts.These
approaches are
 Dependent Data Marts
 Independent Data Marts
20
21
 A dependent data mart is a logical subset of a physical subset of
a higher data warehouse.
 According to this technique, the data marts are treated as the
subsets of a data warehouse.
 In this technique, firstly a data warehouse is created from which
further various data marts can be created.
 These data mart are dependent on the data warehouse and
extract the essential record from it.
 In this technique, as the data warehouse creates the data mart;
therefore, there is no need for data mart integration. It is also
known as a top-down approach.
Independent Data Marts
22
 The second approach is Independent data marts (IDM) Here,
firstly independent data marts are created, and then a data
warehouse is designed using these independent multiple data
marts.
 In this approach, as all the data marts are designed
independently; therefore, the integration of data marts is
required.
 It is also termed as a bottom-up approach as the data marts
are integrated to develop a data warehouse.
 Other than these two categories, one more type exists that is
called "Hybrid Data Marts."
23
Hybrid Data Marts
24
 It allows us to combine input from sources other than a data
warehouse.
 This could be helpful for many situations; especially when
Adhoc integrations are needed, such as after a new group or
product is added to the organizations.
Steps in Implementing a Data Mart
25
 The significant steps in implementing a data mart are to
design the schema, construct the physical storage, populate
the data mart with data from source systems, access it to
make informed decisions and manage it over time.
Designing
26
 The design step is the first in the data mart process.This
phase covers all of the functions from initiating the
request for a data mart through gathering data about the
requirements and developing the logical and physical
design of the data mart.
 It involves the following tasks:
 Gathering the business and technical requirements
 Identifying data sources
 Selecting the appropriate subset of data
 Designing the logical and physical architecture of the data mart.
Constructing
27
 This step contains creating the physical database and
logical structures associated with the data mart to
provide fast and efficient access to the data.
 It involves the following tasks:
 Creating the physical database and logical structures such as
table spaces associated with the data mart.
 Creating the schema objects such as tables and indexes describe
in the design step.
 Determining how best to set up the tables and access
structures.
Populating
28
 This step includes all of the tasks related to the getting data from
the source, cleaning it up, modifying it to the right format and
level of detail, and moving it into the data mart.
 It involves the following tasks:
 Mapping data sources to target data sources
 Extracting data
 Cleansing and transforming the information.
 Loading data into the data mart
 Creating and storing metadata
 This step involves putting the data to use: querying the data,
analyzing it, creating reports, charts and graphs and
publishing them.
29
 It involves the following tasks:
 Set up and intermediate layer (Meta Layer) for the front-
end tool to use.This layer translates database operations
and objects names into business conditions so that the
end-clients can interact with the data mart using words
which relates to the business functions.
 Set up and manage database architectures like summarized
tables which help queries agree through the front-end tools
execute rapidly and efficiently.
30
 Managing
 This step contains managing the data mart over its lifetime. In
this step, management functions are performed as:
 Providing secure access to the data.
 Managing the growth of the data.
 Optimizing the system for better performance.
 Ensuring the availability of data even with system failures.
31
32
Knowledge Management
33
 The Connection Between Business Intelligence and
Knowledge Management
 Information simply has to be accessible – gathering, managing and
utilizing information is an inevitable part of running any modern
business.
 There are two information management technologies we
use: Business Intelligence (BI) and Knowledge
Management (KM).
 However, terms information, data and knowledge are often
used interchangeably, thus, apart from being confused about
each phrase, there is oftentimes much confusion around the
definitions of BI and KM.
34
 Business Intelligence (BI): Generally, BI is considered to be a set of
tools and techniques applied to gather data and transform it into
information that can be used in business analysis for the purposes of
business development.
 Every company gathers, collects, or to say more accurately, deals with
a large amounts of data, including various business documents,
emails, newspaper articles, web pages, reports, contracts, technical
journals and reviews, spreadsheets, graphs and charts and other relevant
sources of business data.
 BI technologies usually deal with large amounts of unstructured data via
the use of data warehousing and online analytical processing (OLAP).
35
 All these data needs to be organized and validated – prepared
for business analytics.
 Thus, “BI is about providing the right data at the
right time to the right people so that they can take
the right decisions.”
 Knowledge Management can be defined in many ways as
it spans many multi- disciplinary approaches –
content management, collaboration, the science of
organizational behavior, analyses like observation of trends
and appearance of anomalies, clustering, classification,
summarization, taxonomy building and so on.
36
 “Knowledge management is the process of
capturing, distributing, and effectively using
knowledge.”
 KM refers to a set of techniques used to capture, share, and
use the information available in order to achieve business
objectives and to aid in business decision making based on
business analytics.
 There has been immense growth in the domain of knowledge
management in the last decade and new applications and
solutions that empower knowledge sharing and knowledge
management have appeared.
37
 Knowledge is a mix of contextual information, experiences,
rules, and values.
 Richer, deeper, and more valuable.
 Consider knowing –
 What? - based upon assembling information and eventually
applying it.
 How? – applying knowledge leads to learning how to do
something.
 Why? – casual knowledge of why something occurs.
38
Tacit vs. Explicit Knowledge
39
 Tacit knowledge is personal, context-specific and hard to
formalize and communicate.
 A [knowledge] developed and internalized by the knower
over a long period of time . . . incorporates so much accrued
and embedded learning that its rules may be impossible to
separate from how an individual acts.
40
 Explicit knowledge can be easily collected, organized and
transferred through digital means.
 A theory of the world, conceived of as a set of all of the
conceptual entities describing classes of objects,
relationships, processes, and behavioral norms.
 Often referred to as knowing that’, or declarative
knowledge.
41
Why Manage Knowledge?
42
 Information and knowledge have become the fields
in which businesses compete.
 Several important factors include:
 Sharing Best Practice
 Globalization
 Rapid Change
 Downsizing
 Managing Information and Communication Overload –
Knowledge Embedded in Products
 Sustainable Competitive Advantage.
43
Sharing Best Practices
44
 Sharing best practices means leveraging the knowledge
gained by a subset of the organization.
 Increasingly important in organizations who depend on
applying their expertise such as accounting, consulting and
training firms.
 KM systems capture best practices to disseminate their
experience within the firm.
 Problems often arise from employees who may be reluctant
to share their knowledge (managers must encourage and
reward open sharing).
Globalization
45
 Historically three factors, land, labor and capital were the
key to economic success.
 Knowledge has become a fourth factor.
 Knowledge-based businesses can grow without traditional
land, labor, and capital requirements.
 Key competitive factor will be how well an organization
acquires and applies knowledge
Other factors
46
 Rapid change: firms must be nimble and adaptive to compete.
 Downsizing: sometimes the wrong people get fired when creating a
leaner organization.
 Managing Info. and Comm. Overload: data must be categorized
in some manner if it is to be useful rather than overwhelming.
 Knowledge Embedded in Products: the intangibles that add the
most value to goods and services are becoming increasingly
knowledge-based.
 Sustainable Competitive Advantage: KM is the way to do this.
Shorter innovation life cycles keep companies ahead of the
competition.
Categorizing knowledge
47
 Process knowledge – best practices, useful for increasing
efficiency.
 Factual knowledge – easy to document; basic information
about people/things.
 Catalog knowledge – know where things are; like directories
of expertise.
 Cultural knowledge – knowing how things get done
politically and culturally.
48
Process Knowledge
Definition:
Process knowledge refers to understanding the best practices,
workflows, and methodologies used to complete tasks
efficiently. It involves knowing how things should be done to
optimize productivity and quality.
Characteristics:
 It includes step-by-step procedures and workflows.
 Often developed through experience, training, and
observation.
 Can be standardized through documentation and
automation.
 Helps in continuous improvement and efficiency.
49
Examples:
 A software company follows an Agile methodology for
software development.
 Manufacturing industries use Six Sigma to improve quality
control.
 Hospitals have standard operating procedures (SOPs) for
patient care.
Importance:
 Reduces errors and inefficiencies in operations.
 Improves consistency and quality of outcomes.
 Enables new employees to learn processes quickly.
 Helps organizations adapt to changing environments through
refined workflows.
50
Factual Knowledge
Definition:
Factual knowledge consists of basic, objective, and easily verifiable information about
people, objects, events, and concepts. It is often recorded and shared in structured formats like
databases, manuals, or books.
Characteristics:
 It is static and does not change frequently.
 Can be easily documented and shared.
 Used for decision-making and reference.
Examples:
 A customer service representative needs customer details (name, address, contact information).
 Engineers refer to mathematical formulas and physical laws.
 An AI model needs a dataset containing facts about weather conditions to make predictions.
Importance:
 Forms the foundation for knowledge-based decision-making.
 Essential for training AI models and automation.
 Helps organizations store and retrieve critical information efficiently.
51
Catalog Knowledge
Definition:
Catalog knowledge refers to knowing where to find information, resources, or
expertise when needed. It acts as a directory that helps individuals or teams navigate
complex systems.
Characteristics:
 Helps in locating resources quickly.
 Often stored in knowledge management systems, directories, or databases.
 Used for networking and collaboration.
Examples:
 A university maintains a faculty directory with specializations and contact details.
 IT support teams have a knowledge base for troubleshooting guides.
 A hospital maintains a medical inventory catalog to track the availability of drugs and
equipment.
Importance:
 Reduces time spent on searching for information.
 Improves collaboration and communication.
 Enhances decision-making by connecting people with the right expertise.
52
Cultural Knowledge
Definition:
Cultural knowledge refers to understanding how things get done within an
organization, community, or society based on norms, traditions, and political
structures. It includes unwritten rules, beliefs, and power dynamics.
Characteristics:
 Often implicit and learned through experience.
 Can vary significantly between organizations and regions.
 Helps in navigating workplace relationships and decision-making.
Examples:
 In a multinational company, employees need to understand cultural differences in
communication styles.
 A new employee in a government agency learns about bureaucratic procedures
and how to get approvals efficiently.
 In academia, researchers need to understand funding priorities and grant
application processes.
Importance:
 Helps employees adapt to the workplace environment.
 Avoids conflicts due to cultural misunderstandings.
 Enhances leadership and teamwork by understanding organizational dynamics.
KM involves four main processes
53
 Generation – all activities that discover “new” knowledge.
 Capture – all continuous processes of scanning, organizing,
and packaging knowledge after it has been generated.
 Codification – the representation of knowledge in a manner
that can be easily accessed and transferred.
 Transfer – transmitting knowledge from one person or group
to another, and the absorption of that knowledge.
Knowledge Generation
54
 Concerns the intentional activities of an organization to
acquire/create new knowledge.
 Two primary ways are knowledge creation and knowledge
sharing.
 Methods include: – Research and Development – Adaptation
– Buy or Rent – Shared Problem Solving – Communities of
Practice.
55
 Knowledge capture takes into account the media to
be used in the codification process.
 The 3 main knowledge capture activities are:
 Scanning (gather “raw” information) – can be
electronic or human.
 Organizing (move it into an acceptable form) – must
be easy for all types of users to access.
 Designing knowledge maps (providing a
guide for navigating the knowledge base)
Designing Knowledge Maps
56
 A knowledge map serves as both a guide to where knowledge
exists in an organization and an inventory of the knowledge
assets available.
 A knowledge map can consist of nothing more than a list of
people, documents, and databases telling employees where to
go when they need help.
 Provides access to resources that would otherwise be difficult
or impossible to find.
57
Knowledge Codification
58
 Knowledge must be used or shared to be of value.
 Codification puts the knowledge into a form that makes it
easy to find and use.
 It is difficult to measure knowledge in discreet units (since it
changes over time).
 Knowledge has a shelf life.
Knowledge Transfer
59
 Knowledge Transfer describe four different modes of
knowledge conversion (transfer):
 Socialization: from tacit knowledge to tacit knowledge
 Externalization: from tacit knowledge to explicit knowledge
 Combination: from explicit knowledge to explicit knowledge
 Internalization: from explicit knowledge to tacit knowledge
The Knowledge Management Process
60
Similarities and Differences between BI and KM
61
 Confusion between these two technologies comes from the fact that they deal
with many similar processes.
 Both business intelligence and knowledge management capture, collect,
organize, analyze and aggregate data in order to find the best solutions
regarding business decision making processes.
 Business intelligence goes as far back as the 19th century and the beginnings of
entrepreneurship and it has been developing steadily over many years.
 BI enables organizations to integrate data across the enterprise, unlock the
information and empower knowledge worker to make better (and faster)
decisions – it focuses on explicit knowledge.
 However, KM deals with the creation of new knowledge and the dispersion of
existing knowledge throughout an organization – it encompasses both tacit and
explicit knowledge, thus, we can say that KM can influence the very nature of
business intelligence.
Types of Decisions
62
 Modern cloud-native organizations have constantly growing streams of
raw data flowing from every corner of the enterprise.
 Determining the impact this data has on business performance can be
an overwhelming task requiring teams of analysts.
 That’s where employing business intelligence (BI) can help.
 By presenting current and historical data within a business context, the
data insights supplied by BI tools enable organizations to make smarter,
more confident decisions that provide strategic direction for years to
come.
 Instead of relying on intuition and “gut feel,” companies can use BI to
find new ways to increase revenue, track performance, boost
operational efficiency, identify market trends, expose problems, and
much, much more.
Decision-making
63
 Simply put, decision-making is the process of deciding something, especially
with a group of people.
 From a business decision perspective, the aim is to achieve business objectives
to satisfy stakeholder requirements, needs, and expectations.
 For the decision to be effective, however, decision makers must forecast the
outcome of each option and determine which is best for a particular situation.
 That makes decision support systems (DSS) like decision intelligence and
business intelligence absolute essentials.
 Business intelligence refers to the technology tools and processes that enable
businesses to organize, analyse, and contextualize business data from around
the company.
 Business intelligence tools and decision-making transform raw data into
meaningful and actionable information.
The role of business intelligence
64
 Companies make big mistakes when they base business
decisions on what they think will happen instead of relying on
facts.
 Using BI and advanced analytics, organizations can extract
crucial facts from the mountain of data, transforming it into
information companies can act on to make informed strategic
decisions.
 The result: improved business processes, operational efficiency,
and business productivity.
Business intelligence decisions
65
 Business intelligence decisions typically fall into three
categories: strategic, tactical, and operational.
 An organization needs to gain a complete understanding of
these types of decisions in business intelligence to make
better-informed decisions that lead to increased customer
retention, stakeholder satisfaction, operational efficiency, and
revenue.
The relationship between business
intelligence and business analytics
66
 Business intelligence tells you what is currently happening
and what happened in the past to bring you to that state.
 On the other hand, business analytics is an umbrella term for
predictive data analysis techniques and prescriptive .
 Using business intelligence and analytics efficiently is the
difference between companies that succeed and those that fail
in the modern environment.
Three primary types of business
intelligence decisions
67
 Business intelligence supports the three types of decision-
making mentioned above:
 strategic
 tactical
 operational
Strategic decisions
68
 Strategic decisions comprise the highest level of organizational
business decisions and are usually less frequent and made by the
organization’s executives. Yet, their impact is enormous and far-
reaching.
 Some types of strategic decisions include selecting a particular
market to penetrate, a company to acquire, or whether to hire
additional staff.
 Decisions made at this level usually involve significant
expenditure.
 However, they are generally non-repetitive in nature and are taken
only after careful analysis and evaluation of many alternatives.
Tactical decisions
69
 Tactical decisions (or semi structured decisions) occur with
greater frequency (e.g., weekly or monthly) and fall into the
mid-management level. Often, they relate to the
implementation of strategic decisions.
 Examples of tactical decisions include product price changes,
work schedules,departmental reorganization, and similar
activities.
 The impact of these types of decisions is medium regarding
risk to the organization and impact on profitability.
Operational decisions
70
 Operational decisions (or structured decisions) usually
happen frequently (e.g., daily or hourly), relate to day-to-day
operations of the enterprise, and have a lesser impact on the
organization.
 Operational decisions determine the day-to-day profitability
of the business, how effectively it retains customers, or how
well it manages risk.
 Answering a sales inquiry, approving a quotation, or
calculating employee bonuses may be examples of this
decision type.
71
 Strategic: Long-term, complex, made by senior
managers
 Tactical: Medium-term, less complex, made by mid-
level managers
 Operational: Day-to-day, simple, routine, made by
junior managers
How to make the best decisions for your
business
72
 How do you make the best business decisions? Some people trust
intuition or gut feeling. Others reach out to constituents and experts
for advice.
 Still, others decision-making to information systems and automation.
 However, the smartest business decisions are made by those who look
at the numbers.
 In a competitive business landscape, where agility, flexibility, and a
real-time decision-making process are critical and timely, accurate data
analysis is more important than ever.
 In that respect, relying on the types of decisions in business intelligence
is non-negotiable.
 It is required for long-standing success and market dominance.
Decision Making Process
73
 As a business owner, you need to make decisions that have a large
impact on the success of your business.
 You need to make decisions ranging from the small, such as what
color your company logo should be, to the larger, such as whether
to expand your business.
 No matter what your decision, it should be based on facts, not
emotions.
 Decision-making is a crucial aspect of running a business.
Whenever you decide what to do next, a chain of events is
triggered, which may eventually lead to the outcome you desire.
 These events constitute the Decision-Making Process using
predictive analytics consulting.
74
 The decision-making process is a term used to describe how a
company gets to a point where it can make the best decision for its
company.
 This process is a way of thinking, a way of seeing the world, and a
method of arriving at what you believe are the best possible
decisions.
75
 From the day a company is founded, it is believed that each
executive should also be a manager in the right sense.
 It is believed that the executives should be able to take the right
action with the help of the appropriate decision-making
process.
 And the best decision-making process is one that is not only
efficient but also cost- effective.
76
 In order to make the best decisions, companies need to use tools to decipher
their data and better understand their customers.
 Business Intelligence (BI) is a set of tools used by companies to analyze data
using data analytics tools, gain valuable insights, and make better and more
informed decisions using Business Intelligence reports.
 Since its early days, business intelligence has been a tool for decision-making. It’s
a way to ensure that managers and executives make better decisions so that
businesses can run more effectively.
 These decisions can range from basic operations like deciding how much to
produce to broader strategic considerations like deciding what products to sell.
 However, BI tools vary considerably in how they are designed, how they interact with
other applications, and how they are used. Ultimately, BI is about having access to the
right information at the right time.
THE EFFECTIVE STEPS IN THE DECISION-MAKING
PROCESS THAT USES BI
77
Gathering information
 If you’re like most of us, you’re using spreadsheets and
dashboards to visualize and present data on theWeb.
 But some businesses are turning to dashboards to make smart
decisions that improve operations, enhance employee
satisfaction, process more data, and have greater visibility into
the bottom line.
Design and analyze
78
 Routinely encountering data requires a person to make
decisions on how to process the data.
 It can be viewed as a few steps involved in this process.
 First, it involves the analysis of data, followed by the
identification of the most appropriate decision- making model.
 The implementation comes after the analysis of data.
 The data analysis process is then repeated, and the model is
altered based on the latest information or discoveries, if
necessary.
Select and implement using ad hoc query,
what-if, and forecasting
79
 The importance of data in business is beyond dispute. But, the
problem is that most companies are not making the most of
the information available to them.
 What they have is not necessarily the right data and is not
being put to its best use.
 Ad hoc queries, what-if scenarios, and forecasting are
particularly effective tools for uncovering hidden information.
 It can influence risk and make decisions. Ad-hoc query,What-
If, and Forecasting are three terms used in business
intelligence to help make decisions.
Do evaluations using the vital tools
80
 Effective decision-making is critical to business success. The
good news is that business intelligence tools can help you make
better decisions.
 Using dashboards and reports can help you identify key
performance indicators, make comparisons between data sets.
 Also, identify opportunities to improve your business. If you
have any experience managing a business, you know that these
vital tools are a necessity, providing you with the information
you need to make business decisions.
 These tools can be used for assessing your performance,
motivating staff, and improving the customer experience.
Separate the components that are not
related
81
 At one point in time, every decision we make appears to have
little connection to anything else.
 This is because we are able to separate decisions into distinctly
different components, such as the sales forecast, the profit, the
hiring policy, and so on.
 The trick is to recognize that these components are inseparable.
Moreover, a decision made without the sales forecast is like an
engine without the air.
 A decision made without a profit is like an engine without fuel.
 A decision made without a hiring policy is like an engine
without the air.
Increase the coherence
82
 Business intelligence is a technology that extracts and
analyzes historical data to help organizations make better
decisions.
 It is a key asset for businesses that use it to gain value from all
their data and to make better decisions on how to spend their
resources.
83
 There is a trend among different organizations in today’s world
to adopt the use of Business Intelligence.
 The use of business intelligence in these organizations helps in
making decisions faster and more effectively.
 With the help of business intelligence, organizations can keep
track of their processes.
 Moreover, the performance of their business, and the
consumption of resources and get insights into the
organization’s current situation.
 Business intelligence solutions help in making more effective
decisions.
Decision Support Systems
84
 Broadly speaking, a decision support system (DSS) is an
analytics software program used to gather and analyze data
to inform decision making.
 There are many different types of decision support systems,
from modern business intelligence which uses AI and machine
learning to suggest insights and analyses for humans to perform,
to model-based DSS systems which use predefined criteria to
perform automated calculations and deliver best-case decisions.
 For all types, DSS is used in timely problem solving to improve
efficiency and streamline operations, planning and company
management.
Traditional vs Modern DSS
85
 Traditional DSS: Historically, DSS and BI tools relied on
preconfigured, historical data with no ability to drive real-time
decisions and action. With this approach, decisions are made
based on the past.
 Modern DSS: New tools and processes allow for “active
intelligence”, a state of continuous intelligence with an end-to-
end analytics data pipeline delivering real- time, up-to-date
information designed to trigger immediate insights and actions.
DSS Characteristics
86
 Prior to decision support systems, organizational leaders relied
heavily on a combination of their experience and
professional training, and applied those to thoughtful use of
the advanced insights generated by a data analytics platform.
 Decision support systems systematize that by taking
organizational data, analyzing it, and presenting it for use in
company decision making.
Categories of decision support systems
87
Data-driven DSS
 A data-driven DSS gives users access to a large amount of
internal and external data.
 This DSS will query a database using the web, an external
server or a company's mainframe.
 It relies on data mining to provide patterns and information
about the data being assessed.
88
 Users rely on data-driven decision support systems to make
decisions about businesses, inventories and products.
 Managers might find data-driven decision support systems most
helpful when analyzing current and historical data to report on the
conditions of a department or the business.
 CEOs, managers and staff might use a data-driven DSS.
 Software examples of a data-driven DSS include:
 Geographic Information Systems (GIS)
 File drawer systems
 Executive information systems
 Computer-based databases with query systems
Model-driven DSS
89
 A model-driven DSS allows a user to analyze and manipulate specific
models of data, such as statistics, finances or scheduling.
 These decision support systems are specific to the type of model
the user wants to interact with and typically offer less data than
other DSS types.
 They analyze scenarios and data to allow the user to manipulate a
model, such as creating a work schedule.
 They might use simple analysis tools or complex statistics,
depending on the model's purpose and the user's needs.
 Managers, staff and third parties who interact with a business might
use a model-driven DSS.
90
Software examples of a model-driven DSS include:
 Scheduling software
 Financial modeling
 Decision analysis modeling
 Optimization software
Knowledge-driven DSS
91
 With a knowledge-driven DSS, a knowledge-management system
monitors continually updated data about an organization to support decisions.
 The DSS uses diagnosis, prediction, interpretation and classification to
recommend actions consistent with the business.
 A knowledge-driven DSS can be helpful to managers because it performs tasks
faster than a human might.They can also help consumers decide which products
and services to buy.This kind of DSS often relies on a data-mining component.
 Managers, staff and external users, such as customers, might use a knowledge-
driven DSS.
 Software examples of a knowledge-driven DSS include:
 Software that identifies new or current customers who might be interested in
products
 Product selection software
Document-driven DSS
92
 A document-driven DSS retrieves unstructured information
from a variety of electronic sources. It searches web pages,
documents in databases and other information based on a user's
search terms to gather relevant information.
 A document-driven DSS might be specific to a business' private files
or as broad as a common internet search engine.
 Anyone using a database's search function or an internet search
engine is using a document-driven DSS.
 Software examples of a document-driven DSS include:
 Search engines
 Database search software
 Article databases with search functions
Communication-driven DSS
93
 A communication-driven DSS uses tools to support
communication and collaboration.
 Email is an example of a communication-driven DSS.
 This type of DSS includes share tools that allow multiple
people to work on a project at once and software that allows
for digital communication between people.
94
 It improves a shared project's efficiency and effectiveness and can help
facilitate meetings and conversations.
 Internal team members, virtual business meeting hosts and online chat and
video meeting software users can benefit from a communication-driven DSS.
 A communication-driven DSS might also be called a group DSS.
 A communication- driven DSS focuses on communication and collaboration,
while a group DSS helps groups streamline the decision-making process.
 A communication-driven DSS, for example, might help two people who work
for the same company on different shifts share documents. It might also allow
employees on opposite sides of the country to meet virtually to view a shared
file.
 Software examples of a communication-driven DSS include:
 Chat and instant messaging services
 Collaboration software, such as document sharing and editing software
Intelligent DSS
95
 Any DSS with artificial intelligence in its design is an example of intelligent
DSS (IDSS).
 Within an IDSS, AI does data mining and processing to filter through large
datasets.
 An IDSS is designed to offer similar services to a human consultant.
 They're programmed to identify patterns and trends to guide
decision-making.
 They can also resolve problems and analyze solutions. AI components add
advantages, such as fuzzy logic and machine learning, to a DSS.
 Managers, diagnosticians and other decision-makers might use an IDSS.
 Software examples of an intelligent DSS include:
 Smart manufacturing systems
 Medical diagnostic systems
Manual DSS
96
 A manual DSS relies on individuals instead of computers to
support decision- making.
 A group of experts analyses the strengths, weaknesses,
opportunities and threats of their organization or project.
 A manual DSS is much slower than a computer-based DSS, but
certain types of analysis still need a human eye at every step.
 Economists, executives and managers might use a manual DSS.
 Examples of manual DSS include:
 Cost-benefit analyses
 Decision matrixes
Hybrid DSS
97
 A hybrid DSS combines parts of multiple DSS types to create a complex outcome.
 Large issues in industries such as finance and health care might require the tools of
multiple decision support systems, such as a knowledge-driven DSS and a data-driven
DSS.
 A hybrid DSS might use additional software to help these components work together.
 Sometimes a human analyses and combines the results of each DSS.
 A hybrid DSS might also describe a system in which a human works with a DSS to
extract and manipulate data.
 Medical professionals, financial decision-makers and researchers might use a hybrid
DSS.
Software examples of a hybrid DSS include:
 Risk assessment
 Clinical DSS
 Web-based DSS
Decision Support System Examples.
98
 GPS route planning.
 Crop-planning.
 Clinical DSS
Business Intelligence
99
 Business intelligence combines business analytics, data
mining, data visualization, data tools and infrastructure,
and best practices to help organizations make more data-driven
decisions.
Techniques Used In Business Intelligence
100
1. DataVisualization
 When data is stored as a set or matrix of numbers, it is
precise but difficult to interpret.When looking at more than
one dimension of the data, this becomes even harder.
Creating charts, graphics or dashboards from the data makes
it much easier for people to understand and interpret.
2. Data Mining
 Data mining is a computer supported method to reveal
previously unknown or unnoticed relations among data
entities.
Example
101
 In retail: shopping basket analysis can examine products
consumers buy together in order to better promote other
products.
 In banking: using an automated risk assessment based on
historical data to evaluate whether a customer is likely to pay
back a loan.
 In insurance: mining behavioural and historical data to
detect fraud.
 In health: analysis of complications and common diseases
may help to reduce risk.
Reporting
102
 One area where BI tools commonly help business users is by
designing, scheduling and generating reports, for example
regular performance, sales or marketing reports.
 Reports output by BI tools efficiently gather and present
information to support the management, planning and
decision making process.
 Once the report is designed it can be automatically run at set
intervals and sent to a predefined distribution list so key
people can see regularly updated numbers.
Time-Series Analysis And Predictive
Techniques
103
 Nearly all data warehouses and all enterprise data have a time
dimension.
 For example, product sales, phone calls, patient
hospitalizations, etc.
 Time-series analysis can reveal changes in user behaviour
over time, relationships between sales of different products,
or changes in sales figures based on marketing promotions.
 Historic data can also be used to extrapolate and try to
predict future trends, outcomes or financial results.
Online Analytical Processing (OLAP)
104
 OLAP is best known for the OLAP-cubes which provide a visualization of
multidimensional data.
 OLAP cubes display dimensions on the cube edges (e.g. time, product, customer
type, customer age etc.).
 The values in the cube represent the measured facts (e.g. value of contracts, number
of sold products etc.).
 The user can navigate through OLAP cubes using drill-up, drill-down and drill-
across features.
 The drill-up functionality enables the user to easily zoom out to more coarse-grained
details.
 Conversely, drill-down displays the information with more details.
 Finally, drilling-across means that the user can navigate to another OLAP cube to see
the relations on another dimension(s).
 All the functionality is provided in real-time.
Statistical Analysis
105
 Statistical analysis uses mathematic foundations to qualify the
significance and reliability of the observed relations.
 The most interesting features are distribution analysis,
confidence intervals (for example for changes in user
behaviours, etc.)
 Statistical analysis is used for devising and analysing the
results from data mining.
The Necessary BI Skills
106
 Data Analysis: Most BI skills and intelligence analyst-related skills are about
using data to make better decisions.You need to be good at examining many
different sources of data and then making accurate conclusions about them.
 Problem-solving: BI isn’t just about analyzing data; it’s also about creating
business strategies and solving real-world business problems with that data.
For example, you could be the one to extract actionable insights from specific
retail KPIs that need to be visualized and presented during a meeting.
 Specific industry knowledge: While some of this can and will be learned
on the job, you need to have a solid grasp of the industry’s dynamics,
particularly the areas of the field that you’re looking to work in. Over time,
you’ll want to become an expert in your industry as this will increase your
ability to connect data with business problem-solving.
107
 Communication skills: In addition to acquiring intelligence analyst-related
skills, you’ll need to be able to communicate your findings effectively to the
other professionals you’ll be working with. To some extent, if you work in
back-end BI, you won’t need to communicate quite as much. However, if you
work in the front- end, you’ll be responsible for communicating technical
concepts to non-technical people. This kind of role requires excellent
communication skills.
 Data visualization: Expanding on the point above, in order to ensure good
communication you will also need to have data visualization skills.
Visualizations are the best tools to make trends and general insights
understandable. Being able to clearly see how the data changes in time is what
makes it possible to extract relevant conclusions from it. For this purpose, you
should be able to differentiate between various charts and report types as well
as understand when and how to use them to benefit the BI process.
108
 Advanced vision and attention to detail: By its very nature,
business intelligence is incredibly detail-oriented.As a BI analyst or
developer, you'll often work with the smallest fragment of
information with the objective of turning it into actionable insight.
You will need a great deal of forward-thinking vision and the
ability to pay very close attention to detail to succeed in the fast-
paced world of BI.
 Statistical analysis: Statistical knowledge is another important
skill especially if you want to become a BI analyst. Understanding
various statistical components such as mean, median, range,
variance, and others, can enable you to go deeper into the data and
extract relevant conclusions from it.
109
 Programming knowledge: On a more technical side of things,
having programming language knowledge can also be a very valuable
skill when it comes to pursuing a career in BI. Many solutions require
the use of different programming languages to perform advanced
analysis such as R, Python, Javascript, just to name a few, and knowing
them can significantly enhance your skillset.
 Technical notion: Our next BI skill is not fundamental, but it can
certainly make you a more complete and prepared professional. Business
intelligence is an industry that highly relies on technology and having a
technical notion of how to manage these technologies can be a plus.
With this, we do not mean that you need to know how to use every tool
in the market, but understanding how these technologies can work to
your advantage.
110
 Business acumen: To thrive in a business intelligence
career, you will need to possess a swift ability to understand
your company’s business model and how to tailor your
efforts to not only gain maximum value from your key
performance indicators (and the KPI management process)
but also make strategic decisions that will help your
organization succeed on a continual basis.
111
 Benefits of business intelligence
 Data clarity
 Increased efficiency
 Better customer experience
 Improved employee satisfaction
112
 How to develop a business intelligence strategy
 A BI strategy is your blueprint for success. You’ll need to
decide how data is used, gather key roles, and define
responsibilities in the initial phases. It may sound simple at a
high level; however, starting with business goals is your key
to success.
113
 How to create a BI strategy from the ground up:
 Know your business strategy and goals.
 Identify key stakeholders.
 Choose a sponsor from your key stakeholders.
 Choose your BI platform and tools.
 Create a BI team.
 Define your scope.
 Prepare your data infrastructure.
 Define your goals and roadmap.
114
 Advantages of BI include:
 Data visibility
 Accurate reports
 Streamlined processes
 Disadvantages of BI include:
 Initial cost
 User resistance
 Data skills gap
OLAP
115
 OLAP is an acronym for Online Analytical Processing.
OLAP performs multidimensional analysis of business data
and provides the capability for complex calculations, trend
analysis, and sophisticated data modeling.
116
 OLAP offers five key benefits:
 Business-focused multidimensional data
 Business-focused calculations
 Trustworthy data and calculations
 Speed-of-thought analysis
 Flexible, self-service reporting
Characteristics of OLAP
117
 It defines which the system targeted to deliver the most
feedback to the client within about five seconds, with the
elementary analysis taking no more than one second and very
few taking more than 20 seconds.
Analysis
118
 It defines which the method can cope with any business logic
and statistical analysis that is relevant for the function and the
user, keep it easy enough for the target client.Although some
pre programming may be needed we do not think it
acceptable if all Share.
 It defines which the system tools all the security requirements
for understanding and, if multiple write connection is needed,
concurrent update location at an appropriated level, not all
functions need customer to write data back, but for the
increasing number which does, the system should be able to
manage multiple updates in a timely, secure manner.
Multidimensional
119
 This is the basic requirement. OLAP system must provide a
multidimensional conceptual view of the data, including full
support for hierarchies, as this is certainly the most logical
method to analyse business and organizations.
 Information
 The system should be able to hold all the data needed by the
applications.
 Data sparsity should be handled in an efficient manner.
 OLAP Operations in the Multidimensional Data Model
Roll-Up
120
 The roll-up operation (also known as drill-up or aggregation
operation) performs aggregation on a data cube, by climbing
down concept hierarchies, i.e., dimension reduction. Roll-up
is like zooming-out on the data cubes.
 When a roll-up is performed by dimensions reduction, one
or more dimensions are removed from the cube.
121
Drill-Down
122
 The drill-down operation (also called roll-down) is the reverse
operation of roll-up.
 Drill- down is like zooming-in on the data cube. It navigates
from less detailed record to more detailed data.
 Drill-down can be performed by either stepping down a concept
hierarchy for a dimension or adding additional dimensions.
 Figure shows a drill-down operation performed on the
dimension time by stepping down a concept hierarchy which is
defined as day, month, quarter, and year.
 Drill-down appears by descending the time hierarchy from the
level of the quarter to a more detailed level of the month.
123
Slice
124
 A slice is a subset of the cubes corresponding to a single value
for one or more members of the dimension.
 For example, a slice operation is executed when the
customer wants a selection on one dimension of a three-
dimensional cube resulting in a two-dimensional site.
 So, the Slice operations perform a selection on one
dimension of the given cube, thus resulting in a sub cube.
125
Dice
126
 The dice operation describes a sub cube by operating a
selection on two or more dimension.
Pivot
127
 The pivot operation is also called a rotation. Pivot is a
visualization operations which rotates the data axes in view to
provide an alternative presentation of the data.
 It may contain swapping the rows and columns or moving
one of the row-dimensions into the column dimensions.
128
129
Difference between OLTP and OLAP
130
 OLTP (On-LineTransaction Processing) is featured by a large
number of short on-line transactions (INSERT, UPDATE, and
DELETE).
 The primary significance of OLTP operations is put on very
rapid query processing, maintaining record integrity in
multi-access environments, and effectiveness consistent by
the number of transactions per second.
131
 OLAP (On-line Analytical Processing) is represented
by a relatively low volume of transactions.
 Queries are very difficult and involve aggregations. For
OLAP operations, response time is an effectiveness measure.
 OLAP applications are generally used by Data Mining
techniques.
 In OLAP database there is aggregated, historical information,
132
Types of OLAP
133
Relational OLAP (ROLAP) Server
134
 These are intermediate servers which stand in between a relational back-end
server and user frontend tools.
 They use a relational or extended-relational DBMS to save and handle warehouse
data, and OLAP middleware to provide missing pieces.
 ROLAP servers contain optimization for each DBMS back end, implementation
of aggregation navigation logic, and additional tools and services.
 ROLAP technology tends to have higher scalability than MOLAP technology.
 ROLAP systems work primarily from the data that resides in a relational
database, where the base data and dimension tables are stored as relational tables.
This model permits the multidimensional analysis of data.
 This technique relies on manipulating the data stored in the relational database to
give the presence of traditional OLAP's slicing and dicing functionality. In
essence, each method of slicing and dicing is equivalent to adding a "WHERE"
clause in the SQL statement.
135
Multidimensional OLAP (MOLAP) Server
136
 A MOLAP system is based on a native logical model that
directly supports multidimensional data and operations.
 Data are stored physically into multidimensional arrays, and
positional techniques are used to access them.
 One of the significant distinctions of MOLAP against a
ROLAP is that data are summarized and are stored in an
optimized format in a multidimensional cube, instead of in a
relational database.
 In MOLAP model, data are structured into proprietary
formats by client's reporting.
137
Hybrid OLAP (HOLAP) Server
138
 HOLAP incorporates the best features of MOLAP and
ROLAP into a single architecture.
 HOLAP systems save more substantial quantities of detailed
data in the relational tables while the aggregations are stored
in the pre-calculated cubes.
 HOLAP also can drill through from the cube down to the
relational tables for delineated data. The Microsoft SQL
Server 2000 provides a hybrid OLAP server.
139
Relational Online Analytical Processing
(ROLAP)
140
 ROLAP servers are placed between relational backend server
and client front-end tools.
 It uses relational or extended DBMS to store and manage
warehouse data. ROLAP has basically 3 main components:
Database Server, ROLAP server, and Front-end tool.
141
 Advantages of ROLAP –
 ROLAP is used for handle the large amount of data. ROLAP
tools don’t use pre-calculated data cubes.
 Data can be stored efficiently.
 ROLAP can leverage functionalities inherent in the relational
database.
142
 Disadvantages of ROLAP –
 Performance of ROLAP can be slow.
 In ROALP, difficult to maintain aggregate tables. Limited by
SQL functionalities.
Multidimensional Online Analytical
Processing (MOLAP)
143
 MOLAP does not uses relational database to storage.
 It stores in optimized multidimensional array storage. The
storage utilization may be low With multidimensional data
stores.
 Many MOLAP server handle dense and sparse data sets by
using two levels of data storage representation. MOLAP has
3 components : Database Server, MOLAP server, and Front-
end tool.
144
 Advantages of MOLAP –
 MOLAP is basically used for complex calculations.
 MOLAP is optimal for operation such as slice and dice.
 MOLAP allows fastest indexing to the pre-computed
summarized data.
 Disadvantages of MOLAP –
 MOLAP can’t handle large amount of data.
 In MOLAP, Requires additional investment.
 Without re-aggregation, difficult to change dimension.
Hybrid Online Analytical Processing
(HOLAP)
145
 Hybrid is a combination of both ROLAP and MOLAP. It
offers functionalities of both ROLAP and as well as MOLAP
like faster computation of MOLAP and higher scalability of
ROLAP.
 The aggregations are stored separately in MOLAP store. Its
server allows storing the large data volumes of detailed
information.
146
 Advantages of HOLAP –
 HOLAP provides the functionalities of both MOLAP and
ROLAP.
 HOLAP provides fast access at all levels of aggregation.
 Disadvantages of HOLAP –
 HOLAP architecture is very complex to understand because
it supports both MOLAP and ROLAP.
147
148
 OLAP offers five key benefits:
 Business-focused multidimensional data
 Business-focused calculations
 Trustworthy data and calculations
 Speed-of-thought analysis
 Flexible, self-service reporting
149
 Advantages
 Business-centered multidimensional information.
 Business-centered figuring’s.
 Dependable information and figuring’s.
 Speed-of-thought examination.
 Adaptable, self-administration detailing.
150
 Disadvantages
 Pre-demonstrating is an absolute necessity.
 As to business information, the traditional OLAP tools don't take
into consideration quick investigation without pre-demonstrating.
 Extraordinary reliance on IT.
 Helpless calculation capacity.
 Shy Interactive examination capacity.
 Slow in responding.
 Theoretical model.
 Extraordinary, expected danger.
Analytic functions
151
 Analytical functions are one of the most popular tools among
BI/Data analysts for performing complex data analysis.
 These functions perform computations over multiple rows
and return the multiple rows as well.
152
 analytic_function_name([argument_list]) OVER (
 [PARTITION BY partition_expression,…]
 [ORDER BY sort_expression, … [ASC|DESC]])
 There are three parts to this syntax,
 namely function, partition by and order by.
153
 analytic_function_name: name of the function — like
RANK(), SUM(), FIRST(), etc
 partition_expression: column/expression on the basis of
which the partition or window frames have to be created
 sort_expression: column/expression on the basis of which
the rows in the partition will be sorted
154
155
 Trendline definition is a function that fits a linear or
exponential model and returns the fitted values or model.
 The numeric_expr represents the Y value for the trend.The
series (time columns) represent the X value.
156
 Syntax:
 TRENDLINE(numeric_expr, ([series]) BY ([partitionBy]), model_type,
result_type)
 numeric_expr represents the data to the trend.This is theY-axis, usually a
measure column.
 series is the X-axis is a list of numerics or time dimension attribute columns.
 partitionBY is a list of dimension attribute columns that are in the view but
not on the X-axis.
 model_type is one of the following (‘LINEAR’,‘EXPONENTIAL’).
 result_type is one of the following (‘VALUE’,‘MODEL’).
 ‘VALUE‘ will return all the regressionY values given X in the fit. ‘MODEL’
will return all the parameters in a JSON format string.
157
 TRENDLINE((Sales),(Order Date) BY (Product Sub
Category), 'LINEAR', 'VALUE')
 numeric_expr = Sales
 series = Order Date (Time dimension)
 partitionBy = Product Sub Category
 model_type = LINEAR
 result_type =VALUE
158
 CLUSTER((dimension_expr1 , ... dimension_exprN),
(expr1, ... exprN),
 output_column_name, options, [runtime_binded_options])
 dimension_expr represents the list of dimensions, e.g.,
(productID, companyID), to be clustered.
 expr represents the list of dimension attributes or measures
to be used to
 cluster the dimension_expr.
159
 output_column_name is the output column.The valid values
are ‘clusterId’, ‘clusterName’, ‘clusterDescription’,
‘clusterSize’,‘distanceFromCenter’,‘centers’.
 options mean the string list of name=value pairs separated by
‘;’. The value can include %1 … %N, specified using
runtime_binded_options.
 runtime_binded_options is an optional comma-separated list
of run-time binded columns or literal expressions.
160
 OUTLIER((dimension_expr1 , ... dimension_exprN), (expr1, .. exprN),
output_column_name, options, [runtime_binded_options]))])
 dimension_expr indicates a list of dimensions.
 expr represents a list of dimension attributes or measures to find outlier.
 output_column_name indicates the output column name. Valid values are
'isOutlier' and 'distance'.
 options indicates a string list of name/value pairs separated by a semi-colon (;).
 The value can include %1 ... %N, which can be specified using
runtime_binded_options.
 runtime_binded_options is an option comma separated list (,) of run-time
binded columns and or literal expressions.
 REGR(y_axis_measure_expr, (x_axis_expr), (category_expr1, ...,
category_exprN), output_column_name, options, [runtime_binded_options])

More Related Content

PPTX
Data warehouse
DOCX
Data Mining and Warehousing -Unit III & Unit IV Notes
PPTX
CHAPTER 1 - Introdution to Datawarehousing.pptx
PPTX
MIS and Business Functions, TPS/DSS/ESS, MIS and Business Processes, Impact o...
DOC
Data Mining
DOCX
Unit 1
PPTX
Manish tripathi-ea-dw-bi
 
PPT
Dw & etl concepts
Data warehouse
Data Mining and Warehousing -Unit III & Unit IV Notes
CHAPTER 1 - Introdution to Datawarehousing.pptx
MIS and Business Functions, TPS/DSS/ESS, MIS and Business Processes, Impact o...
Data Mining
Unit 1
Manish tripathi-ea-dw-bi
 
Dw & etl concepts

Similar to Business analysis of business of current (20)

PPTX
Data Warehousing about data ware house.pptx
PPTX
DATA WAREHOUSING
PPTX
Data_Warehousing[1][1] .pptx
PPTX
Data Warehousing .pptx
PPTX
Data warehousing Concepts and Design.pptx
PPTX
introduction & conceptsdatawarehousing.pptx
PPT
Unit 5
PPTX
Data warehouse architecture
PDF
Business Intelligence Solution on Windows Azure
PPTX
UNIT 2 DATA WAREHOUSING AND DATA MINING PRESENTATION.pptx
PPT
Data Warehouse
PDF
Data warehousing
PPTX
ETL processes , Datawarehouse and Datamarts.pptx
PPT
DataWarehousingandAbInitioConcepts.ppt
DOCX
Concept of Data Warehouse Architecture.docx
PDF
Information & Data Architecture
PDF
Data warehousing
PDF
Implementation of Data Marts in Data ware house
PDF
DMDW 1st module.pdf
PPT
Chapter 2
Data Warehousing about data ware house.pptx
DATA WAREHOUSING
Data_Warehousing[1][1] .pptx
Data Warehousing .pptx
Data warehousing Concepts and Design.pptx
introduction & conceptsdatawarehousing.pptx
Unit 5
Data warehouse architecture
Business Intelligence Solution on Windows Azure
UNIT 2 DATA WAREHOUSING AND DATA MINING PRESENTATION.pptx
Data Warehouse
Data warehousing
ETL processes , Datawarehouse and Datamarts.pptx
DataWarehousingandAbInitioConcepts.ppt
Concept of Data Warehouse Architecture.docx
Information & Data Architecture
Data warehousing
Implementation of Data Marts in Data ware house
DMDW 1st module.pdf
Chapter 2
Ad

Recently uploaded (20)

PDF
AI Cloud Sprawl Is Real—Here’s How CXOs Can Regain Control Before It Costs Mi...
PDF
Meme Coin Empire- Launch, Scale & Earn $500K-Month_3.pdf
PDF
Why Has Vertical Farming Recently Become More Economical.pdf
PDF
Decision trees for high uncertainty decisions
PPTX
_From Idea to Revenue How First-Time Founders Are Monetizing Faster in 2025.pptx
PPTX
The Evolution of Search- Behaviour.pptx
PDF
initiate-entrepreneurship-in-healthcare-service-management-in-sierra-leone.pdf
PPTX
ELS-07 Lifeskills ToT PPt-Adama (ABE).pptx
PDF
Chapter 1 - Introduction to management.pdf
PDF
Budora Case Study: Building Trust in Canada’s Online Cannabis Market
PDF
Pollitrace pitch deck- Ai powered multiple species
PPT
Chap8. Product & Service Strategy and branding
PPTX
Process-and-Ethics-in-Research-1.potatoi
PPTX
ENTREPRENEURSHIP..PPT.pptx..1234567891011
PPTX
Peerless Plumbing Company-Fort Worth.pptx
PDF
Chapter 3 - Business environment - Final.pdf
PPT
chap9.New Product Development product lifecycle.ppt
PPTX
Daily stand up meeting on the various business
PDF
Business Risk Assessment and Due Diligence Report: Zacharia Ali and Associate...
PDF
Investment Risk Assessment Brief: Zacharia Ali and Associated Entities
AI Cloud Sprawl Is Real—Here’s How CXOs Can Regain Control Before It Costs Mi...
Meme Coin Empire- Launch, Scale & Earn $500K-Month_3.pdf
Why Has Vertical Farming Recently Become More Economical.pdf
Decision trees for high uncertainty decisions
_From Idea to Revenue How First-Time Founders Are Monetizing Faster in 2025.pptx
The Evolution of Search- Behaviour.pptx
initiate-entrepreneurship-in-healthcare-service-management-in-sierra-leone.pdf
ELS-07 Lifeskills ToT PPt-Adama (ABE).pptx
Chapter 1 - Introduction to management.pdf
Budora Case Study: Building Trust in Canada’s Online Cannabis Market
Pollitrace pitch deck- Ai powered multiple species
Chap8. Product & Service Strategy and branding
Process-and-Ethics-in-Research-1.potatoi
ENTREPRENEURSHIP..PPT.pptx..1234567891011
Peerless Plumbing Company-Fort Worth.pptx
Chapter 3 - Business environment - Final.pdf
chap9.New Product Development product lifecycle.ppt
Daily stand up meeting on the various business
Business Risk Assessment and Due Diligence Report: Zacharia Ali and Associate...
Investment Risk Assessment Brief: Zacharia Ali and Associated Entities
Ad

Business analysis of business of current

  • 2. Data Warehouses and Data Mart  Business intelligence (BI) is all about turning an organization’s data into insights that can be used to inform business decisions.  BI analysts will use BI tools, software or services to access and analyze datasets and translate their findings into reports, summaries, dashboards, graphs, charts or maps.  In recent years, the advent of modern data visualization and reporting tools has transformed the discipline, empowering businesses to use big data insights to identify, develop and create new business opportunities. 2
  • 3. Business Intelligence 3  “a set of methodologies, processes, architectures and technologies that transform raw data into meaningful and useful information”.  This can then be “used to enable more effective strategic, tactical and operational insights and decision-making”.  This definition acknowledges that data cannot be effectively analyzed or used to generate meaningful insights if it is poor quality.  BI should not be confused with‘business analytics’.  Business intelligence is descriptive and uses metrics to generate clear snapshots of business performance.  Meanwhile, business analytics is predictive, and describes what organizations should do in future to generate better outcomes.
  • 4. Data Warehouse 4  A DataWarehouse (DW) is a relational database that is designed for query and analysis rather than transaction processing.  It includes historical data derived from transaction data from single and multiple sources.  A Data Warehouse provides integrated, enterprise-wide, historical data and focuses on providing support for decision- makers for data modeling and analysis.  A Data Warehouse is a group of data specific to the entire organization, not only to a particular group of users.  It is not used for daily operations and transaction processing but used for making decisions.
  • 5. 5  A Data Warehouse can be viewed as a data system with the following attributes:  It is a database designed for investigative tasks, using data from various applications.  It supports a relatively small number of clients with relatively long interactions.
  • 6. 6  It includes current and historical data to provide a historical perspective of information.  Its usage is read-intensive.  It contains a few large tables.  "Data Warehouse is a subject-oriented, integrated, and time- variant store of information in support of management's decisions."
  • 7. 7
  • 8. Subject-Oriented 8  A data warehouse target on the modeling and analysis of data for decision-makers.  Therefore, data warehouses typically provide a concise and straightforward view around a particular subject, such as customer, product, or sales, instead of the global organization's ongoing operations.  This is done by excluding data that are not useful concerning the subject and including all data needed by the users to understand the subject.
  • 9. Integrated 9  A data warehouse integrates various heterogeneous data sources like RDBMS, flat files, and online transaction records.  It requires performing data cleaning and integration during data warehousing to ensure consistency in naming conventions, attributes types, etc., among different data sources.
  • 10. 10
  • 11. Time-Variant 11  Historical information is kept in a data warehouse. For example, one can retrieve files from 3 months, 6 months, 12 months, or even previous data from a data warehouse.  These variations with a transactions system, where often only the most current file is kept.
  • 12. Non-Volatile 12  The data warehouse is a physically separate data storage, which is transformed from the source operational RDBMS.  The operational updates of data do not occur in the data warehouse, i.e., update, insert, and delete operations are not performed.  It usually requires only two procedures in data accessing: Initial loading of data and access to data.  Therefore, the DW does not require transaction processing, recovery, and concurrency capabilities, which allows for substantial speedup of data retrieval.  Non-Volatile defines that once entered into the warehouse, and data should not change.
  • 13. 13
  • 14. Goals of Data Warehousing 14  To help reporting as well as analysis  Maintain the organization's historical information  Be the foundation for decision making.
  • 15. 15
  • 16. Data Mart 16  A Data Mart is a subset of a directorial information store, generally oriented to a specific purpose or primary data subject which may be distributed to provide business needs.  Data Marts are analytical record stores designed to focus on particular business functions for a specific community within an organization.  Data marts are derived from subsets of data in a data warehouse, though in the bottom- up data warehouse design methodology, the data warehouse is created from the union of organizational data marts.
  • 17. 17  The fundamental use of a data mart is Business Intelligence (BI) applications.  BI is used to gather, store, access, and analyze record.  It can be used by smaller businesses to utilize the data they have accumulated since it is less expensive than implementing a data warehouse.
  • 18. Reasons for creating a data mart 18  Creates collective data by a group of users  Easy access to frequently needed data  Ease of creation  Improves end-user response time  Lower cost than implementing a complete data warehouses  Potential clients are more clearly defined than in a comprehensive data warehouse  It contains only essential business data and is less cluttered.
  • 19. Types of Data Marts 19  There are mainly two approaches to designing data marts.These approaches are  Dependent Data Marts  Independent Data Marts
  • 20. 20
  • 21. 21  A dependent data mart is a logical subset of a physical subset of a higher data warehouse.  According to this technique, the data marts are treated as the subsets of a data warehouse.  In this technique, firstly a data warehouse is created from which further various data marts can be created.  These data mart are dependent on the data warehouse and extract the essential record from it.  In this technique, as the data warehouse creates the data mart; therefore, there is no need for data mart integration. It is also known as a top-down approach.
  • 22. Independent Data Marts 22  The second approach is Independent data marts (IDM) Here, firstly independent data marts are created, and then a data warehouse is designed using these independent multiple data marts.  In this approach, as all the data marts are designed independently; therefore, the integration of data marts is required.  It is also termed as a bottom-up approach as the data marts are integrated to develop a data warehouse.  Other than these two categories, one more type exists that is called "Hybrid Data Marts."
  • 23. 23
  • 24. Hybrid Data Marts 24  It allows us to combine input from sources other than a data warehouse.  This could be helpful for many situations; especially when Adhoc integrations are needed, such as after a new group or product is added to the organizations.
  • 25. Steps in Implementing a Data Mart 25  The significant steps in implementing a data mart are to design the schema, construct the physical storage, populate the data mart with data from source systems, access it to make informed decisions and manage it over time.
  • 26. Designing 26  The design step is the first in the data mart process.This phase covers all of the functions from initiating the request for a data mart through gathering data about the requirements and developing the logical and physical design of the data mart.  It involves the following tasks:  Gathering the business and technical requirements  Identifying data sources  Selecting the appropriate subset of data  Designing the logical and physical architecture of the data mart.
  • 27. Constructing 27  This step contains creating the physical database and logical structures associated with the data mart to provide fast and efficient access to the data.  It involves the following tasks:  Creating the physical database and logical structures such as table spaces associated with the data mart.  Creating the schema objects such as tables and indexes describe in the design step.  Determining how best to set up the tables and access structures.
  • 28. Populating 28  This step includes all of the tasks related to the getting data from the source, cleaning it up, modifying it to the right format and level of detail, and moving it into the data mart.  It involves the following tasks:  Mapping data sources to target data sources  Extracting data  Cleansing and transforming the information.  Loading data into the data mart  Creating and storing metadata  This step involves putting the data to use: querying the data, analyzing it, creating reports, charts and graphs and publishing them.
  • 29. 29  It involves the following tasks:  Set up and intermediate layer (Meta Layer) for the front- end tool to use.This layer translates database operations and objects names into business conditions so that the end-clients can interact with the data mart using words which relates to the business functions.  Set up and manage database architectures like summarized tables which help queries agree through the front-end tools execute rapidly and efficiently.
  • 30. 30  Managing  This step contains managing the data mart over its lifetime. In this step, management functions are performed as:  Providing secure access to the data.  Managing the growth of the data.  Optimizing the system for better performance.  Ensuring the availability of data even with system failures.
  • 31. 31
  • 32. 32
  • 33. Knowledge Management 33  The Connection Between Business Intelligence and Knowledge Management  Information simply has to be accessible – gathering, managing and utilizing information is an inevitable part of running any modern business.  There are two information management technologies we use: Business Intelligence (BI) and Knowledge Management (KM).  However, terms information, data and knowledge are often used interchangeably, thus, apart from being confused about each phrase, there is oftentimes much confusion around the definitions of BI and KM.
  • 34. 34  Business Intelligence (BI): Generally, BI is considered to be a set of tools and techniques applied to gather data and transform it into information that can be used in business analysis for the purposes of business development.  Every company gathers, collects, or to say more accurately, deals with a large amounts of data, including various business documents, emails, newspaper articles, web pages, reports, contracts, technical journals and reviews, spreadsheets, graphs and charts and other relevant sources of business data.  BI technologies usually deal with large amounts of unstructured data via the use of data warehousing and online analytical processing (OLAP).
  • 35. 35  All these data needs to be organized and validated – prepared for business analytics.  Thus, “BI is about providing the right data at the right time to the right people so that they can take the right decisions.”  Knowledge Management can be defined in many ways as it spans many multi- disciplinary approaches – content management, collaboration, the science of organizational behavior, analyses like observation of trends and appearance of anomalies, clustering, classification, summarization, taxonomy building and so on.
  • 36. 36  “Knowledge management is the process of capturing, distributing, and effectively using knowledge.”  KM refers to a set of techniques used to capture, share, and use the information available in order to achieve business objectives and to aid in business decision making based on business analytics.  There has been immense growth in the domain of knowledge management in the last decade and new applications and solutions that empower knowledge sharing and knowledge management have appeared.
  • 37. 37  Knowledge is a mix of contextual information, experiences, rules, and values.  Richer, deeper, and more valuable.  Consider knowing –  What? - based upon assembling information and eventually applying it.  How? – applying knowledge leads to learning how to do something.  Why? – casual knowledge of why something occurs.
  • 38. 38
  • 39. Tacit vs. Explicit Knowledge 39  Tacit knowledge is personal, context-specific and hard to formalize and communicate.  A [knowledge] developed and internalized by the knower over a long period of time . . . incorporates so much accrued and embedded learning that its rules may be impossible to separate from how an individual acts.
  • 40. 40  Explicit knowledge can be easily collected, organized and transferred through digital means.  A theory of the world, conceived of as a set of all of the conceptual entities describing classes of objects, relationships, processes, and behavioral norms.  Often referred to as knowing that’, or declarative knowledge.
  • 41. 41
  • 42. Why Manage Knowledge? 42  Information and knowledge have become the fields in which businesses compete.  Several important factors include:  Sharing Best Practice  Globalization  Rapid Change  Downsizing  Managing Information and Communication Overload – Knowledge Embedded in Products  Sustainable Competitive Advantage.
  • 43. 43
  • 44. Sharing Best Practices 44  Sharing best practices means leveraging the knowledge gained by a subset of the organization.  Increasingly important in organizations who depend on applying their expertise such as accounting, consulting and training firms.  KM systems capture best practices to disseminate their experience within the firm.  Problems often arise from employees who may be reluctant to share their knowledge (managers must encourage and reward open sharing).
  • 45. Globalization 45  Historically three factors, land, labor and capital were the key to economic success.  Knowledge has become a fourth factor.  Knowledge-based businesses can grow without traditional land, labor, and capital requirements.  Key competitive factor will be how well an organization acquires and applies knowledge
  • 46. Other factors 46  Rapid change: firms must be nimble and adaptive to compete.  Downsizing: sometimes the wrong people get fired when creating a leaner organization.  Managing Info. and Comm. Overload: data must be categorized in some manner if it is to be useful rather than overwhelming.  Knowledge Embedded in Products: the intangibles that add the most value to goods and services are becoming increasingly knowledge-based.  Sustainable Competitive Advantage: KM is the way to do this. Shorter innovation life cycles keep companies ahead of the competition.
  • 47. Categorizing knowledge 47  Process knowledge – best practices, useful for increasing efficiency.  Factual knowledge – easy to document; basic information about people/things.  Catalog knowledge – know where things are; like directories of expertise.  Cultural knowledge – knowing how things get done politically and culturally.
  • 48. 48 Process Knowledge Definition: Process knowledge refers to understanding the best practices, workflows, and methodologies used to complete tasks efficiently. It involves knowing how things should be done to optimize productivity and quality. Characteristics:  It includes step-by-step procedures and workflows.  Often developed through experience, training, and observation.  Can be standardized through documentation and automation.  Helps in continuous improvement and efficiency.
  • 49. 49 Examples:  A software company follows an Agile methodology for software development.  Manufacturing industries use Six Sigma to improve quality control.  Hospitals have standard operating procedures (SOPs) for patient care. Importance:  Reduces errors and inefficiencies in operations.  Improves consistency and quality of outcomes.  Enables new employees to learn processes quickly.  Helps organizations adapt to changing environments through refined workflows.
  • 50. 50 Factual Knowledge Definition: Factual knowledge consists of basic, objective, and easily verifiable information about people, objects, events, and concepts. It is often recorded and shared in structured formats like databases, manuals, or books. Characteristics:  It is static and does not change frequently.  Can be easily documented and shared.  Used for decision-making and reference. Examples:  A customer service representative needs customer details (name, address, contact information).  Engineers refer to mathematical formulas and physical laws.  An AI model needs a dataset containing facts about weather conditions to make predictions. Importance:  Forms the foundation for knowledge-based decision-making.  Essential for training AI models and automation.  Helps organizations store and retrieve critical information efficiently.
  • 51. 51 Catalog Knowledge Definition: Catalog knowledge refers to knowing where to find information, resources, or expertise when needed. It acts as a directory that helps individuals or teams navigate complex systems. Characteristics:  Helps in locating resources quickly.  Often stored in knowledge management systems, directories, or databases.  Used for networking and collaboration. Examples:  A university maintains a faculty directory with specializations and contact details.  IT support teams have a knowledge base for troubleshooting guides.  A hospital maintains a medical inventory catalog to track the availability of drugs and equipment. Importance:  Reduces time spent on searching for information.  Improves collaboration and communication.  Enhances decision-making by connecting people with the right expertise.
  • 52. 52 Cultural Knowledge Definition: Cultural knowledge refers to understanding how things get done within an organization, community, or society based on norms, traditions, and political structures. It includes unwritten rules, beliefs, and power dynamics. Characteristics:  Often implicit and learned through experience.  Can vary significantly between organizations and regions.  Helps in navigating workplace relationships and decision-making. Examples:  In a multinational company, employees need to understand cultural differences in communication styles.  A new employee in a government agency learns about bureaucratic procedures and how to get approvals efficiently.  In academia, researchers need to understand funding priorities and grant application processes. Importance:  Helps employees adapt to the workplace environment.  Avoids conflicts due to cultural misunderstandings.  Enhances leadership and teamwork by understanding organizational dynamics.
  • 53. KM involves four main processes 53  Generation – all activities that discover “new” knowledge.  Capture – all continuous processes of scanning, organizing, and packaging knowledge after it has been generated.  Codification – the representation of knowledge in a manner that can be easily accessed and transferred.  Transfer – transmitting knowledge from one person or group to another, and the absorption of that knowledge.
  • 54. Knowledge Generation 54  Concerns the intentional activities of an organization to acquire/create new knowledge.  Two primary ways are knowledge creation and knowledge sharing.  Methods include: – Research and Development – Adaptation – Buy or Rent – Shared Problem Solving – Communities of Practice.
  • 55. 55  Knowledge capture takes into account the media to be used in the codification process.  The 3 main knowledge capture activities are:  Scanning (gather “raw” information) – can be electronic or human.  Organizing (move it into an acceptable form) – must be easy for all types of users to access.  Designing knowledge maps (providing a guide for navigating the knowledge base)
  • 56. Designing Knowledge Maps 56  A knowledge map serves as both a guide to where knowledge exists in an organization and an inventory of the knowledge assets available.  A knowledge map can consist of nothing more than a list of people, documents, and databases telling employees where to go when they need help.  Provides access to resources that would otherwise be difficult or impossible to find.
  • 57. 57
  • 58. Knowledge Codification 58  Knowledge must be used or shared to be of value.  Codification puts the knowledge into a form that makes it easy to find and use.  It is difficult to measure knowledge in discreet units (since it changes over time).  Knowledge has a shelf life.
  • 59. Knowledge Transfer 59  Knowledge Transfer describe four different modes of knowledge conversion (transfer):  Socialization: from tacit knowledge to tacit knowledge  Externalization: from tacit knowledge to explicit knowledge  Combination: from explicit knowledge to explicit knowledge  Internalization: from explicit knowledge to tacit knowledge
  • 61. Similarities and Differences between BI and KM 61  Confusion between these two technologies comes from the fact that they deal with many similar processes.  Both business intelligence and knowledge management capture, collect, organize, analyze and aggregate data in order to find the best solutions regarding business decision making processes.  Business intelligence goes as far back as the 19th century and the beginnings of entrepreneurship and it has been developing steadily over many years.  BI enables organizations to integrate data across the enterprise, unlock the information and empower knowledge worker to make better (and faster) decisions – it focuses on explicit knowledge.  However, KM deals with the creation of new knowledge and the dispersion of existing knowledge throughout an organization – it encompasses both tacit and explicit knowledge, thus, we can say that KM can influence the very nature of business intelligence.
  • 62. Types of Decisions 62  Modern cloud-native organizations have constantly growing streams of raw data flowing from every corner of the enterprise.  Determining the impact this data has on business performance can be an overwhelming task requiring teams of analysts.  That’s where employing business intelligence (BI) can help.  By presenting current and historical data within a business context, the data insights supplied by BI tools enable organizations to make smarter, more confident decisions that provide strategic direction for years to come.  Instead of relying on intuition and “gut feel,” companies can use BI to find new ways to increase revenue, track performance, boost operational efficiency, identify market trends, expose problems, and much, much more.
  • 63. Decision-making 63  Simply put, decision-making is the process of deciding something, especially with a group of people.  From a business decision perspective, the aim is to achieve business objectives to satisfy stakeholder requirements, needs, and expectations.  For the decision to be effective, however, decision makers must forecast the outcome of each option and determine which is best for a particular situation.  That makes decision support systems (DSS) like decision intelligence and business intelligence absolute essentials.  Business intelligence refers to the technology tools and processes that enable businesses to organize, analyse, and contextualize business data from around the company.  Business intelligence tools and decision-making transform raw data into meaningful and actionable information.
  • 64. The role of business intelligence 64  Companies make big mistakes when they base business decisions on what they think will happen instead of relying on facts.  Using BI and advanced analytics, organizations can extract crucial facts from the mountain of data, transforming it into information companies can act on to make informed strategic decisions.  The result: improved business processes, operational efficiency, and business productivity.
  • 65. Business intelligence decisions 65  Business intelligence decisions typically fall into three categories: strategic, tactical, and operational.  An organization needs to gain a complete understanding of these types of decisions in business intelligence to make better-informed decisions that lead to increased customer retention, stakeholder satisfaction, operational efficiency, and revenue.
  • 66. The relationship between business intelligence and business analytics 66  Business intelligence tells you what is currently happening and what happened in the past to bring you to that state.  On the other hand, business analytics is an umbrella term for predictive data analysis techniques and prescriptive .  Using business intelligence and analytics efficiently is the difference between companies that succeed and those that fail in the modern environment.
  • 67. Three primary types of business intelligence decisions 67  Business intelligence supports the three types of decision- making mentioned above:  strategic  tactical  operational
  • 68. Strategic decisions 68  Strategic decisions comprise the highest level of organizational business decisions and are usually less frequent and made by the organization’s executives. Yet, their impact is enormous and far- reaching.  Some types of strategic decisions include selecting a particular market to penetrate, a company to acquire, or whether to hire additional staff.  Decisions made at this level usually involve significant expenditure.  However, they are generally non-repetitive in nature and are taken only after careful analysis and evaluation of many alternatives.
  • 69. Tactical decisions 69  Tactical decisions (or semi structured decisions) occur with greater frequency (e.g., weekly or monthly) and fall into the mid-management level. Often, they relate to the implementation of strategic decisions.  Examples of tactical decisions include product price changes, work schedules,departmental reorganization, and similar activities.  The impact of these types of decisions is medium regarding risk to the organization and impact on profitability.
  • 70. Operational decisions 70  Operational decisions (or structured decisions) usually happen frequently (e.g., daily or hourly), relate to day-to-day operations of the enterprise, and have a lesser impact on the organization.  Operational decisions determine the day-to-day profitability of the business, how effectively it retains customers, or how well it manages risk.  Answering a sales inquiry, approving a quotation, or calculating employee bonuses may be examples of this decision type.
  • 71. 71  Strategic: Long-term, complex, made by senior managers  Tactical: Medium-term, less complex, made by mid- level managers  Operational: Day-to-day, simple, routine, made by junior managers
  • 72. How to make the best decisions for your business 72  How do you make the best business decisions? Some people trust intuition or gut feeling. Others reach out to constituents and experts for advice.  Still, others decision-making to information systems and automation.  However, the smartest business decisions are made by those who look at the numbers.  In a competitive business landscape, where agility, flexibility, and a real-time decision-making process are critical and timely, accurate data analysis is more important than ever.  In that respect, relying on the types of decisions in business intelligence is non-negotiable.  It is required for long-standing success and market dominance.
  • 73. Decision Making Process 73  As a business owner, you need to make decisions that have a large impact on the success of your business.  You need to make decisions ranging from the small, such as what color your company logo should be, to the larger, such as whether to expand your business.  No matter what your decision, it should be based on facts, not emotions.  Decision-making is a crucial aspect of running a business. Whenever you decide what to do next, a chain of events is triggered, which may eventually lead to the outcome you desire.  These events constitute the Decision-Making Process using predictive analytics consulting.
  • 74. 74  The decision-making process is a term used to describe how a company gets to a point where it can make the best decision for its company.  This process is a way of thinking, a way of seeing the world, and a method of arriving at what you believe are the best possible decisions.
  • 75. 75  From the day a company is founded, it is believed that each executive should also be a manager in the right sense.  It is believed that the executives should be able to take the right action with the help of the appropriate decision-making process.  And the best decision-making process is one that is not only efficient but also cost- effective.
  • 76. 76  In order to make the best decisions, companies need to use tools to decipher their data and better understand their customers.  Business Intelligence (BI) is a set of tools used by companies to analyze data using data analytics tools, gain valuable insights, and make better and more informed decisions using Business Intelligence reports.  Since its early days, business intelligence has been a tool for decision-making. It’s a way to ensure that managers and executives make better decisions so that businesses can run more effectively.  These decisions can range from basic operations like deciding how much to produce to broader strategic considerations like deciding what products to sell.  However, BI tools vary considerably in how they are designed, how they interact with other applications, and how they are used. Ultimately, BI is about having access to the right information at the right time.
  • 77. THE EFFECTIVE STEPS IN THE DECISION-MAKING PROCESS THAT USES BI 77 Gathering information  If you’re like most of us, you’re using spreadsheets and dashboards to visualize and present data on theWeb.  But some businesses are turning to dashboards to make smart decisions that improve operations, enhance employee satisfaction, process more data, and have greater visibility into the bottom line.
  • 78. Design and analyze 78  Routinely encountering data requires a person to make decisions on how to process the data.  It can be viewed as a few steps involved in this process.  First, it involves the analysis of data, followed by the identification of the most appropriate decision- making model.  The implementation comes after the analysis of data.  The data analysis process is then repeated, and the model is altered based on the latest information or discoveries, if necessary.
  • 79. Select and implement using ad hoc query, what-if, and forecasting 79  The importance of data in business is beyond dispute. But, the problem is that most companies are not making the most of the information available to them.  What they have is not necessarily the right data and is not being put to its best use.  Ad hoc queries, what-if scenarios, and forecasting are particularly effective tools for uncovering hidden information.  It can influence risk and make decisions. Ad-hoc query,What- If, and Forecasting are three terms used in business intelligence to help make decisions.
  • 80. Do evaluations using the vital tools 80  Effective decision-making is critical to business success. The good news is that business intelligence tools can help you make better decisions.  Using dashboards and reports can help you identify key performance indicators, make comparisons between data sets.  Also, identify opportunities to improve your business. If you have any experience managing a business, you know that these vital tools are a necessity, providing you with the information you need to make business decisions.  These tools can be used for assessing your performance, motivating staff, and improving the customer experience.
  • 81. Separate the components that are not related 81  At one point in time, every decision we make appears to have little connection to anything else.  This is because we are able to separate decisions into distinctly different components, such as the sales forecast, the profit, the hiring policy, and so on.  The trick is to recognize that these components are inseparable. Moreover, a decision made without the sales forecast is like an engine without the air.  A decision made without a profit is like an engine without fuel.  A decision made without a hiring policy is like an engine without the air.
  • 82. Increase the coherence 82  Business intelligence is a technology that extracts and analyzes historical data to help organizations make better decisions.  It is a key asset for businesses that use it to gain value from all their data and to make better decisions on how to spend their resources.
  • 83. 83  There is a trend among different organizations in today’s world to adopt the use of Business Intelligence.  The use of business intelligence in these organizations helps in making decisions faster and more effectively.  With the help of business intelligence, organizations can keep track of their processes.  Moreover, the performance of their business, and the consumption of resources and get insights into the organization’s current situation.  Business intelligence solutions help in making more effective decisions.
  • 84. Decision Support Systems 84  Broadly speaking, a decision support system (DSS) is an analytics software program used to gather and analyze data to inform decision making.  There are many different types of decision support systems, from modern business intelligence which uses AI and machine learning to suggest insights and analyses for humans to perform, to model-based DSS systems which use predefined criteria to perform automated calculations and deliver best-case decisions.  For all types, DSS is used in timely problem solving to improve efficiency and streamline operations, planning and company management.
  • 85. Traditional vs Modern DSS 85  Traditional DSS: Historically, DSS and BI tools relied on preconfigured, historical data with no ability to drive real-time decisions and action. With this approach, decisions are made based on the past.  Modern DSS: New tools and processes allow for “active intelligence”, a state of continuous intelligence with an end-to- end analytics data pipeline delivering real- time, up-to-date information designed to trigger immediate insights and actions.
  • 86. DSS Characteristics 86  Prior to decision support systems, organizational leaders relied heavily on a combination of their experience and professional training, and applied those to thoughtful use of the advanced insights generated by a data analytics platform.  Decision support systems systematize that by taking organizational data, analyzing it, and presenting it for use in company decision making.
  • 87. Categories of decision support systems 87 Data-driven DSS  A data-driven DSS gives users access to a large amount of internal and external data.  This DSS will query a database using the web, an external server or a company's mainframe.  It relies on data mining to provide patterns and information about the data being assessed.
  • 88. 88  Users rely on data-driven decision support systems to make decisions about businesses, inventories and products.  Managers might find data-driven decision support systems most helpful when analyzing current and historical data to report on the conditions of a department or the business.  CEOs, managers and staff might use a data-driven DSS.  Software examples of a data-driven DSS include:  Geographic Information Systems (GIS)  File drawer systems  Executive information systems  Computer-based databases with query systems
  • 89. Model-driven DSS 89  A model-driven DSS allows a user to analyze and manipulate specific models of data, such as statistics, finances or scheduling.  These decision support systems are specific to the type of model the user wants to interact with and typically offer less data than other DSS types.  They analyze scenarios and data to allow the user to manipulate a model, such as creating a work schedule.  They might use simple analysis tools or complex statistics, depending on the model's purpose and the user's needs.  Managers, staff and third parties who interact with a business might use a model-driven DSS.
  • 90. 90 Software examples of a model-driven DSS include:  Scheduling software  Financial modeling  Decision analysis modeling  Optimization software
  • 91. Knowledge-driven DSS 91  With a knowledge-driven DSS, a knowledge-management system monitors continually updated data about an organization to support decisions.  The DSS uses diagnosis, prediction, interpretation and classification to recommend actions consistent with the business.  A knowledge-driven DSS can be helpful to managers because it performs tasks faster than a human might.They can also help consumers decide which products and services to buy.This kind of DSS often relies on a data-mining component.  Managers, staff and external users, such as customers, might use a knowledge- driven DSS.  Software examples of a knowledge-driven DSS include:  Software that identifies new or current customers who might be interested in products  Product selection software
  • 92. Document-driven DSS 92  A document-driven DSS retrieves unstructured information from a variety of electronic sources. It searches web pages, documents in databases and other information based on a user's search terms to gather relevant information.  A document-driven DSS might be specific to a business' private files or as broad as a common internet search engine.  Anyone using a database's search function or an internet search engine is using a document-driven DSS.  Software examples of a document-driven DSS include:  Search engines  Database search software  Article databases with search functions
  • 93. Communication-driven DSS 93  A communication-driven DSS uses tools to support communication and collaboration.  Email is an example of a communication-driven DSS.  This type of DSS includes share tools that allow multiple people to work on a project at once and software that allows for digital communication between people.
  • 94. 94  It improves a shared project's efficiency and effectiveness and can help facilitate meetings and conversations.  Internal team members, virtual business meeting hosts and online chat and video meeting software users can benefit from a communication-driven DSS.  A communication-driven DSS might also be called a group DSS.  A communication- driven DSS focuses on communication and collaboration, while a group DSS helps groups streamline the decision-making process.  A communication-driven DSS, for example, might help two people who work for the same company on different shifts share documents. It might also allow employees on opposite sides of the country to meet virtually to view a shared file.  Software examples of a communication-driven DSS include:  Chat and instant messaging services  Collaboration software, such as document sharing and editing software
  • 95. Intelligent DSS 95  Any DSS with artificial intelligence in its design is an example of intelligent DSS (IDSS).  Within an IDSS, AI does data mining and processing to filter through large datasets.  An IDSS is designed to offer similar services to a human consultant.  They're programmed to identify patterns and trends to guide decision-making.  They can also resolve problems and analyze solutions. AI components add advantages, such as fuzzy logic and machine learning, to a DSS.  Managers, diagnosticians and other decision-makers might use an IDSS.  Software examples of an intelligent DSS include:  Smart manufacturing systems  Medical diagnostic systems
  • 96. Manual DSS 96  A manual DSS relies on individuals instead of computers to support decision- making.  A group of experts analyses the strengths, weaknesses, opportunities and threats of their organization or project.  A manual DSS is much slower than a computer-based DSS, but certain types of analysis still need a human eye at every step.  Economists, executives and managers might use a manual DSS.  Examples of manual DSS include:  Cost-benefit analyses  Decision matrixes
  • 97. Hybrid DSS 97  A hybrid DSS combines parts of multiple DSS types to create a complex outcome.  Large issues in industries such as finance and health care might require the tools of multiple decision support systems, such as a knowledge-driven DSS and a data-driven DSS.  A hybrid DSS might use additional software to help these components work together.  Sometimes a human analyses and combines the results of each DSS.  A hybrid DSS might also describe a system in which a human works with a DSS to extract and manipulate data.  Medical professionals, financial decision-makers and researchers might use a hybrid DSS. Software examples of a hybrid DSS include:  Risk assessment  Clinical DSS  Web-based DSS
  • 98. Decision Support System Examples. 98  GPS route planning.  Crop-planning.  Clinical DSS
  • 99. Business Intelligence 99  Business intelligence combines business analytics, data mining, data visualization, data tools and infrastructure, and best practices to help organizations make more data-driven decisions.
  • 100. Techniques Used In Business Intelligence 100 1. DataVisualization  When data is stored as a set or matrix of numbers, it is precise but difficult to interpret.When looking at more than one dimension of the data, this becomes even harder. Creating charts, graphics or dashboards from the data makes it much easier for people to understand and interpret. 2. Data Mining  Data mining is a computer supported method to reveal previously unknown or unnoticed relations among data entities.
  • 101. Example 101  In retail: shopping basket analysis can examine products consumers buy together in order to better promote other products.  In banking: using an automated risk assessment based on historical data to evaluate whether a customer is likely to pay back a loan.  In insurance: mining behavioural and historical data to detect fraud.  In health: analysis of complications and common diseases may help to reduce risk.
  • 102. Reporting 102  One area where BI tools commonly help business users is by designing, scheduling and generating reports, for example regular performance, sales or marketing reports.  Reports output by BI tools efficiently gather and present information to support the management, planning and decision making process.  Once the report is designed it can be automatically run at set intervals and sent to a predefined distribution list so key people can see regularly updated numbers.
  • 103. Time-Series Analysis And Predictive Techniques 103  Nearly all data warehouses and all enterprise data have a time dimension.  For example, product sales, phone calls, patient hospitalizations, etc.  Time-series analysis can reveal changes in user behaviour over time, relationships between sales of different products, or changes in sales figures based on marketing promotions.  Historic data can also be used to extrapolate and try to predict future trends, outcomes or financial results.
  • 104. Online Analytical Processing (OLAP) 104  OLAP is best known for the OLAP-cubes which provide a visualization of multidimensional data.  OLAP cubes display dimensions on the cube edges (e.g. time, product, customer type, customer age etc.).  The values in the cube represent the measured facts (e.g. value of contracts, number of sold products etc.).  The user can navigate through OLAP cubes using drill-up, drill-down and drill- across features.  The drill-up functionality enables the user to easily zoom out to more coarse-grained details.  Conversely, drill-down displays the information with more details.  Finally, drilling-across means that the user can navigate to another OLAP cube to see the relations on another dimension(s).  All the functionality is provided in real-time.
  • 105. Statistical Analysis 105  Statistical analysis uses mathematic foundations to qualify the significance and reliability of the observed relations.  The most interesting features are distribution analysis, confidence intervals (for example for changes in user behaviours, etc.)  Statistical analysis is used for devising and analysing the results from data mining.
  • 106. The Necessary BI Skills 106  Data Analysis: Most BI skills and intelligence analyst-related skills are about using data to make better decisions.You need to be good at examining many different sources of data and then making accurate conclusions about them.  Problem-solving: BI isn’t just about analyzing data; it’s also about creating business strategies and solving real-world business problems with that data. For example, you could be the one to extract actionable insights from specific retail KPIs that need to be visualized and presented during a meeting.  Specific industry knowledge: While some of this can and will be learned on the job, you need to have a solid grasp of the industry’s dynamics, particularly the areas of the field that you’re looking to work in. Over time, you’ll want to become an expert in your industry as this will increase your ability to connect data with business problem-solving.
  • 107. 107  Communication skills: In addition to acquiring intelligence analyst-related skills, you’ll need to be able to communicate your findings effectively to the other professionals you’ll be working with. To some extent, if you work in back-end BI, you won’t need to communicate quite as much. However, if you work in the front- end, you’ll be responsible for communicating technical concepts to non-technical people. This kind of role requires excellent communication skills.  Data visualization: Expanding on the point above, in order to ensure good communication you will also need to have data visualization skills. Visualizations are the best tools to make trends and general insights understandable. Being able to clearly see how the data changes in time is what makes it possible to extract relevant conclusions from it. For this purpose, you should be able to differentiate between various charts and report types as well as understand when and how to use them to benefit the BI process.
  • 108. 108  Advanced vision and attention to detail: By its very nature, business intelligence is incredibly detail-oriented.As a BI analyst or developer, you'll often work with the smallest fragment of information with the objective of turning it into actionable insight. You will need a great deal of forward-thinking vision and the ability to pay very close attention to detail to succeed in the fast- paced world of BI.  Statistical analysis: Statistical knowledge is another important skill especially if you want to become a BI analyst. Understanding various statistical components such as mean, median, range, variance, and others, can enable you to go deeper into the data and extract relevant conclusions from it.
  • 109. 109  Programming knowledge: On a more technical side of things, having programming language knowledge can also be a very valuable skill when it comes to pursuing a career in BI. Many solutions require the use of different programming languages to perform advanced analysis such as R, Python, Javascript, just to name a few, and knowing them can significantly enhance your skillset.  Technical notion: Our next BI skill is not fundamental, but it can certainly make you a more complete and prepared professional. Business intelligence is an industry that highly relies on technology and having a technical notion of how to manage these technologies can be a plus. With this, we do not mean that you need to know how to use every tool in the market, but understanding how these technologies can work to your advantage.
  • 110. 110  Business acumen: To thrive in a business intelligence career, you will need to possess a swift ability to understand your company’s business model and how to tailor your efforts to not only gain maximum value from your key performance indicators (and the KPI management process) but also make strategic decisions that will help your organization succeed on a continual basis.
  • 111. 111  Benefits of business intelligence  Data clarity  Increased efficiency  Better customer experience  Improved employee satisfaction
  • 112. 112  How to develop a business intelligence strategy  A BI strategy is your blueprint for success. You’ll need to decide how data is used, gather key roles, and define responsibilities in the initial phases. It may sound simple at a high level; however, starting with business goals is your key to success.
  • 113. 113  How to create a BI strategy from the ground up:  Know your business strategy and goals.  Identify key stakeholders.  Choose a sponsor from your key stakeholders.  Choose your BI platform and tools.  Create a BI team.  Define your scope.  Prepare your data infrastructure.  Define your goals and roadmap.
  • 114. 114  Advantages of BI include:  Data visibility  Accurate reports  Streamlined processes  Disadvantages of BI include:  Initial cost  User resistance  Data skills gap
  • 115. OLAP 115  OLAP is an acronym for Online Analytical Processing. OLAP performs multidimensional analysis of business data and provides the capability for complex calculations, trend analysis, and sophisticated data modeling.
  • 116. 116  OLAP offers five key benefits:  Business-focused multidimensional data  Business-focused calculations  Trustworthy data and calculations  Speed-of-thought analysis  Flexible, self-service reporting
  • 117. Characteristics of OLAP 117  It defines which the system targeted to deliver the most feedback to the client within about five seconds, with the elementary analysis taking no more than one second and very few taking more than 20 seconds.
  • 118. Analysis 118  It defines which the method can cope with any business logic and statistical analysis that is relevant for the function and the user, keep it easy enough for the target client.Although some pre programming may be needed we do not think it acceptable if all Share.  It defines which the system tools all the security requirements for understanding and, if multiple write connection is needed, concurrent update location at an appropriated level, not all functions need customer to write data back, but for the increasing number which does, the system should be able to manage multiple updates in a timely, secure manner.
  • 119. Multidimensional 119  This is the basic requirement. OLAP system must provide a multidimensional conceptual view of the data, including full support for hierarchies, as this is certainly the most logical method to analyse business and organizations.  Information  The system should be able to hold all the data needed by the applications.  Data sparsity should be handled in an efficient manner.  OLAP Operations in the Multidimensional Data Model
  • 120. Roll-Up 120  The roll-up operation (also known as drill-up or aggregation operation) performs aggregation on a data cube, by climbing down concept hierarchies, i.e., dimension reduction. Roll-up is like zooming-out on the data cubes.  When a roll-up is performed by dimensions reduction, one or more dimensions are removed from the cube.
  • 121. 121
  • 122. Drill-Down 122  The drill-down operation (also called roll-down) is the reverse operation of roll-up.  Drill- down is like zooming-in on the data cube. It navigates from less detailed record to more detailed data.  Drill-down can be performed by either stepping down a concept hierarchy for a dimension or adding additional dimensions.  Figure shows a drill-down operation performed on the dimension time by stepping down a concept hierarchy which is defined as day, month, quarter, and year.  Drill-down appears by descending the time hierarchy from the level of the quarter to a more detailed level of the month.
  • 123. 123
  • 124. Slice 124  A slice is a subset of the cubes corresponding to a single value for one or more members of the dimension.  For example, a slice operation is executed when the customer wants a selection on one dimension of a three- dimensional cube resulting in a two-dimensional site.  So, the Slice operations perform a selection on one dimension of the given cube, thus resulting in a sub cube.
  • 125. 125
  • 126. Dice 126  The dice operation describes a sub cube by operating a selection on two or more dimension.
  • 127. Pivot 127  The pivot operation is also called a rotation. Pivot is a visualization operations which rotates the data axes in view to provide an alternative presentation of the data.  It may contain swapping the rows and columns or moving one of the row-dimensions into the column dimensions.
  • 128. 128
  • 129. 129
  • 130. Difference between OLTP and OLAP 130  OLTP (On-LineTransaction Processing) is featured by a large number of short on-line transactions (INSERT, UPDATE, and DELETE).  The primary significance of OLTP operations is put on very rapid query processing, maintaining record integrity in multi-access environments, and effectiveness consistent by the number of transactions per second.
  • 131. 131  OLAP (On-line Analytical Processing) is represented by a relatively low volume of transactions.  Queries are very difficult and involve aggregations. For OLAP operations, response time is an effectiveness measure.  OLAP applications are generally used by Data Mining techniques.  In OLAP database there is aggregated, historical information,
  • 132. 132
  • 134. Relational OLAP (ROLAP) Server 134  These are intermediate servers which stand in between a relational back-end server and user frontend tools.  They use a relational or extended-relational DBMS to save and handle warehouse data, and OLAP middleware to provide missing pieces.  ROLAP servers contain optimization for each DBMS back end, implementation of aggregation navigation logic, and additional tools and services.  ROLAP technology tends to have higher scalability than MOLAP technology.  ROLAP systems work primarily from the data that resides in a relational database, where the base data and dimension tables are stored as relational tables. This model permits the multidimensional analysis of data.  This technique relies on manipulating the data stored in the relational database to give the presence of traditional OLAP's slicing and dicing functionality. In essence, each method of slicing and dicing is equivalent to adding a "WHERE" clause in the SQL statement.
  • 135. 135
  • 136. Multidimensional OLAP (MOLAP) Server 136  A MOLAP system is based on a native logical model that directly supports multidimensional data and operations.  Data are stored physically into multidimensional arrays, and positional techniques are used to access them.  One of the significant distinctions of MOLAP against a ROLAP is that data are summarized and are stored in an optimized format in a multidimensional cube, instead of in a relational database.  In MOLAP model, data are structured into proprietary formats by client's reporting.
  • 137. 137
  • 138. Hybrid OLAP (HOLAP) Server 138  HOLAP incorporates the best features of MOLAP and ROLAP into a single architecture.  HOLAP systems save more substantial quantities of detailed data in the relational tables while the aggregations are stored in the pre-calculated cubes.  HOLAP also can drill through from the cube down to the relational tables for delineated data. The Microsoft SQL Server 2000 provides a hybrid OLAP server.
  • 139. 139
  • 140. Relational Online Analytical Processing (ROLAP) 140  ROLAP servers are placed between relational backend server and client front-end tools.  It uses relational or extended DBMS to store and manage warehouse data. ROLAP has basically 3 main components: Database Server, ROLAP server, and Front-end tool.
  • 141. 141  Advantages of ROLAP –  ROLAP is used for handle the large amount of data. ROLAP tools don’t use pre-calculated data cubes.  Data can be stored efficiently.  ROLAP can leverage functionalities inherent in the relational database.
  • 142. 142  Disadvantages of ROLAP –  Performance of ROLAP can be slow.  In ROALP, difficult to maintain aggregate tables. Limited by SQL functionalities.
  • 143. Multidimensional Online Analytical Processing (MOLAP) 143  MOLAP does not uses relational database to storage.  It stores in optimized multidimensional array storage. The storage utilization may be low With multidimensional data stores.  Many MOLAP server handle dense and sparse data sets by using two levels of data storage representation. MOLAP has 3 components : Database Server, MOLAP server, and Front- end tool.
  • 144. 144  Advantages of MOLAP –  MOLAP is basically used for complex calculations.  MOLAP is optimal for operation such as slice and dice.  MOLAP allows fastest indexing to the pre-computed summarized data.  Disadvantages of MOLAP –  MOLAP can’t handle large amount of data.  In MOLAP, Requires additional investment.  Without re-aggregation, difficult to change dimension.
  • 145. Hybrid Online Analytical Processing (HOLAP) 145  Hybrid is a combination of both ROLAP and MOLAP. It offers functionalities of both ROLAP and as well as MOLAP like faster computation of MOLAP and higher scalability of ROLAP.  The aggregations are stored separately in MOLAP store. Its server allows storing the large data volumes of detailed information.
  • 146. 146  Advantages of HOLAP –  HOLAP provides the functionalities of both MOLAP and ROLAP.  HOLAP provides fast access at all levels of aggregation.  Disadvantages of HOLAP –  HOLAP architecture is very complex to understand because it supports both MOLAP and ROLAP.
  • 147. 147
  • 148. 148  OLAP offers five key benefits:  Business-focused multidimensional data  Business-focused calculations  Trustworthy data and calculations  Speed-of-thought analysis  Flexible, self-service reporting
  • 149. 149  Advantages  Business-centered multidimensional information.  Business-centered figuring’s.  Dependable information and figuring’s.  Speed-of-thought examination.  Adaptable, self-administration detailing.
  • 150. 150  Disadvantages  Pre-demonstrating is an absolute necessity.  As to business information, the traditional OLAP tools don't take into consideration quick investigation without pre-demonstrating.  Extraordinary reliance on IT.  Helpless calculation capacity.  Shy Interactive examination capacity.  Slow in responding.  Theoretical model.  Extraordinary, expected danger.
  • 151. Analytic functions 151  Analytical functions are one of the most popular tools among BI/Data analysts for performing complex data analysis.  These functions perform computations over multiple rows and return the multiple rows as well.
  • 152. 152  analytic_function_name([argument_list]) OVER (  [PARTITION BY partition_expression,…]  [ORDER BY sort_expression, … [ASC|DESC]])  There are three parts to this syntax,  namely function, partition by and order by.
  • 153. 153  analytic_function_name: name of the function — like RANK(), SUM(), FIRST(), etc  partition_expression: column/expression on the basis of which the partition or window frames have to be created  sort_expression: column/expression on the basis of which the rows in the partition will be sorted
  • 154. 154
  • 155. 155  Trendline definition is a function that fits a linear or exponential model and returns the fitted values or model.  The numeric_expr represents the Y value for the trend.The series (time columns) represent the X value.
  • 156. 156  Syntax:  TRENDLINE(numeric_expr, ([series]) BY ([partitionBy]), model_type, result_type)  numeric_expr represents the data to the trend.This is theY-axis, usually a measure column.  series is the X-axis is a list of numerics or time dimension attribute columns.  partitionBY is a list of dimension attribute columns that are in the view but not on the X-axis.  model_type is one of the following (‘LINEAR’,‘EXPONENTIAL’).  result_type is one of the following (‘VALUE’,‘MODEL’).  ‘VALUE‘ will return all the regressionY values given X in the fit. ‘MODEL’ will return all the parameters in a JSON format string.
  • 157. 157  TRENDLINE((Sales),(Order Date) BY (Product Sub Category), 'LINEAR', 'VALUE')  numeric_expr = Sales  series = Order Date (Time dimension)  partitionBy = Product Sub Category  model_type = LINEAR  result_type =VALUE
  • 158. 158  CLUSTER((dimension_expr1 , ... dimension_exprN), (expr1, ... exprN),  output_column_name, options, [runtime_binded_options])  dimension_expr represents the list of dimensions, e.g., (productID, companyID), to be clustered.  expr represents the list of dimension attributes or measures to be used to  cluster the dimension_expr.
  • 159. 159  output_column_name is the output column.The valid values are ‘clusterId’, ‘clusterName’, ‘clusterDescription’, ‘clusterSize’,‘distanceFromCenter’,‘centers’.  options mean the string list of name=value pairs separated by ‘;’. The value can include %1 … %N, specified using runtime_binded_options.  runtime_binded_options is an optional comma-separated list of run-time binded columns or literal expressions.
  • 160. 160  OUTLIER((dimension_expr1 , ... dimension_exprN), (expr1, .. exprN), output_column_name, options, [runtime_binded_options]))])  dimension_expr indicates a list of dimensions.  expr represents a list of dimension attributes or measures to find outlier.  output_column_name indicates the output column name. Valid values are 'isOutlier' and 'distance'.  options indicates a string list of name/value pairs separated by a semi-colon (;).  The value can include %1 ... %N, which can be specified using runtime_binded_options.  runtime_binded_options is an option comma separated list (,) of run-time binded columns and or literal expressions.  REGR(y_axis_measure_expr, (x_axis_expr), (category_expr1, ..., category_exprN), output_column_name, options, [runtime_binded_options])