SlideShare a Scribd company logo
1
Ertek G., Sevinç, M., Ulus, F., Köse, Ö., Şahin, G. (2014) “Industrial Benchmarking
through Information Visualization and Data Envelopment Analysis: A New
Framework”, in I. Osman, A.L. Anouze and A. Emrouznejad (eds.) Strategic
Performance Management and Measurement Using Data Envelopment Analysis,
DOI: 10.4018/978-1-4666-4474-8, ISBN13: 9781466644748, IGI_Global (In Press).
Note: This is the final draft version of this paper. Please cite this paper (or this final
draft) as above. You can download this final draft from the following websites:
http://research.sabanciuniv.edu
http://ertekprojects.com/gurdal-ertek-publications/
Industrial Benchmarking through Information Visualization
and Data Envelopment Analysis:
A New Framework
Gürdal Ertek, Mete Sevinç, Firdevs Ulus, Özlem Köse, Güvenç Şahin
Sabanci University, Manufacturing Systems/Industrial Engineering Program
Orhanli, Tuzla, Istanbul, 34956, Turkey
ABSTRACT
We present a benchmarking study on the companies in the Turkish food industry based on
their financial data. Our aim is to develop a comprehensive benchmarking framework using
Data Envelopment Analysis (DEA) and information visualization. Besides DEA, a traditional
tool for financial benchmarking based on financial ratios is also incorporated. The
consistency/inconsistency between the two methodologies is investigated using information
visualization tools. In addition, k-means clustering, a fundamental method from machine
learning, is applied to understand the relationship between k-means clustering and DEA.
2
Finally, other relevant data, apart from the financial data, is introduced to the analysis
through information visualization to discover new insights into DEA results. This study uses
information visualization to both explore and reveal the relationships between the different
methodologies of financial benchmarking and gain practical insights on the Turkish food
industry. The results show that the framework developed is a comprehensive and effective
strategy for benchmarking; it can be applied in other industries as well. As a result, our study
contributes to the DEA benchmarking literature with a novel methodology that integrates the
various benchmarking methods from the fields of operations research, machine learning, and
financial analysis.
Keywords: Financial Benchmarking; Data Envelopment Analysis (DEA); Information
Visualization; Financial Ratios; K-Means Clustering; Tile Visualization; Graph Visualization;
Turkish Food Industry; Istanbul Chamber of Commerce (ISO).
INTRODUCTION
Benchmarking enables companies to see their positions relative to their competitors in order
to explore the opportunities to improve their market position. The response to how
benchmarking deals with this problem is disguised in its definition: “Benchmarking is the
process of continuously measuring and comparing one’s business processes against
comparable processes in leading organizations to obtain information that will help the
organization identify and implement improvements” (Andersen & Jordan, 1998). Various
fields of science including computer science, management and operations research, apply
alternative methods of benchmarking, such as information visualization (e.g. Self Organizing
Maps, SOM), financial ratios (provided that the data is appropriate for financial
benchmarking), and analytical methods (e.g. Data Envelopment Analysis, DEA).
This study proposes a framework that integrates a multitude of methods for the purpose of
comprehensive benchmarking. As an analytical approach, DEA is employed due to its
eligibility among all other methods for its advantage of being a nonparametric technique
requiring fewer assumptions (Weill, 2004). Weill (2004) shows that DEA is consistent with
standard measures of performance such as the Stochastic Frontier Approach (SFA) and the
Distribution-Free Approach (DFA). In addition to its consistency, DEA can be employed to
investigate the reasons for a company’s inefficiency while showing how much change is
needed to achieve efficiency (Galagedera & Silvapulle, 2002).
We combine and compare the DEA results with the insights obtained by other
methodologies, including financial ratios, and the results of k-means clustering, as well as
other relevant data. Information visualization schemes are extensively and primarily used in
3
a second phase of the analysis in order to bring together the outcomes from all mentioned
methods and make them more comprehensive. The steps of the analysis are formalized
within an integrated framework. Via this framework, a researcher can combine different
benchmarking approaches and data mining techniques to obtain useful insights, as well as to
compare the results of the different approaches against each other.
The benchmarking data used in this study is derived from the 2010 Istanbul Chamber of
Industry (ISO) List that ranks the top 500 Turkish companies of different industries, and
provides financial and other relevant data regarding these companies. In this study, the food
industry is selected because it is one of the most developed industries in Turkey and a
significant number of companies from this industry are listed in the ISO 500 list.
The complete ISO 500 dataset (for 2000), that includes all the industries, has been subject of
earlier research (Ulucan, 2002), and we have selected our inputs and outputs in coherence
with this earlier study. While we have not been able to find the efficiency analysis of food
industry in Turkey, there have been studies focusing on other industries, such as insurance
(Çiftçi, 2004), cement (Karsak and Iscan, 2000), and automative (Bakırcı, 2006).
Meanwhile, there exists literature that uses DEA for the benchmarking of food industry or its
subsectors in other countries such as agriculture industry in Scotland (Barnes, 2006), food
manufacturing plants in the USA (Jayanthi, 1999), productivity growth of Indian food
industry (Kumar and Basu, 2008), swine industry in Hawaii (Sharma, Leung and Zaleski,
1997), meat products industry in Greece (Keramidou, Mimis and Pappa, 2011), strawberry
greenhouses in Iran (Banaeian, Omid and Ahmadi, 2011), and food industrial companies in
Taiwan (Wongchai, Tai and Peng, 2011). An interesting related study by Dadura and Lee
(2011) uses DEA for benchmarking the innovativeness of Taiwanese food companies.
Turkish Food Industry Dataset
Sixty-three companies in the Turkish food industry from the 2010 ISO-500 list are included
in the study. Each company is presented by its symbol in the Istanbul Stock Exchange (ISE);
for the unlisted companies, symbols, which evoke the name of the company, are given. Based
on the main food groups, the information regarding the subsector(s) that the companies are
involved are appended to the original dataset. These subsectors are organized as Dairy, Flour,
Beverage, Sugar, Sea Products, Meat, Poultry, Snack, Oil, Hazelnut, Canned Products, Animal
Feed, and other. Data for companies such as Unilever Turkey, a large player in the food
industry, was omitted due to missing data. Similarly, Arbel, a leading agricultural producer,
was dropped out of calculations due to negative value for one of the inputs.
The financial data used in DEA are the Equity, Net Asset, Gross Added Revenue, Profit (from
operations) and Export values of each company. Apart from the financial ones, some other
4
useful data such as (number of) Employees and Chamber of Commerce of each Company, are
included in our visual analysis.
These variables can be used for the evaluation of companies in any industry and this is
exactly what we intend. The main contribution of the paper is not the analysis of the food
industry in Turkey, but the development and the demonstration of a new DEA based analysis
framework. Thus, we intended our inputs and outputs to be such that the framework can be
applied or adapted to any industry, including food industry, as well as others, such as
automobile, machinery, logistics, construction, and many others. The selected inputs and
outputs are the drivers and enablers, as well as the basic constructs of the framework. The
selected inputs and outputs are among the most basic and popular financial and operational
attributes used for benchmarking in any industry, and this is precisely the reason why the
Istanbul Chamber of Commerce (ISO) selects and publishes the values for these attributes
from among many others. Thus, the selection of the attributes by the Istanbul Chamber of
Commerce is a reliable validation for our selection. Furthermore, these attributes are also
crucial for the calculation of the financial ratios, which are essential to the multi-perspective
nature of our framework.
From among the selected attributes, Employees, Equity and Net Assets have been selected as
inputs since these are operational and financial resources a company uses in conducting its
business. On the other hand, Gross Added Revenue and Profit have been selected as outputs,
since these are the financial performance measures for assessing the performance of a
company.
METHODOLOGY
Data Envelopment Analysis (DEA)
DEA is an approach used to measure efficiency (or productivity) of an activity or an entity,
denoted as a Decision Making Unit (DMU), (Cooper et al., 2006). Typically, a multitude of
inputs affect a multitude of outputs for a DMU. It is a challenge to assess how much weight
an individual input has with respect to the determination of outputs. DEA does not make any
assumptions to choose these weights; instead, the weights are directly derived from the data.
The DEA model itself chooses the “best” weights, which have the most adequate properties
for the DMU benchmarking. Thus, these weights are not given, but they are defined as
variables. Since the weights of the inputs and outputs are determined from the data, there is
no need for the units to be congruent. In other words, the units can be of different types (e.g.
number of workers, number of machines, dollars). The DMUs with the highest rank of
efficiency levels together determine the “efficient frontier” while the efficiencies of other
(inefficient) DMUs are calculated according to this efficient frontier. The efficient DMUs that
5
reside on the efficient frontier and have an efficiency score of 1 are either same as or superior
to the inefficient DMUs (that have efficiency less than 1) in all productivity dimensions;
however, they are inferior to each other in at least one dimension. Each inefficient DMU has
its reference set which consists of the efficient DMUs that are used to calculate the efficiency
of inefficient DMU.
The objective of the input-oriented model is to minimize the input by concurrently keeping
the output at least at the current level. The objective of the output-oriented DEA models is to
maximize the output level by making use of at most the same amount of inputs. In our
analysis, output-oriented BCC model has been applied.
The BCC model by Banker, Charnes & Cooper (1984) is a well-known DEA model; it has
almost all of the characteristics of the earlier CCR model. However, unlike the CCR Model,
the BCC Model enables the problem to be variable-returns-to-scale. This property is assured
through a convexity constraint, which makes the production frontier to be spanned by the
DMU a convex hull. As a result, we employ the BCC Model as it can accommodate variable-
returns-to-scale without restriction to constant-returns-to-scale; when all the inputs increase
by a constant, the output levels do not change.
The input and output determination is one of the challenging steps in performing a DEA
study. The dataset for this case study includes financial data, and unfortunately, a standard
scheme for the input and output selection does not exist in the financial benchmarking
literature. Therefore our choice of input and output measures are based on the idea that
achieving more of an output with less of an input is always preferred in DEA. The inputs are
selected as Employees, Equity and Net Assets; outputs are selected as Gross Added Revenue
and Profit (from operations)
In our study, DEA computations have been carried out using the Smart DEA Solver software
(Akcay et al., 2012), which generates the DEA results in a tabular structure, making it easy to
analyze the results using readily available data mining and information visualization
software.
Financial Ratios
Can we determine a company’s success by analyzing its financial data only? Although one
possible way is to examine the growing revenue or positive feedback from customers, one
needs more concrete ways to both measure performance and make comparisons within a set
of companies. One of these methods is benchmarking companies through the comparison of
financial ratios. Financial ratios are ratios calculated based on financial data extracted from
the balance sheet and/or the income statement (Bragg, 2006). Financial ratios are commonly
used in comparing companies with respect to the standardized industry level. They are also
used in evaluating a company in terms of its past and current performances.
6
In general, financial ratios can be categorized in five groups:
 Group I measures how efficient a company uses its assets to make profit. Some ratio
examples for Group 1 are Gross Margin, Efficiency Ratio, and Operating Leverage.
 Group II assesses the utilization of assets to generate profits. Some examples of ratios
in Group 2 are Return on Net Assets, and Return on Equity.
 Group III determines the company’s leverage level. Debt to equity ratio and the
interest coverage ratio fit are included in this group.
 Group IV measures the liquidity level of the company. A popular ratio used for this
purpose is the Current Ratio.
 Group V is used for stock valuation; examples include price/earnings ratio, and
price/sales ratio.
It is easy to find several case studies where financial ratios are used as means of
benchmarking. A study on Motorola Company (Collier, H.W., 2004) uses financial ratio
analysis to study the operating characteristics of Motorola by comparing the company to two
other. Another example considers 20 different financial ratios in evaluating the Critical
Access Hospitals (CAH) in Baltimore, Maryland (Pink et al., 2006), using health care
information system (Flex Monitoring Team, 2005). The authors discover that CAHs without
long-term care make more profit, have fewer liabilities, are more liquid and have higher
beneficial use of beds when compared with the long-term care CAHs.
Our study uses the turnover ratio which is the ratio of sales revenue to net assets. The
justification for this preference is two-fold. First of all, the turnover ratio helps to make
logical comparisons among companies in terms of their efficiencies. It is the measure of the
overall sales for each dollar of asset that the company has. If the turnover ratio of the
company is high, it shows that they manufacture fast and put their products out on the
market quickly. Secondly, the turnover ratio data is widely available, or easily computable
from widely available sales revenue and net assets data.
A review on the use of financial ratios for measuring efficiency yields several results. The first
group of studies uses both ratio analysis and DEA by comparing and/or combining them.
Biener and Eling (2011) use DEA in order to evaluate the performance of micro-insurance
programs, and also financial techniques such as bootstrapping and truncated regression
analysis, and they compare the two results. Feroz et al. (2003) claim that DEA can provide
additional benefits for the analyst besides ratio analysis. Hollingsworth and Smith (2003)
explain why sometimes DEA results are not sufficient while ratio analysis might yield better
results, and they also discuss why an analyst has to use BCC model in DEA when taking
financial ratios into the model as inputs. The second group of papers uses financial ratios as
inputs and/or outputs in their DEA models. Yeh (1996) also constructs a DEA model using
7
financial ratios as inputs and evaluates the performances of Taiwanese banks. Wen, Lim and
Huang (2003) evaluate a DEA model to measure the efficiency of e-commerce business
models, again using financial ratios as outputs in the DEA model.
K-Means Clustering
One of the most popular data mining techniques, k-means clustering, is an algorithm to
cluster entities into k partitions based on their attributes (Han and Kamber, 2006). This
technique starts with a set of entities and their attributes. It is initiated by defining a specified
number of clusters randomly or by using some heuristic data to find the centroid (mean
point) of each cluster. Then, the algorithm builds new clusters in which all entities belong to
the nearest centroid. The “distance” here is the aggregate distance between the cluster
centroid and the actual attribute values of entities. Subsequently, the algorithm finds the
centroids of the new clusters and repeats itself until the total number of moves of all data
objects from a cluster to another is zero. At the end of an epoch, which consists of N
iterations (where N is the number of elements in the dataset), the algorithm may end up with
fewer clusters than the number specified at the beginning.
In this study, the attribute set is selected based on our input-output measures for the DEA
study since we aim to analyze the relationship between these two approaches. The algorithm
is initially applied with 20 clusters while it finally ends up with five.
In the related literature, there are several studies that combine DEA with clustering
methodologies. Sharma and Jin (2011) cluster the DEA results in their paper using self
organizing maps. Yang (2010) clusters DEA results in order to perform cross-system
evaluation of 982 branches of a Canadian bank. Zhou, Huang and Hsu (2008) integrate the
DEA results as inputs into k-means clustering to evaluate the efficiencies of hotel services in
31 regions of China. Samoilenko and Osei-Bryson (2008) perform a clustering of DEA results
and also utilize decision trees in order to visualize the resulting clusters.
Data and Information Visualization
Data visualization is an indispensible method of data analysis, which is highly useful for
detecting outliers, finding patterns that may not be seen by some standard methods of data
mining, and accordingly, asserting new hypotheses. The biggest advantage of data
visualization is that it enables even the novice analysts to interact with and learn from data
due to the advanced cognitive capabilities of the visual brain. One of the well-known
references in data visualization literature is Chambers et al. (1983), which includes plenty of
earlier statistics-related visualization techniques, such as quantile plots, histograms, box
plots, symmetry plots, and scatter plots. Nevertheless, most of these techniques could not
8
pass beyond supporting statistical analysis; there was not effective usage of colors and
patterns, and it was very difficult to visualize larger datasets in most cases.
In 1990’s, with the developments in the field of computer science, the graphical methods of
1980’s have elevated into a new level, being referred to as information visualization. The idea
in information visualization is to identify undiscovered patterns, come up with actionable
insight and constitute maximum benefit out of data, particularly when the data is large-scale
and complex. The new era in information visualization is built on ideas from data mining,
statistics, and computer graphics, and it compensates for the shortcomings of graphical
methods of 1980’s. Well-known references on information visualization include Spence
(2001), Soukup & Davidson (2002), Keim (2002), and Hoffman and Grinstein (2002).
This study employs three types of visualization tools. The tool known as “colored scatter
plot”, developed in the 1980’s, is also referred to as “starfield display” in the literature. Even
though this tool is very simple, it can facilitate comprehension of the data to a maximum
extent. We use the Miner3D software (Miner3D) to perform the starfield display. In addition
to the starfield display, “the tile visualization” is applied to the dataset, which makes it
possible to group the data according to some key characteristics and obtain further insights
about it. The Visokio Omniscope software is used for the tile visualization (Visokio
Omniscope) tool. Finally, a “graph structure” is implemented to identify relationships
between the DMUs, according to the DEA results. Via the implementation of this
visualization tool, some additional analyses, which the study is unable to carry out by using
the other two tools, are performed. The yEd - Graph Editor (yWorks), a software drawing
graphs by using several layouts, is employed to obtain the graphs.
PROPOSED FRAMEWORK
The proposed framework is illustrated in Figure 1. The database that contains the complete
data is queried to select the data for DEA and clustering. Clustering, financial ratio analysis
and DEA are independently conducted, and the results are obtained in tabular form. Then
different visualizations (VIS 1a, 1b, 2, 3, and 4) are constructed using the various
combinations of the results and other data in the database in order to derive deeper insight
about the benchmarked DMUs and the domain.
Earlier studies in literature have extensively used information visualization for the analysis of
DEA results. Ulus et al. (2006) analyze the transportation companies traded in NYSE (New
York Stock Exchange) by visualizing the efficiency scores against various attributes of the
companies, implementing VIS1a in Figure 1. Ertek et al. (2007) analyze the apparel retail
industry in Turkey, introducing VIS1b in Figure 1 in addition to VIS1a. Ertek et al. (2012)
introduce the graph visualization of the reference sets, corresponding to VIS4 in Figure 1.
9
Osther studies that employ visualization for understanding DEA results are reviewed in Ertek
et al. (2012).
To perform the complete analyses, we have followed the instructions as outlined in the
framework, and the resulting visualizations are categorized into four main types. First three
types of visualizations are displaying the relationship between the efficiency scores of DEA
and
(i) other relevant data taken from the company database (VIS1a and VIS1b),
(ii) the turnover ratios of each company (VIS2), and
(iii) the cluster indices by k-means clustering (VIS3).
Finally, in the fourth type of visualization, reference sets of each DMU, which are found as a
result of DEA analysis, are combined with company related data and interpreted accordingly
(VIS4). We now present the results based on four different visualization methods that rely on
the results of our three benchmarking techniques in the subsequent sections.
10
Figure 1. The proposed analysis framework.
DEA Results vs. Relevant Data from ISO 500 (VIS1)
With the DEA analysis, it is usually expected that the efficiency scores should increase as the
output values increase and input values decrease. However, since DEA does not take only one
output or one input value into consideration, this expectation does not have to hold all the
time. In order to see the relationship between the input/output values and the efficiency
11
scores, we use starfield visualization. Figure 2 shows the relationship between the sales
revenue and the efficiency values.
Figure 2. Efficiency vs. SalesRevenueM; red colored circles indicate the SNACK subsector
(VIS1a).
In Figure 2 (VIS1a), the X-axis denotes to Sales Revenue value in millions, whereas the Y-axis
denotes the Efficiency scores. The circles colored in red are denoting the companies in the
Snack subsector. Figure 2 shows that the DMUs having a sales revenue figure larger than
1000 million TL are all inefficient. Companies OZYLMZ, KEREVIT, PROGIDA, NATURA,
KESKINK form a cluster on the upper left side of the graph having low Sales Revenue (lower
than 400), and the only efficient company is NATURA. As one can see in the above figure, the
companies NATURA and ULKERB in Snack industry are located on the top area and are
efficient with notably lower Sales Revenue. Another insight reveals that company ETI has the
highest Sales Revenue with 692 million TL, and it is not efficient, indeed. This shows that
having a higher sales revenue does not necessarily guarantee efficiency. On the contrary, one
can observe that those companies with relatively higher sales revenues are generally
inefficient in the Snack subsector. KENT and ULKERC are represented as two red circles on
the lower side of the chart. These two companies have high sales revenues of 464 million and
674 million, respectively, but they are the two least efficient companies among the Snack
companies from the ISO 500 list.
12
Figure 3. Efficiency vs. Employees; red colored circles indicate the DAIRY subsector
(VIS1a).
In Figure 3 (VIS1a), the X-axis denotes to the number of Employees in a company, whereas
the Y-axis denotes to the Efficiency scores. The circles colored in red denote the companies in
Dairy industry. The two efficient companies are SUTAS and NATURA. We can note that the
number of employees of a Dairy company, is not an indicator of its efficiency or inefficiency.
While SUTAS has the highest number of employees among all Dairy companies, and
NATURA has low employee value of 322, both of them are efficient companies. The two
companies with more than 3750 employees are inefficient; CAY and TRSEKER are both
state-owned companies. Following CAY and TRSEKER, the two companies with the highest
number of employees are BANVIT and SUTAS, and they are both efficient. In Figure 3, we
can also remark that SUTAS is the most efficient company with the largest number of
employees in the Dairy subsector.
13
Figure 4. Rank2010 vs. Rank2009; lighter colors denote higher efficiency values.
In Figure 4 (VIS1a), the X-axis denotes the rank of the company in the ISO 500 list in 2009,
whereas the Y-axis denotes the rank of the company in the ISO 500 list in 2010. The shades
of grey indicate efficiency values; lighter coloring denotes higher efficiency while the white
circles represent the most efficient companies. ORKA, ELITA, AKOVA, KEREVIT, TORUN,
MATLI, TURYAG, NAMET, BESLER, lined up on the right hand side of the chart were not in
the list in 2009, and they made it to the list in 2010. The only efficient company among those
is KEREVIT, represented in the graph with white.
Figure 5 (VIS1b) illustrates the tile visualization of ISO 500 Food Companies, grouped
according to Provincial Chamber of Industry. Each company is represented by a rectangle,
and the size of the rectangle denotes the net asset value of the company. This visualization
enables the analyst to focus on the patterns within a city to compare patterns of different
cities. For example, comparing the companies in Istanbul Chamber of Commerce to others,
we can observe that the total net assets is very large in Istanbul, and there is only a single
company, ULKERC, that has very low efficiency score. Within Istanbul, EFES is efficient,
while having a great amount of net assets. Ülker Company Group, the largest food group in
Turkey, operates in Snack industry with two brands, ULKERC and ULKERB, which produce
chocolate and biscuits respectively, and the efficiency gap between the two brands is
excessive.
14
Figure 5. Grouping by Chamber, color denotes efficiency whereas size denotes net assets
Analysis of DEA Results vs. Financial Ratios
In Figure 6 (VIS2), the X-axis denotes the turnover ratio, while the Y-axis denotes the
Efficiency Score. The circles colored in red are in Hazelnut (and its byproducts) subsector.
Turnover ratio is obtained by dividing the Net Assets of a company by its Sales Revenue.
Turnover ratio is a success indicator as to how efficient a business is in terms of generating
sales revenue by using its net assets. We can observe that the two outlier companies
PROGIDA and OZYLMZ are not only efficient in the Hazelnut subsector , but are also the two
efficient companies with the highest turnover ratios (87% and %34 respectively) within the
entire dataset. Financial ratios and DEA results produced consistent results for these two
companies. However, this may not always hold true, and considering the input and output
relations multi-dimensionally using DEA might yield significantly different results. For
instance, the two companies with negative turnover ratios on the right side of the chart are
KEREVIT and NATURA. This particular outcome shows that DEA results are not by
15
themselves sufficient for the analysis of efficiency, and financial ratios should also be
considered. Yet, a multi-dimensional benchmarking method such as DEA is necessary, in
addition to ratio analysis. For example, even though SEKERP has higher turnover ratio, it’s
not very efficient according to DEA results.
Figure 6. Efficiency vs. Turnover; red color denotes companies in the hazelnut subsector
(VIS 2).
Figure 7. Efficiency vs. ClusterID; lighter colors denote higher efficiency values.
16
Analysis of DEA Results vs. k-Means Clustering Results
In Figure 7 (VIS3), the X-axis denotes the ClusterID, while the Y-axis and the coloring
denotes Efficiency scores. This chart portrays the results of DEA together with the results of
k-means clustering. K-means clustering was applied with Miner3D software using 20 initial
clusters, and running the algorithm for 200 iterations. The variables used in clustering are
the five inputs and outputs used in DEA model. The clustering algorithm ended with five
clusters, three of which (clusters 4, 8, 19) consist of only a single company. Clustering
considers the magnitude of the variable values, ignoring the input-output relationships.
Therefore, it is expected that clustering results may not be consistent with DEA results. This
expectation is observed in our case study; clusters 2 and 3, which contain many companies,
have a high level of variability in efficiency values within the clusters. For example, in cluster
2, where 15 food industry companies with similar magnitudes are clustered, one can observe
very inefficient companies such as KENT and ULKERC, besides a very efficient company
such as BANVT. Similarly, cluster 3 contains the very inefficient POYRAZ, as well as the
efficient NATURA. Figure 8 clearly shows that DEA analysis brings a much better
understanding upon the clustering results. Meanwhile, clustering helps us interpret the
results of DEA even better. For example, a known property (and also a typical shortfall) of
DEA is that it might declare outlier observations as efficient, due to their extreme values in
one or two dimensions. In Figure 8, EFES, an efficient company, is seen as the single member
of cluster 19, which suggests that the efficiency of EFES might be due to its being an outlier in
magnitude in comparison to the other companies. Similarly, CAY and TRSEKER are
obviously outlier companies, which appear as the single members of two different clusters.
These two companies are inefficient, suggesting that their low scores might be due to the
extremely high values of some of its inputs.
Analysis of Reference Sets According to the DEA Results
Figure 8 (VIS4) presents the visualization of the reference sets for the Snack companies
within the food industry; each node represents a company, and the darker colors denote
larger number of employees in the company. The arcs emanating from node i to node set J
denote that J is the reference set for company i. The thickness of an arc denotes the weight
for that reference. Efficient companies are those that have arcs emanating and terminating at
themselves. One can observe two groups of companies in Figure 8. The first group has ETI,
NATURA and ULKERB as the role model, and the second group takes TAM and ORKA as the
role model. NATURA exhibits an efficient business model that should be followed by other
companies, as it manages to be efficient with relatively low number of employees. The nodes
with the same inbound and outbound arc signifies that these companies are efficient.
17
Figure 8. Visualization of Reference Sets; colors denote magnitude of the number of
employees.
Conclusions and Future Work
In this chapter, we carried out a financial benchmarking of food companies in Turkey using
DEA, and compare and combine the DEA results with the results of two other popular
benchmarking/analysis methods, namely financial ratio analysis and k-means clustering.
Three different information visualization techniques (starfield visualization, tile visualization
and graph visualization) were implemented for visual comparisons and analysis.
The study confirms the earlier results in literature in the sense that the results of various
methodologies are not necessarily consistent with each other. Therefore, in order to perform
comprehensive and fair benchmarking, one needs to employ multiple methods of
benchmarking, and interpret the results of these methods together to get the correct results.
However, it is difficult to choose the methodologies to employ and interpret them in an
acceptable way. The integrated benchmarking framework presented in this chapter serves to
this purpose. To the best of our knowledge, such a comprehensive integrated framework does
not exist in the existing literature. Therefore, this study makes a methodological contribution
to the benchmarking literature, besides presenting a benchmark of the Turkish food industry.
This work can be followed with a crucial research area in which one can carry out formal
statistical tests to prove or disprove the hypotheses that are claimed. Moreover, choosing
more appropriate financial data, one can use a larger set of financial ratios to compare
against the DEA results in order to obtain deeper insights and to compare the different
methodologies integrated here.
18
References
Akcay, A.E., Ertek, G., & Buyukozkan G., (2012). Analyzing the solutions of DEA through
information visualization and data mining techniques: SmartDEA framework. Expert
Systems with Applications, 39, 7763–7775.
Andersen, B., & Jordan, P. (1998). Setting Up A Performance Benchmarking Network.
Journal of Production Planning & Control: The Management of Operations, 9(1), 13-19.
Bakırcı, F. (2006). Sektörel Bazda Bir Etkinlik Ölçümü: VZA ile bir analiz. İktisadi ve İdari
Bilimler Dergisi, 20(2).
Banker, R., D., Charnes, A. & Cooper, W. W. (1984). Some models for estimating technical
and scale inefficiencies in data envelopment analysis. Journal of Management Science,
30, 1078–1092.
Barnes, A.P. (2006). Does multi-functionality affect technical efficiency? A non-parametric
analysis of the Scottish Dairy Industry. Journal of Environmental Management, 80, 287
– 294.
Banaeian, N., Omid, M. & Ahmadi, H. (2011) Application of Data Envelopment Analysis to
Evaluate Efficiency of Commercial Greenhouse Strawberry. Research Journal of
Applied Sciences, Engineering and Technology 3(3), 185-193.
Biener, C., & Eling, M. (2011). The Performance of Microinsurance Programs: A Data
Envelopment Analysis. The journal of Risk and Insurance, 78(1), 83-115.
Bragg, S.M. (2006). Financial Analysis: A Controller’s Guide,Wiley, 2nd Edition.
Chambers, J. M., Cleveland, W. S., Kleiner, B., & Tukey, P. A. (1983). Graphical methods for
data analysis. Pacific Grove, CA: Wadsworth & Brooks/Cole Publishing Company.
Charnes, A., Cooper, W. W., & Rhodes, E. (1978). Measuring the efficiency of decision making
units. European Journal of Operational Research, 2(6), 429-444
Cooper, W. W., Seiford, L. M., & Tone, K. (2006). Introduction to data envelopment analysis
and its uses. New York, NY: Springer.
Çiftçi, H. (2004). Türk Sigorta Sektörünün Sorunları; DEA Analizi ile Türk Sigorta
Şirketlerinin Etkinlik Düzeylerinin Belirlenmesi. Çukurova Üniversitesi Sosyal Bilimler
Enstitüsü Dergisi, 13(1), 121-149.
Dadura, A. M & Lee, T. R. (2011) Measuring the Innovation ability of Taiwan’s food industry
using DEA. Innovation: The European Journal of Social Science Research 24(1)-2,
151-172.
Ertek, G., Can, M. A., & Ulus, F. (2007). Benchmarking the Turkish apparel retail industry
through data envelopment analysis (DEA) and data visualization. EUROMA 2007,
Ankara, Turkey.
19
Ertek, G., Tunç, M.M., Kurtaraner, E., & Kebude, D. (2012). Insights into the Efficiencies of
On-Shore Wind Turbines: A Data-Centric Analysis, Accepted for INISTA 2012
Conference. July 2-4, 2012, Trabzon, Turkey (indexed in IEEE Electronic Library).
Feroz, E.H, & Kim, S. (2003). Financial Statement Analysis: A Data Envelopment Analysis
Approach. Journal of the Operational Research Society, 54, 48-58.
Galagedera, D. U. A., & Silvapulle, P. (2002). Australian mutual fund performance appraisal
using data envelopment analysis. Journal of Managerial Finance, 28(9), 60–73.
Han J. & Kamber M. (2006) Data mining: concepts and techniques, Morgan Kaufmann
Publishers.
Hollingsworth, B., & Smith, P. (2003). Use of Ratios in Data Envelopment Analysis. Applied
Economics Letters, 10, 733-735.
ISE. Istanbul Stock Exchange. http://www.ise.org.
ISO (2011). Istanbul Sanayi Odası Dergisi, August 2011, issue:545, ISSN 1307-6418. Istanbul
Chamber of Industry, Istanbul, Turkey.
Jayanthi S., Kocha B. & Sinha, K.K. (1999). Competitive Analysis of Manufacturing Plants: An
application to the US processed food industry. European Journal of Operational
Research, 118, 217-234.
Karsak, E.E., & İşcan, F. (2000). Çimento Sektöründe Göreli Faaliyet Performanslarının
Ağırlık Kısıtlamaları ve Çapraz Etkinlik Kullanılarak Veri Zarflama Analizi ile
Değerlendirilmesi.. Endüstri Mühendisliği Dergisi, 11(3), 2-10.
Keramidou, I., Mimis A. & Pappa, E. (2011) Estimating Technical and Scale Efficiency of
Meat Products Industry: The Greek Case. Journal of Applied Sciences 11(6), 971-
979.
Kumar M. & Basu P., (2008). Perspectives of productivity growth in Indian food industry: a
data envelopment analysis. International Journal of Productivity and Performance
Management, 57(7), 503 – 522.
Miner3D. http://www.miner3d.com/index.php.
Omniscrope. http://www.visokio.com/omniscope/.
Pink, G.H., Holmes, G.M., D’Alpe, C., Strunk, L.A., McGee P. & Slifkin, R.T. (2006) Financial
Indicators for Critical Access Hospitals. The Journal of Rural Health, 22(3), 229-236.
Samoilenko, S., & Osei-Bryson, K.M. (2008). Increasing the Discriminatory Power of DEA in
the presence of the sample heterogeneity with cluster analysis and decision trees. Expert
Systems with Applications, 34, 1568-1581.
Sharma, K.R., Leung, P. & Zaleski, H.M. (1997) Productive Efficiency of the Swine Industry in
Hawaii: Stochastic Frontier vs. Data Envelopment Analysis. Journal of Productivity
Analysis, 8, 447-459.
20
Sharma, M.J., & Jin, Y.S. (2011). Self Organizing Maps Infusion with Data Envelopment
Analysis. Retrieved on 24.05.2012, Available under:
http://cdn.intechweb.org/pdfs/13293.pdf
Ulucan, A. (2002). ISO500 Şirketlerinin Etkinliklerinin Ölçülmesinde Veri Zarflama analizi
Yaklaşımı: Farklı Girdi Çıktı Bileşenleri ve Ölçeğe göre Getiri Yaklaşımları ile
Değerlendirmeler. Ankara Üniversitesi SBF Dergisi, 57(2), 185-202.
Ulus, F., Kose, O., Ertek, G., & Sen, S. (2006). Financial benchmarking of transportation
companies in the New York Stock Exchange (NYSE) through data envelopment analysis
(DEA) and visualization. 4th International Logistics and Supply Chain Congress, İzmir,
Turkey.
Weill, L.( 2004). Measuring cost efficiency in European banking: a comparison of frontier
techniques. Journal of Productivity Analysis, 21, 133–152.
Wen, H.J., Lim, B., & Huang, H.L. (2003). Measuring e-commerce efficiency: a data
envelopment analysis (DEA) approach. Industrial Management & Data Systems, 103(9),
703-710.
Wongchai, A., Tai C.F., & Peng, K.C. (2011) An Application of Super-efficiency and Tobit
Method
for Financial Efficiency Analysis of Food Industrial Companies in Taiwan. Emergency
Management and Management Sciences (ICEMMS), 2nd IEEE International
Conference.
Yang, Z. (2010). Cross System Bank Branch Evaluation Using Clustering and Data
Envelopment Analysis. Lecture Notes in Computer Science (LNCS), 6215, 238-245.
Yeh, Q.J. (1996). The Application of Data Envelopment Analysis in Conjunction with
Financial Ratios for Bank Performance Evaluation. The Journal of Operational Research
Society, 47(8), 980-988.
yWorks. http://www.yworks.com/en/index.html.
Zhou, Z., Huang, Y., & Hsu, M.K. (2008). Using Data Envelopment Analysis to Evaluate
Efficiency: An Explanatory Study of the Chinese Hotel Industry. Journal of Quality
Assurance in Hospitality & Tourism, 9(3), 240-256.
KEY TERMS & DEFINITIONS
Financial Benchmarking: The benchmarking of companies using financial data.
Data Envelopment Analysis (DEA): A non-parametric analytical method for benchmarking a
group of entities.
Information Visualization: The field of computer science that works with the visualization of
large-scale complex data for discovering new useful knowledge.
21
Financial Ratios: Ratios used in finance; they are computed as the ratio of two financial
metrics.
K-Means Clustering: An unsupervised machine learning / data mining method for clustering
a set of entities into clusters of similar entities.

More Related Content

PDF
LeanCor Consulting Webinar: How to Deploy Continuous Improvement in the Wareh...
PDF
Value Stream Mapping in Office and Service
PPT
Operational risk management (2)
PPTX
PPTX
Enterprise Risk Management in Healthcare Organisations “Going Beyond Patient ...
PPTX
Vendor Management
PDF
Operational Excellence Roadmap
PPTX
AIR ARABIA Mini Case Study
LeanCor Consulting Webinar: How to Deploy Continuous Improvement in the Wareh...
Value Stream Mapping in Office and Service
Operational risk management (2)
Enterprise Risk Management in Healthcare Organisations “Going Beyond Patient ...
Vendor Management
Operational Excellence Roadmap
AIR ARABIA Mini Case Study

What's hot (20)

PDF
Operational Excellence model
PPT
Risk Management Fundamentals
PDF
Operational Excellence
PPT
محاضرة توعية 9001 عربى
PDF
Capa A Five Step Action Plan
PPTX
المحور الأول ..مبادئ وأساسيات الجودة
PPT
ERM Presentation
PDF
Enterprise Risk Management - Aligning Risk with Strategy and Performance
PPTX
Mmogle
DOC
Enterprise data architect performance appraisal
PPTX
Operations_Excellence_Presentation_Promotional
PDF
Quality 1
PDF
New 7 QC Tools Module 5 | Prioritization matrices
PPT
Yellow Belt Training
PPT
Introduction to Risk Management
PDF
3M Scotchcast Flexible & Trailing Power Cable Splice Kits 82-F
PDF
Risk Appetite
PDF
Risk and Control Self Assessment - IRM India Affiliate
PDF
Risk Management Process And Procedures PowerPoint Presentation Slides
PPTX
MRO PROCUREMENT: 3 KEYS TO MANAGING THE COMPLEXITY
Operational Excellence model
Risk Management Fundamentals
Operational Excellence
محاضرة توعية 9001 عربى
Capa A Five Step Action Plan
المحور الأول ..مبادئ وأساسيات الجودة
ERM Presentation
Enterprise Risk Management - Aligning Risk with Strategy and Performance
Mmogle
Enterprise data architect performance appraisal
Operations_Excellence_Presentation_Promotional
Quality 1
New 7 QC Tools Module 5 | Prioritization matrices
Yellow Belt Training
Introduction to Risk Management
3M Scotchcast Flexible & Trailing Power Cable Splice Kits 82-F
Risk Appetite
Risk and Control Self Assessment - IRM India Affiliate
Risk Management Process And Procedures PowerPoint Presentation Slides
MRO PROCUREMENT: 3 KEYS TO MANAGING THE COMPLEXITY
Ad

Viewers also liked (7)

PDF
Optimizing the electric charge station network of EŞARJ
PDF
Text Mining with RapidMiner
PDF
Risk Factors and Identifiers for Alzheimer’s Disease: A Data Mining Analysis
PDF
Competitive Pattern-Based Strategies under Complexity: The Case of Turkish Ma...
PDF
Competitiveness of Top 100 U.S. Universities: A Benchmark Study Using Data En...
PDF
Modelling the supply chain perception gaps
PDF
Demonstrating Warehousing Concepts Through Interactive Animations
Optimizing the electric charge station network of EŞARJ
Text Mining with RapidMiner
Risk Factors and Identifiers for Alzheimer’s Disease: A Data Mining Analysis
Competitive Pattern-Based Strategies under Complexity: The Case of Turkish Ma...
Competitiveness of Top 100 U.S. Universities: A Benchmark Study Using Data En...
Modelling the supply chain perception gaps
Demonstrating Warehousing Concepts Through Interactive Animations
Ad

Similar to Industrial Benchmarking through Information Visualization and Data Envelopment Analysis: A New Framework (20)

PDF
The effect of firm specific characteristics on segment information disclosure...
PPTX
VALUE ADDED REPORTING
 
PDF
Eps and eva forecasting ability for industrial jordanian companies
PDF
Idiosyncratic Effect of Corporate Solvency Management Strategies on Corporate...
PDF
Competitive Pattern-Based Strategies under Complexity: The Case of Turkish Ma...
PDF
Paper 4 (ali mansouri)
PDF
222-424-1-PB.pdf
PDF
The ability of accounting information systems to support profitability and gr...
PDF
Employee-Productivity-and-Service-Quality-Enhancement-Using-ERP-With-Knowledg...
PDF
The Effect of Information Technology and Total Quality Management on Organiza...
PDF
The Impact of Entrepreneurial Orientation on Business Performance: A Study of...
PDF
DEA-Based Benchmarking Models In Supply Chain Management: An Application-Orie...
PDF
A hybrid approach to achieve oa
DOCX
PDF
The Strategic Of It Capability to Support Innovativeness for Firm Performance
PDF
Critical-success-factors-for-business-intelligence-implementation-in-an-enter...
PPTX
New Microsoft PowerPoint Presentation.pptx
PDF
Antecedents to effective sales and operations planning. swaim, maloni, bow...
PDF
JGSI 3-1-15, Shah Nawaz, P 5-14
PDF
The factors determining the profitability of low cost airlines, march 2020
The effect of firm specific characteristics on segment information disclosure...
VALUE ADDED REPORTING
 
Eps and eva forecasting ability for industrial jordanian companies
Idiosyncratic Effect of Corporate Solvency Management Strategies on Corporate...
Competitive Pattern-Based Strategies under Complexity: The Case of Turkish Ma...
Paper 4 (ali mansouri)
222-424-1-PB.pdf
The ability of accounting information systems to support profitability and gr...
Employee-Productivity-and-Service-Quality-Enhancement-Using-ERP-With-Knowledg...
The Effect of Information Technology and Total Quality Management on Organiza...
The Impact of Entrepreneurial Orientation on Business Performance: A Study of...
DEA-Based Benchmarking Models In Supply Chain Management: An Application-Orie...
A hybrid approach to achieve oa
The Strategic Of It Capability to Support Innovativeness for Firm Performance
Critical-success-factors-for-business-intelligence-implementation-in-an-enter...
New Microsoft PowerPoint Presentation.pptx
Antecedents to effective sales and operations planning. swaim, maloni, bow...
JGSI 3-1-15, Shah Nawaz, P 5-14
The factors determining the profitability of low cost airlines, march 2020

More from ertekg (20)

PDF
Rule-based expert systems for supporting university students
PDF
Supplier and Buyer Driven Channels in a Two-Stage Supply Chain
PDF
Simulation Modeling For Quality And Productivity In Steel Cord Manufacturing
PDF
Visual and analytical mining of transactions data for production planning f...
PDF
A Tutorial On Crossdocking
PDF
Application Of Local Search Methods For Solving A Quadratic Assignment Probl...
PDF
Financial Benchmarking Of Transportation Companies In The New York Stock Exc...
PDF
Optimizing Waste Collection In An Organized Industrial Region: A Case Study
PDF
A Framework for Visualizing Association Mining Results
PDF
Compiere kurulumu
PDF
Application of the Cutting Stock Problem to a Construction Company: A Case Study
PDF
Benchmarking The Turkish Apparel Retail Industry Through Data Envelopment Ana...
PDF
Visual Mining of Science Citation Data for Benchmarking Scientific and Techno...
PDF
An Open Source Java Code For Visualizing Supply Chain Problems
PDF
Dairy Logistics: A Tutorial
PDF
The Bullwhip Effect In Supply Chain Reflections After A Decade
PDF
A Taxonomy Of Logistics Innovations
PDF
Teaching Warehousing Concepts through Interactive Animations and 3-D Models
PDF
Impact of Cross Aisles in a Rectangular Warehouse: A Computational Study
PDF
Innovation in Product Form And Function: Customer Perception Of Their Value
Rule-based expert systems for supporting university students
Supplier and Buyer Driven Channels in a Two-Stage Supply Chain
Simulation Modeling For Quality And Productivity In Steel Cord Manufacturing
Visual and analytical mining of transactions data for production planning f...
A Tutorial On Crossdocking
Application Of Local Search Methods For Solving A Quadratic Assignment Probl...
Financial Benchmarking Of Transportation Companies In The New York Stock Exc...
Optimizing Waste Collection In An Organized Industrial Region: A Case Study
A Framework for Visualizing Association Mining Results
Compiere kurulumu
Application of the Cutting Stock Problem to a Construction Company: A Case Study
Benchmarking The Turkish Apparel Retail Industry Through Data Envelopment Ana...
Visual Mining of Science Citation Data for Benchmarking Scientific and Techno...
An Open Source Java Code For Visualizing Supply Chain Problems
Dairy Logistics: A Tutorial
The Bullwhip Effect In Supply Chain Reflections After A Decade
A Taxonomy Of Logistics Innovations
Teaching Warehousing Concepts through Interactive Animations and 3-D Models
Impact of Cross Aisles in a Rectangular Warehouse: A Computational Study
Innovation in Product Form And Function: Customer Perception Of Their Value

Recently uploaded (20)

PDF
How to Get Funding for Your Trucking Business
PPT
Chapter four Project-Preparation material
PPTX
New Microsoft PowerPoint Presentation - Copy.pptx
PDF
Deliverable file - Regulatory guideline analysis.pdf
PPT
Data mining for business intelligence ch04 sharda
PDF
WRN_Investor_Presentation_August 2025.pdf
PDF
Business model innovation report 2022.pdf
PDF
Dr. Enrique Segura Ense Group - A Self-Made Entrepreneur And Executive
PDF
Roadmap Map-digital Banking feature MB,IB,AB
PDF
Ôn tập tiếng anh trong kinh doanh nâng cao
PPTX
ICG2025_ICG 6th steering committee 30-8-24.pptx
PDF
COST SHEET- Tender and Quotation unit 2.pdf
PDF
Katrina Stoneking: Shaking Up the Alcohol Beverage Industry
PDF
Traveri Digital Marketing Seminar 2025 by Corey and Jessica Perlman
PDF
IFRS Notes in your pocket for study all the time
PPTX
CkgxkgxydkydyldylydlydyldlyddolydyoyyU2.pptx
PDF
Reconciliation AND MEMORANDUM RECONCILATION
PDF
Laughter Yoga Basic Learning Workshop Manual
PPTX
Probability Distribution, binomial distribution, poisson distribution
PDF
Nidhal Samdaie CV - International Business Consultant
How to Get Funding for Your Trucking Business
Chapter four Project-Preparation material
New Microsoft PowerPoint Presentation - Copy.pptx
Deliverable file - Regulatory guideline analysis.pdf
Data mining for business intelligence ch04 sharda
WRN_Investor_Presentation_August 2025.pdf
Business model innovation report 2022.pdf
Dr. Enrique Segura Ense Group - A Self-Made Entrepreneur And Executive
Roadmap Map-digital Banking feature MB,IB,AB
Ôn tập tiếng anh trong kinh doanh nâng cao
ICG2025_ICG 6th steering committee 30-8-24.pptx
COST SHEET- Tender and Quotation unit 2.pdf
Katrina Stoneking: Shaking Up the Alcohol Beverage Industry
Traveri Digital Marketing Seminar 2025 by Corey and Jessica Perlman
IFRS Notes in your pocket for study all the time
CkgxkgxydkydyldylydlydyldlyddolydyoyyU2.pptx
Reconciliation AND MEMORANDUM RECONCILATION
Laughter Yoga Basic Learning Workshop Manual
Probability Distribution, binomial distribution, poisson distribution
Nidhal Samdaie CV - International Business Consultant

Industrial Benchmarking through Information Visualization and Data Envelopment Analysis: A New Framework

  • 1. 1 Ertek G., Sevinç, M., Ulus, F., Köse, Ö., Şahin, G. (2014) “Industrial Benchmarking through Information Visualization and Data Envelopment Analysis: A New Framework”, in I. Osman, A.L. Anouze and A. Emrouznejad (eds.) Strategic Performance Management and Measurement Using Data Envelopment Analysis, DOI: 10.4018/978-1-4666-4474-8, ISBN13: 9781466644748, IGI_Global (In Press). Note: This is the final draft version of this paper. Please cite this paper (or this final draft) as above. You can download this final draft from the following websites: http://research.sabanciuniv.edu http://ertekprojects.com/gurdal-ertek-publications/ Industrial Benchmarking through Information Visualization and Data Envelopment Analysis: A New Framework Gürdal Ertek, Mete Sevinç, Firdevs Ulus, Özlem Köse, Güvenç Şahin Sabanci University, Manufacturing Systems/Industrial Engineering Program Orhanli, Tuzla, Istanbul, 34956, Turkey ABSTRACT We present a benchmarking study on the companies in the Turkish food industry based on their financial data. Our aim is to develop a comprehensive benchmarking framework using Data Envelopment Analysis (DEA) and information visualization. Besides DEA, a traditional tool for financial benchmarking based on financial ratios is also incorporated. The consistency/inconsistency between the two methodologies is investigated using information visualization tools. In addition, k-means clustering, a fundamental method from machine learning, is applied to understand the relationship between k-means clustering and DEA.
  • 2. 2 Finally, other relevant data, apart from the financial data, is introduced to the analysis through information visualization to discover new insights into DEA results. This study uses information visualization to both explore and reveal the relationships between the different methodologies of financial benchmarking and gain practical insights on the Turkish food industry. The results show that the framework developed is a comprehensive and effective strategy for benchmarking; it can be applied in other industries as well. As a result, our study contributes to the DEA benchmarking literature with a novel methodology that integrates the various benchmarking methods from the fields of operations research, machine learning, and financial analysis. Keywords: Financial Benchmarking; Data Envelopment Analysis (DEA); Information Visualization; Financial Ratios; K-Means Clustering; Tile Visualization; Graph Visualization; Turkish Food Industry; Istanbul Chamber of Commerce (ISO). INTRODUCTION Benchmarking enables companies to see their positions relative to their competitors in order to explore the opportunities to improve their market position. The response to how benchmarking deals with this problem is disguised in its definition: “Benchmarking is the process of continuously measuring and comparing one’s business processes against comparable processes in leading organizations to obtain information that will help the organization identify and implement improvements” (Andersen & Jordan, 1998). Various fields of science including computer science, management and operations research, apply alternative methods of benchmarking, such as information visualization (e.g. Self Organizing Maps, SOM), financial ratios (provided that the data is appropriate for financial benchmarking), and analytical methods (e.g. Data Envelopment Analysis, DEA). This study proposes a framework that integrates a multitude of methods for the purpose of comprehensive benchmarking. As an analytical approach, DEA is employed due to its eligibility among all other methods for its advantage of being a nonparametric technique requiring fewer assumptions (Weill, 2004). Weill (2004) shows that DEA is consistent with standard measures of performance such as the Stochastic Frontier Approach (SFA) and the Distribution-Free Approach (DFA). In addition to its consistency, DEA can be employed to investigate the reasons for a company’s inefficiency while showing how much change is needed to achieve efficiency (Galagedera & Silvapulle, 2002). We combine and compare the DEA results with the insights obtained by other methodologies, including financial ratios, and the results of k-means clustering, as well as other relevant data. Information visualization schemes are extensively and primarily used in
  • 3. 3 a second phase of the analysis in order to bring together the outcomes from all mentioned methods and make them more comprehensive. The steps of the analysis are formalized within an integrated framework. Via this framework, a researcher can combine different benchmarking approaches and data mining techniques to obtain useful insights, as well as to compare the results of the different approaches against each other. The benchmarking data used in this study is derived from the 2010 Istanbul Chamber of Industry (ISO) List that ranks the top 500 Turkish companies of different industries, and provides financial and other relevant data regarding these companies. In this study, the food industry is selected because it is one of the most developed industries in Turkey and a significant number of companies from this industry are listed in the ISO 500 list. The complete ISO 500 dataset (for 2000), that includes all the industries, has been subject of earlier research (Ulucan, 2002), and we have selected our inputs and outputs in coherence with this earlier study. While we have not been able to find the efficiency analysis of food industry in Turkey, there have been studies focusing on other industries, such as insurance (Çiftçi, 2004), cement (Karsak and Iscan, 2000), and automative (Bakırcı, 2006). Meanwhile, there exists literature that uses DEA for the benchmarking of food industry or its subsectors in other countries such as agriculture industry in Scotland (Barnes, 2006), food manufacturing plants in the USA (Jayanthi, 1999), productivity growth of Indian food industry (Kumar and Basu, 2008), swine industry in Hawaii (Sharma, Leung and Zaleski, 1997), meat products industry in Greece (Keramidou, Mimis and Pappa, 2011), strawberry greenhouses in Iran (Banaeian, Omid and Ahmadi, 2011), and food industrial companies in Taiwan (Wongchai, Tai and Peng, 2011). An interesting related study by Dadura and Lee (2011) uses DEA for benchmarking the innovativeness of Taiwanese food companies. Turkish Food Industry Dataset Sixty-three companies in the Turkish food industry from the 2010 ISO-500 list are included in the study. Each company is presented by its symbol in the Istanbul Stock Exchange (ISE); for the unlisted companies, symbols, which evoke the name of the company, are given. Based on the main food groups, the information regarding the subsector(s) that the companies are involved are appended to the original dataset. These subsectors are organized as Dairy, Flour, Beverage, Sugar, Sea Products, Meat, Poultry, Snack, Oil, Hazelnut, Canned Products, Animal Feed, and other. Data for companies such as Unilever Turkey, a large player in the food industry, was omitted due to missing data. Similarly, Arbel, a leading agricultural producer, was dropped out of calculations due to negative value for one of the inputs. The financial data used in DEA are the Equity, Net Asset, Gross Added Revenue, Profit (from operations) and Export values of each company. Apart from the financial ones, some other
  • 4. 4 useful data such as (number of) Employees and Chamber of Commerce of each Company, are included in our visual analysis. These variables can be used for the evaluation of companies in any industry and this is exactly what we intend. The main contribution of the paper is not the analysis of the food industry in Turkey, but the development and the demonstration of a new DEA based analysis framework. Thus, we intended our inputs and outputs to be such that the framework can be applied or adapted to any industry, including food industry, as well as others, such as automobile, machinery, logistics, construction, and many others. The selected inputs and outputs are the drivers and enablers, as well as the basic constructs of the framework. The selected inputs and outputs are among the most basic and popular financial and operational attributes used for benchmarking in any industry, and this is precisely the reason why the Istanbul Chamber of Commerce (ISO) selects and publishes the values for these attributes from among many others. Thus, the selection of the attributes by the Istanbul Chamber of Commerce is a reliable validation for our selection. Furthermore, these attributes are also crucial for the calculation of the financial ratios, which are essential to the multi-perspective nature of our framework. From among the selected attributes, Employees, Equity and Net Assets have been selected as inputs since these are operational and financial resources a company uses in conducting its business. On the other hand, Gross Added Revenue and Profit have been selected as outputs, since these are the financial performance measures for assessing the performance of a company. METHODOLOGY Data Envelopment Analysis (DEA) DEA is an approach used to measure efficiency (or productivity) of an activity or an entity, denoted as a Decision Making Unit (DMU), (Cooper et al., 2006). Typically, a multitude of inputs affect a multitude of outputs for a DMU. It is a challenge to assess how much weight an individual input has with respect to the determination of outputs. DEA does not make any assumptions to choose these weights; instead, the weights are directly derived from the data. The DEA model itself chooses the “best” weights, which have the most adequate properties for the DMU benchmarking. Thus, these weights are not given, but they are defined as variables. Since the weights of the inputs and outputs are determined from the data, there is no need for the units to be congruent. In other words, the units can be of different types (e.g. number of workers, number of machines, dollars). The DMUs with the highest rank of efficiency levels together determine the “efficient frontier” while the efficiencies of other (inefficient) DMUs are calculated according to this efficient frontier. The efficient DMUs that
  • 5. 5 reside on the efficient frontier and have an efficiency score of 1 are either same as or superior to the inefficient DMUs (that have efficiency less than 1) in all productivity dimensions; however, they are inferior to each other in at least one dimension. Each inefficient DMU has its reference set which consists of the efficient DMUs that are used to calculate the efficiency of inefficient DMU. The objective of the input-oriented model is to minimize the input by concurrently keeping the output at least at the current level. The objective of the output-oriented DEA models is to maximize the output level by making use of at most the same amount of inputs. In our analysis, output-oriented BCC model has been applied. The BCC model by Banker, Charnes & Cooper (1984) is a well-known DEA model; it has almost all of the characteristics of the earlier CCR model. However, unlike the CCR Model, the BCC Model enables the problem to be variable-returns-to-scale. This property is assured through a convexity constraint, which makes the production frontier to be spanned by the DMU a convex hull. As a result, we employ the BCC Model as it can accommodate variable- returns-to-scale without restriction to constant-returns-to-scale; when all the inputs increase by a constant, the output levels do not change. The input and output determination is one of the challenging steps in performing a DEA study. The dataset for this case study includes financial data, and unfortunately, a standard scheme for the input and output selection does not exist in the financial benchmarking literature. Therefore our choice of input and output measures are based on the idea that achieving more of an output with less of an input is always preferred in DEA. The inputs are selected as Employees, Equity and Net Assets; outputs are selected as Gross Added Revenue and Profit (from operations) In our study, DEA computations have been carried out using the Smart DEA Solver software (Akcay et al., 2012), which generates the DEA results in a tabular structure, making it easy to analyze the results using readily available data mining and information visualization software. Financial Ratios Can we determine a company’s success by analyzing its financial data only? Although one possible way is to examine the growing revenue or positive feedback from customers, one needs more concrete ways to both measure performance and make comparisons within a set of companies. One of these methods is benchmarking companies through the comparison of financial ratios. Financial ratios are ratios calculated based on financial data extracted from the balance sheet and/or the income statement (Bragg, 2006). Financial ratios are commonly used in comparing companies with respect to the standardized industry level. They are also used in evaluating a company in terms of its past and current performances.
  • 6. 6 In general, financial ratios can be categorized in five groups:  Group I measures how efficient a company uses its assets to make profit. Some ratio examples for Group 1 are Gross Margin, Efficiency Ratio, and Operating Leverage.  Group II assesses the utilization of assets to generate profits. Some examples of ratios in Group 2 are Return on Net Assets, and Return on Equity.  Group III determines the company’s leverage level. Debt to equity ratio and the interest coverage ratio fit are included in this group.  Group IV measures the liquidity level of the company. A popular ratio used for this purpose is the Current Ratio.  Group V is used for stock valuation; examples include price/earnings ratio, and price/sales ratio. It is easy to find several case studies where financial ratios are used as means of benchmarking. A study on Motorola Company (Collier, H.W., 2004) uses financial ratio analysis to study the operating characteristics of Motorola by comparing the company to two other. Another example considers 20 different financial ratios in evaluating the Critical Access Hospitals (CAH) in Baltimore, Maryland (Pink et al., 2006), using health care information system (Flex Monitoring Team, 2005). The authors discover that CAHs without long-term care make more profit, have fewer liabilities, are more liquid and have higher beneficial use of beds when compared with the long-term care CAHs. Our study uses the turnover ratio which is the ratio of sales revenue to net assets. The justification for this preference is two-fold. First of all, the turnover ratio helps to make logical comparisons among companies in terms of their efficiencies. It is the measure of the overall sales for each dollar of asset that the company has. If the turnover ratio of the company is high, it shows that they manufacture fast and put their products out on the market quickly. Secondly, the turnover ratio data is widely available, or easily computable from widely available sales revenue and net assets data. A review on the use of financial ratios for measuring efficiency yields several results. The first group of studies uses both ratio analysis and DEA by comparing and/or combining them. Biener and Eling (2011) use DEA in order to evaluate the performance of micro-insurance programs, and also financial techniques such as bootstrapping and truncated regression analysis, and they compare the two results. Feroz et al. (2003) claim that DEA can provide additional benefits for the analyst besides ratio analysis. Hollingsworth and Smith (2003) explain why sometimes DEA results are not sufficient while ratio analysis might yield better results, and they also discuss why an analyst has to use BCC model in DEA when taking financial ratios into the model as inputs. The second group of papers uses financial ratios as inputs and/or outputs in their DEA models. Yeh (1996) also constructs a DEA model using
  • 7. 7 financial ratios as inputs and evaluates the performances of Taiwanese banks. Wen, Lim and Huang (2003) evaluate a DEA model to measure the efficiency of e-commerce business models, again using financial ratios as outputs in the DEA model. K-Means Clustering One of the most popular data mining techniques, k-means clustering, is an algorithm to cluster entities into k partitions based on their attributes (Han and Kamber, 2006). This technique starts with a set of entities and their attributes. It is initiated by defining a specified number of clusters randomly or by using some heuristic data to find the centroid (mean point) of each cluster. Then, the algorithm builds new clusters in which all entities belong to the nearest centroid. The “distance” here is the aggregate distance between the cluster centroid and the actual attribute values of entities. Subsequently, the algorithm finds the centroids of the new clusters and repeats itself until the total number of moves of all data objects from a cluster to another is zero. At the end of an epoch, which consists of N iterations (where N is the number of elements in the dataset), the algorithm may end up with fewer clusters than the number specified at the beginning. In this study, the attribute set is selected based on our input-output measures for the DEA study since we aim to analyze the relationship between these two approaches. The algorithm is initially applied with 20 clusters while it finally ends up with five. In the related literature, there are several studies that combine DEA with clustering methodologies. Sharma and Jin (2011) cluster the DEA results in their paper using self organizing maps. Yang (2010) clusters DEA results in order to perform cross-system evaluation of 982 branches of a Canadian bank. Zhou, Huang and Hsu (2008) integrate the DEA results as inputs into k-means clustering to evaluate the efficiencies of hotel services in 31 regions of China. Samoilenko and Osei-Bryson (2008) perform a clustering of DEA results and also utilize decision trees in order to visualize the resulting clusters. Data and Information Visualization Data visualization is an indispensible method of data analysis, which is highly useful for detecting outliers, finding patterns that may not be seen by some standard methods of data mining, and accordingly, asserting new hypotheses. The biggest advantage of data visualization is that it enables even the novice analysts to interact with and learn from data due to the advanced cognitive capabilities of the visual brain. One of the well-known references in data visualization literature is Chambers et al. (1983), which includes plenty of earlier statistics-related visualization techniques, such as quantile plots, histograms, box plots, symmetry plots, and scatter plots. Nevertheless, most of these techniques could not
  • 8. 8 pass beyond supporting statistical analysis; there was not effective usage of colors and patterns, and it was very difficult to visualize larger datasets in most cases. In 1990’s, with the developments in the field of computer science, the graphical methods of 1980’s have elevated into a new level, being referred to as information visualization. The idea in information visualization is to identify undiscovered patterns, come up with actionable insight and constitute maximum benefit out of data, particularly when the data is large-scale and complex. The new era in information visualization is built on ideas from data mining, statistics, and computer graphics, and it compensates for the shortcomings of graphical methods of 1980’s. Well-known references on information visualization include Spence (2001), Soukup & Davidson (2002), Keim (2002), and Hoffman and Grinstein (2002). This study employs three types of visualization tools. The tool known as “colored scatter plot”, developed in the 1980’s, is also referred to as “starfield display” in the literature. Even though this tool is very simple, it can facilitate comprehension of the data to a maximum extent. We use the Miner3D software (Miner3D) to perform the starfield display. In addition to the starfield display, “the tile visualization” is applied to the dataset, which makes it possible to group the data according to some key characteristics and obtain further insights about it. The Visokio Omniscope software is used for the tile visualization (Visokio Omniscope) tool. Finally, a “graph structure” is implemented to identify relationships between the DMUs, according to the DEA results. Via the implementation of this visualization tool, some additional analyses, which the study is unable to carry out by using the other two tools, are performed. The yEd - Graph Editor (yWorks), a software drawing graphs by using several layouts, is employed to obtain the graphs. PROPOSED FRAMEWORK The proposed framework is illustrated in Figure 1. The database that contains the complete data is queried to select the data for DEA and clustering. Clustering, financial ratio analysis and DEA are independently conducted, and the results are obtained in tabular form. Then different visualizations (VIS 1a, 1b, 2, 3, and 4) are constructed using the various combinations of the results and other data in the database in order to derive deeper insight about the benchmarked DMUs and the domain. Earlier studies in literature have extensively used information visualization for the analysis of DEA results. Ulus et al. (2006) analyze the transportation companies traded in NYSE (New York Stock Exchange) by visualizing the efficiency scores against various attributes of the companies, implementing VIS1a in Figure 1. Ertek et al. (2007) analyze the apparel retail industry in Turkey, introducing VIS1b in Figure 1 in addition to VIS1a. Ertek et al. (2012) introduce the graph visualization of the reference sets, corresponding to VIS4 in Figure 1.
  • 9. 9 Osther studies that employ visualization for understanding DEA results are reviewed in Ertek et al. (2012). To perform the complete analyses, we have followed the instructions as outlined in the framework, and the resulting visualizations are categorized into four main types. First three types of visualizations are displaying the relationship between the efficiency scores of DEA and (i) other relevant data taken from the company database (VIS1a and VIS1b), (ii) the turnover ratios of each company (VIS2), and (iii) the cluster indices by k-means clustering (VIS3). Finally, in the fourth type of visualization, reference sets of each DMU, which are found as a result of DEA analysis, are combined with company related data and interpreted accordingly (VIS4). We now present the results based on four different visualization methods that rely on the results of our three benchmarking techniques in the subsequent sections.
  • 10. 10 Figure 1. The proposed analysis framework. DEA Results vs. Relevant Data from ISO 500 (VIS1) With the DEA analysis, it is usually expected that the efficiency scores should increase as the output values increase and input values decrease. However, since DEA does not take only one output or one input value into consideration, this expectation does not have to hold all the time. In order to see the relationship between the input/output values and the efficiency
  • 11. 11 scores, we use starfield visualization. Figure 2 shows the relationship between the sales revenue and the efficiency values. Figure 2. Efficiency vs. SalesRevenueM; red colored circles indicate the SNACK subsector (VIS1a). In Figure 2 (VIS1a), the X-axis denotes to Sales Revenue value in millions, whereas the Y-axis denotes the Efficiency scores. The circles colored in red are denoting the companies in the Snack subsector. Figure 2 shows that the DMUs having a sales revenue figure larger than 1000 million TL are all inefficient. Companies OZYLMZ, KEREVIT, PROGIDA, NATURA, KESKINK form a cluster on the upper left side of the graph having low Sales Revenue (lower than 400), and the only efficient company is NATURA. As one can see in the above figure, the companies NATURA and ULKERB in Snack industry are located on the top area and are efficient with notably lower Sales Revenue. Another insight reveals that company ETI has the highest Sales Revenue with 692 million TL, and it is not efficient, indeed. This shows that having a higher sales revenue does not necessarily guarantee efficiency. On the contrary, one can observe that those companies with relatively higher sales revenues are generally inefficient in the Snack subsector. KENT and ULKERC are represented as two red circles on the lower side of the chart. These two companies have high sales revenues of 464 million and 674 million, respectively, but they are the two least efficient companies among the Snack companies from the ISO 500 list.
  • 12. 12 Figure 3. Efficiency vs. Employees; red colored circles indicate the DAIRY subsector (VIS1a). In Figure 3 (VIS1a), the X-axis denotes to the number of Employees in a company, whereas the Y-axis denotes to the Efficiency scores. The circles colored in red denote the companies in Dairy industry. The two efficient companies are SUTAS and NATURA. We can note that the number of employees of a Dairy company, is not an indicator of its efficiency or inefficiency. While SUTAS has the highest number of employees among all Dairy companies, and NATURA has low employee value of 322, both of them are efficient companies. The two companies with more than 3750 employees are inefficient; CAY and TRSEKER are both state-owned companies. Following CAY and TRSEKER, the two companies with the highest number of employees are BANVIT and SUTAS, and they are both efficient. In Figure 3, we can also remark that SUTAS is the most efficient company with the largest number of employees in the Dairy subsector.
  • 13. 13 Figure 4. Rank2010 vs. Rank2009; lighter colors denote higher efficiency values. In Figure 4 (VIS1a), the X-axis denotes the rank of the company in the ISO 500 list in 2009, whereas the Y-axis denotes the rank of the company in the ISO 500 list in 2010. The shades of grey indicate efficiency values; lighter coloring denotes higher efficiency while the white circles represent the most efficient companies. ORKA, ELITA, AKOVA, KEREVIT, TORUN, MATLI, TURYAG, NAMET, BESLER, lined up on the right hand side of the chart were not in the list in 2009, and they made it to the list in 2010. The only efficient company among those is KEREVIT, represented in the graph with white. Figure 5 (VIS1b) illustrates the tile visualization of ISO 500 Food Companies, grouped according to Provincial Chamber of Industry. Each company is represented by a rectangle, and the size of the rectangle denotes the net asset value of the company. This visualization enables the analyst to focus on the patterns within a city to compare patterns of different cities. For example, comparing the companies in Istanbul Chamber of Commerce to others, we can observe that the total net assets is very large in Istanbul, and there is only a single company, ULKERC, that has very low efficiency score. Within Istanbul, EFES is efficient, while having a great amount of net assets. Ülker Company Group, the largest food group in Turkey, operates in Snack industry with two brands, ULKERC and ULKERB, which produce chocolate and biscuits respectively, and the efficiency gap between the two brands is excessive.
  • 14. 14 Figure 5. Grouping by Chamber, color denotes efficiency whereas size denotes net assets Analysis of DEA Results vs. Financial Ratios In Figure 6 (VIS2), the X-axis denotes the turnover ratio, while the Y-axis denotes the Efficiency Score. The circles colored in red are in Hazelnut (and its byproducts) subsector. Turnover ratio is obtained by dividing the Net Assets of a company by its Sales Revenue. Turnover ratio is a success indicator as to how efficient a business is in terms of generating sales revenue by using its net assets. We can observe that the two outlier companies PROGIDA and OZYLMZ are not only efficient in the Hazelnut subsector , but are also the two efficient companies with the highest turnover ratios (87% and %34 respectively) within the entire dataset. Financial ratios and DEA results produced consistent results for these two companies. However, this may not always hold true, and considering the input and output relations multi-dimensionally using DEA might yield significantly different results. For instance, the two companies with negative turnover ratios on the right side of the chart are KEREVIT and NATURA. This particular outcome shows that DEA results are not by
  • 15. 15 themselves sufficient for the analysis of efficiency, and financial ratios should also be considered. Yet, a multi-dimensional benchmarking method such as DEA is necessary, in addition to ratio analysis. For example, even though SEKERP has higher turnover ratio, it’s not very efficient according to DEA results. Figure 6. Efficiency vs. Turnover; red color denotes companies in the hazelnut subsector (VIS 2). Figure 7. Efficiency vs. ClusterID; lighter colors denote higher efficiency values.
  • 16. 16 Analysis of DEA Results vs. k-Means Clustering Results In Figure 7 (VIS3), the X-axis denotes the ClusterID, while the Y-axis and the coloring denotes Efficiency scores. This chart portrays the results of DEA together with the results of k-means clustering. K-means clustering was applied with Miner3D software using 20 initial clusters, and running the algorithm for 200 iterations. The variables used in clustering are the five inputs and outputs used in DEA model. The clustering algorithm ended with five clusters, three of which (clusters 4, 8, 19) consist of only a single company. Clustering considers the magnitude of the variable values, ignoring the input-output relationships. Therefore, it is expected that clustering results may not be consistent with DEA results. This expectation is observed in our case study; clusters 2 and 3, which contain many companies, have a high level of variability in efficiency values within the clusters. For example, in cluster 2, where 15 food industry companies with similar magnitudes are clustered, one can observe very inefficient companies such as KENT and ULKERC, besides a very efficient company such as BANVT. Similarly, cluster 3 contains the very inefficient POYRAZ, as well as the efficient NATURA. Figure 8 clearly shows that DEA analysis brings a much better understanding upon the clustering results. Meanwhile, clustering helps us interpret the results of DEA even better. For example, a known property (and also a typical shortfall) of DEA is that it might declare outlier observations as efficient, due to their extreme values in one or two dimensions. In Figure 8, EFES, an efficient company, is seen as the single member of cluster 19, which suggests that the efficiency of EFES might be due to its being an outlier in magnitude in comparison to the other companies. Similarly, CAY and TRSEKER are obviously outlier companies, which appear as the single members of two different clusters. These two companies are inefficient, suggesting that their low scores might be due to the extremely high values of some of its inputs. Analysis of Reference Sets According to the DEA Results Figure 8 (VIS4) presents the visualization of the reference sets for the Snack companies within the food industry; each node represents a company, and the darker colors denote larger number of employees in the company. The arcs emanating from node i to node set J denote that J is the reference set for company i. The thickness of an arc denotes the weight for that reference. Efficient companies are those that have arcs emanating and terminating at themselves. One can observe two groups of companies in Figure 8. The first group has ETI, NATURA and ULKERB as the role model, and the second group takes TAM and ORKA as the role model. NATURA exhibits an efficient business model that should be followed by other companies, as it manages to be efficient with relatively low number of employees. The nodes with the same inbound and outbound arc signifies that these companies are efficient.
  • 17. 17 Figure 8. Visualization of Reference Sets; colors denote magnitude of the number of employees. Conclusions and Future Work In this chapter, we carried out a financial benchmarking of food companies in Turkey using DEA, and compare and combine the DEA results with the results of two other popular benchmarking/analysis methods, namely financial ratio analysis and k-means clustering. Three different information visualization techniques (starfield visualization, tile visualization and graph visualization) were implemented for visual comparisons and analysis. The study confirms the earlier results in literature in the sense that the results of various methodologies are not necessarily consistent with each other. Therefore, in order to perform comprehensive and fair benchmarking, one needs to employ multiple methods of benchmarking, and interpret the results of these methods together to get the correct results. However, it is difficult to choose the methodologies to employ and interpret them in an acceptable way. The integrated benchmarking framework presented in this chapter serves to this purpose. To the best of our knowledge, such a comprehensive integrated framework does not exist in the existing literature. Therefore, this study makes a methodological contribution to the benchmarking literature, besides presenting a benchmark of the Turkish food industry. This work can be followed with a crucial research area in which one can carry out formal statistical tests to prove or disprove the hypotheses that are claimed. Moreover, choosing more appropriate financial data, one can use a larger set of financial ratios to compare against the DEA results in order to obtain deeper insights and to compare the different methodologies integrated here.
  • 18. 18 References Akcay, A.E., Ertek, G., & Buyukozkan G., (2012). Analyzing the solutions of DEA through information visualization and data mining techniques: SmartDEA framework. Expert Systems with Applications, 39, 7763–7775. Andersen, B., & Jordan, P. (1998). Setting Up A Performance Benchmarking Network. Journal of Production Planning & Control: The Management of Operations, 9(1), 13-19. Bakırcı, F. (2006). Sektörel Bazda Bir Etkinlik Ölçümü: VZA ile bir analiz. İktisadi ve İdari Bilimler Dergisi, 20(2). Banker, R., D., Charnes, A. & Cooper, W. W. (1984). Some models for estimating technical and scale inefficiencies in data envelopment analysis. Journal of Management Science, 30, 1078–1092. Barnes, A.P. (2006). Does multi-functionality affect technical efficiency? A non-parametric analysis of the Scottish Dairy Industry. Journal of Environmental Management, 80, 287 – 294. Banaeian, N., Omid, M. & Ahmadi, H. (2011) Application of Data Envelopment Analysis to Evaluate Efficiency of Commercial Greenhouse Strawberry. Research Journal of Applied Sciences, Engineering and Technology 3(3), 185-193. Biener, C., & Eling, M. (2011). The Performance of Microinsurance Programs: A Data Envelopment Analysis. The journal of Risk and Insurance, 78(1), 83-115. Bragg, S.M. (2006). Financial Analysis: A Controller’s Guide,Wiley, 2nd Edition. Chambers, J. M., Cleveland, W. S., Kleiner, B., & Tukey, P. A. (1983). Graphical methods for data analysis. Pacific Grove, CA: Wadsworth & Brooks/Cole Publishing Company. Charnes, A., Cooper, W. W., & Rhodes, E. (1978). Measuring the efficiency of decision making units. European Journal of Operational Research, 2(6), 429-444 Cooper, W. W., Seiford, L. M., & Tone, K. (2006). Introduction to data envelopment analysis and its uses. New York, NY: Springer. Çiftçi, H. (2004). Türk Sigorta Sektörünün Sorunları; DEA Analizi ile Türk Sigorta Şirketlerinin Etkinlik Düzeylerinin Belirlenmesi. Çukurova Üniversitesi Sosyal Bilimler Enstitüsü Dergisi, 13(1), 121-149. Dadura, A. M & Lee, T. R. (2011) Measuring the Innovation ability of Taiwan’s food industry using DEA. Innovation: The European Journal of Social Science Research 24(1)-2, 151-172. Ertek, G., Can, M. A., & Ulus, F. (2007). Benchmarking the Turkish apparel retail industry through data envelopment analysis (DEA) and data visualization. EUROMA 2007, Ankara, Turkey.
  • 19. 19 Ertek, G., Tunç, M.M., Kurtaraner, E., & Kebude, D. (2012). Insights into the Efficiencies of On-Shore Wind Turbines: A Data-Centric Analysis, Accepted for INISTA 2012 Conference. July 2-4, 2012, Trabzon, Turkey (indexed in IEEE Electronic Library). Feroz, E.H, & Kim, S. (2003). Financial Statement Analysis: A Data Envelopment Analysis Approach. Journal of the Operational Research Society, 54, 48-58. Galagedera, D. U. A., & Silvapulle, P. (2002). Australian mutual fund performance appraisal using data envelopment analysis. Journal of Managerial Finance, 28(9), 60–73. Han J. & Kamber M. (2006) Data mining: concepts and techniques, Morgan Kaufmann Publishers. Hollingsworth, B., & Smith, P. (2003). Use of Ratios in Data Envelopment Analysis. Applied Economics Letters, 10, 733-735. ISE. Istanbul Stock Exchange. http://www.ise.org. ISO (2011). Istanbul Sanayi Odası Dergisi, August 2011, issue:545, ISSN 1307-6418. Istanbul Chamber of Industry, Istanbul, Turkey. Jayanthi S., Kocha B. & Sinha, K.K. (1999). Competitive Analysis of Manufacturing Plants: An application to the US processed food industry. European Journal of Operational Research, 118, 217-234. Karsak, E.E., & İşcan, F. (2000). Çimento Sektöründe Göreli Faaliyet Performanslarının Ağırlık Kısıtlamaları ve Çapraz Etkinlik Kullanılarak Veri Zarflama Analizi ile Değerlendirilmesi.. Endüstri Mühendisliği Dergisi, 11(3), 2-10. Keramidou, I., Mimis A. & Pappa, E. (2011) Estimating Technical and Scale Efficiency of Meat Products Industry: The Greek Case. Journal of Applied Sciences 11(6), 971- 979. Kumar M. & Basu P., (2008). Perspectives of productivity growth in Indian food industry: a data envelopment analysis. International Journal of Productivity and Performance Management, 57(7), 503 – 522. Miner3D. http://www.miner3d.com/index.php. Omniscrope. http://www.visokio.com/omniscope/. Pink, G.H., Holmes, G.M., D’Alpe, C., Strunk, L.A., McGee P. & Slifkin, R.T. (2006) Financial Indicators for Critical Access Hospitals. The Journal of Rural Health, 22(3), 229-236. Samoilenko, S., & Osei-Bryson, K.M. (2008). Increasing the Discriminatory Power of DEA in the presence of the sample heterogeneity with cluster analysis and decision trees. Expert Systems with Applications, 34, 1568-1581. Sharma, K.R., Leung, P. & Zaleski, H.M. (1997) Productive Efficiency of the Swine Industry in Hawaii: Stochastic Frontier vs. Data Envelopment Analysis. Journal of Productivity Analysis, 8, 447-459.
  • 20. 20 Sharma, M.J., & Jin, Y.S. (2011). Self Organizing Maps Infusion with Data Envelopment Analysis. Retrieved on 24.05.2012, Available under: http://cdn.intechweb.org/pdfs/13293.pdf Ulucan, A. (2002). ISO500 Şirketlerinin Etkinliklerinin Ölçülmesinde Veri Zarflama analizi Yaklaşımı: Farklı Girdi Çıktı Bileşenleri ve Ölçeğe göre Getiri Yaklaşımları ile Değerlendirmeler. Ankara Üniversitesi SBF Dergisi, 57(2), 185-202. Ulus, F., Kose, O., Ertek, G., & Sen, S. (2006). Financial benchmarking of transportation companies in the New York Stock Exchange (NYSE) through data envelopment analysis (DEA) and visualization. 4th International Logistics and Supply Chain Congress, İzmir, Turkey. Weill, L.( 2004). Measuring cost efficiency in European banking: a comparison of frontier techniques. Journal of Productivity Analysis, 21, 133–152. Wen, H.J., Lim, B., & Huang, H.L. (2003). Measuring e-commerce efficiency: a data envelopment analysis (DEA) approach. Industrial Management & Data Systems, 103(9), 703-710. Wongchai, A., Tai C.F., & Peng, K.C. (2011) An Application of Super-efficiency and Tobit Method for Financial Efficiency Analysis of Food Industrial Companies in Taiwan. Emergency Management and Management Sciences (ICEMMS), 2nd IEEE International Conference. Yang, Z. (2010). Cross System Bank Branch Evaluation Using Clustering and Data Envelopment Analysis. Lecture Notes in Computer Science (LNCS), 6215, 238-245. Yeh, Q.J. (1996). The Application of Data Envelopment Analysis in Conjunction with Financial Ratios for Bank Performance Evaluation. The Journal of Operational Research Society, 47(8), 980-988. yWorks. http://www.yworks.com/en/index.html. Zhou, Z., Huang, Y., & Hsu, M.K. (2008). Using Data Envelopment Analysis to Evaluate Efficiency: An Explanatory Study of the Chinese Hotel Industry. Journal of Quality Assurance in Hospitality & Tourism, 9(3), 240-256. KEY TERMS & DEFINITIONS Financial Benchmarking: The benchmarking of companies using financial data. Data Envelopment Analysis (DEA): A non-parametric analytical method for benchmarking a group of entities. Information Visualization: The field of computer science that works with the visualization of large-scale complex data for discovering new useful knowledge.
  • 21. 21 Financial Ratios: Ratios used in finance; they are computed as the ratio of two financial metrics. K-Means Clustering: An unsupervised machine learning / data mining method for clustering a set of entities into clusters of similar entities.