Download Visualizing market segmentation using self

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Marketing wikipedia , lookup

Marketing channel wikipedia , lookup

Service parts pricing wikipedia , lookup

Multicultural marketing wikipedia , lookup

Marketing research wikipedia , lookup

Bayesian inference in marketing wikipedia , lookup

Target audience wikipedia , lookup

Advertising campaign wikipedia , lookup

Grey market wikipedia , lookup

Darknet market wikipedia , lookup

Global marketing wikipedia , lookup

Neuromarketing wikipedia , lookup

Product planning wikipedia , lookup

Market penetration wikipedia , lookup

Market analysis wikipedia , lookup

Marketing strategy wikipedia , lookup

Target market wikipedia , lookup

Market segmentation wikipedia , lookup

Segmenting-targeting-positioning wikipedia , lookup

Transcript
Expert Systems with Applications 38 (2011) 198–205
Contents lists available at ScienceDirect
Expert Systems with Applications
journal homepage: www.elsevier.com/locate/eswa
Visualizing market segmentation using self-organizing maps and Fuzzy Delphi
method – ADSL market of a telecommunication company
Payam Hanafizadeh *, Meysam Mirzazadeh
Department of Industrial Management, Allameh Tabataba’i University, Teheran, Iran
a r t i c l e
i n f o
Keywords:
Market segmentation
Clustering
Visualization
Neural networks
Self-organizing map
a b s t r a c t
A common problem for marketing strategists is how to appropriately segment the market and select segment-specific marketing strategies. This paper presents a novel approach which integrates Fuzzy Delphi
method, self-organizing maps (SOM) and a visualization technique to cluster customers according to
their various characteristic variables and visualize segments by producing colorful market maps. These
market maps not only help the managers to see fully visualized clusters of market but also reveal mutual
non-linear correlations between different customers’ characteristic variables. In this research we studied
ADSL service market of an Iranian Telecommunication Company. SOM algorithm and visualizing technique were implemented in MATLAB environment to produce market maps of data set.
Ó 2010 Elsevier Ltd. All rights reserved.
1. Introduction
Market segmentation refers to the identification of distinct subsets of customers, where any subset may be selected as a market
target to be reached with a distinct marketing mix (Kotler, 1980).
To identify these subsets consumers are put into homogeneous
groups that marketing managers can select segment-specific marketing mixes and use them for effective targeting and predicting
potential customers. The value of performing marketing segmentation analysis includes better understanding of the market to properly position a product in the marketplace, identifying the
appropriate segments for target marketing, finding opportunities
in existing markets, and gaining competitive advantage through
product differentiation (Kotler, 1980). Although it was introduced
into the academic marketing literature in the 50s, market segmentation continues to be an important focal point of ongoing research
and marketing practices (e.g., Chaturvedi, Carroll, Green, & Rotondo, 1997).
A common problem for marketing strategists is how to appropriately segment the market and package differentiated products
and services for target segments. Segmentation is a methodological
process of dividing a market into distinct groups that might require
separate experiences or marketing service mixes (Venugopal &
Baets, 1994). Customer clustering is one of the most important
techniques used to identify these segments (Saarenvirta, 1998).
Various clustering techniques are used for segment identification.
These techniques generally take into consideration the identification and assessment of various customer characteristics (such as
* Corresponding author.
E-mail address: hanafizadeh@gmail.com (P. Hanafizadeh).
0957-4174/$ - see front matter Ó 2010 Elsevier Ltd. All rights reserved.
doi:10.1016/j.eswa.2010.06.045
demographics, socioeconomic factors, and geographic location)
and product related behavioral characteristics (such as purchase
behavior, consumption behavior, and attitudes towards and preference for attractions, experiences and services). Target marketing is
a strategy that aims at grouping a destination’s markets into segments so as to aim at one or more of these by developing products
and marketing programs tailored to each (Kotler, 2001).
Although much of the marketing literature has proposed various market segmentation techniques, clustering techniques are
frequently used in practice (Wedel & Kamakura, 2000). k-Means
algorithm and artificial neural networks are some techniques
widely considered in clustering problems (Hung & Tsai, 2008).
k-Means clustering is the most frequently used market segmentation technique among the other clustering techniques (Gehrt &
Shim, 1998; Kuo, Chang, & Chien, 2004). However, the major drawback of k-means clustering is that it often falls in local optima and
the result largely depends on the initial cluster centers (Kim & Ahn,
2008). Prior studies pointed out this limitation and tried to integrate k-means clustering and global search techniques including
genetic algorithms. For instance Kuo et al. integrated k-means,
neural networks and genetic algorithm to analyze web browsing
paths (Kuo, Liao, & Tu, 2005) or Kim and Ahn used GA and k-means
for a recommender system (Kim & Ahn, 2008).
Artificial neural networks (ANNs) have been recently applied to
a wide variety of business areas (Vellido, Lisboa, & Vaughan, 1999),
such as sales forecasting (Kuo & Xue, 1998), and bankruptcy prediction (Cadden, 1991). One type of unsupervised neural networks,
the self-organizing maps (SOM), can project high-dimensional
input space on a low-dimensional topology, allowing one to visually determine out the clusters (Lee, Slagle, & Blum, 1977; Pykett,
1978). Unlike the supervised learning methods, the SOM can be
P. Hanafizadeh, M. Mirzazadeh / Expert Systems with Applications 38 (2011) 198–205
used for clustering data without knowing the class memberships of
the input data.
Despite the fact that some previous research has used SOM
algorithm for market segmentation, its potential power in visualization and knowledge acquisition in marketing field is not fully
developed. The main contribution of this research is the integration
of Delphi Fuzzy method, Kohonens algorithm and a visualization
technique, and applying them for market segmentation problem.
We were motivated to use SOM for market segmentation by the
fact that this algorithm can be simultaneously used to reduce the
amount of data by clustering and to project the data non-linearly
onto a lower-dimensional subspace. The topological relationships
between the features automatically extracted by SOM from multi-spectral images are formed into a neural network in a meaningful order. This approach is efficient for contemporary market
segmentation approaches which consider all customer characteristics and utilize data warehouses with huge amount of data to identify market segments. Moreover the output maps generated by
SOM algorithm not only help the market managers to easily recognize all the market segments precisely but also enables them to
compare market maps over time and monitor market responses
of every segment.
Telecommunication industry is a growing industry in Iran
which has an enormous potential market for different telecommunications services. However many telecommunication companies
has not conducted considerable market research yet. Hence they
neither follow clear strategies nor have market mixes. In this research, SOM is employed to visualize ADSL market segments of a
reputable Iranian telecommunication company. Data saved in selling system database is utilized to form dataset or research.
The rest of the paper is organized as follows: Section 2 reviews
market segmentation literature and describe clustering problems,
comparing its techniques for market segmentation. Section 3 presents SOM algorithm and describes its learning method and visualization technique. Empirical study and data analysis is presented in
Section 4. Last but not least, in Section 5 conclusions and implications are considered.
2. Market segmentation
The critical role of segmentation in choosing true marketing
strategies has been widely argued (Wedel & Kamakura, 2000).
Market segmentation involves a broad variety of approaches (see
Wedel & Kamakura, 2000 for a review). Essentially, these approaches can be divided to two main groups. First group approaches are based on known characteristics and groups are
selected from a population in advance and declared as ‘segments’
199
(e.g. socio-demographic characteristics or band width of an ADSL
service use). In contrast, in the second group approaches, the
post-hoc methods, empirical investigation is conducted using multivariate analysis to identify segments (Anable, 2005). To identify
segments, respondents are clustered according to their similarity
on multivariate profiles on any number of combinations of variables. These approaches may include various mixtures of attitudinal, behavioral or personality characteristics.
Among the post-hoc segmentation methods, clustering methods are relatively powerful and frequently used in practice (Dillon,
Kumar, & Borrero, 1993; Wedel & Kamakura, 2000). The rest of section reviews common clustering techniques used for market segmentation problems and outlines their strengths and weaknesses
for market segmentation.
2.1. Clustering techniques
Cluster analysis is an effective tool in scientific or managerial
inquiry. It groups a set of data in d-dimensional feature space to
maximize the similarity within the clusters and minimize the difference between two different clusters. Clustering methods can be
roughly classified into hierarchical method and partitioning method (Witten & Frank, 2000). Partitioning clustering can be divided
into exclusive and overlapping methods depending on which set
theory the algorithm is built on. Exclusive clustering (e.g., k-means,
SOM) is built on classic set theory where an element is an exclusive
member of a set. Overlapping clustering (e.g., Fuzzy c-means) is
based on fuzzy set theory where an element can be a member of
one or more sets (Hwang & Thill, 2007).
The k-means (MacQueen, 1966) clustering method is probably
the most well known clustering technique. The accuracy of the kmeans procedure is very dependent upon the choice of the initial
seeds (Milligan & Cooper, 1980) and it often falls in local optima.
This can be a major disadvantage for post-hoc market segmentation cases because accurate designation of market clusters is not
possible for market managers. In addition, k-means poverty in data
visualization makes it comparatively an improper method for market segmentation.
Fuzzy c-means (Bezdek, 1981; Roubens, 1982) is an overlapping
clustering method based on fuzzy set theory. Contrary to the kmeans method the fuzzy c-means is more flexible because it shows
those objects that have some interface with more than one cluster
in the partition. However it has the same major drawbacks as the
k-means for market segmentation problem.
Kohonen algorithm and its variants have been used for different
clustering problems. Kiang, Hu, and Fisher (2006) proposed an extended self-organizing map network to forecast market segment
Table 2.1
Comparison and review of clustering methods used for market segmentation.
Work and year
Clustering algorithm
Application area
Pros and Cons
1
Kim and Ahn (2008)
GA k-means
Online shopping market
2
Kiang, Hu, and Fisher (2007)
Extended SOM
Telecommunication
3
4
Fuzzy clustering
SOM and GA k-means
Urban housing
Fright transport industry
5
Hwang and Thill (2007)
Kuo, An, Wang, and Chung
(2006)
Bloom (2004)
Weak visualization – using GA to identify initial seed – non-overlapping
clustering
Non-visualized – non-overlapping clustering – independent from
sample size
Non-visualized – overlapping clustering – predefined cluster numbers
Non-overlapping clustering – non-visualized
Tourist market
Non-overlapping clustering – non-visualized
6
Cheung (2003)
SOM and Back propagation
ANN
k*-Means
7
Jang, Morrison, and O’Leary
(2002)
Hruschka and Natter (1999)
k-Means
Gaussian distribution
dataset
Japanese travel market
k-Means
Household cleaners
Non-overlapping clustering – visualized algorithm – no cluster number
predefinition
Predefined cluster numbers – non-overlapping clustering – weak
visualization
Predefined cluster numbers – non-overlapping clustering – weak
visualization
8
200
P. Hanafizadeh, M. Mirzazadeh / Expert Systems with Applications 38 (2011) 198–205
membership. The extended SOM network was used to group the
nodes on the output map into a user specified number of clusters.
Hung and Tsai (2008) used a hierarchical SOM segmentation model
for the market of multimedia on demand. Table 2.1 reviews and
compares different clustering methods used for market
segmentation.
3. Self-organizing maps
SOM algorithm was introduced for the first time by Kohonen
(Kohonen, 1981) in 1981 and it was practically used in 1984 for
voice recognition (Kohonen, Mäkisara, & Saramäki, 1984).
Although SOM is frequently used in data mining (Vesanto, 1997),
complex spaces display (Vesanto, 1999), clustering the spaces with
high dimensions and particularly in image processing, process control, project management, financial analysis and industrial detections and medical diagnoses, comparatively it is less used in
managerial and business administration related fields(Oja, Kaski,
& Kohonen, 2002). An extensive list of engineering usages of
SOM is presented by Kohonen, Oja, Simula, Visa, and Kangas
(1996).
The basis of SOM is to map spaces with high dimensions into
two- or three-dimension space in a way that minimum information is lost and the hidden information in relations among the data
can be discovered and showed. This method can show the correlation between data, information and their mutual effects on each
other.
The SOM network typically has two layers of nodes, the input
layer and the map layer. Each map includes a set of neurons that
are put together in a two-dimensional grid which is fully connected to the input layer. The lattice of grid is either hexagonal
or rectangular. Fig. 3.1 presents topology of traditional self-organizing map with hexagonal lattice grid.
Each neuron of the map layer is corresponding to an information vector with the dimension numbers equal to the dimension
number of the information space under analysis. In other words,
each neuron is the representative of one part of the information
space.
3.1. SOM algorithm
The SOM basic algorithm is based on a competitive unsupervised learning algorithm known as ‘‘winner takes all”. During the
training process, input data are fed to the network through the
nodes in the input layer. As the training process proceeds, the neu-
Fig. 3.2. Updating the best matching unit (BMU) and its neighbors towards the
input sample marked with x. the black and gray circles correspond to situation
before and after updating, respectively. The lines show neighborhood relations
(Vesanto, 2000).
rons adjust their weights values according to the topological relations in the input data. The neuron with the minimum distance is
the winner called best matching unit (BMU) and adjusts its weights
to be closer to the value of the input pattern. Updating BMU and its
neighbors is illustrated in Fig. 3.2.
The network undergoes a self-organization process through a
number of training cycles, starting with randomly chosen weights
for the neurons in the map layer. During each training cycle, every
input vector is considered in turn and the winner neuron is determined. The weight vectors of the BMU and the nodes in the neighborhood are updated using a weight adaptation function.
SOM training algorithm includes the following steps (Kohonen,
2001):
1. Selecting map parameters such as the dimensions and the initialization weight vector corresponding to each neuron.
2. The network is fed with the data under analysis to find the best
matching unit for each input data vector. Each record such as X
includes quantitative values of n attributes that are shown as
follows:
X ¼ ½X 1 ; X 2 ; . . . ; X n 2 Rn
If weight vector of ith neuron is defined as bellow:
mi ¼ ½mi1 ; mi2 ; . . . ; min 2 Rn
Input Layer
ð3-1Þ
ð3-2Þ
Xj
Then, corresponding to each input record, the best matching unit or
winner neuron is identified based on Eq. (3-3)
mij
c ¼ arg minfdðX; mi Þg
i
ð3-3Þ
In which c indicates the winner neuron and d(X, mt) is the Euclidian
distance between the record and the weight vector of ith neuron
that is calculated by Eq. (3-4)
dðX; YÞ ¼ kX Yk
ð3-4Þ
3. Updating the weight vector corresponding to each neuron using
Eq. (3-5)
mi ðt þ 1Þ ¼ mi ðtÞ þ aðtÞhci ðtÞ½XðtÞ mi ðtÞ
Map Layer
Fig. 3.1. View of traditional SOM topology with hexagonal lattice grid.
ð3-5Þ
In which, 0 < a < 1 is the learning rate and hci(t) indicates the neighborhood rate of ith neuron with the cth neuron (winner neuron).
P. Hanafizadeh, M. Mirzazadeh / Expert Systems with Applications 38 (2011) 198–205
The neighborhood rate of ith neuron with the winner neuron is obtained from Eq. (3-6) which is Gaussian function
hct ¼ e
krc ri k2
2r2 ðtÞ
ð3-6Þ
In Eq. (3-6), r is the controller of the function domain and is eventually decreased during the training process. Also ri and rc, as it is
presented in Fig. 3.3, are respectively the positions of ith and cth
neurons in SOM grid (Kohonen, 2001).
Since the SOM training algorithm uses Euclidian distance to
identify the BMU, the data of each dimension of the input vector
should be normalized and standardized separately before feeding
to the network. At the end of SOM training stage, a neuron map
is obtained that is, in fact, a summary of the space under analysis
of the network.
3.2. SOM visualization technique
Visualization technique used in this research was firstly introduced by Ultsch and Siemon (1990). In this method relating to each
attribute’s value in the weight vector, a RGB (Red–Green–Blue)
vector and consequently a color is considered in a way that all values can be shown in a colored spectrum from dark blue (for lowest
values) to dark red (for highest values). In this way, for each attribute, the color of each neuron is identified and the map related to
that attribute is obtained. With the attribute’s maps, it is then possible to evaluate the mutual relation between them (correlation
test). The same color of the corresponding parts of two maps indicates the correlation of the corresponding attributes in those maps.
The intensity of color’s difference or similarity between maps can
1
2
3
Fig. 3.3. An example of topological neighborhood sets (at the radius 1, 2 and 3)
(Kohonen, 2001).
201
show the correlation rate between two variables in different parts
of the space. More over, quantitative criteria can be calculated for
that. It is possible that the intensity and even the kind of correlation between two attribute s in various parts of the space are different and affected by the values of other attributes that all can
be truly shown using SOM (Kohonen, 2001).
Fig. 3.4 indicates a sample of SOM usage in analyzing complex
models and exhibiting the simultaneous effects of different variables on each other. As it is clear in this figure, the space under
analysis has five dimensions. Comparing the maps with each other,
the following results can be inferred:
1. Variables V2 and V5 and also V1 and V4 have inverse correlation
in their entire change domain. (When V2 is red (takes high values), V5 is blue (takes low values) and vise versa.) Although the
correlation rate of V2 and V5 is fixed almost in all parts of the
space, the same is not applicable to variables V1 and V4.
2. Variables V3 and V5 are inversely correlated but the correlation
rate in different parts of the space is diverse and less than the
correlation rate of V2 and V5.
3. Variables V2 and V3 are directly correlated but their correlation
rate is dependent to the values of other variables.
4. The correlation of variables V1 and V4 with V5 is completely
non-linear and its rate in various parts of the space is different.
The important issue is that maximum and minimum values of
V5 are probably occurred in medium values of V1 and V4.
A popular way to visualize clustering is to compute the distance
between adjacent map units and present the result as a U-matrix1
(Iivarinen, Kohonen, Kangas, & Kaski, 1994; Ultsch & Siemon, 1990).
Clustering matrix and corresponding to that, the clustering map or
U-matrix are the major outputs of SOM. The elements of U-matrix
show the distance of weight vector of adjacent neurons. If the attributes of two parts of the space under analysis are similar, then the
distance between the weight vectors of the neurons related to them
is not too much and in other words, both neurons are in the same
cluster of the space under analysis. On the other hand, the more
the algebraic distance between the neighbor neurons, the more different their corresponding spaces will be. Thus, they can be categorized into two separate clusters (Kohonen, 2001). Fig. 3.5 indicates a
U-matrix with clusters and sub-clusters from a 62-dimension space.
4. Empirical study
In this research ADSL market of a reputable Iranian telecommunication company was studied. Fanava Co. is a holding company
which provides various telecommunication services, but ADSL services involve a major volume of its market. Fanava offers three different types of ADSL services with different specifications to cover
the various needs of its customers. These services differ in sharing
type of bandwidth, total data transmission limitation and billing
method. Our methodology for conducting this research and the
case study has been reviewed in the following section.
4.1. Research methodology
The research has been done in three stages is presented in
Fig. 4.1:
Stage 1: Model development
A. Defining the problem, specifying research goals, scope, and
methodology.
B. Providing a list of demographic, psychological, behavioral and
Fig. 3.4. SOM usage in simultaneous analysis of non-linear relations between
variables.
1
Unified distance matrix.
202
P. Hanafizadeh, M. Mirzazadeh / Expert Systems with Applications 38 (2011) 198–205
Fig. 3.5. U-matrix. Clusters are named by capital letters.
geographic variables used for segmentation in marketing literature and selecting the most effective variables in our
empirical case market structure using Fuzzy Delphi method
Stage 2: SOM network development
A. Model deployment, network construction and configuration
of parameters.
B. Making the dataset using obtained data from Fanava customers by means of an electronic survey and customers’ data
available at the database of Fanava sales system.
C. Data preprocessing by standardizing and normalizing the
dataset.
D. Setting initial weights and training the network using dataset.
E. Generating U-matrix map and each variable map.
F. Distinguish and label segments and sub-segments.
Stage 3: Segments analysis
A. Selecting a sample customer set from each segment.
B. Investigating and calculating the loyalty of each sample customer set to each service of Fanava.
C. Analysis of segments behavior.
4.2. Model development
To find critical variables for segmentation, Fuzzy Delphi method
was employed to identify the most effective factors in our marketing context. The Delphi method originated in a series of studies
that the RAND Corporation conducted in the 1950s (Okoli & Pawlowski, 2004). The objective was to develop a technique to obtain
the most reliable consensus of a group of experts (Dalkey & Helmer, 1963). Although we could select the critical variables through
the traditional statistical analysis methods, Delphi method was
used as a stronger methodology for to achieve consensus of our
market managers on final variables set. Okoli and Pawlowski
(2004) compared and contrasted the strengths and weaknesses of
a Delphi study versus the traditional survey approach as a research
strategy (Okoli & Pawlowski, 2004). Considering this comparison,
we selected the Delphi method for the following reasons:
Firstly our study is an investigation of factors that would affect
customer behavior in our market. This complex issue requires
knowledge of people who understand the different aspects of our
market structure. Thus, a Delphi study answers the study questions
more appropriately. Secondly because we have a limited number of
market experts, our sample size for an acceptable statistical analysis is small therefore, in these conditions, Delphi technique will
lead to more exact results and finally among other group decision
analysis methods Delphi is desirable in that it does not require the
experts to meet physically, it save time and cost required for collecting experts opinions.
The traditional Delphi method has always suffered from low
convergence expert opinions, high execution cost, and the possibility that opinion organizers may filter out particular expert opinions
(Murry, Pipino, & Gigch, 1985) thus proposed the concept of integrating the traditional Delphi method and the fuzzy theory to improve the vagueness and ambiguity of Delphi method. Hsu and
Yang (2000) applied triangular fuzzy number to encompass expert
opinions and establish the Fuzzy Delphi method. The max and min
values of expert opinions are taken as the two terminal points of triangular fuzzy numbers, and the geometric mean is taken as the
membership degree of triangular fuzzy numbers to derive the statistically unbiased effect and avoid the impact of extreme values.
This method may create a better effect of criteria selection. It features the advantage of simplicity, and all the expert opinions can
be encompassed in one investigation (Okoli & Pawlowski, 2004).
The Fuzzy Delphi questionnaire contained a list of 19 variables
frequently mentioned in market segmentation literature which are
divided into four categories labeled demographic, psychological,
behavioral and geographic. The questionnaire was distributed to
seven managers of the sales and marketing unit. The respondent
were asked to indicate on a five-point Likert scale to what extend
each variable influences customer purchasing behavior, according
to their contact with ADSL customers. To make triangular fuzzy
number for each variable steps were followed:
sAj ¼ ðLAj ; MAj ; U Aj Þ
ð4-1Þ
LAj ¼ minðX Aij Þ
ð4-2Þ
U Aj ¼ maxðX Aij Þ
qffiffiffiffiffiffiffiffiffiffi
n
MAj ¼ Pni¼1 X Aij
ð4-3Þ
ð4-4Þ
Eq. (4-1) represents triangular fuzzy number of jth variable. LAj is
the minimum value of Aj that is evaluated by the experts. UAj is
the maximum value of Aj that is evaluated by the experts. MAj is
geometric mean of expert evaluation from Aj and XAij is the evaluation of ith expert from ith variable.
Geometric mean of each triangular number corresponds to consensus of experts on the value of variable. If the geometric mean is
more than a predefined threshold the variable will be selected. For
the threshold value r, the 80/20 rule was adopted (Kuo & Chen,
2007) with r set as 4. This indicated that among the factors for
selection, ‘‘20% of the factors account for an 80% degree of importance of all the factors’’.
According to fuzzy numbers, six variables identified as the most
effective factors on the consensus of market experts. Table 4.1 presents the most effective variables in Fanava market segmentation
context.
4.3. SOM network development
4.3.1. Dataset
For the dataset, 195 data items were collected through a survey
sent to 647 private customers of Fanava via email. The question-
Model Development
P. Hanafizadeh, M. Mirzazadeh / Expert Systems with Applications 38 (2011) 198–205
203
Problem Definition
Variables Selection
Survey
NetworkConstruction
Segments Analysis
SOM Network Development
Customer
Data
Training The Network
Data Preparation
Customer
Survey
Generating Self
Organizing Maps
Segments Identification
Customer
Data
Segments Sampling
Segments Behavior
Analysis
Fig. 4.1. Research methodology steps.
naire was pre-tested by interviewing a few customers to avoid
probable misunderstanding. The questionnaire includes three
dimensions, i.e. demographics, psychographics and geographical.
Demographic dimension collects the respondents’ personal information. Given psychographic variables, each component is represented by five-point Likert’s rating scales with values ranging
from 1 (strongly disagree) to 5 (strongly agree). This dimension attempts to find out information on respondents’ characteristics,
their lifestyles, buying incentives, etc.
4.3.2. Data preprocessing
Scaling variables is of special importance in the SOM network
development since the SOM algorithm uses Euclidean metric to
measure distances between vectors. If one variable has values in
the range of [0. . .1000] and another in the range of [0. . .1] the for-
Table 4.1
most effective variables in Fanava market segmentation context.
Variable category
Variable
Rank
Geographical
Location
1
Demographic
Age
Education
Annual income
6
3
5
Behavioral
Purpose of service usage
2
Psychographic
Life style
3
mer will almost completely dominate the map organization because of its greater impact on the distances measured. To this
end, we normalized dataset and scaled all variables linearly so that
their variances were equal to one.
4.3.3. Network construction and training
The training dataset consisted of 195 six-dimensional vectors
each corresponding to one customer data. A 15 15 sized SOM
was trained with this data.
The networked was developed and trained in MATLAB 7.0 environment. Initial weights of network were randomly assigned.
Sequential training algorithm and Gaussian neighborhood function
were used for training process. Training was performed in two
phases. In the first phase, relatively large initial learning rate (a0)
and neighborhood radius (r0) were used. In the second phase both
learning rate and neighborhood radius was small right from the
beginning. These phases correspond to first tuning the SOM
approximately to the same space as the input data and then fine
tuning the map.
4.3.4. Visualization
Fig. 4.2 presents Fanava ADSL market. The U-matrix shows segments of the market and each variable map illustrates the distribution of values of the corresponding variable.
Variable maps reveal knowledge underlying in customers data,
some rules can be inferred from the maps are:
204
P. Hanafizadeh, M. Mirzazadeh / Expert Systems with Applications 38 (2011) 198–205
Fig. 4.2. Fanava ADSL market map.
Income and education are positively correlated in the earlier
mentioned market. In other words, wherever in the map there
is high education (dark red area of education map) the income
level is high as well (dark blue area of income map), and vice
versa.
Most of our customers are highly educated (red and yellow
area) generally with comparatively high annual income (dark
blue and blue area). The customers set do not contain too old
or too young people – normally they are in a specific range of
age.
Highly educated customers (red area of education map) generally use high-speed internet for research purposes (red area of
usage purpose) and educated ones (orange and light blue area)
for business and entertainment. Low-educated customers’ purpose is mainly entertainment.
Those who have moderate income and are educated or low educated is use the Internet with the purpose of doing business.
Fig. 4.4. Labeled U-matrix according to loyalty rate of segments.
U-matrix represents segments of the market. Every cluster of
the map corresponds to a segment of the market. Nine different
segments can be identified roughly from the Fanava U-matrix.
Fig. 4.3 shows identified clusters. Segments have been separated
by black line.
4.4. Segments analysis
Fig. 4.3. Identified segments of market on SOM map.
Customers of each segment have particular characteristics. For
example, the segment located on the top right corner of the U-matrix represents those customers who live in densely populated cities, are comparatively educated. In addition, their age average is
not so high and they high annual income. This information can pro-
P. Hanafizadeh, M. Mirzazadeh / Expert Systems with Applications 38 (2011) 198–205
vide us with a clear view of the market segments and their specifications for market managers who are generally interested in recognizing their target segment or segments. Target segment
contains the most loyal and profitable customers. If a company selects the target segment correctly, it will earn the most profit from
its target segment.
To identify target segments of Fanava, behavioral aspect of loyalty was investigated, then according to loyalty rate of each segment customers, it was labeled in the map. Samples (15–20)
were randomly selected from each cluster and their purchasing
histories were retrieved from Fanava sales system based on their
identification code. According to frequency of renewing purchase
convention by customer, the rate of loyalty of each segment was
estimated. Fig. 4.4 is labeled U-matrix according to loyalty rate of
market segments.
5. Conclusion
Market segmentation fulfills a crucial role in modern marketing
paradigms, but not much has be done to make an effective instrument for visualizing market segments in a way which not only is
easily understood by market managers but also show all the
knowledge underlying the dataset. In this research, a visualization
approach was proposed based on neural networks and SOM to
market segmentation which has advantages over other clustering
methods such as easiness to understand, ability to reveal mutual
relationships among different customer’s characteristics variables,
and its independence from predefining market segments number.
In this paper, ADSL market of a reputable Iranian telecommunication company was studied. Critical variables of their market context was identified using a survey and the dataset for training the
network was obtained from the survey questionnaire and customers’ data saved in the sales system database. After constructing and
training the SOM network, U-matrix and variable maps were produced that revealed not only the market segments but also mutual
relationships between all market variables.
References
Anable, J. (2005). ‘Complacent car addicts’ or ‘aspiring environmentalists’?
Identifying travel behaviour segments using attitude theory. Transport Policy,
12, 65.
Bezdek, J. C. (1981). Pattern recognition with fuzzy objective function algorithms. New
York: Plenum Press.
Bloom, J. Z. (2004). Tourist market segmentation with linear and nonlinear
techniques. Tourism Management, 25, 723–733.
Cadden, D. T. (1991). Neural networks and the mathematics of chaos – an
investigation of these methodologies as accurate predictors of corporate
bankruptcy. In First international conference on artificial intelligence applications
on wall street (pp. 582–589). EEE Computer Society Press.
Chaturvedi, A., Carroll, J. D., Green, P. E., & Rotondo, J. A. (1997). A feature-based
approach to market segmentation via overlapping k-centroids clustering.
Journal of Marketing Research, 34, 370–377.
Cheung, Y. (2003). k*-means: A new generalized k-means clustering algorithm.
Pattern Recognition Letters, 24, 2883–2893.
Dalkey, N., & Helmer, O. (1963). An experimental application of the Delphi method
to the use of experts. Management Science, 9(3), 458–467.
Dillon, W. R., Kumar, A., & Borrero, M. S. (1993). Capturing individual differences in
paired comparisons: An extended BTL model incorporating descriptor variables.
Journal of Marketing Research, 30, 42–51.
Gehrt, K. C., & Shim, S. (1998). A shopping orientation segmentation of French
consumers: Implications for catalog marketing. Journal of Interactive Marketing,
12(4), 34–46.
Hruschka, H., & Natter, M. (1999). Comparing performance of feed forward neural
nets and k-means of cluster-based market segmentation. European Journal of
Operational Research, 114, 346–353.
Hsu, T. H., & Yang, T. H. (2000). Application of fuzzy analytic hierarchy process in the
selection of advertising media. Journal of Management and Systems, 7, 19–39.
Hung, C., & Tsai, C. (2008). Market segmentation based on hierarchical selforganizing map for markets of multimedia on demand. Expert Systems with
Applications, 34, 780–787.
Hwang, S., & Thill, J. C. (2007). Using fuzzy clustering methods for delineating urban
housing submarkets. In Proceedings of the 15th international symposium on
advances in geographic information systems.
205
Iivarinen, J., Kohonen, T., Kangas, J., & Kaski, S. (1994). Visualizing the clusters on the
self-organizing map. In C. Carlsson, T. J̈arvi, & T. Reponen, (Eds.). Proceedings of
the conference on artificial intelligence res. in Finland, number 12 in conference
proceedings of Finnish Artificial Intelligence Society (pp. 122–126), Helsinki,
Finland: Finnish Artificial Intelligence Society.
Jang, S. C., Morrison, A. M., & O’Leary, J. T. (2002). Benefit segmentation of Japanese
pleasure travelers to the USA and Canada: Selecting target markets based on the
profitability and risk of individual market segments. Tourism Management, 23,
367–378.
Kiang, M., Hu, M., & Fisher, D. (2006). An extended self-organizing map network for
market segmentation – A telecommunication example. Decision Support
Systems, 42, 34–36.
Kiang, M., Hu, M., & Fisher, D. (2007). The effect of sample size on the extended selforganizing map network—A market segmentation application. Computational
Statistics and Data Analysis, 51, 5940–5948.
Kim, K., & Ahn, H. (2008). A recommender system using GA K-means clustering in
an online shopping market. Expert Systems with Applications, 34, 1200–1209.
Kohonen, T., (1981). Automatic formation of topological maps of patterns in a selforganizing system. In E. Oja, & O. Simula, (Eds.). Proceedings of 2SCIA, Scand.
conference on image analysis, Helsinki, Finland (pp. 214–220).
Kohonen, T. (2001). Self-organizing maps (3rd ed.). Springer series in information
sciences (Vol. 30). Berlin, Heidelberg, New York: Springer.
Kohonen, T., Mäkisara, K., & Saramäki, T. (1984). Phonotopic maps – Insightful
representation of phonological features for speech recognition. In Proceedings of
7ICPR, international conference on pattern recognition, Los Alamitos, CA
(pp. 182–185). IEEE Computer Society Press.
Kohonen, T., Oja, E., Simula, O., Visa, A., & Kangas, J. (1996). Engineering applications
of the self-organizing map. Proceedings of the IEEE, 84(10), 1358–1384.
Kotler, P. (1980). Marketing management – Analysis, planning, and control (4th ed.).
Prentice-Hall.
Kotler, P. (2001). Marketing management. Englewood Cliffs: Prentice-Hall.
Kuo, R. J., An, Y. L., Wang, H. S., & Chung, W. J. (2006). Integration of self-organizing
feature maps neural network and genetic K-means algorithm for market
segmentation. Expert Systems with Applications, 30, 313–324.
Kuo Y. F., & Chen, P. C. (2007). Constructing performance appraisal indicators for
mobility of the service industries using Fuzzy Delphi method. Expert Systems
with Applications. doi:10.1016/j.eswa.2007.08.068.
Kuo, R. J., Chang, K., & Chien, S. Y. (2004). Integration of selforganizing feature maps
and genetic-algorithm-based clustering method for market segmentation.
Journal of Organizational Computing and Electronic Commerce, 14(1), 43–60.
Kuo, R. J., Liao, J. L., & Tu, C. (2005). Integration of ART2 neural network and genetic
K-means algorithm for analyzing Web browsing paths in electronic commerce.
Decision Support Systems, 40(2), 355–374.
Kuo, R. J., & Xue, K. C. (1998). A decision support system for sales forecasting
through fuzzy neural network with asymmetric fuzzy weights. Journal of
Decision Support Systems, 24(2), 105–126.
Lee, R. C. T., Slagle, J. R., & Blum, H. (1977). A triangulation method for the sequential
mapping of points from N-space to two-space. IEEE Transactions on Computers,
26, 288–292.
MacQueen, J. (1966). Some methods for classification and analysis of multivariate
observations. In L. M. L. Cam & J. Neyman (Eds.). Proceedings of the fifth Berkeley
symposium on mathematical statistics and probability (Vol. I, pp. 281–297).
University of California Press.
Milligan, G. W., & Cooper, M. C. (1980). An examination of the effect of six types of
error perturbation on fifteen clustering algorithms. Psychometrika, 45(3),
159–179.
Murry, T. J., Pipino, L. L., & Gigch, J. P. (1985). A pilot study of fuzzy set modification
of Delphi. Human Systems Managements, 5, 76–80.
Oja, E., Kaski, S., & Kohonen, T., (2002), Bibliography of self-organizing map (SOM)
Papers: 1998–2001 Addendum, Neural computing surveys (Vol. 3, pp. 1–156).
Okoli, C., & Pawlowski, S. (2004). The Delphi method as a research tool: An example,
design considerations and applications. Information and Management, 42, 15–29.
Pykett, C. E. (1978). Improving the efficiency of Sammon’s nonlinear mapping by
using clustering archetypes. Electronics Letters, 14, 799–800.
Roubens, M. (1982). Fuzzy clustering algorithms and their cluster validity. European
Journal of Operational Research, 10, 294–301.
Saarenvirta, G. (1998). Mining customer data. DB2 Magazine, 3(3), 10–20.
Ultsch, A., & Siemon, H. P. (1990). Kohonen’s self-organizing feature maps for
exploratory data analysis. In Proceedings of the international neural network
conference, INNC’90 (pp. 305–308). Dordrecht, Netherlands: Kluwer.
Vellido, A., Lisboa, P. J. G., & Vaughan, J. (1999). Neural networks in business: A
survey of applications (1992–1998). Expert Systems with Applications, 17(1),
51–70.
Venugopal, V., & Baets, W. (1994). Neural networks and statistical techniques in
marketing research: A conceptual comparison. Marketing Intelligence and
Planning, 12(7), 30–38.
Vesanto, J., (1997). Data mining techniques based on the self-organizing map.
Helsinki University of Technology.
Vesanto, J. (1999). SOM-based data visualization methods. Intelligent Data Analysis
Journal, 3, 111–126.
Vesanto, J. (2000). Using SOM in data mining, Licentiates Thesis, Helsinki University
of Technology.
Wedel, M., & Kamakura, W. A. (2000). Market segmentation conceptual and
methodological foundations. Kluwer Academic Publishers.
Witten, I. H., Frank, E., (2000). Data mining: Practical machine learning tools and
techniques. Morgan Kaufmann series in data management systems.