Download healthcare knowledge management using data mining techniques

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Nonlinear dimensionality reduction wikipedia , lookup

Transcript
Advances in Computational Research
ISSN: 0975-3273 & E-ISSN: 0975-9085, Volume 7, Issue 1, 2015, pp.-274-278.
Available online at http://www.bioinfopublication.org/jouarchive.php?opt=&jouid=BPJ0000187
HEALTHCARE KNOWLEDGE MANAGEMENT USING DATA MINING TECHNIQUES
MAHAMUNE M.1*, INGLE S.2, DEO P.3 AND CHOWHAN S.1
1School
of Computational Sciences, S. R. T. M. University, Nanded - 431606, MS, India.
of Computer Science & Engineering, Shreeyash College of Engineering & Technology, Aurangabad - 431 005, MS, India.
3Department of Commerce and Management, S. B. E. S. College, Aurangabad - 431 001, MS, India.
*Corresponding Author: Email- mohnish.mahamune@gmail.com
2Department
Received: December 18, 2014; Revised: January 05, 2015; Accepted: January 15, 2015
Abstract- Knowledge Management (KM) in any organization in integration with information technology supports system strategies with business strategies to attain the vision and mission very precisely. In today’s era knowledge management is crucial in all aspects of organization;
Hence knowledge discovery from massive database is today’s need. Knowledge Discovery in Databases (KDD) helps organizations turn their
data collection into valuable information. Organizations that take advantage of KDD will find that they can lower the healthcare costs while
improving healthcare quality by using fast and better clinical decision making. The data collected by healthcare organization might be structured or unstructured. Hence it is essential to use some technology to gain knowledge from massive databases which will increase the access
to knowledge for the people working in organization as well as competitiveness among organizations. Though the healthcare sector relies
heavily on knowledge and evidence based medicine is expected to be implemented in daily healthcare activities; the quality of care relies on
the exchanging the knowledge between the organization. The Data Mining comes up against major scientific and technological type of tool for
knowledge discovery and is also considered as significant field in Knowledge Management. This paper explores the Data Mining techniques
which found suitable for discovering the hidden knowledge from the healthcare massive database and their applications which supports the
KM process in healthcare sector, its advantages, disadvantages and challenges.
Keywords- Knowledge Management, Data Mining Applications, Knowledge Discovery in Healthcare, Knowledge Discovery in Databases
Citation: Mahamune M., et al (2015) Healthcare Knowledge Management using Data Mining Techniques. Advances in Computational Research, ISSN: 0975-3273 & E-ISSN: 0975-9085, Volume 7, Issue 1, pp.-274-278.
Copyright: Copyright©2015 Mahamune M., et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution and reproduction in any medium, provided the original author and source are credited.
Introduction
In today’s information era, organizations want to seek competitive
advantage. This gives rise to Knowledge Management (KM). Many
organizations collected and stored huge amount of data however
they are unable to discover valuable information hidden in the data
by transforming these data into valuable and useful knowledge [1].
The information technology plays a vital role in Knowledge Management to create, share, integrate and distribute the knowledge; it’s
actually a process of data usage [2]. In today’s era health organizations collecting vast amount of data which would benefit them by
application of KDD tools and techniques. These techniques are
helpful for discovering the valuable knowledge from large datasets
for decision making [3]. Data Mining can be use as tool; Data Mining is an essential part of KM [2]. The study of data mining has an
objective of finding meaningful knowledge from huge datasets. Data
mining becoming popular in healthcare field because it fulfils the
need of efficient analytical methodology for detecting unknown and
valuable information in data. The data mining techniques provide
several benefits in health industry like detection diseases cause,
medical treatment methods identification, constructing drug recommendation systems, individuals health profile development,
healthcare research policies, even detection of fraud in health insur-
ance etc [4]. For decision making regarding patient health is difficult
due to vast and complex data. El-Sappagh, et al [22] stated that the
clinical decisions are often made based on doctor’s intuition and
experience rather than on the knowledge rich data hidden in the
database. This practice leads to unwanted biases and excessive
medical costs which affects the quality of service provided to patients [5,21]. The data may consist of patients health reports, medical claims, drug prescriptions, etc. hence there is a need of powerful tool which can generate important information from complex
data. The data mining technologies can provide benefits to
healthcare by grouping the patients having similar types of diseases
so that the healthcare organizations provide them better treatments
[7]. Data Mining can be useful for KM in two main manners: (i) to
share common knowledge of business intelligence (BI) context
among data miners. (ii) to use data mining as a toll to extend human knowledge [2]. As a part of Data Mining research, this paper
focuses on surveying data mining techniques for Knowledge Management.
Literature Survey
Koh & Tan [8] mainly discusses data mining and its applications
with major areas like Treatment effectiveness, Management of
Advances in Computational Research
ISSN: 0975-3273 & E-ISSN: 0975-9085, Volume 7, Issue 1, 2015
|| Bioinfo Publications ||
274
Healthcare Knowledge Management using Data Mining Techniques
healthcare, Detection of fraud and abuse, Customer relationship
management [8].
Ranjan [9] presents how data mining discovers and extracts useful
patterns from large data to find observable patterns. This paper
states the ability of Data mining in improving the quality of the decision making process in pharma industry.
Rani, et al [10] discusses mainly examine the potential use of classification based data mining techniques such as Rule Based, Decision tree, Naïve Bayes and Artificial Neural Network to the massive
volume of healthcare data.
Singh & Nagpal [11], presents an experiment based on clustering
data mining technique to discover hidden patterns in the dataset of
liver disorder patients. The system uses the SOM network’s internal
parameters and k-means algorithm for finding out patterns in the
dataset. The research has shown that meaningful results can be
discovered from clustering techniques by letting a domain expert
specify the input constraints to the algorithm.
Santhi, et al [12] proposed a framework where they used the heart
attack prediction data for finding the performance of clustering algorithm. Result shows the performance of classifier algorithm using
prediction accuracy and the visualization of cluster assignments
shows the relation between the error and the attributes.
Apiletti, et al [13] proposed a flexible framework to perform real-time
analysis of physiological data and to evaluate people’s health conditions. Patient or disease-specific models are built by means of data
mining techniques. Models are exploited to perform real time classification of physiological signals and continuously assess a person’s
health conditions. The proposed framework allows both instantaneous evaluation and stream analysis over a sliding time window for
physiological data. But dynamic behavior of the physiological signals is not analyzed also the framework is not suitable for ECG type
of signals.
Knowledge Management
Knowledge will be a major asset if managed properly; it is complex
and fluid concept. It can be either explicit or tacit in nature. Explicit
knowledge can be easily articulated and transferred to others. In
contrast tacit knowledge, which is personal knowledge, residing in
individual’s minds, is very difficult to articulate, codified and communicates [33,34]. The process of knowledge management focuses
on creation, share, dissemination and acquisition; application can
be facilitated by information technology.
 KM technologies can be classified in seven categories:
Data Mining
“Data Mining (DM) is the process of discovering actionable and
meaningful patterns, profiles and trends by sniffing through your
data using pattern recognition technologies such as neural networks, machine learning and genetic algorithms”. DM tool can solve
real complex problems; can search hidden patterns for finding predictive information. Data mining is an essential step in knowledge
discovery in databases (KDD) [4]. The terms of KDD and data mining are different. KDD refers to the overall process of discovering
useful knowledge from data. Data mining refers to discover new
patterns from a wealth of data in databases by focusing on the algorithms to extract useful knowledge [4]. According to Fayyad at al.,
“The knowledge discovery process are structured in various stages
where data selection is the first stage where data is collected from
various sources, then in second stage the selected data is preprocessed, in third stage data is transformed in specific format for further processing, in fourth stage suitable data mining technique is
applied on the data for extracting valuable information and at last
evaluation of data for gaining Knowledge [Fig-1], [15, 16].
Fig. 1- Data Mining and the KDD Process
Source: Fayyad, et al [15]
The CRISP-DM (Cross Industry Standard Process for Data Mining)
is a framework for carrying out Data Mining activities which are
divided into six different phases. The first phase understand the
business activities, second phase collect the data required for business activities and analyzed, third phase pre-processed it, in fourth
phase modeling is done, fifth phase evaluates the model, and in last
sixth phase model deployed. McGregor et al., proposed extended
CRISPDM framework clinical care through integrating the temporal
and multidimensional aspects. This model supports the process
mining in critical care [Fig-2], [7,17].
 KM Framework
 Knowledge-Based System (KBS)
 Data Mining
 Information and Communication Technology
 Artificial Intelligence (AI)/ Expert Systems (ES)
 Database Technology (DT)]
 Modelling
Knowledge Management in Healthcare
Knowledge Management is also known as “the right information to
the right people at the right time in the right form” [14]. In healthcare
there are lot of restrictions for information and knowledge sharing. It
results inefficient use or loss of resources in the organization.
Fig. 2- CRISP-TDM Model
At present all the public and private organizations are producing
enormous amount of data which are very difficult to manage. Hence
there are need of powerful tool for analyzing and producing useful
Advances in Computational Research
ISSN: 0975-3273 & E-ISSN: 0975-9085, Volume 7, Issue 1, 2015
|| Bioinfo Publications ||
275
Mahamune M., Ingle S., Deo P. and Chowhan S.
information from it. This information found valuable for understanding and making decisions for the cause of diseases and cost effective treatments to patients.
Tomar, et al [7], had proposed the concept of Data Mining in the
field of healthcare which offers novel information regarding
healthcare which in turn helpful for making administrative as well as
medical decision such as estimation of medical staff, decision regarding health insurance policy, selection of treatments, disease
prediction etc [7,18-21]. The various Data Mining techniques such
as classification, clustering and association are found useful for
healthcare organization to increase their ability for making decision
regarding patient health.
Data Mining Techniques
Data mining techniques can be classified in six main functions:
Classification: Finding models that analyze and classify a data
item into several predefined classes.
Regression: Mapping a data item to a real-valued prediction variable.
Clustering: Identifying a finite set of categories or clusters to describe the data.
Dependency Modeling (Association Rule Learning): Finding a
model which describe significant dependencies between variables.
Deviation Detection (Anomaly Detection): Discovers the most
significant for a subset of data.
Summarization
Finding a compact description for a subset of data. Data mining has
two primary objectives; prediction and description. Prediction involves using some variables in data set in order to predict unknown
values of other relevant variables (e.g. classification, regression,
and anomaly detection) Description involves finding human understandable patterns and trends in the data (e.g. clustering, association rule learning, and summarization) [5]. The extraction of hidden
predictive information from large datasets is a powerful new technology with great potential to help companies focus on the most
important information in their data warehouses. Data mining tools
predict future trends and behaviors, allowing business to make
proactive, knowledge driven decisions. DM techniques can be implemented rapidly on existing software and hardware platforms to
enhance the value of existing information resources, and can be
integrated with new products and systems as they are brought online. The most commonly used DM techniques in data mining are:
Artificial Neural Networks: A nonlinear predictive model that
learns through training and resembles biological neural networks in
structure.
Decision Trees: Tree-shaped structures that represent sets of
decision. These decisions generate rules for the classification of a
dataset.
Generic Algorithms: They are optimization techniques that use
process such as genetics combination, mutation, and natural selection in a design based on concepts of evolution.
Rule Induction: The extraction of useful if-then rules from data
based on statistical significance.
Regression Methods: This tries to identify the best linear pattern in
order to predict the value of one characteristic we are studying in
relation to another.
Data Mining Process Model in Healthcare
Knowledge discovery is a continuous process, El-Sappagh, et al
[22] proposed that as other processes KDD also has its own environment and phases that runs under certain constraints. In the
healthcare environment, the clinical database can be queried directly using SQL [22]. A Data Warehouse (DW) can be created to integrate data from many sources and enhance data quality. Analysts
can apply OLAP tools on the DW. Data warehouse is not enough
for data analysis. Data mining is required to discover hidden patterns in DW. KDD is an iterative or cyclic process that involves a
number of stages. Although the specific techniques may vary from
project to project, the basic process is the same for all KDD problems [22].
Fig. 3- Decision Making Environment;
Source: El-Sappagh, et al [22]
KDD process helps in analysis of some aspects that might be forgotten or neglected. The following figure shows basic phases of
KDD process [23].
Fig. 4- KDD Phases
Selection Phase targets database, Noise, inconsistency, incompleteness solved by preprocessing. Then the data transformed to
DM tasks, where DM techniques are used to generate a set of patterns. From these patterns only interesting and useful patterns are
evaluated/ interpret. These patterns represent the discovered
knowledge [22].
Applications of Data Mining for Knowledge Management in
Healthcare Data mining provides several benefits to healthcare
organizations. It can be applied for detection and diagnosis with
high performance of many diseases like cancer [24], diabetes [25],
[26]. Kidney [27], heart diseases [22], etc.
Following are the several applications of Data Mining in healthcare:
Treatment Effectiveness
Data mining applications found can develop to evaluate the effectiveness of medical treatments. Data mining can deliver an analysis
Advances in Computational Research
ISSN: 0975-3273 & E-ISSN: 0975-9085, Volume 7, Issue 1, 2015
|| Bioinfo Publications ||
276
Healthcare Knowledge Management using Data Mining Techniques
of which course of action proves effective by comparing and contrasting causes, symptoms, and courses of treatments.
she is a woman. This type of domain knowledge needs to be collected and added to the data mining system [22].
Effective Management of Hospital Resource
It is possible by using data mining techniques to find out complications of the patient disease priorities and they get better treatment
in timely and accurate manner. Healthcare organization offers patients to online access their medical information, online fill the prescription form and allow safe exchanging of treatment details with
the doctors.
Conclusion
Knowledge Management as a concept is very attractive, many organizations are associated with. As we continue our fight for improving quality service using every possible way, Knowledge Management is one of them. As more and more data collected by the
healthcare organization and required more powerful tool to interpret
the data into valuable knowledge through which the healthcare
organizations will earn extra benefit, such as it acts as key driver of
organizations efficiency, maximizing organization’s potential, competitive advantage, managing intellectual capital. Since we find that
the applications of data mining brings a lot of advantages like optimization of resources, better treatment, policy planning, etc. in this
paper we also have discussed the data mining tasks and techniques can be integrated into KM and enhance the KM process with
better knowledge. It is clear that the data mining techniques will
have a major impact on the practice of KM, and will present significance challenges for future knowledge and information research.
Knowledge Management is not a short term quick fix. It is a long
term commitment to changing the culture of healthcare industry to
become more collaborative, more transparent, and more proactive.
Hospital Ranking
Different data mining techniques are used to analyze the various
hospital details in order to determine their ranks [28]. Ranking of
hospitals determines by analyzing how the hospitals handles the
high and low risk priorities patients.
Smarter Treatment Techniques
Doctors and patients can easily compare various treatment techniques. They can analyze the effectiveness of available treatments
and find out which one is better and cost effective. The side effects,
hazard of particular treatment, and developing smart methodologies
are also possible by the data mining techniques.
Improved Patient Care
In order to analyze the hospitals massive data, a predictive model is
constructed using data mining that discover valuable information
from it and make decision to improve healthcare quality [7]. Data
mining helps the healthcare providers to identify the present and
future requirements of patients and their preferences to enhance
their satisfaction levels.
Health Policy Planning
Data mining play an important role for making effective policy of
healthcare in order to improve the health quality as well as reducing
the cost for health services.
Data Mining Challenges in Healthcare
Knowledge Management using data mining faces many challenges
such as [29,30]. Need for very high accurate algorithms.
For Active Data Mining
An automated data mining technique is needed on each data updating. And for embedding the discovered knowledge to healthcare
information system.
Every time various data mining techniques are used and compare
for most appropriate.
For the modeling and model evaluation process previously gained
knowledge must be taken in to account for evaluation phases [31].
Managing complex data, like mining knowledge from graphs or
charts and non relational data such as text and images requires
multi-relational data mining.
KDD systems today work with relational data bases. But now it
needs to work with Object-oriented and multimedia database.
In distributed data mining, mining across multiple heterogeneous
data sources must be possible.
Collecting the default knowledge is a big challenge for example if a
person is pregnant, then the system must by default understand the
Conflicts of Interest: None declared.
References
[1] Berson A., Smith S.J. & Thearling K. (1999) Building Data Mining Applications for CRM, New York: McGraw- Hill.
[2] Dawei J. (2011) IEEE Computer Society International Conference on Management of e-Commerce and e-Government, 7-9.
[3] Koh H.C. & Tan G. (2005) Journal of Healthcare Information
Management, 19(2).
[4] Fayyad U., Piatetsky-Shapiro G. & Smyth P. (1996) AI magazine, 17(3), 37-54.
[5] Gorunescu F. (2011) Data Mining: Concepts, models and techniques, Springer, 12.
[6] Hwang H.G., Chang I.C., Chen F.J. & Wu S.Y. (2008) Expert
Systems with Applications, 34(1), 725-733.
[7] Tomar D. & Agrawal S. (2013) International Journal of BoiScience and Bio-Technology, 5(5), 241-166.
[8] Koh H.C. & Tan G. (2011) Journal of Healthcare Information
Management, 19(2), 64-72.
[9] Ranjan J. (2009) Journal of Theoretical and Applied Information
Technology, 9(1), 60-70.
[10] Rani B.K., Srinivas K., Reddy B.R. & Govardhan A. (2011)
International Journal of Engineering and Technology, 3(2), 100105.
[11] Singh V. & Nagpal S. (2010) International Journal of Computing
and Business Research, 1(1).
[12] Santhi P. & Bhaskaran V.M. (2010) International Journal for
Advances in Computer Science, 2(1), 26-31.
[13] Apiletti D., Baralis E., Bruno G. & Cerquitelli T. (2009) IEEE
Transactions on Information Technology in Biomedicine, 13(3),
313-321.
[14] Rector A. (2001) From an information poor to an information
rich health environment, A research study in health informatics
Advances in Computational Research
ISSN: 0975-3273 & E-ISSN: 0975-9085, Volume 7, Issue 1, 2015
|| Bioinfo Publications ||
277
Mahamune M., Ingle S., Deo P. and Chowhan S.
for the UK institute for health informatics.
[15] Fayyad U., Piatetsky-Shapiro G. & Smyth P. (1996) Communications of the ACM, 39(11), 27-34.
[16] Fayyad U., Piatetsky-Shapiro G. & Smyth P. (1996) AI magazine, 17(3), 24-26.
[17] Rodrigues P., Pechenizkiy M., Gaber M. & Gama J. (2011)
Learning from medical data streams: an introduction, Learning
from Medical Data Streams: LEMEDS'11.
[18] Silver M., Sakata T., Su H.C., Herman C., Dolins S.B. & O Shea
M.J. (2001) Journal of Healthcare Information Management, 15
(2), 155-164.
[19] Harper P.R. (2005) Health Policy, 71(3), 315-331.
[20] Stel V.S., Pluijm S.M., Deeg D.J., Smit J.H., Bouter L.M. & Lips
P. (2003) Journal of the American Geriatrics Society, 51(10),
1356-1364.
[21] Bellazzi R. & Zupan B. (2008) International Journal of Medical
Informatics, 77(2), 81-97.
[22] El-Sappagh S.H., El-Masri S., Raid A.M. & Elmogy M. (2013)
International Journal of Engineering Research and Applications,
3(3), 900-906.
[23] Fayyad U.M., Piatetsky-Shapiro G. & Smyth P. (1996) Second
International Conference on Knowledge Discovery and Data
Mining, 96, 82-88.
[24] Li L., Tang H., Wu Z., Gong J., Gruidl M., Zou J., Tockman M. &
Clark R.A. (2004) Artificial Intelligence in Medicine, 32(2), 7183.
[25] Breault J.L., Goodall C.R. & Fos P.J. (2002) Artificial Intelligence in Medicine, 26(1), 37-54.
[26] Lee N., Laine A.F., Hu J., Wang F., Sun J. & Ebadollahi S.
(2011) First IEEE International Conference on Healthcare Informatics, Imaging and Systems Biology, 250-257.
[27] Kusiak A., Dixon B. & Shah S. (2005) Computers in Biology and
Medicine, 35(4), 311-327.
[28] Obenshain M.K. (2004) Infection Control and Hospital Epidemiology, 25(8), 690-695.
[29] Yang Q. & Wu X. (2006) International Journal of Information
Technology & Decision Making, 5(4), 597-604.
[30] Canlas R.D.Jr. (2009) Data mining in healthcare: Current applications and issues, Masters thesis, Science in Information
Technology, Carnegie Mellon University, Australia.
[31] Alonso F., Martínez L., Pérez A. & Valente J.P. (2012) Expert
Systems with Applications, 39(8), 7524-7535.
[32] Acharyulu G.V.R.K. (2011) International Journal of Innovation,
Management and Technology, 2(6), 534-537.
[33] Mahamune M. & Ingle S. (2014) Application of data mining
techniques in Knowledge Management system, NCI2TM, 184187.
[34] Dalkir K. (2013) Knowledge management in theory and practice, Routledge.
Advances in Computational Research
ISSN: 0975-3273 & E-ISSN: 0975-9085, Volume 7, Issue 1, 2015
|| Bioinfo Publications ||
278