You are on page 1of 4

> REPLACE THIS LINE WITH YOUR PAPER IDENTIFICATION NUMBER (DOUBLE-CLICK HERE TO EDIT) < 1

Review of Data Gathering Techniques on Blood


Donors Data Mining
Nor Syuhada Che Khalid
Faculty of Information Technology & Communications
University Teknikal M alaysia M elaka
Durian Tunggal, M alaysia
norsyuhadachekhalid@gmail.com


Abstract—This paper focuses on data gathering techniques and II. LIT ERAT URE REVIEW
recent application on blood donors data mining. It expresses an
overview of data gathering for past few years and its intentions. Some data gathering techniques have influenced by data
Data gathering is one of initial research procedures which is characteristics, given by blood donors cases. Then, data
helpful to understand the blood donors data mining study. gathering techniques would affect data mining techniques that
Therefore, data gathering techniques could provide more useful have applied against data collected. Therefore, data gathering
information on the way of data mining researchers providing one knowledge and comprehension on blood donors data mining
of the research instrument, which is data analysis based on blood are helpful to researchers.
donors dataset. This paper has recommendation on how to
execute data gathering techniques based on authors patterns for Most past study on data gathering and blood donors data
future research. mining have included in this section. Furthermore, goal,
importance and limitation of both fields could be useful as
Index Terms—Data gathering, data collection, blood donors, main highlights.
data mining
A. Data Gathering
Data gathering or known as data collection is a procedure
I. INT RODUCT ION that involved collecting facts or information from specific
sources and using some systematic methods, mainly for
D ATA mining implementation on blood donors dataset has
involved many types of datasets that have various
characteristics such as data scale, data representation, size of
research [1]. It is extremely significant to flow of research
because it could affect the validity research problem, design
data, relationship and data value. These characteristics have instruments and results [2]. Data is a package of values, either
led by numerous research backgrounds and contributed to quantitative or qualitative variables. Qualitative values are
immeasurable by number, asking ways (how) and reasons
many styles of collecting and analyzing data. Data mining
(why) and produced from human perspectives such as "good
study towards blood donors problem has just started actively
or bad" and "agree or disagree", survey and interview
in past six years and involved different countries around the
world. Its' main purpose is to extract concealed information questionnaires. Quantitative values are measurable by number,
from blood donors dataset to be used by blood bank, questioning quantities (how many) and produced from
healthcare expert and statistician to attract more blood donors experimental instruments such as timer and electronic devices,
and increase blood stocks. Some useful information of blood also survey and interview results or population. Data is
donors data are their motivation of blood donatio n, their collected to save as record, commercial purpose through profit
altruism, behavior, knowledge, attribution and risk. Different gain, present and guide in decision making process
useful information should affect the technique of data Data gathering significance could complete if the researchers
can recognize data sources, instruments and analysis. Data
gathering used.
source is origin of information retrieval. Research process
The paper is organized as follows: Section II is introduced
usually collected data from primary source or secondary
literature review on data gathering and blood donors data
mining. Section III has present about analysis methodology. source. Primary source is referring to collection of data
Section IV is arranged with analysis results . Section V directly by researcher and gathered through many ways such
discusses about results. Section VI has stated about as questionnaire, interview, observation and experiment.
conclusions and recommendations. Secondary source is data retrieved, collect and compiled or
written by another researcher through acknowledged reference
such as books, journal and newspaper. Methods used to collect
primary source data such as interviews, survey,
questionnaires, observation and experimentation. Then, data
> REPLACE THIS LINE WITH YOUR PAPER IDENTIFICATION NUMBER (DOUBLE-CLICK HERE TO EDIT) < 2

should be organized properly using suitable instruments. Data needed data mining as useful tool to recognize blood donors
is usually presented by chart, table or graph to give better patterns and rules through various factors. Region, habit,
understanding on data. Data analysis has many method and purpose and problem could influence type of data produced
instruments such as statistical analysis that involved measures and data gathering techniques too.
of central tendency, frequency and data testing validation
through T-test, calculated manually or using software such as
MATLAB, WEKA and SPSS and summary sheet or checklist III. A NALYSIS M ET HODOLOGY
[3]. An analysis has been conducted through some researched
Blood donors data should produce information through data articles for past 6 years. Details observed are based on data
mining analysis. Various data gathering techniques have been gathering techniques which are year published, author, data
done through many studies of blood donors data mining. Most source, method of collecting primary data, data organization
observation on past studies has concluded about researchers’ and data analysis. Details are arranged as shown in table I.
preference. They have used primary and secondary sources. T ABLE I
Method used to collected data for all datasets originally using DETAILS OF DATA GATHERING T ECHNIQUES
survey, questionnaires and observation. Data analyzed has
No Properties Details
organized using checklist or summary sheet, calculated
manually or automatically using software and statistical 1 Year (Y) Range from 2009 until 2014
2 Author (A) Name of article author
analysis. 3 Data source (S) 1: Primary source
2: Secondary source
B. Blood Donors Data Mining
4 Method of collecting 1: Interviews
Data mining is an analysis of observational dataset to search primary data (M) 2: Survey and questionnaires
concealed relationships such as models or patterns and 3: Observation
4: Experimentation
simplified the data to become comprehensive and useful. The 5 Data organization (O) 1: Chart
dataset analyzed often has large size. Types of dataset 2: Graph
involved such as data matrix including real number, category, 3: T able
6 Data analysis (DA) 1: Summary sheet / checklist
missing, noise, text, sequence, structure, pictures and 2: Automatically using tool
transactions. Predictive models of data mining are supervised 3:Statistical analysis
learning like classification, regression, prediction and time
series analysis and unsupervised learning such clustering,
association rule, sequence discovery and summarization. Data
mining is a part of knowledge discovery in data process Based on details, statistical analysis would conduct based
(KDD). KDD function is to recognize valid, novel, on measures central tendencies to extract mean, mode and
promisingly useful and absolutely comprehensive patterns or T ABLE II
rules in data. Stages of KDD process are selecting target data, P AST STUDIES OF DATA GATHERING TECHNIQUES ON BLOOD DONORS DATA
MINING
preprocessing, transforming data, data mining and interpreting
assesses structures. However, KDD processes actually Y A S M O DA
involved more on preparing data instead of mining them [4]. 2009 Mohamed M Mostafa [5] 1 2 2,3 1,3
Therefore, data gathering stage is vital to initiate the process 2010 T Santhanam and Shyam 2 2 2,3 1,3
of KDD, data mining and as proof either the information Sundaram [6]
produced are useful or not. 2011 Fatemeh Zabihi, et al [7] 2 3 2,3 1,3
2011 P. Ramachandran, et al [8] 2 3 2,3 1,3
Blood donors is involved through blood bank or healthcare 2011 T Santhanam and Shyam 2 3 2,3 1,2,3
centre especially hospital activities regarding to increase blood Sundaram [11]
supplies for many critical medical conditions, such as massive 2011 Shyam Sundaram and T 2 3 2,3 1,2,3
Santhanam [12]
blood loss from trauma injuries, blood replacement for 2011 Vikram Singh and Sagna 2 3 2,3 1,3
treatment, surgery and organ transplant. Multiple activities Nagpal [13]
have been established to promote blood donation. However, 2012 Wen-Chen Lee and Bor-Wen 2 3 2,3 1
Cheng[14]
blood donation rate is consistently low compared to 2012 Ankit Bhardwaj, et al [15] 1 3 3 1,2
community population in some countries Low donation is not 2012 Arvind Sharma and P.C. Gupta 2 3 2,3 1,2,3
suitable for current and future demands. This condition should [16]
2012 T om Franks [17] 2 3 2,3 1,2,3
lead to higher tendency of future blood shortage due to 2013 Wijai Boonyanusith and 1 2 2,3 1,2,3
increment in human population, introduction of new medical Pyongchai Jittamai [18]
and surgery procedures and further individual deferrals as 2014 Nor Syuhada Che Khalid, et al 2 3 3 1
[19]
contamination risk to blood supply [5].
2014 Ritika and Aman Paul [20] 2 3 3 1,2,3
Therefore, the study of blood donation factors and blood 2014 S. Asha Rani and S. Hari 2 3 2,3 1,2,3
donors attributes are important to device successful strategies Ganesh [21]
to encourage blood donation habit for current and future blood
supply despite of all challenges . Actually, blood donation has Abbreviation of Y, A, S, M O, DA, and their values can refer to tableI
> REPLACE THIS LINE WITH YOUR PAPER IDENTIFICATION NUMBER (DOUBLE-CLICK HERE TO EDIT) < 3

median. From there, percentage of samples and graph may V. DISCUSSIONS


summarize all data gathering techniques. Data collected is Data of details are tabulate as recorded in table II. From
shown in table II. table III, figure 1 and figure 2, authors tend to use secondary
source rather than primary source. It is because those data has
IV. A NALYSIS RESULT S been collected and verified on expert on blood donors and
Data of details are tabulate as recorded in table III. From donation in healthcare field rather, especially when authors are
this table, 16 articles have been extracted. After that, coming from computer science, information technology or
measures of central tendencies have been applied. technical background. Besides that, data that already prepared
T ABLE III usually less anomaly and error that could reduce error of data
MEASURES OF CENTRAL TENDENCIES OF DATA GATHERING TECHNIQUES ON mining that will apply later and produce real measurement of
BLOOD DONORS DATA MINING data mining.
Variables Mean Mode Median Data gathering medium usually come from observation,
which is medical record collected or kept by blood transfusion
S1 0.2667 0 0
S2 0.7333 1 1 centre or blood bank for future reference. Second choice of
M1 0 0 0 data gathering medium is survey and questionnaires.
M2 0.2 0 0 Generally, authors prefer to them from blood donors
M3 0.8 1 1
M4 0 0 0 personally or through center.
O1 0 0 0 The researchers agreed on using table for their findings.
O2 0.7333 1 1 Table is systematic presentation and easily understood by
O3 1 1 1
DA1 1 1 1 readers. Graph is next choice of data organization. Pattern or
DA2 0.5333 1 1 distribution of data can be shown directly as useful
DA3 0.8 1 1 information to readers. Many varieties of graph can be
retrieved for many kind of information. Chart is unpopular as
its function is limited. Most data mining research have used
many variables or attributes of data to analyze. Therefore, a lot
Comparison of each variable has show through figure 1. of charts are needed to explain many variables involved which
can be explain easily by using table or graph. Selection of
chart are limited too.
Data analysis is main contribution of providing research
output. Authors are clearly accepted checklist and summary
sheet. Summary sheet acted as an effective way to deliver idea
for explanation and description to data analysis. Next, most
authors also used statistical analysis on their results. It is
specific observation on uses numbers as evidence. As
example, measures of central tendency, frequency, range, T-
test and many more have applied. Some authors mentioned
about software or automatic tool application on their analysis.
They can help authors to calculate or analysis data results
easier.
Therefore, blood donors data mining for past 6 years have
gathered data mainly from secondary source, based on
Fig. 1Data gathering techniques in blood donors data mining articles.
observation, using table and summary sheet or checklist.
Comparison of each variable using percentage of data These method could be used by another researchers as
gathering techniques has present using figure 2. reference to collect data later.

VI. CONCLUSION
Data gathering techniques could be utilize for sake of research
to provide better research on blood donors data mining. That
useful information would become important analysis for any
medical experts to search ways to avoid lack of blood
donation in future. This paper can become guide to incoming
data mining experts that interested in blood donation field.
This work should be extended as a way to improve and
enhance data collection in future. Additionally, increment in
different cases of blood donors and blood donation may
change data gathering technique patterns and contribute to
Fig. 2Percentage of data gathering techniques in blood donors data mining
articles
> REPLACE THIS LINE WITH YOUR PAPER IDENTIFICATION NUMBER (DOUBLE-CLICK HERE TO EDIT) < 4

information expansion, especially to handle blood donation [9] “Classifying Blood Donors Using Data Mining Techniques,” vol. 1, no.
1, pp. 10–13, 2011.
issues and risks. [10] A. I. T echnology, “A COMPARISON OF BLOOD DONOR
CLASSIFICAT ION,” vol. 30, no. 2, pp. 2–5, 2011.
A CKNOWLEDGMENT [11] W. Lee, “An Intelligent System for Improving Performance of Blood
Donation,” vol. 18, no. 2, pp. 173–185, 2011.
The author are grateful to University Teknikal Malaysia [12] V. Singh and S. Nagpal, “Interactive Knowledge Discovery in Blood
Melaka for give support. Thanks to Dr Mohd Saari bin T ransfusion Data Set,” vol. 1, no. 8, pp. 541 –547, 2011.
Mohamad Isa to give me encouragement to writing this paper [13] S. Sundaram and T . Santhanam, “Real-Time Blood Donor Management
Using Dashboards Based on Data Mining Models,” Int. J. Comput. Sci.
as class assignment. Issues, vol. 8, no. 5, pp. 159–163, 2011.
[14] A. Bhardwaj, A. Sharma, and V. K. Shrivastava, “ Data Mining
REFERENCES T echniques and T heir Implementation in Blood Bank Sector – A
Review,” vol. 2, no. August, pp. 1303–1309, 2012.
[1] (2014, September) Data Collection Methods. [Online]. [15] W. Boonyanusith and P. Jittamai, “Blood Donor Classification Using
http://people.uwec.edu/piercech/researchmethods/data%20collection%2 Neural Network and Decision T ree T echniques,” vol. I, 2012.
0methods/data%20collection%20methods.htm [16] A. Sharma and P. C. Gupta, “Predicting the Number of Blood Donors
[2] Barbara B. Kawulich. (2005, May) Forum: Qualitative Social Research. through their Age and Blood Group by using Data Mining T ool,” vol.
[Online]. http://www.qualitative- 01, no. 6, pp. 6–10, 2012.
research.net/index.php/fqs/article/view/466/996 1/ [17] W. Boonyanusith and P. Jittamai, “Blood Donor Classification Using
[3] Mohd Saari Mohamad Isa. (2014, November) DAT A GAT HERING. Neural Network and Decision T ree T echniques,” vol. I, 2012.
Slide. [18] T .Franks, “Predicting the Number of Blood Donors through their Age
[4] Jerzy Stefanowski. (2008) Data Mining Analiza i eksploracja danych. and Blood Group by using Data Mining Tool,” vol. 2, no. 2, pp. 8 –14,
Slide 2012.
[5] Mahtab Maghsudlu Leila Kasraian, "Blood donors' attitudes towards [19] N. Syuhada, M. A. Burhanuddin, S. Razali, and S. Sakinah,
incentives: infl uence on motivation to donate," Blood Transfus, vol. 10, “Implementation of Classification Techniques in Blood Donors Sector –
pp. 187-190, 2012. A Review, 2013”
[6] M. M. Mostafa, “ Expert Systems with Applications Profiling blood [20] S. A. Rani and S. H. Ganesh, “A comparative study of classification
donors in Egypt : A neural network analysis,” Expert Syst. Appl., vol. 36, algorithm on blood transfusion,” vol. 3, no. 6, pp. 57 –60, 2014.
no. 3, pp. 5031–5038, 2009. [21] C. Science and S. Engineering, “Prediction of Blood Donors ‟ Popul
[7] T . Nadu, “ Application of CART Algorithm in Blood Donors ation using Data Mining Classification T echnique,” vol. 4, no. 6, pp.
Classification T . Santhanam and Shyam Sundaram PG and Research 634–638, 2014.
Department of Computer Science , DG Vaishnav College ,” vol. 6, no. 5,
pp. 548–552, 2010.
[8] F. Zabihi, M. Ramezan, M. M. Pedram, and A. Memariani, “Rule
Extraction for Blood Donators with Fuzzy Sequential Pattern Mining,”
vol. 1, no. 1, pp. 37–43, 2011.

You might also like