Professional Documents
Culture Documents
Data Technology Plan
Assignment ‐ 03
SUBMITTED TO : DR. MINA LABIB
Student Name
STUDENT #
Based on the Data collection Plan, the technology to be used in each stage of process is
finalized based on key points. Technical Criteria and Business Criteria is explained and Solution
for the identified challenges is presented.
STORAGE:
Key Technical Criteria Key Business Criteria Technology Solution
Healthcare Data: Data collected from The storage system should be of In order to support the business
online forms filled by healthcare less cost. constraints and storage solution,
employees and offline forms filled by ‐>There are different forms of data data can be stored across
patients. like data fed into direct online Haadoop(HDFS) which helps in
‐>This data has to be merged and stored. forms which directly gets stored in storing and processing larger
‐>Here Inflow of data is huge so the the storage system. datasets on a faster pace
storage system should manage this. ‐>Technical team who manages ‐>Hadoop stores all unstructured
There are different forms of data like this have cloud storage and data data .
data fed into direct online forms and also warehouse expertise and can be ‐>Its beneficial to support the
offline handwritten forms which trained on Hadoop skills like cost requirements as its open
sometimes are unstructured or semi MapReduce and HDFS. source.
structured ‐>The technical team has expertise ‐>The forms can be converted
Sales Data: in converting forms to Json into json documents and can be
Sales data of the organisation is fed on documents and update them in stored in couchDB which
the company's sales record by employers database. supports storage of semi
at different regions using shared ‐>The tech Team also has skills of structured and unstructured
spreadsheets JavaScript and expertise in data.
‐>Override of data should not be allowed Database (RDBMS) and can easily ‐>It also supports insertion and
and edit history has to be managed adopt a NoSQL DB interface update at large scale
‐>Large scale of data insertion and
update to be allowed
PRE‐PROCESSING :
Key Technical Criteria Key Business Criteria Technology Solution
Different attributes of Feedback data may not be Bar charts could be used to
customer data should be honest sometimes which may compare different attributes of
displayed like the diabetes, give a skewed visualization and customers and give visualization
cholesterol and fat level. could be misleading, Validating on which type of customer are
This enables better the reliability is important and high in numbers.
understanding the different can be done on a sample of Sales data in the past years
types of customers in a way data. could be displayed in a line
enables sales of different health Stakeholders could use the chart with year and revenue on
supplements(like sugar‐free visualization to stabilize both axis which will help
products) production and improvise sales understand the sales trend
Sales data of previous years has better.
to be visualized in order to do Sales of different products done
analysis and set the target, so far can be displayed using a
This could be further classified Pie chart with different colors
into sales of different products to understand the percentage
like protein supplements, of high selling and low selling
weight loss products, and products.
diabetic control mix, to improve ‐ All these Visualizations can be
sales done using Tableau as
This does not require any
coding skills so it can be easily
used by the sales and marketing
people also
Data from spreadsheet as well
as from Hadoop can be
connected to Tableau so any
small data to all data
visualization can be done using
this
Conclusion:
Based on the ideas presented above each stage of data processing will have the different
technologies specified (Hadoop‐storage, Data Cleaner‐Preprocessing, Tableau‐Visualization) to
make the process effective.