You are on page 1of 2

large amounts of data that can become outdated very quickly.

The real danger here is falling behind your competition. If you are not analyzing
the right data, you won’t be drawing the right insights that will provide value. Meanwhile,
your competitors most likely will be running their own data projects. And if they are
getting it right, they’ll take the lead. A healthcare client I recently worked with created a
217-page report for senior management. A lot of the data in the report would have been
useful, but it was drowned out by irrelevant background noise. Working with them, I was
able to show them how to cut the report down to 20 pages, mostly info graphics, which
clearly showed the relevant data while omitting a lot of the noise.
That’s just a simple checklist of the risks that every big data project needs to
account for before one cent is spent on infrastructure or data collecting. Businesses of all
sizes should engage wholeheartedly with big data projects. If they don’t, they run the
serious risk of being left behind. But they also should be aware of the risks and enter into
big data projects with their eyes wide open.

1.6 STRUCTURE OF BIG DATA:

Figure: Big Data structures, models and their linkage at different processing stages.

1.7 CHALLENGES OF CONVENTIONAL SYSTEMS:

In the past, the term ‘Analytics' has been used in the business intelligence world to
provide tools and intelligence to gain insight into the data through fast, consistent,

8
interactive access to a wide variety of possible views of information. Data mining has
been used in enterprises to keep pace with the critical monitoring and analysis of
mountains of data. The main challenge in the traditional approach is how to unearth all
the hidden information through the vast amount of data.
⦁ Traditional Analytics analyzes on the known data terrain that too the data that is
well understood. It cannot work on unstructured data efficiently.
⦁ Traditional Analytics is built on top of the relational data model, relationships
between the subjects of interests have been created inside the system and the
analysis is done based on them. This approach will not adequate for big data
analytics.
⦁ Traditional analytics is batch oriented and we need to wait for nightly ETL
(extract, transform and load) and transformation jobs to complete before the
required insight is obtained.
⦁ Parallelism in a traditional analytics system is achieved through costly hardware
like MPP (Massively Parallel Processing) systems
⦁ Inadequate support of aggregated summaries of data

Apart from these challenges others are categorized as


Data challenges
-Volume, velocity, veracity, variety
-Data discovery and comprehensiveness
-Scalability
Process challenges
-Capturing data
-Aligning data from different sources
-Transforming data into suitable form for data analysis
-Modeling data (mathematically, simulation)
-Understanding output, visualizing results and display issues on mobile devices
Management challenges
-Security
-Privacy
-Governance
-Ethical issues
-Traditional/ RDBMS challenges
-Designed to handle well structured data
-traditional storage vendor solutions are very expensive
-shared block-level storage is too slow
-read data in 8k or 16k block size
-Schema-on-write requires data be validated before it can be written to disk.
-Software licenses are too expensive
-Get data from disk and load into memory requires application

1.8 WEB DATA:

You might also like