You are on page 1of 12

Format for Report Submission

Capstone Project
On
“<Title of the Report>” (Times New Roman 14, Title Case, Bold)

Submitted

In the partial fulfillment of the degree of


<Degree /Program Name>
< Semester >

By

<Student Full name> (14, Bold)


< (Enrollment no.) >

Under the Guidance of: (14, Bold)


< Name of Faculty >
< Faculty Designation >

Submitted To: (14, Bold)


Ganpat University- Centre for Management Studies and Research

< (Month and Year) >


<Second Page>

STUDENT DECLARATION (On plain paper)

This is to certify that we have completed the Project titled “title of the project” under the guidance of
Faculty Name in partial fulfillment of the requirement for the award of < Degree / Program Name >
at Ganpat University- Centre for Management Studies and Research. This is an original piece of
work & we have not submitted it earlier elsewhere.

University
Sr. No Student’s Name Student’s Signature
Enrollment No.
1.

2.

Date:
Place:
:
< Third Page>

CERTIFICATE FROM THE INSTITUTE GUIDE

This is to certify that the project titled “__(Title of your report)_____________”is an academic
work done by “_______(Your Full Name)____________” submitted in the partial fulfillment
of the requirement for the award of the < Degree / Program Name > at Ganpat University-
Centre for Management Studies and Research under my guidance & direction. To the best of my
knowledge and belief the data & information presented by him/her in the project has not been
submitted earlier.

Name of the Faculty Guide:


Designation:
Faculty Guide Signature:
“ AN EMPIRICAL STUDY ON BIG DATA ON
BUSINESS DECISION MAKING”

ABSTRACT
Big Data Analytics results can play a major role in corporate decision-making
allowing companies to achieve competitive advantage and make improved decisions.
This paper describes a systematic literature review (SLR) on the role of the results of
Big Data Analytics in corporate decisions. Initially, 1652 papers were identified from
various sources. Filtering through the 5-step process, 20 relevant studies were selected
for analysis in this SLR. The findings of this study are fourfold in the area of: (a)
usage of the results of Big Data Analytics in corporate decision-making; (b) the types
of business functions where analytics has been fruitfully utilised; (c) the impact of
analytics on decision-making; and (d) the impediments to using Big Data Analytics in
corporate decision-making. Also, on the management front, two important issues
identified are: (i) aligning data-driven decision-making with business strategy and (ii)
collaboration across business functions for effective flow of Big Data and information.
On the technical front, big data present some challenges due to the lack of tools to
process such properties of Big Data as variety, veracity, volume, and velocity. We
observe from this analysis that, thus far, little scientific research has focused on
understanding how to address the analytics results in corporate decision-making. This
paper ends with some recommendations for further research in this area. Big Data
Analytics has been gaining much focus of attention lately as researchers from industry
and academia are trying to effectively extract and employ all possible knowledge from
the overwhelming amount of data generated and received. Traditional data analytic
methods stumble in dealing with the wide variety of data that comes in huge volumes
in a short period of time, demanding a paradigm shift in storage, processing and
analysis of Big Data. Owing to its significance, several agencies including U.S.
government have released huge funds for research in Big Data and allied fields in
recent years. This paper presents a brief overview of research progress in various areas
associated to Big Data Processing and Analytics and conclude with a discussion on
research directions in the same area.

KEYWORDS : big data , data analytics , corporate decision making , data


mining ,big data processing
CHAPTER-1 INTRODUCTION
1. INTRODUCTION
1.1 WHAT IS BIG DATA ?
The definition of big data is data that contains greater variety, arriving
in increasing volumes and with more velocity. This is also known as the three Vs.

Put simply, big data is larger, more complex data sets, especially from new data
sources. These data sets are so voluminous that traditional data processing software
just can’t manage them. But these massive volumes of data can be used to address
business problems you wouldn’t have been able to tackle before.

Following are the types of Big Data:

1. Structured
2. Unstructured
3. Semi-structured

Structured
Any data that can be stored, accessed and processed in the form of fixed format is
termed as a ‘structured’ data. Over the period of time, talent in computer science has
achieved greater success in developing techniques for working with such kind of data
(where the format is well known in advance) and also deriving value out of it.
However, nowadays, we are foreseeing issues when a size of such data grows to a
huge extent, typical sizes are being in the rage of multiple zettabytes.

Unstructured
Any data with unknown form or the structure is classified as unstructured data. In
addition to the size being huge, un-structured data poses multiple challenges in terms
of its processing for deriving value out of it. A typical example of unstructured data is
a heterogeneous data source containing a combination of simple text files, images,
videos etc. Now day organizations have wealth of data available with them but
unfortunately, they don’t know how to derive value out of it since this data is in its
raw form or unstructured format.

Semi-structured
Semi-structured data can contain both the forms of data. We can see semi-structured
data as a structured in form but it is actually not defined with e.g. a table definition in
relational DBMS Example of semi-structured data is a data represented in an XML
file.
1.2 IMPORTANCE OF BIG DATA
Big Data importance doesn’t revolve around the amount of data a company has.
Its importance lies in the fact that how the company utilizes the gathered data.

Every company uses its collected data in its own way. More effectively the company
uses its data, more rapidly it grows.

The companies in the present market need to collect it and analyze it because

1.2.1 COST SAVING

Big Data tools like Apache Hadoop, Spark, etc. bring cost-saving benefits to
businesses when they have to store large amounts of data. These tools help
organizations in identifying more effective ways of doing business.
1.2.2 TIME SAVING

Real-time in-memory analytics helps companies to collect data from various


sources. Tools like Hadoop help them to analyze data immediately thus helping in
making quick decisions based on the learnings.
1.2.3 UNDERSTAND THE MARKET CONDITION

Big Data analysis helps businesses to get a better understanding of market


situations.

For example, analysis of customer purchasing behavior helps companies to identify


the products sold most and thus produces those products accordingly. This helps
companies to get ahead of their competitors.
1.2.4 SOCIAL MEDIA LISTENING

Companies can perform sentiment analysis using Big Data tools. These enable
them to get feedback about their company, that is, who is saying what about the
company.

Companies can use Big data tools to improve their online presence.
1.2.5 THE DRIVER OF INNOVATIONS AND PRODUCT DEVELOPMENT

Big data makes companies capable to innovate and redevelop their products.
Here are some more examples of how big data is used by organizations:

1.In the energy industry, big data helps oil and gas companies identify potential
drilling locations and monitor pipeline operations; likewise, utilities use it to track
electrical grids.

2.Financial services firms use big data systems for risk management and real time
analysis of market data.
3. Manufacturers and transportation companies rely on big data to manage their supply
chains and optimize delivery routes.
4.Other government uses include emergency response, crime prevention and smart
city initiatives.

Breaking down the V's of big data


Volume is the most commonly cited characteristic of big data. A big data environment
doesn't have to contain a large amount of data, but most do because of the nature of
the data being collected and stored in them. Clickstreams, system logs and stream
processing systems are among the sources that typically produce massive volumes of
data on an ongoing basis.

Big data also encompasses a wide variety of data types, including the following:

 structured data, such as transactions and financial records;


 unstructured data, such as text, documents and multimedia files; and
 semistructured data, such as web server logs and streaming data from sensors.

Various data types may need to be stored and managed together in big data systems. In
addition, big data applications often include multiple data sets that may not be
integrated upfront. For example, a big data analytics project may attempt to forecast
sales of a product by correlating data on past sales, returns, online reviews and
customer service calls.
Velocity refers to the speed at which data is generated and must be processed and
analyzed. In many cases, sets of big data are updated on a real- or near-real-time basis,
instead of the daily, weekly or monthly updates made in many traditional data
warehouse Managing data velocity is also important as big data analysis further
expands into machine learning and artificial intelligence where analytical processes
automatically find patterns in data and use them to generate insights.
CHAPTER -2 : LITERATURE REVIEW

You might also like