You are on page 1of 2

DEPARTMENT OF COMPUTER SCIENCE AND

ENGNEERING

Online Shopping – Log Analysis for Big-Data


Systems

Abstract:
Today web mining is a challenging task in organization. Every organization generated
vast amount of data from various source. Web mining is the process of extracting
useful knowledge from web resources. Log files are maintained by the web server.
The challenging task for E-commerce companies is to know their customer behavior
to improve the business by analyzing web log files. E-commerce website can generate
tens of peta bytes of data in their web log files. This paper discuss about the
importance of log files in E-commerce world. The analysis of log files is used for
learning the user behavior in E-commerce system. The analysis of such large web log
files need parallel processing and reliable data storage system. The Hadoop
framework provides reliable storage by Hadoop Distributed File System and parallel
processing system for large database using MapReduce programming model. These
mechanisms help to process log data in parallel manner and computes results
efficiently. This approach reduces the response time as well as load on the end
system. This work proposes apredictive prefetching system based on preprocessing
of web logs using HadoopMapReduce, which will provide accurate results in minimum
response time for E-commerce business activities.

 The Volume challenge exists because most businesses generate much more
data than what their systems were designed to handle.
 The Velocity challenge exists if a company’s data analysis or data storage runs
slower than its data generation. This could be because of customer clicks on
your website or thousands of sales transactions every second — a good
problem to have.
 The Variety challenge exists because of the need to process different types of
data to produce the desired insights. This could include, for example, analyzing
data from social networks, databases and customer service call records at the
same time.
 The Value challenge applies to deriving valuable insights from data, which is the
most important of all V’s in my view. A company can usually collect all the data
but the challenge is to ask the right questions to get value from it

The Benefits of Using Big Data in Ecommerce


1. Improve shopper analysis.
2. Improve customer service.
3. Personalize customer experience.
4. Provide more secure online payment processing.
5. Better target advertising.

Submitted by
-V.NAGESHWAR RAO (16R25A05H7)
-G.VARUNDHAR (16R21A05H8)
-A.VARDHAN (16R21A05C3)
-K.VIVEK KUMAR REDDY (16R21A05E6)

Internal Guide Project Coordinator HOD CSE


(B.MADHURVANI)

You might also like