Professional Documents
Culture Documents
TECHNOLOGY
Entertain Purchase
• Sure, it sounds impressive that Facebook's data warehouse stores upwards of
300 petabytes of data, but the velocity at which new data is created should be
taken into account. Facebook claims 600 terabytes of incoming data per day.
ISRAEL
• A big data application was designed by Agro Web Lab to aid
irrigation regulation.
• Through SNA :
Can now accurately predict when one of their clients will
expect a baby
Insurance companies
• Can actually understand how you drive
Eng. N.F Thusabantu
eBay.com
• Uses two data warehouses at 7.5 petabytes and 40PB as well as a 40PB
Hadoop cluster for search, consumer recommendations, and merchandising
Amazon.com
• Handles millions of back-end operations every day, as well as queries from
more than half a million third-party sellers. The core technology that keeps
Amazon running is Linux-based and as of 2005 they had the world's three
largest Linux databases, with capacities of 7.8 TB, 18.5 TB, and 24.7 TB
FACEBOOK
• Handles over 50 billion photos from its user base
GOOGLE
• Was handling roughly 100 billion searches per month as of August 2012.
Phase 1: Discovery
Phase 2: Data Preparation
Phase 3: Model Planning
Phase 4: Model Building
Phase 5: Communicate Results
Phase 6: Operationalize
o Assess the structure of the data – this dictates the tools and analytic
techniques for the next phase
o Ensure the analytic techniques enable the team to meet the business
objectives and accept or reject the working hypotheses
o Determine if the situation warrants a single model or a series of
techniques as part of a larger analytic workflow
o Research and understand how other analysts have approached this kind
or similar kind of problem
Key findings
o Need more data in future
o Some data were sensitive
o A parallel initiative needs to be created to improve basic BI
activities
o A mechanism is needed to continually reevaluate the model
after deployment