You are on page 1of 11

This is your database

John !
Single point of truth
Inmon methodology (SB will be
Kimball)
70 users monthly
30 active users
8 team members
2TB (2011.) 10TB (2015.)
Improving betting odds models
Support for all business functions
Basis for advanced method
machine learning, segmentation
models...
Production Analytic zone Reporting

Bookie module (bookmakers)

Betinvent
Table software
partitioning

Archive 2014-2015 Archive 2016 - 2017

np

Betshops and online np np


Offer Server

3rd party
Other data
systems:
Shell
Bit8 Manual export scripts
Sdmc
Nsoft .csv files
Zendesk
...
np
Initial architecture
proposal:

- hadoop for low level


aggregations
- vertica as columnar
data store
- power bi with d3js as
reports
- R and python for
machine learning and
ad hoc analysis
But after meeting in
Zagreb we decide to
try party like it is 2017.
Yes @Amazon
Few minutes after...
we were up and
running with Redshift
single node cluster
with

- automatized backup
- performance graphs
- queries logs with
response times

Database comes free


for two months
enough for testing
against Vertica
Few more things we found on Amazon

AWS Glue
AWS Glue is a fully managed extract, transform, and load (ETL) service that
makes it easy for customers to prepare and load their data for analytics. You can
create and run an ETL job with a few clicks in the AWS Management Console.
You simply point AWS Glue to your data stored on AWS, and AWS Glue
discovers your data and stores the associated metadata (e.g. table definition and
schema) in the AWS Glue Data Catalog. Once cataloged, your data is
immediately searchable, queryable, and available for ETL. AWS Glue generates
the code to execute your data transformations and data loading processes.

Plus: Kinesis, EMR, Athena...


Remember...

There are many ways to look at your


important measures