Professional Documents
Culture Documents
SAP
An Inttroduc
ction to
t SAP
P HAN
NA by Dr.
D Vis
shal Sikka
S
UNIT 1: Intrroduction and Backgro
ound of SAP
P HANA
00:00:00
00:00:21
s very straigh
htforward. Th
he relational database
d
waas designed in the late
So the basic idea was
80s, earlyy 90s,
00:00:30
when SQL
L was a relattional algebra
a, and SQL started
s
to be
ecome populaar. People wanted
w
to get
away from
m file-based data
d
manage
ement
00:00:38
and mana
agement of specialized sttructures and
d so forth, an
nd objects, a nd get into a more
structured
d relational way
w to manag
ge data.
00:00:48
00:00:57
the hardw
ware that implemented rellational databases was significantly
s
ddifferent than
n the
hardware that is availa
able now.
00:01:04
a
10, 11, 12 years a
ago, when we
w started thinking about re-doing the database,
Or even already
the hardw
ware was alre
eady very diff
fferent then.
00:01:13
00:01:25
00:01:32
00:01:46
00:01:58
00:02:05
M was much
h more expen
nsive and mu
uch smaller than it is now
w.
The DRAM
00:02:13
00:02:25
w largely a
attained beca
ause of impro
ovement in thhe single corre CPU
And so Moore's Law was
itself.
00:02:32
And already by 2003 it was clear tthat this was running into a physical w
wall, into a ha
ard
limitation,
00:02:40
00:02:48
in order to
o continue the performan
nce and manu
ufacturing be
enefits of Mooore's Law. So
S already
by 2003 also,
a
it was cllear that a co
ompletely ne
ew kind of database paraddigm was within our
reach.
00:03:03
And when
n we started thinking abo ut this, it bec
came clear th
hat if SAP haas to build a new
database,, then it has to
t be built arround the new
w reality of hardware:
h
00:03:15
That is mu
ulticore proce
essing; masssively larger and cheaper main memoory; and the advent of
columnar structures.
00:03:25
on why colum
mnar structurres are intere
esting is beca
ause alreadyy by that time
e, about 10
The reaso
or so yearrs ago, OLTP
P and OLAP had become
e very differe
ent kinds of inndustries, in fact.
00:03:36
00:03:45
00:03:55
00:04:04
00:04:12
And colum
mnar technology itself is n
not new. Dattabases like for example our own Syb
base IQ are
more than
n 20 years old,
00:04:19
00:04:30
00:04:41
00:04:48
00:04:56
The reaso
on for it was very straighttforward. Am
mong other things, first of all, a comple
etely new
paradigm in databases is possible
e.
00:05:01
00:05:06
l
at the tim
me line of wh
here we have been, way back here inn the past wa
as 1977,
So if you look
and Rudi Munz did his
s thesis,
00:05:16
00:05:29
00:05:41
And some
e of the work
k that Franz h
had done wa
as already known by that time. I went and I talked
to one of my
m PhD advisors, Gio W
Wiederhold, an
nd he introdu
uced me to S
Sang Cha
00:05:52
00:06:00
s
gene
eration datab
base. And Fra
anz and Steffan and the gguys built the
e EUCLID
Sort of a second
project, which
w
was demonstrated a
and was the winner of the
e very first D
DKOM that SA
AP ever did,
00:06:13
back in 20
003. When Franz
F
showed
d the EUCLID
D running a billion recordds in one sec
cond, that
was prettyy remarkable
e.
00:06:25
So Hasso
o started to te
each these th
hings at HPI around 2004
4, and startedd to launch a real
investigation into how all of these tthings could be combined into a full ddatabase.
00:06:36
And in Au
ugust of 2006
6, about 30 m
metres from here,
h
in his office,
o
he toldd me about th
his idea that
he had, th
hat we could rewrite Fina ncials.
00:06:47
And he wa
anted to rew
write Financia
als for the fou
urth time in his life. And thhis time, bec
cause of this
amazing ability
a
to calc
culate things on the fly, and the drama
atic performaance that we
e get
00:06:58
00:07:10
00:07:19
00:07:29
00:07:37
00:07:46
And so we
e started worrking on thatt. In 2009, Ha
asso presentted his pape r on the in-m
memory
database,, this column
nar in-memorry database,
00:07:59
at SIGMO
OD in New En
ngland. And it was extrem
mely well rec
ceived. That Fall, we starrted the
HANA devvelopment project
00:08:12
for buildin
ng the HANA product. Occtober of 2009. On Decem
mber 1st of 22010, we launched
HANA.
00:08:21
00:08:31
00:08:36
A became gen
nerally availa
able in June of 2011, sho
ortly after Sappphire, back
k then. At
So HANA
Sapphire, I had showe
ed the first 25
5 or so custo
omers that we had workeed with
00:08:48
d all kinds o
of amazing th
hings with HA
ANA. Since thhen it has be
een an
who were starting to do
unbelievable journey.
00:09:00
00:09:09
As we are
e taping this it is Septemb
ber of 2013, so it's about 2 years andd 3 months siince the
launch of HANA. Actually, it's exacctly 2 years and
a 3 months
s! It was Junne 20th of 2011 that
ally available
e.
HANA beccame genera
00:09:21
00:09:29
A billion dollars.
d
Yes, that
t
is 1 with
h 9 zeros
00:09:37
00:09:45
And more
e than 2,000 customers h
have already purchased HANA.
H
We h ave something like 1100
implemen
ntations going
g already.
00:09:54
00:10:08
00:10:19
It is now 10
1 years sinc
ce we started
d working witth Intel. I rem
member whenn the first Wo
oodcrest
chip came
e out,
00:10:29
which had
d two CPU co
ores, and Da
aniel Schneis
ss had called
d me and toldd me that TR
REX was
running so
omething like
e 85% fasterr because it had
h 2 cores.
00:10:38
We just re
ecompiled the software fo
or the Woodc
crest chip. Actually, it waas the same code.
c
Then
it was clea
ar that this multicore
m
ben
nefit that we get
g from HAN
NA
00:10:50
was some
ething that was going to rreally... So, thanks to ourr friends at Inntel. You are the best!
Without yo
ou, HANA would not be p
possible.
00:11:00
So 10 harrdware vendo
ors, IBM, HP
P, Cisco, Fujitsu,
00:11:08
Dell, Leno
ovo, Hitachi, all kinds of ccompanies are making se
ervers for HA
ANA.
00:11:17
And we ha
ave all kinds of storage p
partners and stuff like this
s. Almost 10,,000 differen
nt
consultants from around the indusstry have bee
en trained on HANA: Thaat's quite spectacular.
00:11:28
00:11:39
g, training aro
ound HANA, so it's been quite an incredible journney.
consulting
00:11:45
00:11:54
or is in the
e process of running on H
HANA. And one
o of the most amazing things that our
o team did
recently, that
t
we are really, really p
proud of,
00:12:03
is ISP. Ou
ur internal ER
RP system n ow runs on HANA.
H
And this
t
has basiccally more th
han 60,000
users in our
o company.
00:12:16
Our whole
e company depends
d
on th
he ISP syste
em, and ISP now, for the last roughly 5 weeks
has been running on HANA.
H
00:12:26
mazing testam
ment to this product.
p
And it was origin
nally
So that is something. Quite an am
d to be simply
y an analyticc.
supposed
00:12:35
00:12:44
00:12: 56
UNIT 2A: S
SAP HANA Technology:
T
Parallelism
m
00:00:00
So let's ta
alk some HAN
NA technolog
gy. And yes, every once in a while, w
we have to ta
alk
technolog
gy.
00:00:16
We can't be
b all about PowerPoint and traffic lig
ghts and things of this soort all the time
e.
00:00:24
00:00:32
00:00:47
00:01:00
act that we re
e-thought evverything, and
d worked tog
gether with ccustomers to do this.
plus the fa
00:01:09
00:01:22
That is ba
asically the se
ecret for how
w HANA cam
me about. The
e power thatt HANA deriv
ves is from
the fact th
hat it runs ma
assively para
allel.
00:01:34
00:01:42
00:01:54
2 terabyte
es of DRAM, and you can
n put maybe 5 or even more terabytees of SSD as the
persistencce on the serrver.
00:02:06
00:02:18
e assumption
ns of the passt, where peo
ople used to try
t to keep thhe CPU cons
sumption
Unlike the
low, and so
s forth, here
e, our belief iis the more you
y burn the CPUs the faaster you gett the results,
00:02:29
00:02:42
0 CPUs, roug
ghly 3 gigahe
ertz clock spe
eed on every
y single one oof them, this is 240
This is 80
gigahertz of clock spe
eed available
e to you.
00:02:51
believable am
mount of com
mputing capa
acity. And you have 2 teraabytes of data in DRAM.
It's an unb
So everything on HAN
NA was desig
gned to take maximum ad
dvantage of these two th
hings.
00:03:02
00:03:17
Three and
d a half billion integer sca
ans per seco
ond per core. This meanss that basicallly, the scale
is nearly unlimited
u
00:03:26
00:03:38
m
g run to plan
n, or some sa
ales analytics
s to run, or iff you were to
o run any
or some manufacturin
kind of a thing,
t
00:03:47
Seattle,
00:03:59
00:04:07
o less in one
e second on a hundred co
ores. This is basically whhere the power is derived
or more or
from. We also do in ad
ddition to the
ese scans,
00:04:15
o the native algorithm in the operators, we do about 12.5 to 115 million agg
gregations
because of
per secon
nd per core.
00:04:24
00:04:31
So, the co
ore of HANA is built arou nd these prin
ncipals of parallelism in thhe operators
s. All the
major ope
erators in calculations, in joins, in sca
ans, all use parallelism.
00:04:43
In fact, the
ey use what we call intra
a-operator pa
arallelism. That is, in a coocktail party you
y can
throw thatt word around: Vishal saiid intra-opera
ator parallelis
sm.
00:04:54
What thatt means is that basically tthat not only do we take a little job annd distribute that across
processorrs, we can ev
ven take, witthin the opera
ator itself, we
e can take a part of the jo
ob
00:05:05
00:05:18
00:05:30
00:05:41
00:05:51
That is so
ort of the icon
n. When you look at Hass
so's notes an
nd stuff, that is the icon fo
or
parallelism
m. So that is number one
e.
UNIT 2B: S
SAP HANA Technology:
T
Row and C
Column Storres
00:00:00
The secon
nd big one in
n HANA is th e row and co
olumn stores
s.
00:00:20
The colum
mn stores are
e basically likke that.
00:00:26
The colum
mns, they are
e not all unifo
orm, and I'll get
g into the reason for whhy that is.
00:00:33
But basica
ally, you take
e a relation w
which sort of looks like a table or an E
Excel spreadsheet, and
store everrything about it in column
ns.
00:00:44
00:00:57
00:01:08
And so we
e do latch-fre
ee index travversal. So this was a new
w data structuure that Sang
g and his
team had designed, and so on and
d so forth.
00:01:14
00:01:23
is that you
u can do ana
alytics and re
eads dramatically faster. How much faaster? Well, I mentioned
already th
he number.
00:01:31
I hope you
u guys remembered this:: three and a half billion scans
s
per seecond per corre. And
when you think about enterprise d ata structure
es,
00:01:41
ple, or the ma
anufacturing order, or thee account se
egments, the
one of our sales orderrs, for examp
BSEG,
00:01:51
o of the two
o core tabless; BKPF is th
he other one,, in our Finanncials applica
ation. This iss
which is one
the heade
ers and this is
s the line item
ms.
00:02:02
The accou
unting line ite
ems. And the
e BSEG table
e has something like 3200 fields in it.
00:02:08
00:02:20
So it is a very,
v
what we
w would call wide data sttructure. And
d when you hhave such wiide data
structuress and whenev
ver somebod
dy, a normal human being,
00:02:32
wants to know
k
someth
hing about, le
et's say, acco
ounting line items,
i
our brrains have th
he ability to
handle ma
aybe 10, 20 out of these 320 fields.
00:02:43
So in a co
olumnar data
a structure, th
hat means th
hat you just pick
p out the oones that you
u need, that
you are in
nterested in getting
g
inform
mation on, qu
uickly assemble them intoo the result,
00:02:54
and demo
onstrate that.. In a row sto
ore, in a tradiitional disk-based row stoore, you are going to the
disk, you are grabbing
g things row by row, and then after yo
ou have retrieeved all the rows,
r
00:03:02
a identifyin
ng the colum ns out of the
e rows that yo
ou are lookinng for, and th
hen pulling
then you are
this inform
mation up. So
o it's dramati cally slower..
00:03:08
Here, not only do you get just the ccolumns that you are inte
erested in, buut, in fact, do
o that
cause you ca
an assign diffferent cores to grab the different colu
umns.
massivelyy parallel, bec
00:03:19
In fact, yo
ou can take more
m
than on
ne core for on
ne particular column. Andd this is the fancy
f
cocktail th
hing that I told you about,, the intra-op
perator parallelism.
00:03:29
So that's basically
b
the idea. Now trraditionally, one
o of the myths of our ccolumn stores was that
they were
e slow when it came to tra
ansactions,
00:03:37
and our te
eams worked
d super hard over the yea
ars to make sure
s
that thiss is not the case.
c
And
the way we
w do that is actually a bu
unch of quite
e clever, ama
azing techniqques.
00:03:48
So you ha
ave the basic
c column sto re here. In addition, we have
h
a less ooptimized collumn store
that we ca
all the delta store.
s
00:03:59
00:04:09
00:04:19
00:04:29
And if eve
erything that you are lookking for is in the
t main, the
en of course it is really, re
eally fast.
And one of
o the things that we have
e added rece
ently into this
s design of thhe delta and the main
00:04:40
is we have
e added a co
oncept of wh
hat we call an
n L1 delta, which is a variiation on this
s row store,
00:04:49
00:05:00
that is flyin
ng in the sky
y, and all the engines of every
e
airplane in the sky are sending out events
at the sam
me time to the ground. Yo
ou want to ab
bsorb like a million
m
eventts per second
d,
00:05:08
or you wa
ant to capture
e every trade
e that is going
g on in Wall Street, or yoou want to ca
apture, you
know, eve
ery piece of instrumentatiion that is co
oming from all the tractorss of John De
eere that are
on the fielld,
00:05:19
or MRI ma
achines of Philips, or Sie
emens, or Ge
eneral Electric, or any of tthese kinds of
o things
which are
e sending outt super-fast ttransactions,
00:05:28
those can
n come into th
he L1 delta, and as the trransactions are
a closed doown or when
n the system
gets some
e breathing room,
r
you du
ump them intto the delta or
o into the maain, as things
s will
happen,
00:05:37
ns that come
e in, you do jooins across these
t
three
and then in the meanttime, if there are question
o this is an ex
xtremely novvel and super-high perforrmance archiitecture
things. So
00:05:46
which ena
ables us to ru
un very, veryy fast queries
s and column
nar operationns while at th
he same
time prese
erving the be
enefits of the
e row store,
00:05:56
00:06:06
00:06:14
00:06:21
sign of the ce
entral structu
ures of HANA
A. It's the co lumn store th
hat's our
So this is the main des
main inve
ention, and in
n addition the
e parallel row
w store
00:06:33
00:06:43
00:06:50
ou can do ro
ow analytics and
a transactions in the roow store, you
u can do
this is not the case. Yo
a transacttions in the ccolumn store,, and we hav
ve some attri butes of the row store
analytics and
inside the
e column storre,
00:07:01
00:07:10
h
decade
es of legacy that
t
they hav
ve to protect.. And it is mu
uch more
and the otther people have
difficult for them to do so than for u
us.
UNIT 2C: S
SAP HANA Technology:
T
Projections
s, Dynamic Aggregation, Integrated
d Compress
sion
00:00:00
00:00:23
a
, and lightwe
eight compres
ssion, or... le
et's call it inteegrated comp
pression.
Dynamic aggregation,
00:00:43
And when
n looking for the icons, th is is the projections icon,, it goes som
mething like th
hat.
00:00:56
Like that, and some off these thing s are filled, and
a then you
u do from theere into just those two
g
Like
e that.
that you grabbed.
00:01:04
00:01:16
00:01:26
00:01:38
00:01:48
e 320 fields in
n here in the column storre, like that,
So grabbing those outt of the 320, so you have
his,
and more going like th
00:02:00
00:02:13
p
On
ne of the prin
nciples that we
w teach, thaat we recomm
mend, that
So this is a dynamic projection.
o applicatio
on programm
mers to get, is
s to do this projection
p
as often as they need,
we want our
00:02:26
00:02:33
00:02:42
00:02:52
00:03:01
00:03:12
So if you want
w
to calcu
ulate weekly sums, instea
ad of having a batch proccess that tak
kes the raw
informatio
on and then calculates
c
aw
week's worth
h of totals ou
ut of that,
00:03:22
00:03:31
00:03:44
and interm
mediate value
es, and so fo
orth. You can
n calculate th
hem on the flly. So dynam
mic
aggregatio
ons this is
s hugely imp
portant in ana
alytics, espec
cially in analyytics on raw
10
transactio
onal data,
00:03:57
because you
y can think
k about aggrregations, yo
ou can think about
a
any kinnd of way tha
at you want
to group things together and add tthings togeth
her and enable people to do this comp
pletely in an
ed manner,
unrestricte
00:04:10
do that lim
mited only by
y their imagin
nation. So if you
y want to know
k
not justt the weekly totals but
the weekly totals in Ch
hina, except for the big to
op 5 cities, yo
ou can do thhat on the fly..
00:04:19
00:04:30
t
you are
e limited to th
hose question
ns that some
ebody thoughht about for you.
y
You
Because then
have to be
e limited to as
a fresh as th
hat informatio
on is.
00:04:38
Here, dire
ectly, on-draw
w information
n, you can ag
ggregate thin
ngs any timee you think ab
bout, any
time you feel
f
like, on the
t fly.
00:04:46
00:04:56
00:05:04
00:05:14
00:05:21
an create a dictionary,
d
ho
old the values
s for these 200 countriess, and then ju
ust have an
So you ca
encoding for which co
ountry that is in these billion columns.
00:05:30
et a dramatic
c compressio
on improvement without compromisin
c
g performance. And that
So you ge
is something quite am
mazing about HANA,
00:05:41
00:05:49
00:06:00
mpression in H
HANA gives us an ability
y to do tremeendous savings in space,
So the integrated com
mes to thingss like in analy
ytics,
especiallyy when it com
00:06:09
00:06:18
00:06:27
routinely get
g ten times
s, twenty time
es, even thirtty times com
mpression. Ouur own ERP system, you
u
know, up until the time
e when it use
ed to run on DB2,
00:06:38
d was that it w
here between 11.5 and 112 terabytes of data. And
what I was always told
was somewh
e I looked, it is never morre than 2 tera
abytes, inclu
uding the worrking memorry, so today
every time
it was like
e 1.8 terabyte
es.
00:06:49
And out of
o this, sometthing like 1.1 terabytes is
s the actual database sizee, and the remaining
700 gigab
bytes is the working
w
mem
mory of HANA
A.
11
00:06:59
00:07:08
00:07:19
00:07:29
00:07:39
So that wa
as projection
ns, dynamic a
aggregation,, and the inte
egrated comppression of HANA.
H
12
UNIT 2D: S
SAP HANA Technology:
T
Insert Only
y, Partitionin
ng & Scale-O
Out, Active aand Passive
e Storage
00:00:00
00:00:18
00:00:31
And those
e are things like INSERT ONLY, partitioning and scale-out,
s
00:00:43
00:00:53
00:01:02
And so yo
ou add new columns
c
in th
here. Adding a new entry
y in here meaans taking a part of that
record here and just making
m
the rig
ght insert in the appropria
ate place.
00:01:15
00:01:25
then in co
olumn storage
e, it is very a
advantageous to simply create
c
a new
w entry, and then as a
separate process, inva
alidate the p revious entry
y that you ha
ad.
00:01:36
n fact dep
pending on how you invallidate this, hoow you have
e your
The benefit of this is in
on strategy it is possib
ble to recreate histories.
invalidatio
00:01:45
It is possible to create
e audit trails. It is possible
e to do time travels.
t
How did something change
an extraordin
narily valuab
ble capability .
over a period of time? And that is a
00:01:55
And in HA
ANA, we get this natively.. In fact, the update sequ
uence operattion is implem
mented in
HANA
00:02:04
bination of an
n insert and then an inva
alidation of th
he thing that'ss not valid an
nymore.
as a comb
00:02:15
One of the
e things in HANA that is e
extremely im
mportant: In many
m
databaases, they did
d this after
the fact, with
w HANA we
w did this rig
ght at the beg
ginning, from
m scratch,
00:02:25
00:02:37
h drew it lik
ke that, and t hen this has four pieces, and then thiis one is like
e a zillion
...maybe he
pieces, an
nd stuff like that.
00:02:46
I don't kno
ow how to drraw that.... likke that... So you'll see tha
at. When youu see that ico
on in
Hasso's slides,
s
that's Sanjay
S
at wo
ork, who cam
me up with this idea of paartitioning.
00:02:56
But basica
ally, what is says
s
is, if yo u have, let us
u say, a lot of
o informationn, and you need to
partition th
hat across no
odes,
00:03:04
across ma
achines that are connecte
ed to each other,
o
you can dynamicallly partition th
hem so that
part of the
e data is in one machine,, and partly in
n another.
00:03:13
00:03:25
13
00:03:33
or multiple
e servers. An
nd of course,, you can partition columns, so you h ave one piec
ce of the
database in one mach
hine, and so on, or a com
mbination of both
b
of thesee things.
00:03:42
And then unleash the cores that a re sitting on each one of these mach ines into all of
o them.
wesome perfformance.
Then you get really aw
00:03:50
ale-up and sc
cale-out prettty much lineearly.
Typically, we see perfformance sca
00:03:57
00:04:05
00:04:12
But HANA
A gives us these native ca
hat other data
abases don'tt have, which
h is to be
apabilities th
able to run workloads across mach
hines.
00:04:21
And some
e of the more
e extreme exxamples that we have ach
hieved: We oonce took rettail data,
00:04:29
which is anonymous
a
retail
r
data fro
om one of our largest, sup
per, super laarge Retail cu
ustomers,
00:04:37
one of the
e biggest com
mpanies in th
he world. And
d they were doing
d
sometthing like 330
0 million
transactio
ons per day.
00:04:47
And we to
ook 10 years' worth of da
ata. Wow! So
o that is how much data? That is apprroximately
1,200 billion records.
00:05:00
2 trillion, in ca
ase you are ccounting, tha
at's 1.2 and then 12 zeross after that.
That's 1.2
00:05:09
That's a very,
v
very larg
ge number. IIt's one and a half times the
t populatioon of the worrld. One and
three quarter times the
e population of the world.
00:05:18
00:05:32
00:05:45
00:05:57
00:06:04
00:06:13
00:06:21
00:06:33
and so a year,
y
of courrse, is typicallly a little bit longer than one
o year, beecause you want
w
to keep
the open items from th
he previous yyear that cam
me into this year
y
and werre carried forrward,
00:06:41
as well ass the items from this yearr that you will carry forward into the neext year: So maybe 14
months off information for one yearr. And then iff you want to
o do year oveer year comp
parison,
00:06:49
so then yo
ou'll do also the
t previouss year. So tha
at's it. You do
on't need to hold more th
han that in
14
c
store the
e rest of it, m
meaning put it into Flash, into SSD, annd then you get
g
You can cold
additional compressio
on that way.
00:07:06
an take 42 ye
ears of appliccation know--how and brin
ng that to baare to organiz
ze data in
So you ca
this kind of
o a tiered strrategy, with h
hot and cold data,
00:07:17
so that we
e can get eve
en more com
mpression, th
hen we can get
g even betteer performan
nce. And
HANA ena
ables us to do
d these kind
ds of things natively,
n
insid
de the databaase.
00:07:28
So that wa
as INSERT ONLY,
O
partit ioning, and scale-out,
s
and hot and coold.
15
00:00:00
00:00:16
That's the
e language to
o access HAN
e beauty of HANA
H
is that we did all off this stuff
NA. Now the
completely redesigned
d from scratcch, ground up.
00:00:27
00:00:33
Anyhow, here
h
in the Valley,
V
there is a lot of talk about no SQL
S
and stuffff like this. Th
here are like
25 million people arou
und the world
d who know how
h
to progrram SQL.
00:00:42
y know, to
ons and tons and tons of usage of SQ
QL. The entire
re enterprise world, and
There is, you
even in th
he consumer world, all ou
ur ability to as
sk questions
s and do trannsactions bas
sically
depends on
o SQL.
00:00:53
00:01:05
and the ro
ow store, and
d both of thesse have the SQL front en
nd on top of tthem.
00:01:16
00:01:26
00:01:37
00:01:48
00:01:56
00:02:03
s
map--reduce operrations, which is what typ
pically the N o SQL movement
We also support
people are
e so fascinatted by.
00:02:10
I mean ba
asically, when you think a
about it, a ma
ap is like a sc
can that you distribute ov
ver a large
amount off data when you want to map an operation,
00:02:17
if you wan
nt to, let's say
y, increment everything or
o do an aggregation oveer something, or add
10% to alll your plan, or
o something
g like that.
00:02:29
That is a map
m operatio
on that applie
es widely, an
nd we have a map API. Itt reduces typ
pically the
inverse off that, that yo
ou want to re
educe someth
hing, you wa
ant to do an aaggregation,
00:02:39
00:02:51
00:03:00
e can write co
ode, low-leve
el code direc
ctly in L, and have that bee attached into HANA.
so people
We have, of course HANA itself is written in C+
++, and there
e are lots of C++ libraries
s in HANA,
00:03:10
es for GIS da
ata, for text d
data, for wha
at we call the Business Fuunction Libra
ary, and the
the librarie
16
Predictive
e Analytics Liibrary.
00:03:20
We integrrate R, which
h is a statisticcal package in there, we have IMSL.. ., let's see... all kinds of
function libraries like this,
00:03:30
p with a gene
eric way, wha
at we call the
e AFL, as a w
way to integrrate people,
and we arre coming up
anybody'ss library, insid
de HANA in a safe way.
00:03:40
00:03:46
00:03:53
00:04:04
00:04:13
with LiveC
Cache, we ha
ad lots of exp
perience with
h the system
m crashing beecause of corruption of
memory and
a things lik
ke that,
00:04:22
because the
t program area of the ssystem and the
t data area
a of the systeem used to collide
c
with
each othe
er every once
e in a while, a
and stuff like
e that.
00:04:29
00:04:40
working on
o any databa
ase that you can think off in the world, you can goo and, withou
ut any
training whatsoever,
w
get
g up to spe ed and running on HANA
A. It's fully staandards-com
mpliant.
00:04:50
00:04:58
00:05:05
hink about th
he power of HANA,
H
the ovverall picture
e that we
So the overall picture, when you th
have sort of looks like this.
00:05:15
00:05:26
00:05:35
00:05:44
h
the grap
ph store thatt we are work
king on,
We also have
00:05:52
00:06:02
00:06:13
00:06:21
00:06:31
17
And so on
n. So there are all these e
engines that are inside HANA.
H
00:06:49
00:07:01
00:07:12
e have the ab
bility to fully vvisualize the
e plans that we
w make in S
SQL and interactively be
In fact, we
able to ch
hange them. These are th
he kinds of amazing thing
gs that we haave the ability to do in
HANA.
00:07:21
00:07:31
00:07:41
00:07:49
00:07:59
00:08:09
All kinds of
o things to manage
m
userr sessions, us
ser authentic
cation and auuthorization, memory for
users, and
d stuff like th
hat.
00:08:19
00:08:26
e for MDM, o
or data servic
ces for extrac
cting, transfoorming, and loading
or master data service
he rule engin
ne.
data, or th
00:08:41
00:08:51
So as you
u can see, HA
ANA goes fa
ar beyond a database,
d
an
nd into becom
ming a real platform
p
for
us and forr our future. And
A that is ssomething tha
at is super-powerful.
00:09:05
18
UNIT 3: SA
AP HANA Pe
erformance Benchmarks
B
s
00:00:00
00:00:15
00:00:24
and betwe
een structure
ed processin g of informattion and proc
cessing of unnstructured information,
or betwee
en being able
e to build new
w application
ns on legacy applicationss, and so on,
00:00:35
we have to...
t
one of my
m conclusion
ns is that we have to also
o rethink the notion of performance
itself. We have to rethink the notio
on of benchm
marks, and so
o on.
00:00:45
00:01:01
mazing thing
g that I found
d was that we
e have now 27
2 or 28 custtomers who run
and the am
something
g in HANA att least 10,00 0 times faste
er than they did
d before.
00:01:14
00:01:25
A couple of
o years ago, Fujisawa sa
an told me th
hat he is the head of IT
T and operations at
Yodobash
hi, and he's also
a
the son o
of the ownerr of the firm. He's from thee founding fa
amily
00:01:37
so his gra
andfather founded the com
mpany and his
h father is now
n
the headd of the company. So he
told me th
hat they have
e 22 million to
otal custome
ers in Japan.
00:01:51
So that's 22
2 million. And out of tho
ose, 5 million are loyalty. So this is thee total. And what
w
they
do is, theyy used to do in our ERP ssystem on an Oracle database,
00:02:04
once a mo
onth, they wo
ould calculatte what incen
ntives to pay these guys, based on th
he
purchasess made by th
hem as well a
as the purchases made by
b everybodyy.
00:02:16
00:02:23
And when
n they ran this thing on HA
ANA, this ran instead of three days, iin 2 seconds
s. So that is
a performance improv
vement of ap proximately 125,000 time
es.
00:02:35
l
numbe
er. That is diffficult for the human mindd to comprehend this. So
Which is a very, very large
when you look at 10,0
000 times perrformance im
mprovement,
00:02:44
00:02:56
00:03:03
00:03:18
10,000 tim
mes faster than a snail. T
That's an inte
eresting way to think abouut it. Well any
yhow, so 28
customerss in the 10,00
00 club.
00:03:27
And the re
eason this ha
appens is tha
at if you look
k at the Yodo
obashi exampple, they werre doing this
on data th
hat is operational in naturre.
00:03:35
19
They wan
nt to calculate
e the purchasses made by
y everybody who bought the same things that
this perso
on did. So wh
hen you take a combination of a comp
plex query
00:03:50
and this la
arge amount of data, it is 22 million re
ecords, purch
hases made by 22 million
n people, so
that is pro
obably a few billion record
ds.
00:03:59
So a large
e amount of data,
d
comple
ex questions, and on una
aggregated, oon real opera
ational data
that is cha
anging as we
e speak, this is the kind of
o a combination that enaables HANA to
t achieve
dramatic performance
p
e.
00:04:12
So how do
o we think ab
bout this in a more scientific way? We
ell, my sensee is that in th
he age of
HANA, we
e have to rethink the con cept of perfo
ormance itself,
00:04:23
and rethin
nk the notion of benchma
arks for inform
mation proce
essing system
ms. And I wro
ote a paper
about thiss at the ICDE
E Conference
e in Australia
a earlier this year,
y
00:04:32
00:04:42
So there are
a five dime
ensions of pe
erformance These
T
are Drr. Sikka's fivee dimensions
s of
performan
nce, by Dr. Sikka.
S
00:04:50
00:05:00
00:05:07
00:05:18
finding me
edians, perce
entiles, doing
g clustering analyses,
a
and other kindss of complex
x analytics
on data. The
T more com
mplex our qu
uestions are, the longer itt takes, the sslower the pe
erformance
of the system, and so forth.
00:05:31
00:05:44
A fourth dimension
d
is, is the data p
prepared, or is it raw? An
nd finally, it iss the respons
se time.
00:05:55
00:06:04
that the hu
uman brain starts
s
to lose
e attention; ty
ypically, we start
s
to lose aattention at 8 seconds.
At less tha
an 3 seconds
s, we can ca
arry out a task more or les
ss continuouusly,
00:06:16
00:06:27
This is the
e way our bra
ains, our sen
nses are wire
ed. So, how much
m
is the rresponse? I mean, look
at these fiive dimensions. And I be lieve that HA
ANA,
00:06:34
the more of these five dimensions are in there, the more HANA's perforrmance stan
nds out.
00:06:43
ple way to th
hink about the
w
HANA would reallyy, really demo
onstrate
So, a simp
e value, of where
tremendous value wou
uld be when we take morre and more of these fivee dimensions
s
00:06:53
k
of scen
narios do we
e have which exercise theese five dime
ensions,
and think about what kinds
00:07:00
20
on is our only
y limitation w
when we think about the kinds
k
of thinggs that we ca
an apply this
Imaginatio
to.
00:07:16
00:07:22
And we've
e been startin
ng this recen
nt effort to rethink the con
ncept of perfoormance ben
nchmark
itself; give
en the abilitie
es of HANA:
21
UNIT 4: SA
AP HANA Ro
oadmap and Re-thinking
g Software Developmen
D
nt
00:00:00
So that wa
as performan
nce. And wh en we think about what are
a we goingg to do with th
his, what
are we do
oing with the HANA techn
nology,
00:00:19
the roadm
map of this is very straighttforward. We
e are bringing
g this to everry single product in
SAP. So, bar none. Eiither everyth ing already runs
r
on HAN
NA,
00:00:29
un on HANA
A. And of course that mea
ans we start ffirst with the Business
or it is on the way to ru
d the Business Suite now
w runs on HA
ANA.
Suite. And
00:00:39
00:00:48
H
as I sp
peak for approximately 6 weeks, 5 annd a half wee
eks. And,
has been running on HANA
ur books on H
HANA.
knock on wood, we arre about to cllose the quarter, close ou
00:01:00
han 60,000 e
employees, time
t
recordin
ng, Financialss, Support, Service,
S
HR,
So everything; more th
o things are running on H
HANA now.
all kinds of
00:01:12
CRM, of course
c
our ow
wn ICP syste
em that Rob and Bill run the Sales org
rganization on, is alreadyy
on HANA for the last 6 and a half m
months, sinc
ce March of this year.
00:01:24
00:01:35
We have some
s
amazing things witth Ariba and SuccessFac
ctors alreadyy running in the Cloud,
BusinessB
ByDesign, Sa
alesOnDema
and. Hybris has
h already demonstrate
d
ed some of th
he great
scenarios that run on HANA.
00:01:47
e bringing HA
ANA to everyy single product, every single applicattion that we have.
So we are
Beyond th
hat, all the technology pro
oducts.
00:01:59
So, back in
i Novemberr 2011, we d id quite an amazing
a
thing
g, and it wass one of our big
b honours
that Franzz and Stefan and the team
m delivered BW on HANA
A.
00:02:11
00:02:19
that we co
ould non-disrruptively put HANA underneath BW, but
b in a way that dramatically
accelerate
ed it. So the content coulld remain unchanged. It got
g compiledd into HANA.
00:02:30
We have things
t
in BW
W now that ru n hundreds, even thousa
ands of timess faster. Man
ny of the BW
W
reports run 500+ times
s faster.
00:02:41
And the lo
oading time into BW has been made parallel, so parallel
p
loadinng. That mea
ans things
like the DS
SO activation
ns and the P
PSA activatio
ons. These arre the staginng areas.
00:02:55
00:03:03
And we've
e been building every sin
ngle thing, fro
om the Rules
s engines, PII, messaging
g, and so
forth, the data
d
services, ETL tools,, master data
a services an
nd MDM,
00:03:18
00:03:30
I have a nice
n
picture here
h
that I wa
ant to show you.
y
Here it is. And so yoou can see in
n this
22
ed, BW le
picture. Evverything here I mentione
et me change
e the color off this guy to white,
w
00:03:43
00:03:52
The ABAP
P 7.4 is optim
mized for HA
ANA. Lots of innovation th
hat was donee by our application
platform and
a HANA te
eams togethe
er, for examp
ple shared memory,
00:04:03
lots of thin
ngs in the de
esign time en
nvironment off ABAP that were optimizzed for HANA
A. Also
that's all there.
00:04:09
00:04:20
00:04:31
00:04:41
00:04:51
Because of
o the nearly
y unlimited sccalability of HANA,
H
we do
on't need thiss anymore, and
a therefore
e
the physiccal scalability
y of HANA iss independen
nt of the kind of services tthat are runn
ning inside
it.
00:05:02
00:05:14
00:05:26
00:05:37
00:05:46
00:05:58
he native, wh
hich is the XS
S engine and
d the Integra
ator environm
ment. There is what we
There is th
call the inttegrated env
vironment
00:06:08
00:06:21
can talk to
o HANA. And
d whether it'ss a .NET, Clo
oud Foundry, Heroku, Foorce.com, Py
ython, PHP,
Perl,
00:06:29
nment is thes
se days to wrrite their codee in, could in
ntegrate
whatever people's favorite environ
d run with HA
ANA. This is native integrated open.
with, could
00:06:40
00:06:47
23
00:07:06
So what our
o team did was, along tthe five dime
ensions that I talked abouut of performa
ance,
started to look at whatt are the mosst valuable, most
m
resourc
ce-intensive kkinds of scen
narios that
e running on
n HANA, on ttheir databas
ses today?
people are
00:07:21
00:07:32
that's whe
ere the value is, and startted to optimiz
ze these. And some amaazing things have
showed up there. Bill of
o materials explosion in manufacturing, and MRP
P run.
00:07:40
c
the MR
RP run can rrun thousand
ds of times fa
aster. And thaat is really ex
xtraordinary
In many cases,
when you think about, you know, e
everything arround us, you know,
00:07:48
00:07:58
00:08:08
00:08:19
w, the R in ou
ur history wass always for realtime, and
d with HANA
A we have a fourth
f
You know
iteration of
o realtime no
ow,
00:08:26
00:08:35
And these
e can start to
o be replaced
d by realtime operations, interactive ooperations, in
n HANA.
Well as so
oftware deve
elopers, when
n you think about
a
softwarre developm
ment itself, software
developm
ment is a batc
ch job today.
00:08:48
And when
n you think ab
bout the plattform, one ob
bvious question that surfaaces is: How
w can we
rethink so
oftware devellopment itsellf with HANA
A?
00:08:58
00:09:08
00:09:15
00:09:25
00:09:38
e browser as
s well as in E
Eclipse. And the
t team in Israel, Jake aand Ariel and
d the team
both in the
are workin
ng on River
00:09:47
as a way to
t rethink an
nd simplify th e developme
ent experienc
ce with instaant feedback,,
24
realtime collaboration
c
with Jam, and
d so forth. It's
s just an extrraordinary op
pportunity
integrated w
that we ha
ave to rethink
k software de
evelopment itself
00:10:05
00:10:15
Every sing
gle thing thatt we do is be
eing rethough
ht and refactored on the H
HANA platfo
orm.
25
UNIT 5: SA
AP HANA in Practice and
d Summary
y
00:00:00
00:00:24
00:00:35
ations. And w
with the recen
nt merger of the Custom Developmen
nt
is new cusstom applica
organizatiion with Abdul's team, thiis gives us a tremendous
s opportunityy for growth.
00:00:48
00:00:58
and doing
g logistics and route calcu
ulations, and
d stuff like this, which is vvery, very harrd.
00:01:06
Route calculations, forecasting of utilization of containers; these are thee kinds of problems
e can run thin
ngs tens of th
housands of times faster.
where we
00:01:15
00:01:23
00:01:33
and using
g signals to figure out the health of wh
hat the CFO of eBay callss the eBay economy.
There are
e 300 people, analysts, in
n eBay who work
w
on analyzing signalss,
00:01:45
00:01:56
00:02:08
With HAN
NA we were able
a
to find th
he same sign
nal automatic
cally within 114 minutes. And
A these
are the kin
nds of extrao
ordinary achiievements th
hat we can ge
et to.
00:02:18
00:02:28
00:02:36
on running
g the gene trreatment thin
ng, and align
nment, and va
ariant callingg, and so fortth, dozens
to hundreds of times faster
f
with HA
ANA.
00:02:45
00:02:55
00:03:05
once som
mebody ends up in the ho
ospital, the re
ethinking of th
he hospital m
managementt system, the
e
patient ma
anagement system.
s
00:03:11
So this is an incredibly
y exciting are
ea, to open up
u new frontiers for us in areas that were
w
frankly
ble before.
not possib
26
00:03:20
00:03:29
They spen
nd hundreds
s of millions o
of dollars eve
ery year on doing
d
exploraation of oil, in
n predictive
analysis fo
or drills, for drilling.
d
00:03:41
Every time
e a drill gets stuck, there is tens of millions of dollars of expennses that hav
ve to be
incurred to
o remove the
e drill,
00:03:48
00:04:02
And this iss something that is quite extraordinarry. So if you are thinking about this, I think just
think abou
ut something
g that is desirrable, feasible, and viable
e for custom ers.
00:04:11
Think abo
out something
g that bringss a combinatiion of this larrge volume oof data, comp
plexity of the
e
question, the rate of change of datta. And bring
g the power of
o HANA to thhese kinds of
o problems.
00:04:22
o imaginatio
on is the onlyy limitation th
hat is holding
g us back onn being able to
t build
Frankly, our
amazing, amazing app
plications tha
at change the
e world.
00:04:34
00:04:46
00:04:53
t broader e
ecosystem around
a
us, there is a trem
mendous amo
ount of
But when you look at the
on that is outt there.
imaginatio
00:05:00
My biggesst experience
e with HANA
A over the las
st three years
s has been thhat in buildin
ng out the
amazing applications,
a
frankly,
00:05:09
our imagin
nation has no
ot been as vvivid and as extraordinary
e
y as I would hhave thought. And to
some deg
gree that is to
o be expecte
ed.
00:05:18
You know
w, we have bu
uilt our appliccations over the last 4 de
ecades, so inn many ways
s, our
thinking becomes
b
dom
minated by, o
or constraine
ed by the kind
d of things w
we always did
d.
00:05:27
So refacto
oring, and rethinking, and
d re-imaginin
ng these is so
omething thaat comes natturally to us.
But there is a ton of am
mazing thing
gs that can be done with this technoloogy.
00:05:39
And so we
e have a thriving ecosysttem of partne
ers, of compa
anies that haave been buiilding
solutions around this, and we need
d to think about new way
ys to bring th ese innovations to
market.
00:05:50
And so we
e've been thiinking about new areas, for example,, things that w
we did like HANA
H
One,
00:05:59
which is th
he deployme
ent of HANA that is availa
able on AWS
S, but also onn Korea Tele
ecom, on
Portugal Telecom,
T
and
d many othe rs; on VMwa
are, and so on.
o
00:06:12
And we re
ecently starte
ed working o
on HANA as a Service in our own dataa center. And
d of course,
what I believe is the co
ornerstone o
of our future is the HANA Enterprise C
Cloud,
00:06:24
s and comple
ex applications, mission-c
critical appliccations on HA
ANA in our
to run ourr applications
Cloud;
00:06:35
27
00:06.46
00:06:55
and be ab
ble to get the benefits of rrunning multiple workload
ds on a pool of resources
s without
compromise.
00:07:02
00:07:15
Accenture
e has done a great set off things aroun
nd retail and thinking aboout consume
er proximity,
customer segmentatio
on, and thing
gs like this. Deloitte
D
has been
b
buildingg applications
s.
00:07:27
00:07:41
Infosys, Wipro,
W
they're
e all building amazing applications. IB
BM of coursee is our distinguished
partner in implementation and so o
on.
00:07:50
e
o
of companies
s around HAN
NA. But the oone thing in our
o
So there is a thriving ecosystem
m that I perso
onally find th
he most excitting is a prog
gram that we started last year
ecosystem
00:08:04
00:08:15
t
more than 800 starrtup companies that are building
b
appllications on HANA.
H
we have today
00:08:24
800. It's ju
ust an amazing achievem
ment that Kau
ustav and the
e team have . And this program
started in February of last year, an
nd we are on
n track to get to a thousannd startup co
ompanies on
HANA,
00:08:35
ng their produ
ucts on HAN A by the end
d of this yearr. So if you loook at these startup
for buildin
companie
es, they are doing
d
all kind
ds of amazing
g things. So here is Warw
wick Analytic
cs, for
example,
00:08:47
hinking manu
ufacturing, there is Zettas
set back therre, there is Whodini,
W
that is working on reth
a
set o
of things aro
ound Internet of Things.
ThingWorrx doing an amazing
00:09:00
00:09:09
00:09:17
My God, there
t
are all these compa
anies doing 42stats,
4
they
y do customeer proximity analysis,
a
and thingss like this.
00:09:25
00:09:35
00:09:48
00:09:56
00:10:04
And we ha
ave all this data.
d
So wha
at Taodyne do
oes is they do
d visualizatioons of star fields. So
you can do
d a 3D mode
el of the wayy the universe
e looks,
28
00:10:12
00:10:21
00:10:32
00:10:45
Or this co
ompany Mobiilistic MIBS, tthis is Mobilistic Innovativ
ve Business Solutions, based
b
out of
India,
00:10:53
00:11:06
Every yea
ar, around this time, you kknow, in the Summer, the
e monsoonss come in Ind
dia, and they
bring a lott of disease malaria, tyyphoid, chole
era and th
here is alwayys a shortage
e of relief
workers,
00:11:19
00:11:31
T
also loo
ok at the inte
egrity of the medicine
m
suppply chain an
nd things like
e
It is really incredible. They
ust amazing kinds of thing
gs that peop
ple are doing.
that. It's ju
00:11:39
Here is Fa
an Appz, which looks at o
of course the
e fan experie
ence, and... m
more than 80
00
companie
es already on
n their way to
o a thousand of them in 57
5 different ccountries,
00:11:49
building th
heir applications on HANA
dibly inspirattional. So whhen you think
k about this
A. It is incred
amazing, this inspirational work th ese sort of companies
c
arre doing
00:11:58
ourney that we
w have been
n on: On the one hand it feels like wee have been doing this
and the jo
for three years,
y
but wh
hen I think ab
bout it, when
n I think abou
ut the power of what is in front of us,
00:12:08
it is very clear
c
that in many
m
ways, w
we have only
y just begun. The HANA revolution is
s still in front
of us. We have just be
egun.
00:12:23
j
begun. And
A with HAN
NA,
We have just
00:12:32
00:12:48
k about some
e amazing th
hings that can
n be done wiith this techn
nology.
So go outt there. Think
Think abo
out simplifying things arou
und us, doing
g great, new
w, unprecedennted things around
a
us.
00:12:59
00:13:08
o short to do the
t same old
d things. The
e future, you know Alan K
Kay, one of th
he great
Life is too
teachers of
o my life, on
nce told me rright here, ou
utside this ba
alcony,
00:13:22
that the fu
uture does no
ot have to be
e an increme
ent of the pas
st. It is sometthing that we
e can build.
You know
w, we are dev
velopers, we are software
e industry bu
uilders.
00:13:33
00:13:40
29
00:13:51
00:14:00
30
www.sap.c
w
com
2014
4 SAP AG or an SAP affiiliate company. All rights reserved.
No part of this publication may bbe reproduced or transmitted in any form or for
any purpose without the expresss permission of SAP AG or an SAP affiliate
company.
SAP an
nd other SAP products annd services mentioned he
erein as well as their
respective logos are trademarkss or registered trademarks of SAP AG (or an
SAP afffiliate company) in Germa
many and other countries. Please see
http://w
www.sap.com/corporate-een/legal/copyright/index.epx#trademark for
additional trademark informationn and notices. Some softw
ware products
marketted by SAP AG and its disstributors contain propriettary software
components of other software ve
vendors.
may vary.
Nationa
al product specifications m
These materials are provided by SAP AG or an SAP
P affiliate company for
informa
ational purposes only, w
without representation or warranty of any kind,
and SAP AG or its affiliated companies shall not be
b liable for errors or
omissio
ons with respect to the m
materials. The only warrranties for SAP AG or
SAP afffiliate company productss and services are those that are set forth in the
express warranty statements aaccompanying such pro
oducts and services, if
any. No
othing herein should be cconstrued as constituting an additional warranty.
In particular, SAP AG or its affilia
iated companies have no obligation to pursue
any course of business outlinedd in this document or any related presentation,
or to de
evelop or release any funnctionality mentioned therein. This document, or
any related presentation, and SA
AP AGs or its affiliated companies strategy
and po
ossible future developmennts, products, and/or platfform directions and
functionality are all subject to chhange and may be changed by SAP AG or its
ed companies at any timee for any reason without notice. The information
affiliate
in this document
d
is not a commiitment, promise, or legal obligation
o
to deliver
any ma
aterial, code, or functional
ality. All forward-looking sttatements are subject
to vario
ous risks and uncertaintiees that could cause actual results to differ
materia
ally from expectations. Reeaders are cautioned not to place undue
reliance on these forward-lookinng statements, which speak only as of their
dates, and they should not be reelied upon in making purc
chasing decisions.