Professional Documents
Culture Documents
a Netcetera Company
Bad SQL
Non-technical users
OLAP tools exist because this sort of query is traditionally slow on relational systems
Bridge Table
400 K
Rows
8191 nodes
12 levels
4096 leaves
500 K
Rows
300 K
Rows
Rows
9'780
10
532067
181
29819
453392
81
123
118
8916
53248
11875
344380
11
77
501
3001
5
723739
81
16019518
775561
17415366
Realistic scenario
Data Generation
create_dim(
p_bf
=> 2,
p_depth => 12,
p_name => 'org',
p_cols => 'org01,org02,org03,org04,org05,org06,org07,org08,org09,org10',
p_types => 't10,t10,t10,t10,t10,t10,t10,t10,t10,t10
);
Generated from a helper table with just integers and random numbers
Started out as PL/SQL, now a Perl script that works with any DB
Generated by BI tool
OS
DBMS
AIX
Oracle 10G
1200 sec
168 sec
Oracle 10 G
1023 sec
205 sec
MS SQL Server
741 sec
699 sec
293 sec
1432 sec
413 sec
16 Mio
1 Mio
3500
Description
2005
Oracle 10 G
OS
AIX
DBMS
Oracle 10G
Oracle 10 G
MS SQL Server
Exacluster
2005
Oracle 10 G
16 Mio
1 Mio
3500
Description
1200 sec
168 sec
167 sec Expensive Production Server
1023 sec
205 sec
741 sec
699 sec
293 sec
1432 sec
413 sec
22 sec
2 sec
Exasol
(Linux Microkernel)
0 sec In Memory DB
DD CRA
HP
IBM
Exa
The Contenders
Oracle 11 G
MySQL
MonetDB
LucidDB
Exasol
16 GB RAM
Teradata
Oracle ExaData
Netezza
Vertica
Infobright
Kognitio
The field is very active and new products and approaches keep entering the market.
MonetDB
Origin:
Open Source:
Yes
Free of Charge:
Yes
Remarks:
o
Recent publicity through a paper in Communications of the ACM:
Breaking the Memory Wall in MonetDB
o
Constantly changing as research progresses
o
Easy to get into direct contact with the developers
Quote from the website:
MonetDB is a open-source database system for high-performance
Applications in data mining, OLAP, GIS, XMLQuery, text and multimedia
retrieval.
LucidDB
Origin:
Open Source:
Yes
Free of Charge:
Yes
Remarks:
o
Emphasizes ease of configuration and maintenance
o
Mostly written in Java
Greenplum
Origin:
Open Source:
Free of Charge:
No
Remarks:
o
Based on similiar hardware architecture as Exasol
o
Highly configurable and tunable, lots of features
o
Column store is an option, default is row store
Exasol
Origin:
Open Source:
No
Free of Charge:
No
Remarks:
o
Based on similiar hardware architecture as Greenplum
o
Pure column store DB
o
Emphasizes ease of administration
o
No need to create indexes or gather statistics
o
Imitates some Oracle-isms for compatibility
Quote from the website:
The database has been specially developed for analysis and is being used
successfully for data warehousing, Web analytics, data mining applications
and more. In contrast with universal databases, this specialization means that the
data to be analyzed can be made available to analysis tools virtually in real time.
22 8 0
2000
1500
1000
500
4 60
22 6
31
13
10
MonetDB
Greenplum
Exasol
0
Oracle
MySQL
LucidDB
Oracle on a new 64 Bit box is 4 times faster than on an average 32 bit box
Both Oracle and LucidDB were twice as fast after dropping all indexes on the fact
table (those are the times in the chart)
We did not manage to tune MySQL to get acceptable performance for a free system,
LucidDB has good performance and little hassle
MonetDB needed a fix in the optimizer before coping with the query
Next generation in memory DBs are at least one order of magnitude faster
Performance Scaling
400
364
350
300
288
250
210
200
150
Greenplum[sec]
133
105
50
Exasol [sec]
(untuned comparable
hardware)
Exasol [sec]
(local dimensions
comparable hardware )
183
100
Exasol [sec]
(public demo system)
97
54
26
13
6
3
0
16
160
320
Conclusion
Big Lessons
By adopting the new technologies, you can totally revolutionize the way you access
your data
Small Lessons
If you have an Oracle on a 32 Bit system, move to a 64 Bit architecture. It will give
you a factor 4 without any pain
The commercial systems are worth their money. Their SQL is more compatible, and
they are more stable