You are on page 1of 3

22/5/2014 Advantages | Spider Schema

http://spider-schema.info/?page_id=139 1/3
Advantages
Reduced data storage is achieved by the reduction of Foreign Keys in the Fact Tables. In the example below we
took two years of financial data from System-X and normalized it into both the Spider and Star Schemas. The
only difference between the two databases is that the Spider Schema contains the Intermediate Dimension,
and the Star Schema does not. With the Foreign Keys being removed from the Fact Tables in the Spider
Schema and put into a single Intermediate Dimension, a 24% savings in data was realized. This was due to two
Fact Tables sharing the same common dimensions.
Here is what the screenshot for the properties of Policy Spider Schema database looked like. Notice the 73.88
MB size.
Here is the screenshot for the Policy Star Schema. Notice the 97.13 MB size compared to the 73.88 MB of the
Spider Schema.
Spider Schema
A robust alternative to the Star Schema
22/5/2014 Advantages | Spider Schema
http://spider-schema.info/?page_id=139 2/3
Big Data, refers to the use of storing large amounts of data that can later be retrieved and processed. The
main goal behind Big Data storage is the use of smaller tables. A smaller table is a table with fewer columns in
it as database engines can process many rows of data, but when the table contains numerous columns, the
processing is slowed down.
The Spider Schema allows data to be broken down into smaller, more normalized data sets, thereby reducing
the number of columns in each table. This is particularly true with the fact table, which typically contains
more records of data than the other tables in a set of data. Since the Spider Schema removes the Foreign Keys
from the Fact Table, and allows for a distinct set of those keys, and since the possibility exists to create more
than one Intermediate Dimension; the ability to Load Balance Data and truly store Big Data becomes a reality.
The creation of Intermediate Dimensions allows Dimensional Data to be associated together based upon how
they relate to each other naturally and not a set of Facts. In creating an Intermediate Dimension, it is possible
to create different levels of granularity in the data. For example in the Sample-X System, there are Policies,
with Terms and Risks, which have Coverages and sometimes Deductions. While a Policy may not always
have a Deduction, it will always have Risks, Terms, and Coverages. Therefore normalizing data into the
Intermediate Dimension becomes important as this determines the level of granularity in the Fact Table. Since
more than one Intermediate Dimension may be referenced in the Fact Table, it is possible to create re-useable
Intermediate Dimensions that are used across all Fact Tables where needed, and it becomes the combination
of the Intermediate Dimensions that determine the level of granularity.

22/5/2014 Advantages | Spider Schema
http://spider-schema.info/?page_id=139 3/3
Share this:
>>> Comments
Spider Schema
Email Digg Facebook Twitter Google LinkedIn
Proudly powered by WordPress.

You might also like