Professional Documents
Culture Documents
Assumptions Guide
Document Purpose
This document is intended to help Alation’s customers and prospects understand the assumptions needed for
the Data Catalog Value Index model.
General Assumptions
The following assumptions are the inputs for our DCVI model. These are the only inputs needed along with
Alation Analytics data from a customer in order to generate a DCVI report. Prospects only need the applicable
assumptions and to run a simulator to project value for the first year of their program.
Caveat: The assumptions are trying to decompose complex measures into a single number. The real life data for assumptions
#8-10 take the shape of something called the Generalized Pareto Distribution (GPD) a.k.a. 80/20 or 90/10 rule. Meaning that most
searches will save you time on the order of minutes or hours and some searches will save you days of effort. Keep in mind that
these assumptions are the averages of their respective distributions of data.
Some of these metrics also require data to be supplied via a direct Alation Analytics connection and extract
instead of using the Compose query method of extract.
In most cases, the majority of the value generated by DVCI will be related to baseline and analyst productivity,
so these data governance specific assumptions can be ignored until a customer has a very mature program.
3. Title Case
○ Allowed character set for titles: upper, lower, mixed