Teradata

Leaders in Enterprise Data Warehousing
John Tulley
Vice President, Teradata Canada


Email: John.tulley@ncr.com
Office: 905-478-8997
2
2004 Revenue
by Business Unit
NCR Corporate Overview
• Fortune 500 company
• Global operations in more than 100
countries & territories
• 28,500 employees

• 2004 Revenue $5.984B
• 1999-2004 >51% revenue growth
Teradata
Financial
Retail
Systemedia
Customer Service
Payment & Imaging
Other
Teradata
Data Warehouse
Retail
Solutions
Financial
Solutions
Worldwide
Customer
Services
Systemedia
3
50% of Top 10
Global Retailers
60% of Top 10
Most Admired
Global Companies
80% of Top 10
Global Telco Firms
60% of Top 10
Global Airlines
50% of the Top 10
Transportation
Logistic Firms
Top Industry Leaders Rely on Teradata
• Leading industries
> Banking
> Government
> Insurance & Healthcare
> Manufacturing
> Retail
> Telecommunications
> Transportation Logistics
> Travel

• World class customer list
> More than 800 customers
> Over 1200 installations

• Global presence
> Over 100 countries

• 4,000 world-wide professionals
dedicated to data warehousing
FORTUNE Global Rankings, July 2005
Teradata Top 10
4
The Teradata Difference
What We Do….
• Enterprise data warehouse
• Windows 2003/Unix/Linux scales from Intel laptop to MPP
• Analytic capabilities transform data into information.
• Extreme high availability
• Industry leader in analytical applications
• Integration with SAP, Siebel, Hyperion
• Partnerships include Accenture, Bearingpoint,
CAPGemini, Deloitte, EDS, Lockheed Martin
• Strong customer references

All we do is Data Warehousing!

5
Best Worst
HP
HP9000
HP-UX
Oracle
IBM SP
RS/6000
AIX
DB2 EEE
Sun
Enterprise
Solaris
Oracle
Generic
Intel IA-32
Win2000
SQL Server
Unisys
ES7000
Win2000
SQL Server
IBM
S/390
OS/390
DB2 EEE
Compaq
Alpha
Tru64
Oracle
Teradata
Data Mgmt.
Query Perform.
Scalability
and Suitability
Concurrent
Query Mgmt.
DW Track
Record
Data Admin.
Source: Gartner ASEM Ratings 2004
Teradata - the recognized leader in data warehousing
and high-performance decision analytics.
….Gartner ASEM
6
Industry Leadership Recognition
• Gartner - “Dominant Lead” – 5
th
Consecutive Year
> “DBMS is surely the place where NCR Teradata sets the gold standard. As
in previous years, the Teradata score was 98%, leaving little scope (and need) for
improvement.”
– Gartner's [Application Server Evaluation Model] ASEM Data Warehouse Server Update, A. Butler, K. Strange, J.
Enck, M. Chuba, November 2004

> Teradata[database management system] DBMS capabilities remain
unchallenged by its competitors in the market.”
– Gartner’s Magic Quadrant for Data Warehouse DBMSs, 2004, Kevin H. Strange, June 2004

> “Teradata continues to drive a strong vision.”
– Gartner Research, MarketScope: Customer Relationship Marketing, 1Q04, G. Herschel, J. Radcliffe, Feb 2004

> Gartner Dataquest recognized Teradata as the growth leader in the RDBMS
market, with above market growth of 17.4%. 2005

> Teradata is rated “Positive” in Gartner’s MarketScope for Campaign Management,
the highest rating awarded 2005

• META Group
> “Teradata has displayed unmatched (but often copied) strength of vision
and focus in the [enterprise data warehouse] EDW market.”
– METAspectrum Market Summary, Enterprise Data Warehouse METAspectrumSM Evaluation, 2004

7
BI Excellence Award
Sponsor: Gartner Group
•Continental Airlines - winner
•Cardinal Health - finalist


Technology Leadership
Award
Sponsor: Frost & Sullivan
•Teradata selected for
Leadership Award – CRM
Analytics


TDWI Best Practices
Award
•sunrise TDC Switzerland AG
– winner - Customer
Relationship Management
NEXUS
Awards
1to1 Impact Award
Sponsor: Peppers & Rogers
Continental Airlines recognized
as Technology Optimization winner

Editors’ Choice Awards
Sponsor: Intelligent Enterprise
•Teradata selected for the
“Dozen” Most Influential
BI Companies
•Winner, Customer Analytics category

NEXUS Awards
Sponsor: New Zealand
Direct Marketing Association
•Bank of New Zealand,
silver award - data mining & analytics;
bronze award - data management








Industry Awards and Recognition - 2005
8
Government Agencies with Teradata Presence
• US Air Force
• US Navy
• US Transportation
Command
• Defense Commissary
Agency
• Army, Air Force
Exchange
• Intelligence
Community
• US Postal Service
• Italian Post Office


• Dept. of Justice
• Dept. of Housing and
Urban Development
• Dept. of Agriculture
• Arizona, Iowa, Florida,
Texas, Illinois, New
York, Utah, Michigan
• RAMQ – Quebec
• Australian Tax Office
• South African Tax
Office

9
Teradata Solutions Methodology
Project Management
Data Mapping
Application
Requirement
Infrastructure
& Education

Logical Model
Business
Value
EDW
Roadmap
Opportunity
Assessment
Enterprise
Assessment
Value
Assessment
User
Training
Production
Install
Components
for Testing
Initial Data
System Test
Acceptance
Testing
Custom
Component
System
Architecture
Test Plan
Package
Adaptation
Education
Plan

User
Curriculum
Information
Exploitation
Physical
Database
Operational
Applications
ECTL
Application
Backup &
Recovery

Availability
SLA
HW/SW
Upgrade
System
Performance
System DBA
Help Desk
Business
Continuity
Capacity
Planning
Solution
Architect
Data
Migration
Support
Management

Hardware
Platform

Operational
Mentoring

Software
Platform

Technical
Education
Strategy Integrate Analyze Research Design Build Equip Manage
Technology Neutral Services
Information
Sourcing
Teradata’s success is the combination of hardware, software and
methodology
10
W
o
r
k
l
o
a
d

C
o
m
p
l
e
x
i
t
y

Data Sophistication
Data Warehouse Needs Will Evolve
OPERATIONALIZING
WHAT IS happening?
Event-Based
Triggering
Takes Hold
ACTIVATING
MAKE it happen!
Continuous Update/Short Queries
Event-Based Triggering
Primarily Batch &
Some Ad Hoc Reports
Increase in
Ad Hoc Analysis
ANALYZING
WHY
did it happen?
REPORTING
WHAT
happened?
Analytical
Modeling
Grows
PREDICTING
WHAT WILL
happen?
Batch
Ad Hoc
Analytics
• Query complexity grows
• Workload mixture grows
• Data volume grows
• Schema complexity grows
• Depth of history grows
• Number of users grows
• Expectations grow
11
Enterprise Analytical Topologies
Sources
Users
DW
Sources
Users
DW
Marts
Sources
Users
Marts
Sources
Users
Middleware
Data Mart
Centric
Virtual,
Distributed,
Federated
Hub-and-
Spoke Data
Warehouse
Enterprise
Data
Warehouse
Independent Data
Marts
Leave Data Where it
Lies
Dependent Data
Marts
Centralized
Integrated Data
With Direct Access
P
r
o
s
• Easy to Build
Organizationally
• Easy to Build
Technically
• No need for ETL
• No need for separate
platform
• Allows easier
customization of user
interfaces & reports
• Enterprise view
• Design consistency &
data quality
• Data reusability
C
o
n
s
• Business Enterprise
view unavailable
• Redundant data costs
• High ETL costs
• High App costs
• High DBA and
operational costs
• No ETL
• Meta data issues
• Network bandwidth and
join complexity issues
• Only viable for low
volume
• Business Enterprise
view challenging
• Redundant data costs
• High DBA and
operational costs
• Data latency
• ODS duplication
• Requires vision
• Requires Data Owners
to willingly participate
ODS
12
Typical Data Warehouse Architecture
What’s wrong with
this picture?
1. There are too many
copies of the data.
Will they all be the
same?
3. The solution is too
complex. Every line
on the chart
represents an ETL
process that
requires $$ for Life
Cycle Maintenance
4. The solution is too
expensive. There
are numerous
components that
lead to increased
costs. Costs often
hidden in
distributed
organization.
Operational Data Stores
Central store, Hub, Clearing house
Data Marts
Transaction Systems
2. There is too much
latency - too long to
get the data to the
people who need it.
Everyone sees
different inconsistent
points in time
13
Teradata’s Enterprise Data Warehouse
An Integrated, Centralized Data Warehouse Solution
Transactional Data
Decision Users
Transactional Users
Data Transformation
Operational
Data Store (ODS)
“Enterprise”
Data Warehouse
Data Replication
Data Marts
E
n
t
e
r
p
r
i
s
e
,

S
y
s
t
e
m
,

&

D
a
t
a
b
a
s
e

M
a
n
a
g
e
m
e
n
t

M
e
t
a
d
a
t
a












L
o
g
i
c
a
l

D
a
t
a

M
o
d
e
l












P
h
y
s
i
c
a
l

















D
a
t
a

B
a
s
e

D
e
s
i
g
n

M
i
d
d
l
e
w
a
r
e
/
E
n
t
e
r
p
r
i
s
e

M
e
s
s
a
g
e

B
u
s


B
u
s
i
n
e
s
s


&

T
e
c
h
n
o
l
o
g
y



C
o
n
s
u
l
t
a
t
i
o
n

S
u
p
p
o
r
t

&

E
d
u
c
a
t
i
o
n

S
e
r
v
i
c
e
s


Strategic
Users
Tactical
Users
Reporting
OLAP Users
Event-driven/
Closed Loop
Data
Miners
Optional
ETL Hub
Optional
Optional
CUSTOMER
CUSTOMERNUMBER
CUSTOMERNAME
CUSTOMERCITY
CUSTOMERPOST
CUSTOMERST
CUSTOMERADDR
CUSTOMERPHONE
CUSTOMERFAX
ORDER
ORDER NUMBER
ORDER DATE
STATUS
ORDER ITEM BACKORDERED
QUANTITY
ITEM
ITEM NUMBER
QUANTITY
DESCRIPTION
ORDER ITEM SHIPPED
QUANTITY
SHIPDATE
Optional ELT
SALES
PERIODKEY
PRODUCT KEY
CUSTOMERKEY
MARKET KEY
DOLLARS
UNITS
PERIOD
PERIODKEY
DATE
DAY
MONTH
YEAR
QUARTER
TRIMESTER
CUSTOMER
CUSTOMERKEY
CUSTOMERNAME
CUSTOMERCITY
CUSTOMERPOST
CUSTOMERST
CUSTOMERADDR
CUSTOMERPHONE
CUSTOMERFAX
PRODUCT
PRODUCT KEY
PRODUCT NAME
DISTRIBUTOR
PRODUCT DESCRIPTION
PRODUCT HEIGHT
PRODUCT WIDTH
PRODUCT DEPTH
PRODUCT WEIGHT
MARKET
MARKET KEY
CITY
STATE
ZIP
ZIP4
DISTRICT
REGION
COUNTRY
Logical
(Views) Application
Dimensional
Co-Located
Dependent DM
Virtual Views
Single version of data
14
TERADATA is an Open System
TERADATA
CORBA
ODBC
IIOP
.NET
OLE-DB
ASP
WEB
TERADATA
Utilities
Queues
Adapter(s)
M
e
s
s
a
g
e

B
u
s

P
u
b
l
i
s
h

&

S
u
b
s
c
r
i
b
e

TAP Appl
JDBC
JSP
EJB
JDBC
JMS
Adapter(s)
TERADATA
Utilities
Messages
Virtually
any application
or middleware
framework can be
integrated with
TERADATA !!!
JAVA
JDBC
JMS
16
Information Exchange
Fast Export
Decision Making Environment Transactional Environment
Web Services
Enterprise Application
Integration
T-Pump, MQ Adapter
Direct Data Access
MQ Adapter
Legacy
Systems
Secure
DOD
Network
Secure
DOD
Network

Front
Line
Business Services
Secure Wireless
Warfighter
Support
Base
Supply
DOD Supplier

Web-
Sphere
Tibco
(EAI)
.NET
OLAP
Queries
Intel
Agents
Event
Engine
Rules
Engine
Teradata Active Data Warehouse in action
1.Continuous Transaction
feeds on supplies usage
2. Conditioning &
Loading of trans
data
3.Stored
Procedures
trigger based
event
detection
sends alert
to
Warfighter,
Warfighter
Support, &
DOD Supplier
via MSTR
Narrowcaster
Strategic
& Tactical
Queries
5.Warfighter receives alert via
Secure Blackberry, adjusts Battle
Plans to align with rush replenishment
4. and or DOD
Vendor notified
and reorders
T-Pump, MQ Adapter
Ascential
Informatica
Data Acquisition
Fast Load, Multi Load
TERADATA
Stored Procedures
Q Tables
UDF, Triggers

So what is Teradata ?


18
What is Teradata?
• RDBMS designed to run the world’s
largest databases
• Latest Intel technology nodes
• UNIX-MP-RAS, Windows 2003
• Linux in Fall 2005
• Scales linearly from Laptop to MPP
• Has a parallel aware optimizer that
allows multiple complex queries to run
concurrently
• Standard access language (SQL)
• Uses a “Shared-Nothing” architecture
• Unlimited, unconditional parallelism
• Linear Scalability allows for increased
workload without decreased throughput.

19
Teradata Hardware Architecture
• SMP Nodes
> Latest Intel SMP CPUs
> Configured in 2 to 8 node
cliques
> Windows, Unix or Linux
• BYNET Interconnect
> Fully scalable bandwidth
> 1 to 1024 nodes
• Connectivity
> Fully scalable
> Channel - ESCON
> LAN, WAN
• Storage
> Independent I/O
> Scales per node
• Server Management
> One console to view
the entire system



Server Management
PE
SMP Node
1

AMP PE
AMP AMP AMP
PE
SMP Node
2

AMP PE
AMP AMP AMP
PE
SMP Node
3

AMP PE
AMP AMP AMP
PE
SMP Node
4

AMP PE
AMP AMP AMP
BYNET Interconnect
20
Teradata Shared Nothing Architecture
• Similar to Large SMP, except Interconnect runs at I/O Rates and not
Memory Rates
• Longer Lifetime: I/O Interfaces have a 3-5 Year Lifetime
• Scaling Is By Increasing Link Data Rates and Parallel Links
P
Memory
FSB
I/O
P
P
Memory
FSB
I/O
P
P
FSB
P
Memory I/O
P
FSB
P
Memory I/O
21
SMP vs. MPP: The Teradata Advantage
• 2-Way SMP
> 1.8 Relative CPU’s
> 4 GB Memory
> 3.2 GB/Sec BUS
> 3.2 GB/Sec Memory
> 1.5 GB/Sec I/O
• 4-Way SMP
> 3.1 Relative CPU’s
> 4 GB Memory
> 3.2 GB/SEC BUS
> 3.2 GB/Sec Memory
> 1.5 GB/Sec I/O
• 2 2-Way Teradata Nodes
> 3.6 Relative CPU’s
> 8 GB Memory
> 6.4 GB/Sec BUS
> 6.4 GB/Sec Memory
> 3 GB/Sec I/O

• 32 2-Way Teradata Nodes
> 57.6 Relative CPU’s
> 128 GB Memory
> 102.0 GB/Sec BUS
> 102.0 GB/Sec Memory
> 48 GB/Sec I/O

22
• Rows are distributed evenly by hash partitioning
> Done in real-time as data are loaded, appended, or changed.
> No reorgs, repartitioning, space management
• Shared nothing software:
> Each VAMP owns an equal slice of the data.
> Each VAMP works exclusively & independently on its rows
> Nothing centralized: No single point of control for any operation (I/O,
Buffers, Locking, Logging, Dictionary)
Teradata Data Distribution
Dividing the Work
VAMP1 VAMP2 VAMP3 VAMP4 ………………………………………………………VAMPn
Table A Table B Table C
Prime Index
Teradata Parallel Hash Function
P
D M
P
D M
P
D M
P
D M
P
D M
P
D M
P
D M
P
D M
P
D M
RowHash (Hash Bucket) Data Fields
23
File System
• File system architecture is fundamentally different
> Broke all the rules
> No Pages, BufferPools, TableSpaces, Extents,...
> Data location and management are entirely automatic
> Space allocation is entirely dynamic
• Absolutely minimal labor required
> No reorgs
– Don’t even have a reorg utility
> No index rebuilds
> No re-partitioning
> No detailed space management
> Easy database and table definition
> Minimum ongoing maintenance
– All performed automatically
Self Managing Architecture

• Teradata’s self-managing philosophy provides the lowest
total cost of ownership of any RDBMS
> Automatic, random and even data distribution
> Parallel-aware optimizer eliminates query tuning
> Parallel utilities with low setup and checkpoint restart
> Single operational view of entire MPP complex (AWS)
> Single point of control for the DBA (Teradata Manager)
> SQL-ready database management information (log files)
25
Teradata DBAs Don’t Worry About!
1. Install the Database
2. Understand, monitor and tune extensive operating system
parameters
3. Understand, monitor and tune extensive database parameters
4. Determine the size and physical location and/or space allocations
of tables and index partitions
5. Perform periodic table and index re-orgs
6. Manually restart multi-step load process when failure occurs
7. Ability to run queries and data maintenance 24x7
8. Sort data before loading
9. Calculate and configure fail-over plans in a clustered
multiprocessing environment
10. Spend a lot of time planning and expanding the system
11. Query tuning for decision support
26
Teradata High Availability
• Teradata software
provides high availability
beyond other databases
> Compensates for
hardware failures:
– Automatic failover for
dynamic
workload rebalancing
(migrating VPROCS)
– Online, continuous
backup
(Fallback)
> Recycles before
the operating system
completes its reboot
(multi-node system)


PE
SMP Node
1

AMP PE
AMP AMP AMP
PE
SMP Node
2

AMP PE
AMP AMP AMP
PE
SMP Node
3

PE
AMP AMP
PE
SMP Node
4

AMP PE
AMP AMP AMP
BYNET Interconnect
AMP
AMP
28
Teradata’s Multidimensional Scalability
(It’s more than just big data)
Amount of Detailed Data
Concurrent Users
CUSTOMER
CUSTOMER NUMBER
CUSTOMER NAME
CUSTOMER CITY
CUSTOMER POST
CUSTOMER ST
CUSTOMER ADDR
CUSTOMER PHONE
CUSTOMER FAX
ORDER
ORDER NUMBER
ORDER DATE
STATUS
ORDER ITEM BACKORDERED
QUANTITY
ITEM
ITEM NUMBER
QUANTITY
DESCRIPTION
ORDER ITEM SHIPPED
QUANTITY
SHIPDATE
Multiple Subject Areas
• Simple Direct at the start
• Moderate Multi-table Join
• Regression analysis
• Query tool support
Sophisticated Queries
29
Data Volume
(Raw, User Data)
Schema
Sophistication
Query
Freedom
Query
Complexity
Query
Concurrency
Mixed
Workload
Query Data Volume
Data
Freshness
EDW Requires Multi-dimensional
Scalability
30
Data Volume
(Raw, User Data)
Competition Scales
One Dimension at the
Expense of Others

Limited by Technology!
Schema
Sophistication
Query
Freedom
Query
Complexity
Data
Freshness
Query Data Volume
The Teradata Difference
“Multi-dimensional Scalability”
Query
Concurrency
Mixed
Workload
Teradata can Scale
Simultaneously Across
Multiple Dimensions
Driven by Business!
31
Data Volume
(Raw, User Data)
Competition Scales
One Dimension at the
Expense of Others

Limited by Technology!
Schema
Sophistication
Query
Freedom
Query
Complexity
Data
Freshness
Query Data Volume
The Teradata Difference
“Multi-dimensional Scalability”
Query
Concurrency
Mixed
Workload
Teradata can Scale
Simultaneously Across
Multiple Dimensions
Driven by Business!
The
Teradata
Difference!
32
The Teradata Difference
“Multi-dimensional Scalability”
Workload
Mix
Query
Complexity
Active Data Warehousing
3-5 Way
Joins
Normalized
TBs
MBs
GBs
Query Data
Volumes
10 TB
Others
100’s TBs +
Teradata
15 TB
20 TB
Multiple, Integrated
Stars and Normalized
15+ way Joins +
OLAP operations +
Aggregation +
Complex “Where”
constraints +
Views
Parallelism
Batch Reporting,
Repetitive Queries
“Iterative”, Ad Hoc Queries
Data Analysis/Mining
Near Real Time Data Feeds
Simple
Star
Multiple,
Integrated
Stars
Data Storage
(raw, user data)
Schema
Sophistication
5-10 Way
Joins
5 TB
# of
Concurrent
Queries
1,000’s
33
Implementation Summary
•Integrated data from nine separate health-related agencies
•Managed and used by agency subject matter/programmatic
experts, not by the IT department
•Over 200 users in Medicaid and 8,000 state-wide


Realizations and ROI
• Estimated annual savings of $75 million–$100 million due to
advanced health care analysis
• Medicaid administrative costs have been reduced by 25 percent
• Recoveries for Medicaid Fraud has doubled
• Maximized Medicaid program savings while sustaining quality
care
• Warehouse helped Michigan go from “last to first” in child
immunization rates
• Track and substantiate savings in Medicaid pharmacy costs
• 2004 TDWI Best Practice Award Winner – Government and
Non-Profit Category

Teradata Customer Since 1991
Business Solutions

• Data warehouse integrates
claims/encounters; beneficiary
eligibility data; provider data; birth
records; death records; long-term
care assessments; WIC data;
immunizations; lead screening;
newborn screening; & notifiable
diseases.
•Fraud & abuse
•Contract management with health
plans
•Healthcare cost & quality
assessment
•Overpayment & COB analysis
•Program effectiveness
•Predict State’s healthcare needs
•Prioritize health initiatives
for future
Customer Profile
As the largest department in the State of Michigan, DCH is responsible for managing delivery of health care
services to more than 1.2 million clients and overseeing an annual budget of $9.5 billion. DCH administers many of
the state’s most critical programs, including Medicaid, WIC, and child immunizations.

State of Michigan, Department
of Community Health (DCH)
34
Implementation Summary
•More than five years of History
•1.3 Billion Claims
•650 users from 17 counties that is expected to grow to
thousands

Realizations and ROI
• First year in operation paid for entire implementation of
the DW!
• Better analysis of integrated data resulted in recoveries
in the millions!
• $16m - Coordination of Benefits, $5m - duplicate
payments, $1 million - overpayments
• $187 million saved due to better policy decisions based
on medical and pharmaceutical analysis
• Millions saved due to efficiency of analysis such as Audit
process reduced to 2 hours from 8 weeks
• 2004 NASCIO Award – Best Information Architecture
Category


Teradata Customer Since 1999
Business Solutions
New York is making more rapid, informed
decisions about programs, policies, and
people across its vast Medicaid system.
• Fraud & abuse
• Tracking bio-terrorism indicators daily by
pharmaceutical purchases with acute
illness data from hospital emergency
rooms
• Determining disease patterns and trends
and the best possible treatment
• Tracking drug pattern usage to prevent
abuse
• Program effectiveness
• Service delivery effectiveness
• Enhanced audit control
• Forecasting the cost and utilization of
expensive prescription drugs
• Identification of overpayments
• Responding quickly to legislative
inquiries
Customer Profile
New York’s Medicaid program provides critical health care services to more than 3.7 million participants – 2.4
million in New York City alone. To serve this constituency, the state processes and analyzes more than 300 million
claims totaling more than $38 billion annually. It is the largest Medicaid program in the US.

The New York State
Department of Health (DoH)
35
Iowa Department of Revenue
Tax Compliance
• Have more accurate leads because of better information
• Experienced substantial savings; staff can --
> Analyze greater volumes of data
> Manage a greater number of cases
> Exercise a higher level of control over taxpaying behavior
> Before the EDW, this additional work would have caused
for a 20-25% increase of the audit staff
• Generated $69.7M in incremental collections and refund
reductions in 2003
> $30.6M through office examinations
> $17.4M in refund reductions
> $ 9.1M from tax gap revenues
> $ 7.5M in out-of-state audits of multi-state businesses
> $ 5.1M from in-state field audits Business Benefits


36
The Teradata Mission

Teradata Active Data Warehousing


strategic
tactical
event-driven
decision making in a single
centralized
mission-critical
up-to-date
version of the enterprise data


“Any Question, By Any User, At Any Time”
All Decision Making…from One Copy of the Data.
strategic
tactical
Sources
Users
Active Data Warehouse
37
john.tulley@ncr.com
The Industry Leader in Data Warehousing