Professional Documents
Culture Documents
01b - DBApplications - CSC 343 PDF
01b - DBApplications - CSC 343 PDF
1
Table
SQL Database Systems Information Resource Management
• Managing descriptions of DB’s and applications
~ $5.4B in new SQL DB licenses in 1998 • Reverse engineer & import code and DB Customer Update
Marketing
dept
Entry
dno
schema
Scheduled Bill
Customer
dna
Delivery
Salesperson
select all
Schedule
Delivery
Inventory
2/3 transaction processing, 1/3 decision support • Catalogue management and browser tools ERD
Customer
Scheduled
Delivery
Order
Salesperson
Product
Forms Spec
Architecture
Transaction processing
Tables
VB Code VC++ Code
• Applications
135K txns/min (tpmC), $97/tpmC, $13.1M system cost
• Metadata reporting- find relevant databases
20K txns/min (tpmC), $15/tpmC, $305K system cost
• Year 2000 - find date fields
7x txn rate, for 43x the price! (Scale out vs. scale
• Database translation - move to a different
up)
DBMS
TP monitors are becoming Internet application
servers • Technology – entity-relationship model on SQL
They’re at the core of all big e-commerce sites. DBMS
Query optimization is excellent, but there’s still much • Platinum (CA), Viasoft (R&O)
room for improvement • No automatic integration or categorization
©2005 John Mylopoulos CSC343 Introduction to Databases — University of Toronto Applications & Technologies — 5 ©2005 John Mylopoulos CSC343 Introduction to Databases — University of Toronto Applications & Technologies — 6
Packaged Applications
Example of DB Reengineering Packaged applications (Enterprise Resource Planning
The relation systems) drive a lot of DB business -- there is little
custom application development at large companies
Employee(emp#,cname,dept,hiredt,sin# Commercial applications - finance, manufacturing,
,sal) distribution, human resources, order processing ...
might be mapped into Date Vendors include SAP ($5.1B), Peoplesoft ($1.3B), Baan
hires ($700M), Oracle apps ($700M), growing at 20%
M 1
They all have big built-in repositories of models to
Company
Company
Employee
Employee manage DB mapping, customization, & application
M 1
Name upgrades.
Sin# employs
Models drive package integration
Emp#
Emp# Load models from legacy apps; then write adapters
Dept Interfaces on which to build custom front ends
Sal
©2005 John Mylopoulos CSC343 Introduction to Databases — University of Toronto Applications & Technologies — 7 ©2005 John Mylopoulos CSC343 Introduction to Databases — University of Toronto Applications & Technologies — 8
2
Data Warehousing The Killer Model Mgmt App
Business goal - decision support for everyone Creating and maintaining a Data Warehouse is hard. You need
Problem - Ad hoc query on production DBs doesn’t work tools, which require lots of models.
Approach - Create snapshot DB for decision support
= a data warehouse Inconsistent data formats Data quality & timeliness
One of main growth areas of the DB business
¾ $2B revenue in 1995, grew to $8B in 1998 (h/w + s/w)
Production Data Data Users Missing or invalid data Relate technical and
Databases Warehouse Marts business models
©2005 John Mylopoulos CSC343 Introduction to Databases — University of Toronto Applications & Technologies — 9 ©2005 John Mylopoulos CSC343 Introduction to Databases — University of Toronto Applications & Technologies — 10
cust
emp
Order Product
Authorize
Credit Order
Entry
dept
dna Salesperson
Schedule
Forms Spec
Version schemas and transformations for Web-based portals are essential to core business,
ERD
Customer
Order Product
Scheduled
Delivery
Salesperson
Architecture
Tables
VB Code VC++ Code
3
Internet Databases Data Mining
The other big growth area of today’s DB business
Internet content-oriented meta-data is big business Extract patterns and models from the data
-- indexing, categorization hierarchies, comparison Automatically, from large data sets
shopping, … Use statistics, pattern recognition, machine
Structural metadata is just starting to get attention learning, visualization
XML interoperation requires libraries of DTD’s Applications - finance, fraud detection, astronomy,
Need tools to map to related DB schemas, forms, marketing analysis, medical diagnosis, biology …
… A small market, with much projected growth
E-commerce application deployment requires Must define and manage data subsets to mine
configuration definition and requirements -- TP And retain measures, derived data, and lineage
monitor and ORB technology is weak. Like scientific experiments, with irregular
Vision - Plug in a DB or application and it’s analysis steps
accessible and manageable; Improve hot links, Versioning is important.
search engines, and categorizations
©2005 John Mylopoulos CSC343 Introduction to Databases — University of Toronto Applications & Technologies — 13 ©2005 John Mylopoulos CSC343 Introduction to Databases — University of Toronto Applications & Technologies — 14
©2005 John Mylopoulos CSC343 Introduction to Databases — University of Toronto Applications & Technologies — 15 ©2005 John Mylopoulos CSC343 Introduction to Databases — University of Toronto Applications & Technologies — 16
4
Scientific Databases
What Was Left Out
CERN laboratory planning a new
Digital libraries
generation of experiments in particle
physics which will generate 1-10PB per Document management
year starting in 2005. Directory services
These data will be distributed world-wide Advanced file systems
to be analyzed by scientists Software databases and repositories
(public.web.cern.ch/Public/ Configuration management systems
To deal with the data, there is an
...
international Data Grid project e.g.,
www.cacr.caltech.edu/ppdg/,
grid.web.cern.ch/grid/
©2005 John Mylopoulos CSC343 Introduction to Databases — University of Toronto Applications & Technologies — 17 ©2005 John Mylopoulos CSC343 Introduction to Databases — University of Toronto Applications & Technologies — 18
©2005 John Mylopoulos CSC343 Introduction to Databases — University of Toronto Applications & Technologies — 19