Professional Documents
Culture Documents
A Project-Oriented Data Warehouse For Construction: Thammasak Rujirayanyong, Jonathan J. Shi
A Project-Oriented Data Warehouse For Construction: Thammasak Rujirayanyong, Jonathan J. Shi
www.elsevier.com/locate/autcon
Abstract
A construction organization generates a great amount of operational data that are distributed across various functional systems to support its
daily operations. Although those data may be potentially useful for future projects, they are not widely collected and centrally stored in the
organization. This research presents a Project-oriented Data Warehouse (PDW) for contractors. PDW is designed with dimensional data models
consisting of 26 tables. Sixteen of the tables are dimension tables for storing general descriptive information, and the other ten are fact tables for
detailing various facts that are captured in the lifecycle of construction projects. PDW can be directly populated with data from existing
operational systems, such as P3 files, MS Access, P3/e databases, and Excel files. It maintains each data in the context of its associated project so
that a user can retrieve a specific piece of information plus any background information of the related project. PDW has been populated with three
sample project data. Through the user interface, a user can generate interested query reports as needed. The presented warehouse structure and data
models are scalable. They may be adopted by medium or large contractors for developing company-level data facilities.
2005 Elsevier B.V. All rights reserved.
Keywords: Database; Information system; Decision support system; Construction; Data warehousing
1. Introduction
Everyday organizations large or small create billions of bytes
of data about various aspects of their business, such as
customers, products, operations and people [1]. An organization
has the need to access a variety of information to support either
its daily operations or business decisions. Construction
organizations also generate a great amount of operational data
that are distributed across various functional databases. These
data play an important role in securing a project's completion on
time, within budget and meeting design specifications [2].
The information systems in an organization are generally
divided into two categories: operation support systems (OSS)
and decision support systems (DSS). OSS serve the need of
running the daily operations of the business. DSS provide
historical information for analyzing the business so that
important business decisions can be made appropriately. Many
companies have realized the importance of the hidden treasure of
information, which can significantly improve the quality of
Corresponding author.
E-mail address: jonathan.shi@iit.edu (J.J. Shi).
0926-5805/$ - see front matter 2005 Elsevier B.V. All rights reserved.
doi:10.1016/j.autcon.2005.11.001
decisions [3]. Moreover, unlike consumable resources, information as an organization's intangible asset can be reused over
and over without losing its value. Instead, it may even be
enriched in the process. Therefore, it is the interest of an
organization to collect and store the information for future use.
Data warehousing is a new technology evolved in the last
decade. It intends to provide all users in an organization with
timely access to whatever level of information as needed [1]. A
data warehouse provides an architectural model for the flow of
data from operational systems already in place to decisionsupport environments [4,5]. It is periodically populated with
data from operational systems such as equipment managements,
accounting systems, material inventory systems and customer
management systems. Essentially, a data warehouse collects all
of the relevant data into one central system, organizes the data
efficiently so it is consistent and easy to retrieve, keeps old
data for historical analysis, and enables access to and use of data
conveniently so that users can do it themselves [6]. William
Inmon, who coined the term data warehouse in 1990, defined
a data warehouse as a subject oriented, integrated, nonvolatile,
and time variant collection of data in support of management
decision [7].
Data Source
801
Data Access
Project
performance
Query/Reporting
Estimate
Data
Staging Area
Material
ETL
PDW
ETL
Analysis
Contract&
Bidding
Data Mart
Data Mart
External
DB
Data Mining
802
Project
Calendar
Date
Month
Year
Activity
Activity_key
Activity_name
Activity_code
Acitivity_type
Cost Fact
Date
Project_key
WBS_key
Activity_key
Cost
Duration
Quantity
Project_key
Project_name
Description
Category
WBS
WBS_key
WBS_name
WBS_code
Project
Calendar
Date
Month
Year
Activity
Activity_key
Activity_name
Activity_code
Acitivity_type
803
Cost Fact
Date
Project_key
WBS_key
Activity_key
Cost
Duration
Quantity
Project_key
Project_name
Description
Category_key
Category
Category_key
Category
WBS
WBS_key
WBS_name
WBS_code
essential data of each activity includes its as-planned and asbuilt schedules, costs, resources, and change orders incurred
during construction.
After a careful examination of the available project data and
the essential data for potential future uses, the PDW is designed
with 26 tables: 16 dimension tables and 10 fact tables. The 16
dimension tables as summarized in Table 1 provide descriptive
information about a construction project including: its owner,
geographical location, time, cost accounts, materials, suppliers,
relationships between activities, organizational responsibility,
subcontractors, and activities. The 10 fact tables as summarized
in Table 2 detail a project information in 10 categories
including: bid, change order, contract, estimate, expense,
material, relationship, resource, schedule, and subcontract.
The relationships between the 26 tables are shown in Fig. 4
in which the boxes represent the fact tables and the ovals
represent the dimension tables. Some dimension tables are
shared by multiple fact tables such as project dimension,
activity dimension, and geography dimension.
The details of the 26 tables are not provided in this article due
to a limit on space. Interested readers may write to the authors
Table 1
The 16 dimension tables
Dimension table
Table descriptions
Activity
Activity predecessor
Awarded project
Calendar
Change extra
Cost account
Expense
Geography
Material
OBS
Owner
Project
Resource
Subcontractor
Supplier
WBS
Table descriptions
Bid
Change order
Contract
Estimate
Expense
Material
Relationship
Resource
Schedule
Subcontract
804
Schedule Facts
Activity
Estimate Facts
Calendar
Activity Predecessor
Relationship Facts
Contract Facts
Owner
Awarded-Project
Resource
Resource Facts
Bid Facts
Geography
Project
Expense
Expense Facts
Subcontract Facts
Subcontractor
Cost Account
Supplier
Change Extra
WBS
Material Facts
Material
OBS
five dimension tables, we can determine the project ID, the cost
account, the time the PO was placed, the time that goods were
received, and the suppliers.
2.4. The development of the PDW
A data warehouse is a central database residing on the
servers and users can access and retrieve data at their client
computers. The PDW must be constructed in a clientserver
Fig. 5. The material fact table and its supporting dimension tables.
Material_ID
Material_name
Unit
1
2
3
4
5
6
7
8
9
10
11
RB10
C101
PC103
HR1
C104
WM3
P005
BK213
CAP11
WA5
DR200
Steel rebar
Structural concrete
Barrier pre-cast concrete
Hand rails
Topping concrete
Wire mesh
Paint
Brick
Carpet
Interior wall
Door
ton
cu yd
each
lf
cu yd
sf
gal
sf
sf
sf
each
805
Cost_Account_wk
Order_Date_wk
Received_date_wk
Supplier_wk
Material_wk
PO_Number
Unit_price
Quantity
Discount
1
1
1
1
1
1
2
2
2
2
2
1
1
3
2
2
4
1
1
7
5
4
20020105
20020112
20020203
20020215
20020215
20020215
20020412
20020412
20020515
20020520
20020520
20020112
20020120
20020210
20020119
20020222
20020225
20020417
20020421
20020522
20020527
20020529
1
1
3
2
2
2
1
1
4
4
4
1
2
3
4
5
7
1
2
10
11
7
1001
1002
1003
1004
1004
1004
1005
1005
1006
1007
1007
$50.00
$65.00
$810.00
$10.00
$55.00
$25.00
$50.00
$65.00
$3.00
$150.00
$25.00
110.00
346.15
20.00
120.00
196.36
8.00
30.00
21.54
800.00
14.67
52.00
$2.20
$6.92
$0.00
$2.40
$3.93
$0.00
$0.60
$0.00
$16.00
$0.00
$1.04
806
807
[7] W.H. Inmon, Building the Data Warehouse, John Wiley and Sons, Inc.,
New York, 1993.
[8] R. Adhikari, Migrating legacy data, Software Magazine 16 (1) (1996
(January)) 7580.
[9] J. Kador, One on one, Midrange Systems 8 (20) (1995 (October)).
[10] L. Miller, S. Nilakanta, Data warehouse modeler: a CASE tool for
warehouse design, Proc. of the 31st Hawaii Intl. Conf. on Sys. Sciences,
IEEE Computer Society Press, 1998.
[11] P. Ponniah, Data Warehousing Fundamentals, John Wiley & Sons, Inc.,
New York, 2001.
[12] K.W. Chau, Y. Cao, M. Anson, J. Zhang, Application of data warehouse
and decision support system in construction management, Automation in
Construction (12) (2002) 213224.
[13] Primavera Systems, Inc., Primavera Enterprise: Administrator's Guide,
Pennsylvania, 2002.
[14] Primavera Systems, Inc., Primavera Enterprise: User's Guide, Pennsylvania, 2002.
[15] R.L. Kimball, L. Reeves, M. Ross, W. Thornthwaite, The Data Warehouse
Lifecycle Toolkit: Expert Methods for Designing, Developing, and
Deploying Data Warehouses, John Wiley & Sons, Inc., New York, 1998.
[16] M. Chaffin, B. Knight, T. Robinson, Professional SQL Server 2000 DTS,
Wrox Press Inc., Illinois, 2000.