You are on page 1of 28

Introduction to Information

Systems, 1 Edition
st

 Authors: Rainer, Turban and Potter


 Publisher: John Wiley & Sons, Inc.

 Slides by: Hellene Bankowski, Professor, Philadelphia University

Copyright 2007 John Chapter 4 1


Chapter 4

Data and Knowledge


Management

Copyright 2007 John Chapter 4 2


Chapter Outline

 4.1 Managing Data


 4.2 The Database Approach
 4.3 Database Management Systems
 4.4 Data Warehousing
 4.5 Data Visualization
 4.6 Knowledge Management

Copyright 2007 John Chapter 4 3


Learning Objectives

 Recognize the importance of data, issues involved


in managing data and their lifecycle.
 Describe the sources of data and explain how data
are collected.
 Explain the advantages of the database approach.
 Explain the operation of data warehousing and its
role in decision support.

Copyright 2007 John Chapter 4 4


Learning Objectives (Continued)

 Understand the capabilities and benefits of


data mining.
 Describe data visualization.
 Explain geographic information systems and
virtual reality as decision support tools.
 Define knowledge and describe the different
types of knowledge.

Copyright 2007 John Chapter 4 5


4.1 Managing Data

 Difficulties of Managing Data.


 Amount of data increases exponentially.
 Data are scattered and collected by many
individuals using various methods and devices.
 Data come from many sources including internal
sources, personal sources and external sources.
 Data security, quality and integrity are critical.

Copyright 2007 John Chapter 4 6


Managing Data (Continued)

 Clickstream data. Data that visitors and


customers produce when they visit a
Website.
 An ever-increasing amount of data needs to
be considered in making organizational
decisions.

Copyright 2007 John Chapter 4 7


Data Life Cycle

Copyright 2007 John Chapter 4 8


Data Hierarchy

 Bit (a binary digit): a circuit that is either on


or off.
 Byte: group of 8 bits, represents a single
character.
 Field: name, number, or characters that
describe an aspect of a business object or
activity.

Copyright 2007 John Chapter 4 9


Data Hierarchy (Continued)

 Record: collection of related data


fields.
 File (or table): collection of related
records.
 Database: a collection of integrated and
related files.

Copyright 2007 John Chapter 4 10


4.2 Database Approach

 Database management system (DBMS) provides all users


with access to all the data.
 DBMSs minimizes the following problems:
 Data redundancy: the same data stored in many places.

 Data isolation: applications cannot access data

associated with other applications.


 Data inconsistency: various copies of the data do not

agree.

Copyright 2007 John Chapter 4 11


Database Approach (Continued)

 DBMSs maximize the following issues:


 Data security.
 Data integrity: data meets certain constraints, no
alphabetic characters in zip code field.
 Data independence: applications and data are
independent of one another, all applications are
able to access the same data.

Copyright 2007 John Chapter 4 12


Designing the Database

 Data model. Diagram that represents the entities in


the database and their relationships.
 Entity is a person, place, thing or event.
 Attribute is a characteristic or quality of a particular entity.
 Primary key is a field that uniquely identifies that record.
 Secondary keys are fields that have identifying information
but may not identify with complete accuracy.

Copyright 2007 John Chapter 4 13


Entity-Relationship Modeling

 Database designers plan the database design in a


process called entity-relationship (ER) modeling.
 ER diagrams consists of entities, attributes and
relationships.
 Entity classes are a group of entities of a given
type, i.e. STUDENT.
 Instance is the representation of a particular entity,
i.e. STUDENT(John Smith, 123-45-6789, …).
 Identifiers are attributes unique to that entity
instance, i.e. StudentIDNumber.

Copyright 2007 John Chapter 4 14


4.3 Database Management
Systems
 Database management system (DBMS) is a set of
programs that provide users with tools to add,
delete, access and analyze data stored in one
location.
 Online transaction processing (OLTP) is when
transactions are processed as soon as they occur.
 Relational database model is based on the concept
of two-dimensional tables.
 Popular examples of relational databases are
Microsoft Access and Oracle.

Copyright 2007 John Chapter 4 15


Query Languages

 Structured query language (SQL) is the


most popular query language used to request
information.
 Query by example (QBE) is a grid or
template that a user fills out to construct a
sample or description of the data wanted.

Copyright 2007 John Chapter 4 16


Relational Database Management
Systems
 Normalization is a method for analyzing
and reducing a relational database to its most
streamlined form for:
 Mimimum redunancy;
 Maximum data integrity;
 Best processing performance.
 Normalized data is when attributes in the
table depend only on the primary key.

Copyright 2007 John Chapter 4 17


Virtual Databases

 Software applications that provide a way of


managing many different data sources as
though they were all one large database.
 Benefits of virtual databases include:
 Lower development costs;
 Faster development time;
 Less maintenance;
 Single point of entry into a company’s data.

Copyright 2007 John Chapter 4 18


4.4 Data Warehousing

 Data warehouse is a repository of historical


data organized by subject to support decision
makers in the organization and include:
 Online analytical processing which involves
the analysis of accumulated data by end users;
 Multidimensional data structure which allows
data to be represented in a three-dimensional
matrix (or data cube).

Copyright 2007 John Chapter 4 19


Benefits of Data Warehousing

 End users can access data quickly and easily


via Web browsers because they are located
in one place.
 End users can conduct extensive analysis
with data in ways that may not have been
possible before.
 End users have a consolidated view of
organizational data.

Copyright 2007 John Chapter 4 20


Data Marts & Data Mining

 Data mart is a small data warehouse,


designed for the end-user needs in a strategic
business unit (SBU) or a department.
 Data mining involves searching for valuable
business information in a large database, data
warehouse, or data mart.
 Used to predict trends and behaviors.
 Identify previously unknown patterns.

Copyright 2007 John Chapter 4 21


Data Mining Applications

 Retailing and sales. Predict sales, prevent theft and fraud,


determine correct inventory levels and distribution
schedules.
 Banking. Forecast levels of bad loans, fraudulent credit card
use, predict credit card spending by new customers, etc.
 Manufacturing and production. Predict machinery
failures, find key factors to help optimize manufacturing
capacity.
 Insurance. Forecast claim amounts, medical coverage costs,
predict which customers will buy new insurance policies.

Copyright 2007 John Chapter 4 22


Data Mining Applications
(Continued)
 Policework. Track crime patterns, locations,
criminal behavior; identify attributes to assist in
solving criminal cases.
 Health care. Correlate demographics of patients
with critical illnesses, develop better insight to
identify and treat symptoms and their causes.
 Marketing. Classify customer demographics to
predict how customers will respond to mailing or
buy a particular product.

Copyright 2007 John Chapter 4 23


4.5 Data Visualization
Technologies
 Geographic Information Systems (GIS) is a
computer-based system for capturing, integrating,
manipulating and displaying data using digitized
maps.
 Find locations for new restaurants.
 Emerging GIS applications integrated with global
positioning systems (GPSs).
 Virtual Reality is interactive, computer-generated,
three-dimensional graphics delivered to the user
through a head-mounted display.

Copyright 2007 John Chapter 4 24


4.6 Knowledge Management

 Knowledge management (KM) is a process that


helps organizations manipulate important
knowledge that is part of the organization’s
memory, usually in an unstructured format.
 Knowledge is information that is contextual,
relevant and actionable; information in action.
 Intellectual capital (or intellectual assets) is
another term often used for knowledge.

Copyright 2007 John Chapter 4 25


Knowledge Management
(Continued)
 Explicit knowledge deals with more objective, rational and
technical knowledge.
 Tacit knowledge is the cumulative store of subjective or
experiential learning.
 Knowledge management systems (KMSs) use modern
information technologies – Internet, intranets, extranets, data
warehouses - to systemize, enhance and expedite intrafirm
and interfirm knowledge management.
 Best practices are the most effective and efficient ways of
doing things, readily available to a wide range of employees.

Copyright 2007 John Chapter 4 26


Knowledge Management System
Cycle
 Create knowledge. Determine new ways.
 Capture knowledge. Identify as valuable.
 Refine knowledge. Make it actionable.
 Store knowledge. Store in a reasonable format.
 Manage knowledge. Verify it is relevant, accurate.
 Disseminate knowledge. Made available.

Copyright 2007 John Chapter 4 27


Copyright 2007
John Wiley & Sons, Inc.
All rights reserved. Reproduction or translation of this work
beyond that permitted in section 117 of the 1976 United
States Copyright Act without express permission of the
copyright owner is unlawful. Request for further
information should be addressed to the Permissions
Department, John Wiley & Sons, Inc. The purchaser may
make back-up copies for his/her own use only and not for
distribution or resale. The Publisher assumes no
responsibility for error, omissions, or damages caused by the
use of these programs or from the use of the information
herein.

Copyright 2007 John Chapter 4 28

You might also like