You are on page 1of 33

1

Chapter 6
Foundations of Business Intelligence: Databases and Information
Management

Learning Objectives

What are the problems of managing data resources in a traditional file environment and how are
they solved by a database management system?
What are the major capabilities of database management systems (DBMS) and why is a
relational DBMS so powerful?
What are some important principles of database design?
What are the principal tools and technologies for accessing information from databases to
improve business performance and decision making?
Why are information policy, data administration, and data quality assurance essential for
managing the firm’s data resources?

Chapter Outline
6.1 Organizing Data in a Traditional File Environment
File Organization Terms and Concepts
Problems with the Traditional File Environment
6.2 The Database Approach to Data Management
Database Management Systems
Capabilities of Database Management Systems
Designing Databases
6.3 Using Databases to Improve Business Performance and Decision Making
Data Warehouses
Business Intelligence, Multidimensional Data Analysis, and Data Mining
Databases and the Web
6.4 Managing Data Resources
Establishing an Information Policy
Ensuring Data Quality
6.5 Hands-On MIS
Management Decision Problems
Improving Operational Excellence: Building a Relational Database for Inventory
Management
Improving Decision Making: Online Databases for Overseas Business Resources

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall


2

Key Terms

The following alphabetical list identifies the key terms discussed in this chapter. The page
number for each key term is provided.

Attributes, 210 Entity-relationship diagram, 220


Business intelligence (BI), 226 Field, 210
Data administration, 233 File, 210
Data cleansing, 234 Foreign key, 215
Data definition, 217 Information policy, 231
Data dictionary, 217 Key field, 214
Data governance, 233 Normalization, 219
Data inconsistency, 211 Object-oriented DBMS, 215
Data manipulation language, 217 Object-relational DBMS, 215
Data mart, 225 Online analytical processing (OLAP), 226
Data mining, 228 Predictive analysis, 229
Data quality audit, 234 Primary key, 214
Data redundancy, 211 Program-data dependence, 211
Data warehouse, 223 Record, 214
Database, 212 Referential integrity, 220
Database (rigorous definition), 212 Relational database, 213
Database administration, 233 Repeating data groups, 219
Database management system (DBMS), 212 Structured Query Language (SQL), 217
Database server, 230 Text mining, 229
Distributed database, 221 Tuple, 214
Entity, 210 Web mining, 229

Teaching Suggestions

The essential message of this chapter is the statement that “Organizations need to manage their
data assets very carefully to make sure that the data are easily accessed and used by managers
and employees across the organization.” Data have now become central and even vital to an
organization’s survival. You can illustrate these comments by referencing the opening case,

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall


3

“Can HP Mine Success from an Enterprise Data Warehouse?,” in order to stress the importance
of data and database systems for success in business.

Without a consistent view of the enterprise, HP senior executives struggled with decisions on
matters such as the size of sales and service teams assigned to particular systems. HP had too
many different information system applications in too many computer centers. It had too many
different database technologies and way too many different databases. As with most
organizations, departments were allowed to create, manage and use their own databases without
regard towards sharing the data with other departments—islands of information at their finest.
HP CIO Randy Mott began consolidating hundreds of data marts into a single data warehouse.
He had three goals for the database: it had to always be up-to-date, consistent for the entire
enterprise, and complete. The Neoview system includes all of the data used by a company and
not just partial segments of data or the company.

What’s interesting and intriguing about the opening vignette is how it points out that every
organization, even a technology company like HP, struggles with the need to manage data and
information as an important resource. How businesses store, organize, and manage their data has
a tremendous impact on organizational effectiveness. Companies need to manage their data to
help them reduce costs, improve operational efficiency and decision making, and most of all,
boost profitability.

Section 6.1, “Organizing Data in a Traditional File Environment,” introduces basic key terms
like field, record, file, database, entity and attribute. Try using a simple spreadsheet print-out to
demonstrate these terms. The section points out the drawbacks and difficulties organizations
experience with traditional methods of file management. They are: systems grow independently
without a company-wide plan; data redundancy (duplicate data in multiple data files) and data
inconsistency (the same attribute may have different values); program-data dependence, lack of
flexibility in delivering information when it’s needed; poor security, and the lack of being able to
share data and have it easily available to users.

Section 6.2, “The Database Approach to Data Management.” This section introduces students
to more file organization terms and concepts. A database management system is comprised of
three components: a data definition language, data dictionary, and data manipulation language. If
you have access to a relational DBMS during class time, you can demonstrate several of the
concepts presented in this section.

Database design and management requirements for database systems are introduced. Help your
students see how a logical design allows them to analyze and understand the data from a business
perspective, while physical design shows how the database is arranged on direct access storage
devices. At this point, you can use the enrollment process at your university as an example. Have
your students prepare a logical design for the enrollment process. If you have time and as a class
activity, ask your students to prepare an entity-relationship diagram (using Figure 6-11 as a
guide), as well as normalize the data. Your students will need guidance from you to complete
this activity, but it will help them see and understand the logical design process.

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall


4

Section 6.3, “Using Databases to Improve Business Performance and Decision Making.” This
section focuses on how data technologies are actually used: data warehouses, data marts,
business intelligence, multidimensional data analysis, and data mining. Regardless of their
career choice, students will probably use some or all of these in their jobs. For example, data
warehouses and data marts are important to many people, partly because they are critical for
those who want to use data mining, which in turn has many uses in management analysis and
business decisions. Keep in mind as you teach this chapter that managing data resources can be
very technical, but many students will need and want to know the business uses and business
values. In the end, effectively managing data is the goal. Doing it in a way that will enable your
students to contribute to the success of their organization is the reason why most students are in
this course.

This section discusses text mining and Web mining that are taking on significance as more data
and information is stored in text documents and on the Web. Web mining is divided into three
categories: Web content mining, Web structure mining, and Web usage mining. Each one
provides specific information on patterns in Web data.

Interactive Session: Organizations: The Internal Revenue Service Uncovers Tax Fraud
with a Data Warehouse

Case Study Questions:

1. Why was it so difficult for the IRS to analyze the taxpayer data it had collected?

Initially, IRS data were stored in legacy systems designed to process tax return forms efficiently
and organized in many different formats, including hierarchical mainframe databases, Oracle
relational databases, and non-database “flat” files. The data in the older style hierarchical
databases and “flat” files were nearly impossible to query and analyze and could not easily be
combined with the relational data.

2. What kind of challenges did the IRS encounter when implementing its CDW? What
management, organization, and technology issues had to be addressed?

The challenges the IRS encountered when it implemented its CDW include:

Management: Convincing the organization to undergo a sweeping upgrade like a data


warehouse implementation was not easy, since government agencies are normally risk-adverse
and resist changes. Data warehouses require extensive effort to keep up-to-date.

Organization: The structure of data wasn’t consistent because of tax law changes through the
years. This made integration of the data a complicated process. The sheer amount of data that the
CDW was slated to manage was far more than anything the IRS had previously handled. Data
warehouses tend to require extensive amounts of money to keep up-to-date.

Technology: The CDW has grown in capacity from three terabytes at its creation in the late
1990s to approximately 150 terabytes of data. The most important feature of the data warehouse

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall


5

was that it be sufficiently large to accommodate multiple terabytes of data, but also accessible
enough to allow queries of its data using many different tools. The components that the IRS
selected allowed CDW to do that. Conversion of the legacy data to the new system was not a
uniform process.

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall


6

3. How did the CDW improve decision making and operations at the IRS? Are there
benefits to taxpayers?

The CDW enables highly flexible queries against one of the largest databases in the world. IRS
researchers can now search and analyze hundreds of millions or even billions of records at one
time using a centralized source of accurate and consistent data instead of having to reconcile
information from multiple inconsistent sources. The CDW allows the agency to recoup many
billions of dollars in tax revenue that was lost under the old system. In 2006 the IRS collected
$59.2 billion in additional revenue via 1.4 million audits of taxpayers questioned for
underreporting taxes. Using the data warehouse, analysts are able to determine patterns in groups
of people most likely to cheat on their taxes. The data warehouse reduced the time it takes to
trace mistakes in claims and analyze data from six to eight months to only a few hours. The
CDW is more secure than the old legacy system storage tapes, thereby better protecting taxpayer
data.

4. Do you think data warehouses could be useful in other areas of the federal sector?
Which ones? Why or why not?

Other federal agencies that might find data warehouses useful include:
 Department of Defense: maintain all personnel data from all four branches of the military
including active duty, Guard, Reserve, and retired people. During times of war or
national emergencies the data warehouse could supply information on people most
qualified and available to respond to the emergency. All kinds of information and
analyses could be performed if the data were consistent and complete.
 Federal Trade Commission: could combine data on consumer-related activities into one
data warehouse that would be available to all branches of government and private
organizations. Data could help analyze economic situations and factors so that businesses
and governments could make faster and better decisions.

MIS In Action

1. Go to www.irs.gov and download a 1040 tax return form. What are some of the fields
from that form that would most likely be included in the CDW?

Obviously personal identification information like name, address, and social security number are
included in the CDW. Probably all of the rest of the information like filing status, wages,
exemptions, other sources of income, and taxes and credits are maintained in the CDW.

2. How could IRS researchers use these data to determine whether someone was
underreporting (cheating on) taxes?

Using data mining techniques, the IRS can determine typical income and deduction patterns. By
matching the submitted forms against the patterns, the IRS can deduce whether someone is
underreporting their taxes.

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall


7

Interactive Session: Technology: The Databases Behind MySpace

Case Study Questions

1. What kind of databases and database servers does MySpace use?

In its initial phases, MySpace operated with two Web servers communicating with one database
server and a Microsoft SQL Server database. The site continued adding Web servers to handle
increased user requests. After the number of accounts exceeded 500,000 the site added more
SQL Server databases: one served as a master database, the others focused on retrieving data for
user page requests. After two million accounts were activated, MySpace switched to a vertical
partitioning model in which separate databases supported distinct functions of the Web site.
After three million accounts, the site scaled out by adding many cheaper servers to share the
database workload.

It eventually switched to a virtualized storage architecture in which databases write data to any
available disk, thus eliminating the possibility of an application’s dedicated disk becoming
overloaded. MySpace later installed a layer of servers between the database servers and the Web
servers to store and serve copies of frequently accessed data objects so that the site’s Web
servers wouldn’t have to query the database servers with lookups as frequently.

2. Why is database technology so important for a business such as MySpace?

Almost everything MySpace receives from and serves to its users are data objects like pictures,
audio files and video files. The objects are very individualized and attached to a certain entity
(person). Its databases must make the objects readily available to anyone requesting access to
that entity. Database technology is the only technology that accomplishes the mission.

3. How effectively does MySpace organize and store the data on its site?

In its infancy, MySpace used two Web servers communicating with one database server. That
was adequate when the site had a small number of users who were updating or accessing
database objects. Obviously that won’t work with tens of millions of users. Unfortunately,
MySpace still overloads more frequently than other major Web sites. With a log-in error rate of
20 to 40 percent on some days, the site is not effectively organizing or storing data at all.

4. What data management problems have arisen? How has MySpace solved, or attempted
to solve, these problems?

Some of the problems MySpace has encountered are inadequate storage space on its database
servers, slow access or no access through its log-in application, and users’ inabilities to access
data. Over the years, MySpace has attempted to fix these problems by adding more Web servers
and more database servers. Some were simply “added on” without restructuring the entire
system to more efficiently use its hardware and software. Workloads were not distributed evenly

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall


8

between servers which caused inefficient use of resources. MySpace developers continue to
redesign the Web site’s database, software, and storage systems, to keep pace with its exploding
growth, but their job is never done.

MIS In Action

Explore MySpace.com, examining the features and tools that are not restricted to
registered members. Then answer the following questions:

1. Based on what you can view without registering, what are the entities in MySpace’s
database?

Obviously, individual users are the main entity in MySpace’s databases. Other entities are video
files, audio files, blogs, forums, groups, events, favorites, and email.

2. Which of these entities have some relationship to individual members?

Which of the entities have a relationship to individual members depends on what the individual
decides. For instance, it’s possible that Sarah would have a list of films (video files) attached to
her profile. She may also participate in forums or groups. It’s possible that all the entities have
some relationship to individual members.

3. Select one of these entities and describe the attributes for that entity.

Films included in MySpace’s databases likely have these attributes: name, date produced, date
released, actors, actresses, director, subject, place it was filmed, musical scores included in the
film, awards given to the film, comments of film goers, and critics’ ratings.

Section 6.4, “Managing Data Resources.” This section introduces students to some of the
critical management issues surrounding corporate data. Students should realize that setting up
the database is only the beginning of the process. Managing the data is the real challenge. In
fact, the main point is to show how data management has changed and the reason why data must
be organized, accessed easily by those who need it, and protected from the wrong people
accessing, modifying, or harming the data.

Developing a database environment requires much more than selecting database technology. It
requires a formal information policy governing the maintenance, distribution, and use of
information in the organization. The organization must also develop a data administration
function and a data-planning methodology. Data planning may need to be performed to make
sure that the organization’s data model delivers information efficiently for its business processes
and enhances organizational performance. There is political resistance in organizations to many
key database concepts, especially the sharing of information that has been controlled exclusively
by one organizational group. Data governance, with special emphasis on promoting privacy,
security, data quality, and compliance with government regulations is becoming more important.
If you have time, you might have students brainstorm on a data governance policy they would

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall


9

like to see implemented in your university. Creating a database environment is a long-term


endeavor requiring large up-front investments and organizational change.

Section 6.5, “Hands-On MIS”

Management Decision Problems

1. Emerson Process Management: data warehouse was full of inaccurate and redundant data
gathered from numerous transaction processing systems. The design team assumed all users
would enter data the same way. Users actually entered data in multiple ways. Assess the
potential business impact of these data quality problems. What decisions have to be made and
steps taken to reach a solution?

Managers and employees can’t make accurate and timely decisions about customer activity
because of inaccurate and redundant data. The company could be wasting resources pursuing
customers it shouldn’t and neglecting its best customers. The company could be experiencing
financial losses resulting from the inaccurate data.

Managers, employees, and data administrators need to identify and correct the faulty data and
then establish better routines for editing data when it’s entered. The company should perform a
data quality audit by surveying entire data files, surveying samples from data files, or surveying
end users for perceptions of data quality. The company needs to perform data cleansing
operations to correct errors and enforce consistency among the different sets of data at their
origin.

2. Industrial supply company: the company wants to create a single data warehouse by
combining several different systems. The sample files from the two systems that would supply
the data for the data warehouse contain different data sets.

1. What business problems are created by not having these data in a single standard format?

Managers are unable to make good decisions about the company’s sales and products because of
inconsistent data. Managers can’t determine which products are selling the best world-wide; they
can only determine product sales by region.

2. How easy would it be to create a database with a single standard format that could store the
data from both systems? Identify the problems that would have to be addressed.

It may not be too hard to create a database with a single standard format if the company used
middleware to pull both data sets into the consolidated database. The company should use
specialized data-cleansing software that would automatically survey data files, correct errors in
the data, and integrate the data in a consistent company-wide format. Problems that may occur
would stem from inconsistent data names like the Territory and Customer ID in the old sets and
data element names like Division in the new set. The data administrators, managers, and
employees may have to track the data conversion and manually convert some data.

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall


10

3. Should the problems be solved by database specialist or general business managers? Explain.

Both the database specialist and general business managers should help solve the problems. Data
administrators are responsible for developing information policy, planning for data, overseeing
logical database design and data dictionary development, and monitoring how information
system specialists and end-user groups use data. However, end-users and business managers
have the final decision-making authority and responsibility for the data.

4. Who should have the authority to finalize a single company-wide format for this information
in the data warehouse?

Owners and managers are the only ones who have the authority to finalize the format for the
information in the data warehouse. They could develop an information policy that specifies the
organization’s rules for sharing, disseminating, acquiring, standardizing, classifying, and
inventorying information.

Improving Operational Excellence, Building a Relational Database for Inventory


Management

Software skills: Database design, querying and reporting


Business Skills: Inventory Management

This exercise requires that students know how to create queries and reports using information
from multiple tables. The solutions provided here were created using the query wizard and report
wizard capabilities of Access. Students can, of course, create more sophisticated reports if they
wish.

The database would need some modification to answer other important questions about the
business. The owners might want to know, for example, which are the fastest-selling bicycles.
The existing database shows products in inventory and their suppliers. The owners might want to
add an additional table (or tables) in the database to house information about product sales, such
as the product identification number, date placed in inventory, date of sale, purchase price, and
customer name, address, and telephone number. Management could use this enhanced database
to create reports on best selling bikes over a specific period, the number of bicycles sold during a
specific period, total volume of sales over a specific period, or best customers. Students should
be encouraged to think creatively about what other pieces of information should be captured on
the database that would help the owners manage the business.

The answers to the following questions can be found in the Microsoft Access File named:
MIS11ch05solutionfile.mdb.

1. Prepare a report that identifies the five most expensive bicycles. The report should list the
bicycles in descending order from most expensive to lease expensive, the quantity on hand
for each, and the markup percentage for each.

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall


11

2. Prepare a report that lists each supplier, its products, their quantities on hand, and associated
reorder levels. The report should be sorted alphabetically by supplier. Within each supplier
category, the products should be sorted alphabetically.

3. Prepare a report listing only the bicycles that are low in stock and need to be reordered. The
report should provide supplier information for the items identified.

4. Write a brief description of how the database could be enhanced to further improve
management of the business. What tables or fields should be added? What additional reports
would be useful?

Improving Decision Making: Searching Online Databases for Overseas Business Resources

Software skills: Online databases


Business skills: Researching services for overseas operations

List the companies you would contact to interview on your trip to determine whether they
can help you with these and any other functions you think vital to establishing your office.

Student answers will vary based on the companies they choose to contact.

Rate the databases you used for accuracy of name, completeness, ease-of-use, and general
helpfulness.

The U.S. Department of Commerce Web site contains a fair amount of economic information.
However, it may be simpler to direct your students to go to http://www.aol.com. The Web site
for the Nationwide Business Directory of Australia is http://www.nationwide.com.au

What does this exercise tell you about the design of databases?

Students may not understand that the World Wide Web is one massive data warehouse, but in
non-technical terms that is exactly what it is. Remind them of this when they are completing this
assignment. This assignment may best be accomplished in groups, where they can consolidate
their findings into a written or oral presentation.

Review Questions

1. What are the problems of managing data resources in a traditional file


environment and how are they solved by a database management system?
‫ما هي مشاكل إدارة البيانات في بيئة ملفات تقليدية؟‬
‫تقنيات إدارة الملفات التقليدية تجعل من الصعب على المؤسسات‬

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall


‫‪12‬‬

‫أج &&زاء البيان &&ات ال &&تي يس &&تخدمونها& بطريق &&ة منهجي &&ة وتنظيم& ه &&ذه البيان &&ات بحيث يمكن بس &&هولة لوص &&ول‪ .‬تم‬
‫تطوير مناطق ومجموعات وظيفية مختلفة بشكل مستقل‪. .‬‬
‫مع مرور& الوقت ‪،‬الوقت‪ ،‬فإن إدارة الملفات التقليدية هذه تتسم بالتكرار وعدم االتساق ‪،‬االتس&&اق‪ ،‬واالعتم&&اد‬
‫على البيان&&ات البرنامجية ‪ ،‬وع&&دم المرون&&ة ‪ ،‬وض&&عف& األمن ‪ ،‬واالفتق&&ار إلى تب&&ادل البيان&&ات والت&&وافر‪ .‬نظ&&ام‬
‫إدارة قواعد البيانات (‪ )DBMS‬يحل هذه المشاكل مع‬
‫ال &&برامج ال &&تي تس &&مح لمركزي &&ة البيان &&ات وإ دارة البيان &&ات أن الش &&ركات ل &&ديها مص &&در واحد لجمي &&ع احتياج &&ات‬
‫البيانات الخاصة بهم‪ .‬استخدام& نظام إدارة قواعد البيانات يقلل من التكرار وغير متناسق‬
‫قوائم االنتظار‪.‬‬
‫‪2.‬‬ ‫‪List and describe each of the components in the data hierarchy.‬‬
‫قائمة ووصف كل من المكونات في التسلسل الهرمي للبيانات‪.‬‬

‫يتض&&من التسلس&&ل اله&&رمي& للبيان&&ات وح&&دات الب&&ايت والب&&ايت والحق&&ول والس&&جالت والملف&&ات وقواع&&د البيان&&ات‪.‬‬
‫يتم تنظيم البيان &&ات في تسلس &&ل ه &&رمي يب &&دأ بـ ‪ ، bit‬وال &&ذي يتم تمثيل &&ه إم &&ا بـ ‪( 0‬إيق &&اف) أو ‪( 1‬على)‪ .‬يمكن‬
‫تجمي&&ع البت&&ات لتتش&&كل حس&&ب الح&&رف أو ال&&رقم أو الرم&&ز‪ .‬يمكن تجمي&&ع وح&&دات الب&&ايت لتش&&كيل حق&&ل ‪ ،‬مث&&ل‬
‫اسم أو تاريخ ‪ ،‬ويمكن تجميع الحقول ذات الصلة لتكوين سجل‪ .‬يمكن تجميع السجالت ذات الصلة لتش&&كيل‬
‫الملفات ‪ ،‬ويمكن تنظيم& الملفات ذات الصلة في قاعدة بيانات‪.‬‬

‫‪3.‬‬ ‫‪Figure 6–1 shows the data hierarchy. The data hierarchy includes bits,‬‬
‫‪bytes, fields, records, files, and databases. Data are organized in a‬‬
‫‪hierarchy that starts with the bit, which is represented by either a 0 (off) or‬‬
‫‪a 1 (on). Bits can be grouped to form a byte to represent one character,‬‬
‫‪number, or symbol. Bytes can be grouped to form a field, such as a name‬‬
‫‪or date, and related fields can be grouped to form a record. Related‬‬
‫‪records can be collected to form files, and related files can be organized‬‬
‫‪into a database.‬‬
‫‪4.‬‬
‫‪5.‬‬ ‫‪Define and explain the significance of entities, attributes, and key fields.‬‬

‫تعريف وشرح أهمية الكيانات والسمات والحقول الرئيسية‪.‬‬

‫• الكيان هو الشخص أو المكان أو الشيء أو الحدث الذي يمكن الحصول على المعلومات منه‪.‬‬
‫معينا‪.‬‬
‫كيانا ً‬
‫• السمة هي جزء من المعلومات التي تصف ً‬

‫‪Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall‬‬


‫‪13‬‬

‫• اﻟﺤﻘﻞ اﻟﺮﺋﻴﺴﻲ هﻮ اﻟﺤﻘﻞ اﻟﺬي ﻳﺘﻢ ﻓﻴﻪ ﺗﺤﺪﻳﺪ اﻟﺴﺠﻼت ﺑﺼﻮرة ﻓﺮﻳﺪة واﻟﺘﻲ ﻳﻤﻜﻦ اﺳﺘﺮدادهﺎ أو ﺗﺤﺪﻳﺜﻬﺎ‬
‫أو ﻓﺮزﻬﺎ ﻓﻘﻂ‪ .‬على سبيل المثال ‪ ،‬اسم الشخص فريد ألنه فريد‪ .‬كما يمكن تصميم منتج فري&د& ليك&&ون فري&دا‬
‫من نوعه‪.‬‬

‫‪6.‬‬ ‫‪Entity is a person, place, thing, or event on which information can be‬‬
‫‪obtained.‬‬
‫‪7.‬‬ ‫‪Attribute is a piece of information describing a particular entity.‬‬
‫‪8.‬‬ ‫‪Key field is a field in a record that uniquely identifies instances of that‬‬
‫‪unique record so that it can be retrieved, updated, or sorted. For‬‬
‫‪example, a person’s name cannot be a key because there can be another‬‬
‫‪person with the same name, whereas a social security number is unique.‬‬
‫‪Also a product name may not be unique but a product number can be‬‬
‫‪designed to be unique.‬‬
‫‪9.‬‬
‫‪10.‬‬ ‫‪List and describe the problems of the traditional file environment.‬‬
‫قائمة ووصف مشاكل بيئة الملفات التقليدية‪&.‬‬

‫مشاكل في بيئة البيانات التقليدي&ة والتك&رار& واالرتب&اك& ونقص المرون&ة وانع&دام& المرون&ة وقل&ة تب&ادل البيان&ات‪.‬‬
‫تكرار البيانات هو وجود& بيان&ات مك&ررة في ملف&ات بيان&ات متع&ددة‪ .‬في ه&ذه الحال&ة ‪ ،‬ينتج عن االرتب&اك& ألن‬
‫البيان &&ات يمكن أن يك &&ون له &&ا مع &&ان مختلف &&ة في ملف &&ات مختلف &&ة‪ .‬االعتم &&اد على بيان &&ات البرن &&امج ه &&و العالق &&ة‬
‫الض&&يقة بين البيان&&ات المخزن&&ة في الملف&&ات وال&&برامج المح&&ددة المطلوب&&ة‪ .‬ه&&ذه التبعي&&ة غ&&ير فعال&&ة للغاي&&ة ‪ ،‬مم&&ا‬
‫أدى إلى الحاجة إلى إجراء تغييرات في العديد من البرامج عندما يتغ&&ير مث&&ل ه&&ذه البيان&&ات المش&&تركة ‪ ،‬مث&&ل‬
‫حجم الرم& &&ز البري& &&دي‪ .‬يش& &&ير ع& &&دم المرون& &&ة إلى حقيق& &&ة أن& &&ه من الص& &&عب للغاي& &&ة إنش& &&اء بيان& &&ات جدي& &&دة‪ .‬من‬
‫المس&&تحيل إنش&&اء تق&&ارير مخصص&&ة ؛ ق&&د يتطلب تقري&&ر جدي&&د أك&&ثر من برن&&امج واح&&د وإ نش&&اء ملف&&ات وس&&يطة‬
‫لدمج البيانات من الملفات المتباين&ة‪ .‬النت&ائج األمني&ة الس&يئة من ع&دم الس&يطرة على البيان&ات واس&عة االنتش&ار‪.‬‬
‫تقريبا مشاركة البيانات ألنه يتم توزيعها& في العديد من الملفات المختلفة حول المؤسسة‪.‬‬
‫ً‬ ‫من المستحيل‬

‫‪11.‬‬ ‫‪Problems with the traditional file environment include data redundancy‬‬
‫‪and confusion, program-data dependence, lack of flexibility, poor security,‬‬

‫‪Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall‬‬


14

and lack of data sharing and availability. Data redundancy is the presence
of duplicate data in multiple data files. In this situation, confusion results
because the data can have different meanings in different files. Program-
data dependence is the tight relationship between data stored in files and
the specific programs required to update and maintain those files. This
dependency is very inefficient, resulting in the need to make changes in
many programs when a common piece of data, such as the zip code size,
changes. Lack of flexibility refers to the fact that it is very difficult to
create new reports from data when needed. Ad-hoc reports are impossible
to generate; a new report could require several weeks of work by more
than one programmer and the creation of intermediate files to combine
data from disparate files. Poor security results from the lack of control
over the data because the data are so widespread. Data sharing is virtually
impossible because it is distributed in so many different files around the
organization.
12.
13. Define a database and a database management system and
describe how it solves the problems of a traditional file
environment.
)‫تعريف قاعدة بيانات ونظام إدارة قاعدة بيانات ووصف كيف يحل مشاكل بيئة الملفات التقليدية& (غير مهم‬.

‫ ومن الممكن تخزينه& &&ا‬، ‫قاع& &&دة البيان& &&ات هي مجموع& &&ة من البيان& &&ات منظم& &&ة إلى ع& &&دد كب& &&ير من التطبيق& &&ات‬
‫)& ه&و برن&امج خ&اص يم ّكن‬DBMS( ‫ نظ&ام إدارة قواع&د البيان&ات‬.‫ كما أنه يقلل من البيانات المتك&ررة‬.‫بكفاءة‬
‫المؤسس &&ات من مركزي &&ة البيان &&ات وإ دارته &ا& بكف &&اءة وتوف &&ير الوص &&ول إلى البيان &&ات المخزن &&ة بواس &&طة ب &&رامج‬
.‫التطبيقات‬

‫ ويقل&&ل من تك&&اليف‬، ‫) أن يقل&&ل من تعقي&&د بيئ&&ة أنظم&&ة المعلوم&&ات‬DBMS( ‫يمكن لنظ&&ام إدارة قواع&&د البيان&&ات‬
‫ وتحس&ين الوص&ول‬، ‫ وي&تيح اس&ترجاع المعلوم&ات المخصص&ة‬، ‫ ويع&زز المرون&ة‬، ‫تطوير البيان&ات والص&يانة‬
.‫ واألمن‬، &‫ واستخدامها‬، ‫ والسماح لإلدارة المركزية للبيانات‬، ‫& المعلومات‬.‫والتوافر‬
A database is a collection of data organized to service many applications efficiently by
storing and managing data so that they appear to be in one location. It also minimizes
redundant data. A database management system (DBMS) is special software that permits an
organization to centralize data, manage them efficiently, and provide access to the stored data
.by application programs

A DBMS can reduce the complexity of the information systems environment, reduce data
redundancy and inconsistency, eliminate data confusion, create program-data independence,

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall


‫‪15‬‬

‫‪reduce program development and maintenance costs, enhance flexibility, enable the ad hoc‬‬
‫‪retrieval of information, improve access and availability of information, and allow for the‬‬
‫‪.centralized management of data, their use, and security‬‬

‫‪14.‬‬ ‫‪2. What are the major capabilities of DBMS and why is a relational DBMS‬‬
‫?‪so powerful‬‬
‫قويا للغاية؟‬
‫ما هي القدرات الرئيسية لنظام إدارة قواعد البيانات (‪ )DBMS‬ولماذا يكون نظام ‪ DBMS‬العالئق ً‬
‫تتض & &&من الق & &&درات األساس & &&ية لنظ & &&ام إدارة قواع & &&د البيان & &&ات ( ‪ )DBMS‬إمكاني & &&ة البيان & &&ات ‪ ،‬وقابلي & &&ة ق & &&اموس‬
‫البيان &&ات ‪ ،‬ولغ &&ة معالج &&ة البيان &&ات‪ .‬تح &&دد ق &&درة تعري &&ف البيان &&ات الهيك &&ل ومحت &&وى قاع &&دة البيان &&ات‪ .‬ق &&اموس‬
‫البيان&ات ه&&و دلي&ل آلي يخ&زن المعلوم&ات ح&&ول البيان&ات الموج&&ودة في قاع&دة البيان&ات ‪ ،‬بم&&ا في ذل&ك األس&&ماء‬
‫والتعريفات واألشكال& واألوصاف لعناصر البيانات‪.‬‬
‫لغ&ة معالج&ة البيان&ات ‪ ،‬مث&ل لغ&ة االس&تعالمات البنيوي&ة ‪ ،‬هي لغ&ة متخصص&ة للوص&ول& والتعامل م&ع البيان&ات‬
‫في قاعدة البيانات‪.‬‬
‫لق&&د ك&&انت قاع&&دة البيان&&ات العالئقي&&ة هي الطريق&&ة األساس&&ية لتنظيم البيان&&ات والحف&&اظ عليها ألنه&&ا مرن&&ة للغاي&&ة‬
‫ويمكن الوصول إليها‪ .‬ينظم البيانات في الجداول ثنائية األبعاد‬
‫دعم العالقات مع الصفوف& واألعمدة‪ .‬يحتوي& كل جدول على بيانات حول الكيان وسماته‪.‬‬
‫يمث&&ل ك&&ل ص&&ف س&&جالً ويمث&&ل ك&&ل عم&&ود س&&مة أو حق&&ل‪ .‬ك&&ل ج&&دول يحت&&وي& أيضا حق&&ل رئيس&&ي لتعري&&ف ك&&ل‬
‫سجل بشكل فريد لالسترجاع& أو التالعب‪ .‬جداول قاعدة البيانات العالئقية‬
‫يمكن دمج عنصر بيانات مشترك‪ .‬أصبحت قواعد البيانات غير العالئقي&ة ش&ائعة إلدارة أن&واع البيان&ات ال&تي‬
‫ال يمكن التعامل معها‬
‫من خالل نموذج البيانات العالئقية‪ .‬كالهما عالقي وغير عالقي‬
‫خدمات الحوسبة السحابية‪.‬‬
‫مادي&&ا‪ .‬تص &&ميم& منطقي نم &&اذج قاع &&دة البيان &&ات من‬‫&ميما& ً‬‫منطقي&&ا وتص & ً‬
‫ً‬ ‫&ميما‬
‫يتطلب تص &&ميم قاع &&دة البيان &&ات تص & ً‬
‫منظ& &&ور& األعم& &&ال‪ .‬يجب أن يعكس نم& &&وذج بيان& &&ات المنظم& &&ة أعماله& &&ا الرئيس& &&ية العملي& &&ات ومتطلب& &&ات ص& &&نع‬
‫القرار‪ .‬عملية إنشاء صغيرة ومستقرة ومرنة ‪ ،‬والتكيف‬
‫يطلق على هياكل البيانات من مجموعات معقدة من البيانات عند تصميم قاعدة بيانات عالئقية التطبيع‪.‬‬
‫تحتوي قاع&دة بيان&&ات عالئقي&ة جي&&دة التص&&ميم على عالق&&ات أط&راف& ب&&أطراف كي&ان مح&دد لن ينطب&ق إال على‬
‫ه&&ذا الكي&&ان‪ .‬س&&يحاول تط&&بيق& قواع&&د التكام&&ل الم&&رجعي لض&&مان ذلك تظ&&ل العالق&&ات بين الج&&داول المزدوج&&ة‬

‫‪Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall‬‬


‫‪16‬‬

‫متناس&قة‪ .‬رس&م تخطيطي لعالق&ة كي&ان يص&ور بش&كل بي&اني العالق&ة بين الكيان&ات (الج&داول) في قاع&دة بيان&ات‬
‫عالئقية‪.‬‬

‫‪15.‬‬ ‫‪Name and briefly describe the capabilities of a DBMS.‬‬


‫اسم ووصف موجز إلمكانيات نظام ‪.DBMS‬‬

‫يتض&&من نظ&&ام ‪ DBMS‬ق&&درات وأدوات لتنظيم وإ دارة والوص&&ول& إلى البيان&&ات في قاع&&دة البيان&&ات‪ .‬تتض&&من‬
‫القدرات األساسية لنظام ‪ DBMS‬لغة البيانات وقاموس& البيانات ولغة معالجة البيانات‪.‬‬
‫تحدد لغة تعريف البيانات بنية ومحتوى قاعدة البيانات‪.‬‬
‫• قاموس البيان&ات ه&&و مل&ف آلي أو ي&دوي يتض&من معلوم&&ات ح&&ول البيان&ات الموج&&ودة في قاع&&دة البيان&&ات ‪،‬‬
‫بما في ذلك األسماء والتعريفات واألشكال& واألوصاف لعناصر البيانات‪.‬‬
‫• لغ&&&ة معالج&&&ة البيان&&&ات ‪ ،‬مث&&&ل ‪ ، SQL‬هي لغ&&&ة متخصص&&&ة للوص &&ول إلى البيان &&ات ومعالجته&&&ا في قاع&&&دة‬
‫البيانات‪.‬‬

‫‪A DBMS includes capabilities and tools for organizing, managing, and accessing the data in‬‬
‫‪the database. The principal capabilities of a DBMS include data definition language, data‬‬
‫‪.dictionary, and data manipulation language‬‬
‫‪.The data definition language specifies the structure and content of the database ‬‬
‫‪The data dictionary is an automated or manual file that stores information about the data ‬‬
‫‪.in the database, including names, definitions, formats, and descriptions of data elements‬‬
‫‪The data manipulation language, such as SQL, is a specialized language for accessing ‬‬
‫‪.and manipulating the data in the database‬‬

‫‪16.‬‬ ‫‪Define a relational DBMS and explain how it organizes data.‬‬


‫يحتوي تعريف ‪ DBMS‬العالئقية ويشرح كيفية تنظيم البيانات‪.‬‬

‫تع & &&د قاع & &&دة البيان & &&ات العالئقي & &&ة هي الطريق & &&ة األساس & &&ية لتنظيم البيان & &&ات والحف & &&اظ عليه & &&ا الي & &&وم في أنظم & &&ة‬
‫المعلومات‪ .‬ينظم البيانات في جداول ثنائية األبعاد م&ع ص&فوف وأعم&دة تس&مى العالق&ات‪ .‬يحت&وي ك&ل ج&دول‬
‫على بيانات حول الكيان وسماته‪ .‬يمثل كل ص&ف س&جالً ويمث&ل ك&ل عم&ود س&مة أو حق&ل‪ .‬يحت&وي& ك&ل ج&دول‬
‫أيضا على حقل رئيسي لتعريف& كل سجل بشكل فريد لالسترجاع& أو التالعب‪.‬‬
‫ً‬
‫‪The relational database is the primary method for organizing and maintaining data today in‬‬

‫‪Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall‬‬


‫‪17‬‬

‫‪information systems. It organizes data in two-dimensional tables with rows and columns‬‬
‫‪called relations. Each table contains data about an entity and its attributes. Each row‬‬
‫‪represents a record and each column represents an attribute or field. Each table also contains‬‬
‫‪a key field to uniquely identify each record for retrieval or manipulation.‬‬

‫‪List and describe the three operations of a relational DBMS.‬‬

‫قائمة ووصف العمليات الثالث من ‪ DBMS‬العالئقية‪.‬‬

‫في قاع&&دة بيان&&ات عالئقي&&ة ‪ ،‬يتم اس&&تخدام& ثالث عملي&&ات أساس&&ية لتط&&وير& مجموع&&ات مفي&&دة من البيان&&ات ‪select ، :‬‬
‫‪.project ، and join‬‬
‫• حدد عملية إنشاء مجموعة فرعية من جميع السجالت في الملف الذي يستوفي& المعايير المذكورة‪ .‬بمعنى آخ&&ر‬
‫‪ ،‬حدد إنشاء مجموعة فرعية من الصفوف التي تفي بمعايير معينة‪.‬‬
‫• االنضمام إلى عملية الجمع بين الجداول العالئقية والمعلومات المتوفرة في الجداول الفردية‪..‬‬
‫• تق &&وم عملي &&ة المش &&روع& بإنش &&اء مجموع &&ة فرعي &&ة من األعم &&دة في ج &&دول ‪ ،‬مم &&ا يس &&مح للمس &&تخدم بإنش &&اء‬
‫جداول جديدة تحتوي فقط‬
‫على المعلومات المطلوبة‬

‫‪17.‬‬ ‫‪Explain why non-relational databases are useful.‬‬


‫اشرح لماذا تكون قواعد البيانات الغير عالئقية مفيدة؟‬
‫اس& &&تخدام& نم& &&وذج بيان& &&ات أك& &&ثر مرون& &&ة وتم تص& &&ميمه إلدارة مجموع& &&ات البيان& &&ات الكب& &&يرة ع& &&بر العدي& &&د من اآلالت‬
‫الموزع&&ة وللتوسع& بس&&هولة ألعلى أو ألس&&فل‪ .‬ه&و مفي&&د لتس&&ريع االس&&تعالمات البس&&يطة مقاب&&ل مجل&&دات كب&&يرة‬
‫من الهيكلية بيانات غير منظمة ‪ ،‬بما في ذل&ك ال&ويب والوس&ائط االجتماعي&ة والرس&&ومات وأش&كال أخ&رى من‬
‫البيانات التي يصعب تحليلها باستخدام& األدوات التقليدية المستندة إلى ‪.SQL‬‬
‫هناك عدة أنواع مختلفة من قواعد بيانات ‪ ، NoSQL‬لكل منها قواعدها الخ‬

‫‪18.‬‬ ‫‪Define referential integrity and explain its role in relational database‬‬
‫‪systems.‬‬
‫تعريف التكامل المرجعي وشرح دوره في أنظمة قواعد البيانات العالئقية‬

‫‪Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall‬‬


‫‪18‬‬

‫‪19.‬‬ ‫‪Define and describe an entity-relationship diagram and explain its role in‬‬
‫‪database‬‬
‫‪20.‬‬ ‫‪design.‬‬
‫تعريف ووصف العالقة الكيان الرسم البياني وشرح دوره في قاعدة البيانات التصميم‪.‬‬

‫‪In a relational database, three basic operations are used to develop useful . .21‬‬
‫‪.sets of data: select, project, and join‬‬
‫‪Select operation creates a subset consisting of all records in the file that .22‬‬
‫‪meet stated criteria. In other words, select creates a subset of rows that‬‬
‫‪.meet certain criteria‬‬
‫‪Join operation combines relational tables to provide the user with more .23‬‬
‫‪.information that is available in individual tables‬‬
‫‪Project operation creates a subset consisting of columns in a table, .24‬‬
‫‪permitting the user to create new tables that contain only the information‬‬
‫‪.required‬‬
‫‪.25‬‬
‫‪26. 3. What are the principal tools and technologies for accessing‬‬
‫‪27.‬‬ ‫‪information from databases to improve business performance and decision‬‬
‫?‪making?What are some important database design principles‬‬
‫ما هي األدوات والتكنولوجيات الرئيسية للوصول إلى المعلومات من قواعد البيانات لتحسين األعمال األداء وصنع القرار؟‬
‫تحتوي& تكنولوجيا إدارة البيانات المعاصرة على مجموعة من األدوات ألنواع المختلفة من البيانات التي‬
‫تس &&تخدمها& الش &&ركات الي &&وم‪ ،‬بم &&ا في ذل &&ك البيان &&ات الكب &&يرة ش &&به المنظم &&ة وغ &&ير المنظم &&ة في كمي &&ات هائل &&ة‪.‬‬
‫وتش&&مل ه&&ذه الق&&درات مس&&تودعات البيان&&ات وبيان&&ات البيان&&ات ‪،‬البيان&&ات‪ ،Hadoop ،Hadoop ،‬الحوس&&بة في‬
‫ال & & &&ذاكرة ‪،‬ال & & &&ذاكرة‪ ،‬ومنص & & &&ات تحليلي & & &&ة‪ .‬تمث & & &&ل ‪ OLAP‬العالق & & &&ات بين البيان & & &&ات باعتباره & & &&ا بني & & &&ة متع & & &&ددة‬
‫األبعاد ‪،‬األبع&اد‪ ،‬وال&&تي يمكن تص&ورها كمكعب&&ات من البيان&&ات والمكعب&ات& داخ&&ل مكعب&&ات البيان&&ات ‪،‬البيان&ات‪،‬‬
‫مما يتيح المزيد من تحليل البيانات المعقدة‪.‬‬
‫تحليل البيانات يحلل مجموعات كبيرة من البيانات ‪،‬البيانات‪ ،‬بما في ذلك محتويات مستودعات البيانات‬
‫‪ ،‬للعث&&ور& على أنم&&اط القواع&&د ال&&تي يمكن اس&&تخدامها& للتنب&&ؤ بالس&&لوك& المس&&تقبلي وتوجي&&ه عملي&&ة ص&&نع الق&&رار‪.‬‬
‫أدوات التعدين النص تساعد الشركات تحليل واسع مجموعات البيانات غير منظم تتك&&ون من النص‪ .‬أدوات‬
‫التعدين على شبكة اإلنترنت تركز على تحليل األنماط المفيدة‬

‫‪Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall‬‬


‫‪19‬‬

‫والمعلومات من شبكة اإلنترنت العالمية المستخدمين وكذلك محتويات صفحات الويب‪ .‬يمكن ربط‬
‫قواع& &&د البيان& &&ات التقليدي& &&ة ع& &&بر الوس& &&يطة ب& &&الويب أو واجه& &&ة ويب لتس& &&هيل وص& &&ول المس& &&تخدم إلى البيان& &&ات‬
‫الداخلية للمؤسسة‪.‬‬

‫‪28.‬‬ ‫‪Define big data and describe the technologies for managing and‬‬
‫‪analyzing it.‬‬
‫تحديد البيانات الكبيرة ووصف التقنيات إلدارة وتحليل البيانات الضخمة‪.‬‬
‫تتكون البيانات الض&خمة من بيان&ات غ&ير منظم&ة في ج&داول وأعم&دة قاع&دة البيان&ات التقليدية ‪ ،‬وهي تتج&اوز&‬
‫قدرة نظ&ام إدارة قواع&د البيان&ات‪ .‬ه&ذه البيان&ات الض&خمة ليس&ت بيان&ات عادي&ة مث&ل المع&امالت ‪ ،‬فهي في ك&ل‬
‫مكان مثل ‪ ، tweets‬ورسائل& البريد اإللكتروني& ‪ ،‬وحركة المرور على شبكة اإلنترنت ‪ ،‬ك&&ل األش&&ياء ال&&تي‬
‫ال يمكن أن تكون بسهولة ويسيرة‪ .‬كما يتم إنتاج البيانات الكب&&يرة بكمي&&ات كب&&يرة وبس&&رعة أك&&بر من البيان&&ات‬
‫التقليدية ‪ ،‬مم&ا يجع&ل من الص&عب تتبعه&ا‪ .‬هن&اك أدوات مح&ددة لتحلي&ل البيان&ات وك&ذلك مس&تودعات البيان&ات‪.‬‬
‫مستودع البيانات عبارة عن قاعدة بيانات تخزن فقط المعلومات ‪ Hadoops ،‬التاريخية‪ A Hadoop .‬هو‬
‫إط&&ار برمجي&&ات مفت&&وح المص&&در ي&&تيح الحوس&&بة المتوازي&&ة لكمي&&ات كب&&يرة من البيان&&ات ع&&بر أجه&&زة كم&&بيوتر‬
‫متع&&ددة‪ .‬تأخ&&ذ ‪ Hadoop‬في االعتب&&ار مجموع&&ات البيان&&ات األص&&غر واألص&&غر ‪ ،‬حيث تتم معالجته&&ا ودمجه&&ا‬
‫في مجموع &&ات بيان &&ات أص &&غر‪ .‬على س &&بيل المث &&ال ‪ ،‬إذا كنت تبحث عن فن &&دق في الوالي &&ات المتح &&دة ‪ ،‬فمن‬
‫المحتمل أن تكون البيانات التي تبحث عنها هي األك&&ثر أهمي&&ة‪Define and describe normalization.‬‬
‫‪and referential integrity and explain how they contribute to a well-designed‬‬
‫‪.relational database‬‬

‫‪29.‬‬ ‫‪List and describe the components of acontemporary business‬‬


‫‪intelligence infrastructure‬‬
‫قائمة ووصف مكونات البنية التحتية للبيانات التجارية المعاصرة؟‬
‫تتك& &&ون البني& &&ة التحتي& &&ة للبيان& &&ات التجاري& &&ة من أج& &&زاء متع& &&ددة‪ .‬واح& &&دة من األج& &&زاء األولى هي مس& &&تودعات‬
‫مع&&ا‪ .‬س&&وق&‬
‫البيان&&ات وبيان&&ات الس&&وق‪ .‬يأخ&&ذ مس&&تودع& البيان&&ات البيان&&ات التاريخي&&ة من نظ&&ام& الش&&ركة ويس&&حبها ً‬
‫البيان&&ات ه &&و مجموع&&ة فرعي&&ة من مس &&تودع البيان&&ات وه &&و نفس البيان &&ات التاريخي&&ة ال&&تي تم س&&حبها من قب &&ل‬
‫المستودع‪ .‬أحد المكونات التالية هو ‪Hadoop‬‬
‫‪ Hadoop .‬ه&&و إط&&ار ب&&رمجي مفت&&وح المص&&در ي&&وفر حوس&&بة متوازي&&ة موزع&&ة لكمي&&ات كب&&يرة من البيان&&ات‬
‫ع&بر أجه&زة كم&&بيوتر متع&ددة‪ .‬ي&تيح ذل&ك معالج&&ة البيان&&ات ودمجه&ا& م&&ع مجموع&&ات البيان&ات األخ&&رى‪ .‬المك&ون‬

‫‪Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall‬‬


‫‪20‬‬

‫الراب&&ع ه&&و الحوس&&بة في ال&&ذاكرة‪ .‬ه&&ذا ه&&و نظ&&ام لالس&&تخدام في تخ&&زين البيان&&ات‪ .‬تمكن الحوس&&بة في ال&&ذاكرة‬
‫البيانات من مستودع& البيانات من اإلقامة فقط في أجهزة الكمبيوتر&‬

‫‪30.‬‬ ‫‪Describe the capabilities of online analytical processing (OLAP).‬‬


‫صف قدرات المعالجة التحليلية عبر اإلنترنت (‪.)OLAP‬‬

‫ض &ا مث&&ل المعالج&&ة التحليلي&&ة‬


‫ت&&دعم مس&&تودعات البيان&&ات تحلي&&ل البيان&&ات متع&&دد األبع&&اد ‪،‬األبع&&اد‪ ،‬المع&&روف أي ً‬
‫ع&&بر اإلن&&ترنت (‪ ،)، &)OLAP‬وال&&تي تمكن المس&&تخدمين من الع&&رض نفس البيان&&ات بط&&رق& مختلف&&ة باس&&تخدام&‬
‫أبع&&اد متع&&ددة‪ .‬كل ج&&انب من المعلوم&&ات يمث&&ل بع&&دا مختلف&&ا‪ .‬وتمث&&ل ‪ OLAP‬العالق&&ات بين البيان&&ات باعتباره&&ا‬
‫متع &&ددة األبع &&اد هيك &&ل ‪،‬هيك &&ل‪ ،‬وال &&تي يمكن تص &&ورها& كمكعب &&ات من البيان &&ات والمكعب &&ات داخل مكعب &&ات من‬
‫البيانات ‪،‬البيانات‪ ،‬مما يتيح تحليل البيانات أكثر تطورا&‬
‫‪ OLAP‬ي&&تيح للمس&&تخدمين الحص&&ول على إجاب&&ات لألس&&ئلة بش&&كل ع&&ادل من خالل توف&&ير& كمي&&ة كب&&يرة من‬
‫الوقت ‪ ،‬حتى عندما يتم تخزين البيانات الكبير جداً في قواعد البيانات‪.‬‬
‫المعالج&&ة التحليلي&&ة ع&&بر اإلن &&ترنت وتع&&دين البيان&&ات وتحلي&&ل كمي&&ات كب &&يرة من البيان&&ات من كث &&ير وجه&&ات‬
‫النظر ‪ ،‬على س&&بيل المث&&ال ‪ ،‬المبيع&&ات حس&&ب البند ‪ ،‬من قب&&ل اإلدارة ‪ ،‬من خالل متجر ‪ ،‬من قبل المنطقة ‪،‬‬
‫من أجل العثور& على أنم&اط في البيان&ات‪ .‬ه&ذه األنم&اط ص&عبة م&ع أس&اليب قاع&دة البيان&ات العادية ‪ ،‬وه&ذا ه&و‬
‫السبب في مستودع البيانات وتعدين البيانات ‪.OLAP‬‬

‫‪31.‬‬ ‫‪Define data mining, describing how it differs from OLAP and the types of‬‬
‫‪information it provides.‬‬
‫تحديد استخراج البيانات ‪ ،‬واصفا كيفية اختالفه عن ‪ OLAP‬وأنواع المعلومات التي تقدمها‪.‬‬
‫ي&&وفر& تع&&دين البيان&&ات إحص&&اءات ح&&ول البيان&&ات ال&&تي ال يمكن الحص&&ول عليه&&ا باس&&تخدام ‪ OLAP‬من خالل‬
‫إيجاد أنماط وعالقات مخفية في قواعد بيانات كب&&يرة‪ .‬تُس&&تخدم األنم&&اط والقواع&&د في اتخ&&اذ الق&رارات‪ .‬تش&مل‬
‫أن& &&واع المعلوم& &&ات ال& &&تي يتم الحص& &&ول& عليه& &&ا من اس& &&تخراج البيان& &&ات الجمعي& &&ات والتسلس& &&الت والتص& &&نيفات‬
‫والمجموعات والتنبؤات‪.‬‬

‫‪32.‬‬ ‫‪Explain how text mining and Web mining differ from conventional data‬‬
‫‪mining.‬‬
‫استكشاف التعدين& بيانات التعدين& والتعدين&‪.‬‬

‫‪Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall‬‬


‫‪21‬‬

‫يرك&&ز اس&&تخراج البيان&&ات التقليدي&&ة على البيان&&ات وق&&د تم تنظيم&&ه في قواع&&د البيان&&ات والملف&&ات‪ .‬يرك&&ز تع&&دين‬
‫النص على إيج&اد أنم&اط واتجاه&ات البيان&ات غ&ير المهيكل&ة الموج&ودة في الملف&ات النص&ية‪ .‬ق&د تك&ون البيان&ات‬
‫في البري&&&د اإللك&&&تروني والم&&&ذكرات& ونس&&&خ مرك&&&ز االتص&&&ال واالس &&تجابات& االستقص &&ائية والقض&&&ايا& القانوني&&&ة‬
‫وأوص&&اف ال&&براءات وتق&&ارير& الخدم&&ة‪ .‬أدوات اس&&تخراج النص اس&&تخراج العناص&&ر الرئيس&&ية من مجموع&&ات‬
‫البيانات الكبيرة غير منظم ‪ ،‬واكتشاف& أنماط والعالقات& ‪ ،‬وتلخيص المعلومات‪.‬‬

‫التعدين على شبكة اإلنترنت يساعد الشركات على فهم سلوك العمالء ‪ ،‬وتقييم فعالي&ة موق&ع ويب معين ‪ ،‬أو‬
‫تحديد مدى نجاح حملة التسويق‪ &.‬ويب التعدين يبحث عن أنماط في البيانات من خالل‬
‫• استخراج محتوى الويب‪ :‬استخراج المعرفة من محتوى صفحات الويب‬
‫• تعدين بنية الويب‪ :‬تحليل البيانات المتعلقة بهيكل موقع ويب معين‬
‫• تعدين استخدام الويب‪ :‬النظر في بيانات تفاعل المستخدم المسجلة بواسطة خادم الويب‬

‫‪33.‬‬ ‫‪Define a data warehouse, explaining how it works and how it benefits‬‬
‫‪organizations.‬‬
‫تحديد مستودع البيانات ‪ ،‬وشرح كيفية عمله وكيف يستفيد منها‪.‬‬
‫مس& &&تودع البيان& &&ات قاع& &&دة بيان& &&ات م& &&ع المحفوظ& &&ات‪ ،‬االس& &&تعالم‪ ،‬وأدوات التنقيب عن البيان& &&ات (أي األدوات‬
‫اإلحص&&ائية) ويس&&تخدم& لتخ&&زين البيان&&ات التاريخي&&ة والحالي&&ة وتنش&&أ البيان&&ات في العدي&&د من المن&&اطق التش&&غيلية‬
‫ويتم نس&&خها في مس&&تودع& البيان&&ات كلم&&ا دعت الحاج&&ة‪ .‬يتم تنظيم& البيان&&ات في المس&&تودع& وفقً&&ا للمع&&ايير على‬
‫مس&توى الش&ركة بحيث يمكن اس&تخدامها لتحلي&ل اإلدارة واتخ&اذ الق&رارات‪ .‬ت&دعم مس&تودعات البيان&ات بيان&ات‬
‫المنظم &&ة من خالل العدي &&د من وجه &&ات النظ &&ر أو االتجاه &&ات‪ .‬يجع &&ل مس &&تودع& البيان &&ات البيان &&ات متاح &&ة ألي‬
‫ض& & &ا مجموع & &&ة من األدوات‬
‫ش & &&خص ‪،‬ش & &&خص‪ ،‬ولكن ال يمكن تغييره & &&ا‪ .‬ي & &&وفر نظ & &&ام مس & &&تودع& البيان & &&ات أي ً‬
‫المخصص&&ة والموح&&دة واألدوات التحليلي&&ة ومراف&ق& إع&&داد التق&&ارير الرس&&ومية‪ .‬يس&&مح نظ&&ام مس&&تودع البيان&&ات‬
‫للم & & &&دراء ب & & &&االطالع على المنتج & & &&ات من قب & & &&ل العمالء ‪،‬العمالء‪ ،‬حس & & &&ب الس & & &&نة ‪،‬الس & & &&نة‪ ،‬من قب & & &&ل من & & &&دوب‬
‫جوهري&&ا عن البيان&&ات‪ .‬ال تس&&مح قواع&&د البيان&&ات التش&&غيلية العادي&&ة بمث&&ل‬
‫ً‬ ‫المبيع&&ات ‪،‬المبيع&&ات‪ ،‬وال&&تي تختل&&ف‬
‫هذه اآلراء المختلفة‪.‬‬
‫‪Normalization is the process of creating small stable data structures from complex groups of data when designing a relational‬‬
‫‪database. Normalization streamlines relational database design by removing redundant data such as repeating data groups. A‬‬
‫‪well-designed relational database will be organized around the information needs of the business and will probably be in some‬‬
‫‪.normalized form. A database that is not normalized will have problems with insertion, deletion, and modification‬‬

‫‪Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall‬‬


22

Referential integrity rules ensure that relationships between coupled tables remain consistent. When one table has a foreign
key that points to another table, you may not add a record to the table with the foreign key unless there is a corresponding
.record in the linked table

.Define a distributed database and describe the two main ways of distributing data

A distributed database is one that is stored in more than one physical location. A distributed database can be partitioned
or replicated. When partitioned, the database is divided, so that each remote processor has access to the data that it
needs to serve its local area. These databases can be updated locally and later justified with the central database. With
.replication, the database is duplicated at various remote locations. Figure 6–12 shows how a database is distributed

4. What are the principal tools and technologies for accessing information from
databases to improve business performance and decision making?

Define a data warehouse, explaining how it works and how it benefits organizations.

A data warehouse is a database with archival, querying, and data exploration tools (i.e.,
statistical tools) and is used for storing historical and current data of potential interest to
managers throughout the organization and from external sources (e.g., competitor sales or
market share). The data originate in many of the operational areas and are copied into the
data warehouse as often as needed. The data in the warehouse are organized according to
company-wide standards so that they can be used for management analysis and decision
making. Data warehouses support looking at the data of the organization through many
views or directions. The data warehouse makes the data available to anyone to access as
needed, but it cannot be altered. A data warehouse system also provides a range of ad hoc
and standardized query tools, analytical tools, and graphical reporting facilities. The data
warehouse system allows managers to look at products by customer, by year, by salesperson,
essentially different slices of the data. Normal operational databases do not permit such
.different views

Define business intelligence and explain how it is related to database technology.

Powerful tools are available to analyze and access information that has been captured and
organized in data warehouses and data marts. These tools enable users to analyze the data to see
new patterns, relationships, and insights that are useful for guiding decision making. These tools
for consolidating, analyzing, and providing access to vast amounts of data to help users make
better business decisions are often referred to as business intelligence. Principal tools for
business intelligence include software for database query and reporting tools for
multidimensional data analysis and data mining.

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall


23

Describe the capabilities of online analytical processing (OLAP).

Data warehouses support multidimensional data analysis, also known as online analytical
processing (OLAP), which enables users to view the same data in different ways using
multiple dimensions. Each aspect of information represents a different dimension.

OLAP represents relationships among data as a multidimensional structure, which can be


visualized as cubes of data and cubes within cubes of data, enabling more sophisticated
data analysis. OLAP enables users to obtain online answers to ad hoc questions in a fairly
rapid amount of time, even when the data are stored in very large databases. Online
analytical processing and data mining enable the manipulation and analysis of large
volumes of data from many perspectives, for example, sales by item, by department, by
store, by region, in order to find patterns in the data. Such patterns are difficult to find
with normal database methods, which is why a data warehouse and data mining are
usually parts of OLAP. OLAP represents relationships among data as a multidimensional
structure, which can be visualized as cubes of data and cubes within cubes of data, enabling
more sophisticated data analysis.

Define data mining, describing how it differs from OLAP and the types of information it
provides.

Data mining provides insights into corporate data that cannot be obtained with OLAP by
finding hidden patterns and relationships in large databases and inferring rules from them
to predict future behavior. The patterns and rules are used to guide decision making and
forecast the effect of those decisions. The types of information obtained from data mining
include associations, sequences, classifications, clusters, and forecasts.

Explain how text mining and Web mining differ from conventional data mining.

Conventional data mining focuses on data that have been structured in databases and files.
Text mining concentrates on finding patterns and trends in unstructured data contained in text
files. The data may be in email, memos, call center transcripts, survey responses, legal cases,
patent descriptions, and service reports. Text mining tools extract key elements from large
.unstructured data sets, discover patterns and relationships, and summarize the information

Web mining helps businesses understand customer behavior, evaluate the effectiveness of a
particular Web site, or quantify the success of a marketing campaign. Web mining looks for
patterns in data through
Web content mining: extracting knowledge from the content of Web pages 
Web structure mining: examining data related to the structure of a particular Web site 
Web usage mining: examining user interaction data recorded by a Web server 
whenever requests for a Web site’s resources are received

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall


24

Describe how users can access information from a company’s internal databases
.through the Web

Conventional databases can be linked via middleware to the Web or a Web interface to
facilitate user access to an organization’s internal data. Web browser software on his/her
client PC is used to access a corporate Web site over the Internet. The Web browser software
requests data from the organization’s database, using HTML commands to communicate
with the Web server. Because many back-end databases cannot interpret commands written
in HTML, the Web server passes these requests for data to special middleware software that
then translates HTML commands into SQL so that they can be processed by the DBMS
working with the database. The DBMS receives the SQL requests and provides the required
data. The middleware transfers information from the organization’s internal database back to
the Web server for delivery in the form of a Web page to the user. The software working
between the Web server and the DBMS can be an application server, a custom program, or a
series of software scripts.

34. 54. Why are information policy, data administration, and data quality
assurance essential for managing the firm’s data resources?
‫ وضمان جودة البيانات األساسية إلدارة موارد بيانات‬،‫البيانات‬، ‫ وإ دارة البيانات‬،‫المعلومات‬، ‫لماذا سياسة المعلومات‬
‫الشركة؟‬
&‫ تحكم سياس&&ة المعلوم&&ات الرس&&مية ص&&يانة وتوزي&&ع واس&&تخدام‬.‫تط&&وير& بيئ&&ة قاع&&دة البيان&&ات وأنظم&&ة إدارة البيان&&ات‬
‫ يك &&ون نظ &&ام إدارة البيان &&ات الرس &&مي مس &&ؤوالً& عن‬،‫الكب &&يرة‬، ‫ في الش &&ركات الكب &&يرة‬.‫المعلوم &&ات في المنظم &&ة‬
.‫ ويتم استخدام& مراقبة البيانات في الشركة‬،‫المعلومات‬، ‫سياسة المعلومات‬
‫البيان&&ات غ&&ير الدقيق&&ة أو غ&&ير المكتمل&&ة أو غ&&ير المتس&&قة في س&&ياق تس&&عير المنتج وحس&&ابات العمالء وبيان&&ات‬
‫ يجب أن تتخ&&ذ‬.‫ ت&&ؤدي إلى ق&&رارات غ&&ير دقيق&&ة ح&&ول اإلج&&راءات ال&&تي يجب أن تتخ&&ذها الش&&ركة‬، ‫المخ&&زون‬
ٍ & ‫الش &&ركات خط &&وات خاص &&ة للتأك &&د من أنه &&ا تتمت &&ع بمس &&توى& ع‬
‫ وتش &&مل ه &&ذه المع &&ايير‬.‫&ال من ج &&ودة البيان &&ات‬
‫ وقواع &د& البيان &&ات المص &&ممة لتقلي &&ل البيان &&ات غ&&ير المتس &&قة‬، ‫اس &&تخدام مع &&ايير& البيان &&ات على نط &&اق المؤسس &&ة‬
.‫ وبرامج تنظيف& البيانات‬، ‫ وعمليات تدقيق& جودة البيانات‬، ‫وغير المتكررة‬

35. Describe the roles of information policy and data administration in


information management.
36.
‫وصف أدوار سياسة المعلومات وإ دارة البيانات في إدارة المعلومات‬.

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall


25

‫تح&&دد سياس&&ة المعلوم&&ات قواع&&د المنظم&&ة الخاص&&ة بمش&&اركة المعلوم&&ات ونش&&رها& والحص&&ول& عليه&&ا وتوحي&&دها‬
‫ ومن ه&&و المس&&ؤول عن تح&&ديث‬، ‫ حيث يمكن توزي&&ع المعلوم&&ات‬، ‫ سياس&&ة المعلوم&&ات‬.‫وتص&&نيفها وحص&&رها‬
‫المعلومات والحفاظ‬
‫إدارة البيان &&ات مس &&ؤولة عن السياس &&ات واإلج &&راءات المح &&ددة ال &&تي يمكن من خالله &&ا إدارة البيان &&ات كم &&ورد‬
‫ واإلش&راف& على تص&ميم‬، ‫ والتخطي&ط للبيان&ات‬، ‫ وتشمل هذه المسؤوليات& تطوير& سياسة المعلومات‬.‫تنظيمي‬
‫ ومراقب&&ة كيفي&&ة إخص&&ائيي نظم المعلوم&&ات ومجموع&&ات‬، ‫قواع&&د البيان&&ات المنطقي&&ة وتط&&وير& ق&&اموس البيان&&ات‬
‫ يك& &&ون نظ& &&ام إدارة البيان& &&ات الرس& &&مي مس& &&ؤوالً& عن سياس& &&ة‬، ‫ في الش& &&ركات الكب& &&يرة‬.‫المس& &&تخدمين النه& &&ائيين‬
.‫ ويتم استخدام مراقبة البيانات في الشركة‬، ‫المعلومات‬

37. Explain why an organization needs an information policy


‫اشرح لماذا تحتاج المؤسسة إلى سياسة معلومات‬
‫ وال تري&&د أن‬، ‫ بيان&&ات ش&&ركتك هي م&&ورد مهم‬.‫ يحت&&اج إلى سياس&&ة المعلوم&&ات‬، &‫ كب&&ير وص&&غير‬، ‫ك&&ل عم&&ل‬
‫ ومن ُيس&مح ل&ه بمش&اهدة البيان&ات أو‬، ‫ أنت في حاج&ة إلى قواعد تتم المحافظ&ة علي&ه‬.‫يفع&ل الن&اس تري&د معهم‬
.‫تغييرها‬
.‫نش &&ر واكتس &&اب وتوحي &&د وتص&&نيف وج&&رد& المعلوم&&ات‬، ‫تح&&دد سياس&&ة المعلوم&&ات قواع&&د المنظم&&ة للمش&&اركة‬
‫ تحدي&د المس&تخدمين والوح&دات التنظيمي&ة ال&تي يمكن‬، ‫سياسة المعلومات تح&دد إج&راءات ومس&ؤوليات مح&ددة‬
‫ ومن ه & & &&و المس & & &&ؤول لتح & & &&ديث والحف & & &&اظ على‬، ‫ حيث يمكن توزي & & &&ع المعلوم & & &&ات‬، ‫مش & & &&اركتها& المعلوم & & &&ات‬
.‫المعلومات‬
An information policy specifies the organization’s rules for sharing, disseminating,
acquiring, standardizing, classifying, and inventorying information. Information
policy lays out specific procedures and accountabilities, identifying which users and
organizational units can share information, where information can be distributed,
.and who is responsible for updating and maintaining the information

Data administration is responsible for the specific policies and procedures through which
data can be managed as an organizational resource. These responsibilities include
developing information policy, planning for data, overseeing logical database design and
data dictionary development, and monitoring how information systems specialists and end-
.user groups use data

In large corporations, a formal data administration function is responsible for information


policy, as well as for data planning, data dictionary development, and monitoring data
.usage in the firm

38. Explain why data quality audits and data cleansing are essential.
39.

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall


26

‫اشرح لماذا تعتبر عمليات تدقيق جودة البيانات وتطهير البيانات ضرورية‬.
‫البيان&&ات غ&&ير الدقيق&&ة أو غ&&ير المكتمل&&ة أو غ&&ير المتس&&قة في س&&ياق تس&&عير المنتج وحس&&ابات العمالء وبيان&&ات‬
‫ يجب أن تتخ&&ذ‬.‫ ت&&ؤدي إلى ق&&رارات غ&&ير دقيق&&ة ح&&ول اإلج&&راءات ال&&تي يجب أن تتخ&&ذها الش&&ركة‬، ‫المخ&&زون‬
ٍ & ‫الش &&ركات خط &&وات خاص &&ة للتأك &&د من أنه &&ا تتمت &&ع بمس &&توى& ع‬
‫ وتش &&مل ه &&ذه المع &&ايير‬.‫&ال من ج &&ودة البيان &&ات‬
‫ وقواع &د& البيان &&ات المص &&ممة لتقلي &&ل البيان &&ات غ&&ير المتس &&قة‬، ‫اس &&تخدام مع &&ايير& البيان &&ات على نط &&اق المؤسس &&ة‬
.‫ وبرامج تنظيف& البيانات‬، ‫ وعمليات تدقيق& جودة البيانات‬، ‫وغير المتكررة‬
‫ يمكن إج&&راء عملي&&ات‬.‫ ت&&دقيق ج&&ودة البيان&&ات ه&&و مس&&ح منظم للدق&&ة واكتم&&ال البيان&&ات في نظ&&ام المعلوم&&ات‬-
‫ت &&دقيق& ج &&ودة البيان &&ات من خالل مراقب &&ة ملف &&ات البيان &&ات أو مراقب &&ة المس &&تخدمين لمف &&اهيمهم الخاص &&ة بج &&ودة‬
.‫البيانات‬
‫ ال‬.‫تنظيف البيانات هي عملية غير صحيحة أو غير كاملة أو غ&ير منس&قة بش&كل ص&حيح أو غ&ير ض&رورية‬
‫ض&ا االتس&&اق& بين مجموع&&ات البيان&&ات‬
ً ‫ ب&&ل تف&&رض أي‬، ‫تقتص&&ر تنقي&&ة البيان&&ات على البيان&&ات الص&&حيحة فحس&&ب‬
.‫المختلفة التي تنشأ في أنظمة معلومات منفصلة‬
Data that are inaccurate, incomplete, or inconsistent create serious operational
and financial problems for businesses because they may create inaccuracies in
product pricing, customer accounts, and inventory data, and lead to inaccurate
decisions about the actions that should be taken by the firm. Firms must take
special steps to make sure they have a high level of data quality. These include
using enterprise-wide data standards, databases designed to minimize
.inconsistent and redundant data, data quality audits, and data cleansing software

A data quality audit is a structured survey of the accuracy and level of


completeness of the data in an information system. Data quality audits can be
performed by surveying entire data files, surveying samples from data files, or
.surveying end users for their perceptions of data quality

Data cleansing consists of activities for detecting and correcting data in a


database that are incorrect, incomplete, improperly formatted, or redundant. Data

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall


27

cleansing not only corrects data but also enforces consistency among different
.sets of data that originated in separate information systems

Discussion Questions‫غير مهم‬

40. 1. It has been said that you do not need database management software to
create a database environment. Discuss.
‫ مناقشة‬.‫لقد قيل أنك ال تحتاج إلى برنامج إدارة قواعد البيانات إلنشاء بيئة قاعدة بيانات‬.

‫قاع&&دة البيان&&ات هي مجموع&&ة من البيان&&ات المنظم&&ة لخدم&&ة العدي&&د من التطبيق&&ات في نفس ال&&وقت عن طري&&ق‬
‫ ال يوص& &ى& ب &&أن تحت &&وي قاع &&دة البيان &&ات على نظ &&ام‬.‫تخ &&زين البيان &&ات وإ دارته &&ا بحيث تب &&دو في مك &&ان واح &&د‬
&‫ وه&&و نم&&وذج لتنظيم& المعلوم&&ات ال&&تي يمكن تخزينه &ا‬- ‫ واألك&&ثر& أهمي&&ة ه&&و مفه&&وم قاع&&دة البيان&&ات‬.DBMS
&‫ ال يك&ون نظ&ام‬، ‫ ب&&دون الع&رض الص&حيح لقاع&دة البيان&&ات ونم&&وذج البيان&&ات‬.‫والوص&&ول إليه&ا بمرون&&ة وكف&اءة‬
‫ يم ّكن تطبيق& &&ات األعم& &&ال‬.‫ص & &ا إلنش& &&اء قاع& &&دة بيان& &&ات وص& &&يانتها‬
ً ‫برنامج& & ا& خا‬
ً DBMS ‫ يع& &&د‬.ً‫ فع& &&اال‬DBMS
‫ ف&&إن اس&&تخدام نظ&&ام إدارة قواع&&د‬، ‫ وم&&ع ذل&&ك‬.‫الفردي&&ة الس&&تخراج البيان&&ات في ب&&رامج الكم&&بيوتر الخاص&&ة بهم‬
‫) يمكن أن يقل &&ل من اعتم &&اد البرن &&امج على البيان &&ات إلى ج &&انب تك &&اليف تط &&وير& البرن &&امج‬DBMS( ‫البيان &&ات‬
‫ يمكن زي&&&ادة إمكاني&&&ة الوص&&&ول إلى المعلوم&&&ات من قب&&&ل المس &&تخدمين ويمكن للم &&برمجين إج&&&راء‬.‫والص&&&يانة‬
‫) للمنظم &&ة‬DBMS ( ‫ يس &&مح نظ &&ام إدارة قواع &&د البيان &&ات‬.‫اس &&تعالمات مخصص &&ة للبيان &&ات في قاع &&دة البيان &&ات‬
A database is a collection of data organized .‫بإدارة البيانات واستخدامها& وأمنها بشكل مركزي‬
to service many applications at the same time by storing and managing data so
that they appear to be in one location. It is not mandated that a database have a
DBMS. What is most important is the concept of a database — a model for
organizing information so that it can be stored and accessed flexibly and
efficiently. Without the right vision of a database and data model, a DBMS is not
effective. A DBMS is special software to create and maintain a database. It
enables individual business applications to extract the data they need without
having to create separate files or data definitions in their computer programs.
However, the use of a DBMS can reduce program-data dependence along with

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall


28

program development and maintenance costs. Access and availability of


information can be increased because users and programmers can perform ad-
hoc queries of data in the database. The DBMS allows the organization to
.centrally manage data, its use, and security

41. 2. To what extent should end users be involved in the selection of a


database management system and database design?
‫إلى أي مدى ينبغي أن يشارك المستخدمون في اختيار نظام إدارة قاعدة البيانات وتصميم قاعدة البيانات؟‬

‫ يتطلب‬.‫ينبغي إش&&راك المس&&تخدمين النه&&ائيين في اختي&&ار نظ&&ام إدارة قواع&&د البيان&&ات وتص&&ميم& قاع&&دة البيان&&ات‬
‫ يتطلب تغي&ير في موق&ف& الش&ركة تج&اه‬.‫تطوير بيئة قاعدة البيانات أكثر بكث&ير من مج&رد اختي&ار التكنولوجي&ا‬
‫ ق&&د تك&&ون مش&&اركة‬.‫ يجب على المنظم&&ة تط&&وير& وظيف&&ة إدارة البيان&&ات ومنهجي&&ة تخطي&&ط البيان&&ات‬.‫المعلوم&&ات‬
End users should be.‫المس& & &&تخدم النه& & &&ائي مفي& & &&دة في التخفي& & &&ف من المقاوم& & &&ة السياس& & &&ية للمنظم& & &&ات‬
involved in the selection of a database management system and the database
design. Developing a database environment requires much more than just
selecting the technology. It requires a change in the corporation’s attitude toward
information. The organization must develop a data administration function and a
data planning methodology. The end-user involvement can be instrumental in
mitigating the political resistance organizations may have to many key database
concepts, especially to sharing information that has been controlled exclusively by
.one organizational group

Video Cases

You will find video cases illustrating some of the concepts in this chapter on the
Laudon Web site at www.pearsonhighered.com/laudon along with questions to
.help you analyze the cases

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall


29

Collaboration and Teamwork: Identifying Entities and Attributes in an Online


Database

With your team of three or four students, select an online database to explore,
such as AOL Music, iGo.com or the Internet Movie Database. Explore one of
these Web sites to see what information it provides. Then list the entities and
attributes that the company running the Web site must keep track of in its
databases. Diagram the relationship between the entities you have identified.
.Use Google Sites to post the results on the team’s Web site

Direct your students to these Web sites. In their analysis, students should quickly
articulate that many of these sites use the same entities and attributes to keep
.track of their database

There are hundreds of Internet Movie Databases so students will have to select
the one that interests them. The Web sites for AOL Music and iGo.com are
.listed below

/http://music.aol.com
/http://igo.com

Business Problem-Solving Case: Trouble with the Terrorist Watch List Database

?What concepts in this chapter are illustrated in this case .1

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall


30

Consolidating multiple databases into a single cohesive one is a primary goal of


the FBI’s Terrorist Screening Center (TSC). The organization is integrating at
least 12 different databases; two years after the process began, 10 of the 12
have been processed. The remaining two databases are both fingerprint
.databases, and not technically watch lists

Using data warehouses to serve all the agencies that need information is the
second major concept taken from this chapter. Agencies can receive a data mart,
a subset of data, that pertains to its specific mission. For instance, airlines use
data supplied by the TSA system in their NoFly and Selectee lists for
prescreening passengers, while the U.S. Customs and Border Protection system
uses the watch list data to help screen travelers entering the United States
[presumably in transportation other than an airplane]. The State Department
screens applicants for visas to enter the U.S. and U.S. residents applying for
passports, while state and local law enforment agencies use the FBI system to
.help with arrests, detentions, and other criminal justice activities

Managing data resources effectively and efficiently is the third major concept in
this case. No information policy has been established to specify the rules for
sharing, disseminating, acquiring, standardizing, classifying, and inventorying
information. Data administration seems to be poor. Data governance that would
help the organizations manage the availability, usability, integrity, and security of
the data seems to be missing. It would help increase the privacy, security, data
quality, and compliance with government regulations. Lastly, data

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall


31

quality audits and data cleansing are desperately needed to decrease the number
of inconsistent record counts, duplicate records, and records that lacked data
.fields or had unclear sources for the data

Why was the consolidated terror watch list created? What are the benefits of the .2
?list

The FBI’s Terrorist Screening Center, or TSC, was established to organize and
standardize information about suspected terrorists between multiple government
agencies into a single list to enhance communications between agencies. A
database of suspected terrorists known as the terrorist watch list was born from
these efforts in 2003 in response to criticisms that multiple agencies were
maintaining separate lists and that these agencies lacked a consistent process to
.share relevant information concerning the individuals on each agency’s list

Describe some of the weaknesses of the watch list. What management, .3


?organization, and technology factors are responsible for these weaknesses

Management: policies for nomination and removal are not uniform between
governmental departments. The size of the list is unmanageable – it has grown
to over 750,000 records since its creation and is continuing to grow at a rate of
200,000 records each year since 2004. However, obvious non-terrorists are
included on the list – a six-year old child and Senator Ted Kennedy. There is no
simple or quick redress process for removal from the list. The watch list has
drawn criticism because of its potential to promote racial profiling and
.discrimination

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall


32

Organization: Integrating 12 different databases into one is a difficult process – 2


databases remain to be integrated. Balancing the knowledge of how someone is
added to the list is difficult – information about the process for inclusion must be
protected if the list is to be effective against terrorists. On the other hand, for
innocent people who are unnecessarily inconvenienced, the inability to ascertain
how they came to be on the list is upsetting. Criteria for inclusion on the list may
be too minimal. The government agencies lack standard and consistent
procedures for nominating individuals to the list, performing modifications to
.information, and relaying those changes to other governmental offices

Technology: The poor quality of the database leads to inaccurate data, redundant
data, and erroneous entries. The TSA needs to perform an intensive data quality
audit and data cleansing to help match imperfect data in airline reservation
systems with imperfect data on the watch lists. While government agencies have
been able to synchronize their data into a single list, there is still more work to be
done to integrate that list with those maintained by airlines, individual states, and
.other localities using more information to differentiate individuals

If you were responsible for the management of the TSC watch list database, what .4
?steps would you take to correct some of these weaknesses

Student answers will vary. Some suggestions include performing data quality
audits and using data cleansing software to correct many of the imperfections of
the data. Information policies and data governance policies need to be developed
to standardize the procedures for nomination and removal from the lists. The
policies could also address the problem with inconsistent record counts, duplicate
records, and records that lacked data fields or had unclear sources for their data.

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall


33

The TSA should develop consistent policies and methods to ensure that the
government, not individual airlines, is responsible for matching travelers to watch
.lists

Do you believe that the watch list represents a significant threat to individuals’ .5
?privacy or Constitutional rights? Why or why not

Most students will probably answer that the watch list does represent threats to
privacy and Constitutional rights. The TSA is developing a system called “Secure
Flight” but it has been continually delayed due to privacy concerns regarding the
sensitivity and safety of the data it would collect. Similar surveillance programs
and watch lists, such as the NSA’s attempts to gather information about
suspected terrorists, have drawn criticism for potential privacy violations. The
watch list has drawn criticism because of its potential to promote racial profiling
.and discrimination

Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall

You might also like