You are on page 1of 7

BUSINESS WOLVES MIS

Chapter 6
File Organization Terms and Concepts

•Database: Group of related files


•File: Group of records of same type
•Record: Group of related fields
•Field: Group of characters as word(s) or number(s)
•Entity: Person, place, thing on which we store information
•Attribute: Each characteristic, or quality, describing entity
•Database
–Serves many applications by centralizing data and controlling redundant data
•Database management system (DBMS)
– Interfaces between applications and physical data files
– Separates logical and physical views of data
– Solves problems of traditional file environment
Relational DBMS
• Represent data as two-dimensional tables
• Each table contains data on entity and attributes
•Table: grid of columns and rows
–Rows (tuples): Records for different entities
–Fields (columns): Represents attribute for entity
–Key field: Field used to uniquely identify each record
–Primary key: Field in table used for key fields
–Foreign key: Primary key used in second table as look-up field to identify
records from original table
Operations of a Relational DBMS
–SELECT
 Creates subset of data of all records that meet stated criteria
–JOIN
 Combines relational tables to provide user with more information than
available in individual tables
–PROJECT
 Creates subset of columns in table, creating tables with only the information
specified

1
BUSINESS WOLVES MIS
Non-Relational Databases and Databases in the Cloud
“No SQL”
–More flexible data model
–Data sets stored across distributed machines
–Easier to scale
–Handle large volumes of unstructured and structured data
•Databases in the cloud
–Appeal to start-ups, smaller businesses
–Amazon Relational Database Service, Microsoft SQL Azure,Private clouds
Capabilities of Database Management Systems
•Data definition capability
•Data dictionary
•Querying and reporting
–Data manipulation language : Structured Query Language (SQL)
•Many DBMS have report generation (Microsoft Access)
Designing Databases
-- Conceptual Design: abstract model of the database from a
business perspective
-- Physical Design: shows how the database is actually arranged on
direct-access storage devices
Problems with the Traditional File Environment
•Files maintained separately by different departments
•Data redundancy
•Data inconsistency
•Program-data dependence
•Lack of flexibility
•Poor security
•Lack of data sharing and availability
• Normalization
–Streamlining complex groupings of data to minimize redundant data
elements and awkward many-to-many relationships
• Referential integrity
–Rules used by RDBMS to ensure relationships between tables remain
consistent
•Entity-relationship diagram
•A correct data model is essential for a system serving the business well

2
BUSINESS WOLVES MIS
Blockchain
• Distributed ledgers in a peer-to-peer distributed database
• Maintains a growing list of records and transactions shared by all
• Encryption used to identify participants and transactions
• Used for financial transactions, supply chain, and medical records
• Foundation of Bitcoin, and other crypto currencies
Using Databases to Improve Business Performance and Decision Making
The Challenge of Big Data
• Big data
–Massive sets of unstructured/semi-structured data from web traffic, social
media, sensors, and so on
• Volumes too great for typical DBMS –Petabytes, exabytes of data
•Can reveal more patterns, relationships and anomalies
•Requires new tools and technologies to manage and analyze
Business Intelligence Infrastructure
•Array of tools for obtaining information from separate systems and from big
data
• Data warehouse
–Stores current and historical data from many core operational transaction
systems
–Consolidates and standardizes information for use across enterprise, but
data cannot be altered
–Provides analysis and reporting tools
• Data marts
–Subset of data warehouse
–Typically focus on single subject or line of business
• Hadoop
–Enables distributed parallel processing of big data across inexpensive
computers
–Key services
 Hadoop Distributed File System (HDFS): data storage
 MapReduce : breaks data into clusters for work
 Hbase: No SQL database

3
BUSINESS WOLVES MIS

• In-memory computing
–Used in big data analysis
–Uses computers main memory (RAM) for data storage to avoid delays in
retrieving data from disk storage
–Can reduce hours/days of processing to seconds
–Requires optimized hardware
• Analytic platforms
–High-speed platforms using both relational and non-relational tools
optimized for large datasets
Analytical Tools: Relationships, Patterns, Trends
•Tools for consolidating, analyzing, and providing access to vast amounts of
data to help users make better business decisions
–Multidimensional data analysis (OLAP)
–Data mining
–Text mining
–Web mining
Online Analytical Processing (OLAP)
•Supports multidimensional data analysis
–Viewing data using multiple dimensions
–Each aspect of information (product, pricing, cost, region, time period) is
different dimension
•OLAP enables rapid, online answers to ad hoc queries
Data Mining
•Finds hidden patterns, relationships in datasets
–Example: customer buying patterns
•Infers rules to predict future behavior
•Types of information obtainable from data mining:
–Associations –Sequences –Classification –Clustering –Forecasting
Text Mining and Web Mining
• Text mining
–Extracts key elements from large unstructured data sets
–Sentiment analysis software
• Web mining
–Discovery and analysis of useful patterns and information from web
–Web content mining –Web structure mining –Web usage mining

4
BUSINESS WOLVES MIS
Databases and the Web
•Many companies use the web to make some internal databases available to
customers or partners
• Typical configuration includes:
–Web server
–Application server/middleware/C G I scripts
–Database server (hosting DBMS)
•Advantages of using the web for database access:
–Ease of use of browser software
–Web interface requires few or no changes to database
–Inexpensive to add web interface to system
Establishing an Information Policy
•Firm’s rules, procedures, roles for sharing, managing, standardizing data
•Data administration
–Establishes policies and procedures to manage data
• Data governance
–Deals with policies and processes for managing availability, usability, integrity,
and security of data, especially regarding government regulations
• Database administration
–Creating and maintaining database

5
BUSINESS WOLVES MIS
QUESTIONS :
1. An Entity represents a person, place, thing or event about which a company stores
information.
2. A firm's information policy includes rules that lay out who is responsible for updating and
maintaining the information in a database system.
3. A Database Management System (DBMS) is a specific type of software for creating, storing,
organizing, and accessing data from a database.
4. A characteristic or quality describing an entity is called a(n):entity
5. A field identified in a table as holding the unique identifier of the table's records is called
the:foreign key .
6. In a table for customers, the information about a single customer would reside in a single:row
7. A data warehouse stores current and historical data of potential interest to decision makers
throughout the company.
8. Businesses use web mining tools to search and analyze unstructured data sets, such as
emails and memos.
9. Web mining is the discovery of useful patterns on the Web
10.A grouping of characters into a word, a group of words, or a complete number is called a
record.
11.Every record in a file should contain at least one key field
12.Both Oracle and Microsoft Access use SQL to retrieve information from the database
13.DBMS simplify how end users work with databases by separating the logical and physical
views of the data
14.Hadoop can process large quantities of any type of data, including video and Twitter feeds
15.A data warehouse may include information from legacy systems
16.A data warehouse is typically comprised of several smaller data marts
17.OLAP is used to find hidden patterns and relationships in large databases, and infer rules
from these to infer future behavior.
18.A foreign key is a field that links to a separate table
19.In-memory computing relies on a computer's main memory for storing data
20.The use of different terms for identifying data in a firm's various information systems is an
example of redundant data
21.Data cleansing is the same process as data scrubbing
22.The process of streamlining data to minimize redundancy and awkward many-to-many
relationships is called: normalization
23.Data mining is more discovery driven than OLAP
24.The tool that enables users to view the same data in different ways using multiple dimensions
is:OLAP
25.Businesses use WEB mining tools to search and analyze unstructured data sets, such as e-
mails and memos.
26.A physical view shows data as it is organized on the storage media .
27.Hadoop is open-source software framework designed for distributing data-processing over
inexpensive computers.
28.A foreign key is a field that links to a separate table.
29.predictive analysis is technique of data mining

6
BUSINESS WOLVES MIS
1. which of the following database types is useful for storing Java applets as well as
handling large numbers of transactions?
A)HIERACHAL DATABAS B) OBJECT–RELATIONAL database c) relational database d) OODBMS
2. The tool that enables users to view the same data in different ways using multiple
dimension is
A) OLAP B) SQL C) predicative analysis d) data mining
3. the major benefit of _______to the company is ability to provide accurate information
A) data warehouse b) analyatic model c) management information system
d) business intelligence
4. Access is a:
A) DBMS for small handheld computing devices.
B) popular open-source DBMS.
C) DBMS for midrange computers.
D) DBMS for desktop PC systems.
5. Which of the following best illustrates the relationship between entities and attributes?
A) The entity CUSTOMER with the attribute PRODUCT
B) The entity CUSTOMER with the attribute ADDRESS
C) The entity PRODUCT with the attribute PURCHASE
D) The entity PRODUCT with the attribute CUSTOMER
6. A characteristic or quality describing an entity is called a(n):
A) field. B) tuple. C) key field. D) attribute.
7. In a table for customers, the information about a single customer would reside in a
single: A) field. B) row. C) column. D) table.
8. In a relational database, a record is also called a(n):
A) tuple. B) row. C) entity. D) field.
9. A field identified in a table as holding the unique identifier of the table's records is
called the: A) primary key. B) key field. C) primary field. D) foreign key.
10. A field identified in a record as holding the unique identifier for that record is called
the: A) primary key. B) key field. C) primary field. D) foreign key
11. The process of streamlining data to minimize redundancy and awkward many-to-
many relationships is called: A) normalization. B) data scrubbing.
C) data cleansing. D) data administration.
12. MySQL is a:
A) DBMS for small handheld computing devices.
B) popular open-source DBMS.
C) mainframe relational DBMS.
D) DBMS for desktop systems
13. Data mining is more ________ than OLAP.
A) data focused B) multidimensional C) query oriented D) discovery driven
14. Detecting and correcting data in a database or file that are incorrect, incomplete, improperly
formatted, or redundant is called: A) data auditing. B) defragmentation.
C) data scrubbing. D) data optimization
15. A(n) ________ is a dedicated computer in a client/server environment that hosts a
DBMS. A) web server B) application server C) database server D) client server
16. Which of the following is not a typical feature of DBMS?
A) Query wizard tool b) report generation tools c) data manipulation language

You might also like