Professional Documents
Culture Documents
Chapter 6 – Summary
By: Dhivena Leanti Ardhan – 185020307141002
A. Organizing Data In A Traditional File Environment
An effective information system provides users with accurate, timely, and relevant
information. Accurate information is free of errors. Information is timely when it is available
to decision makers when it is needed. Information is relevant when it is useful and
appropriate for the types of work and decisions that require it.
A computer system organizes data in a hierarchy that starts with the bit, which represents
either a 0 or a 1. Bits can be grouped to form a byte to represent one character, number, or
symbol. Bytes can be grouped to form a field, and related fields can be grouped to form a
record. A grouping of characters into a word, a group of words, or a complete number (such
as a person’s name or age) is called a field. A group of related fields, such as the student’s
name, the course taken, the date, and the grade, comprises a record; a group of records of the
same type is called a file. A record describes an entity. An entity is a person, place, thing, or
event on which we store and maintain information. Each characteristic or quality describing a
particular entity is called an attribute.
Querying & Reporting: includes tools for accessing and manipulating information in
databases. Most DBMS have a specialized language called a data manipulation
language that is used to add, change, delete, and retrieve the data in the database.
3. Designing Databases
2. Data Marts
A subset of a data warehouse in which a summarized or highly foccused portion of the
organization’s data is placed in a separate database for a specific population of users.
3. Hadoop
An open source software framework managed by the Apache Software Foundation that
enables distributed parallel processing of huge amounts of data across inexpensive
computers.
4. In-Memory Computing
Another way of faciliating big data analysis which relies primarily on a cmputer’s main
memory (RAM) for data storage
5. Analytic Platforms
Using both relational and non-rational technology that are optimized for analyzing large data
sets.
2. Data Mining
More discovery-driven. Data mining provides insights into corporate data that cannot be
obtained with OLAP by finding hidden patterns and relationships in large databases and
inferring rules from them to predict future behavior.
The types of information obtainable form data mining include (1) Associations: occurences
linked to a single event. (2) Sequences: events are linked over time. (3) Classification:
recognizes patterns that describe the group to which an item belongs by examining existing
items that have been classified and by inferring a set of rules. (4) Clustering: works in a
manner similar to classification when no groups have yet been defined, and (5) Forecasting:
uses predictions in a different way. It uses a series of existing values to forecast what other
values will be.