Professional Documents
Culture Documents
Managing Data
Difficulties in Managing Data
- Amount of data increases exponentially:
According to the annual survey of the global digital output by International Data Corporation, the total
amount of global data was expected to pass 1.2 zettabytes.
- Data are scattered and collected by many individuals using various methods and devices:
These data are frequently stored in numerous servers and locations and in different computing systems, databases,
formats, and human and computer languages.
- Information systems that do not communicate with each other can result in inconsistent data
- Data rot:
Problems with media on which the data are stored.
Over time, temperature, humidity, and exposure to light can cause physical problems with storage media and thus
make it difficult to access the data. The second aspect of data rot is that finding the machines needed to access the
data can be difficult.
Data Governance
Data Governance: an approach to managing data across an entire organization.
Formal sets of policies that are designed to ensure that the data are collected, handled and protected in a
certain, well-defined fashion.
The objective is to make information available, transparent, and useful for the people who are authorized to access it,
from the moment it enters an organization until it is outdated and deleted.
The most important data in the organization that will be used by all the departments.
Example: if we take an example of one of the telecommunication companies like Omantel what is the most
important data that most likely that data will be stored most of the departments. we can see customer data,
customer data can be used by marketing department, it can be used let's say by human resource, it might be
used by many other departments who are involved in.
you know, doing the projects or innovating in new ways of reaching out to their customers. So, if you can
highlight what is your master data, then you can focus on that master data you will make the way to collect that
data and to maintain that data all unanimous across all the departments and accordingly you ensure that your
most important data is being taken care of.
Master data:
A set of core data [customer, employee, vendor, geographic location] that span all enterprise information
systems.
Data Hierarchy
Bit: a binary digit, or a “0” or a “1” - The smallest unit of data a computer can handle.
Byte: eight bits and represents a single character (e.g., a letter, number, or symbol)
Field: is a group of related characters (e.g., student’s name, age, mobile number)
Record: a group of logically related fields (e.g., student in a university database)
J M K P
Byte Byte
(1001010) (1001101) (1001011) (1010000)
Bit Bit O 1 O 1
Entity: a person, place, thing, or event about which information is maintained. [A record generally describes an
entity]
Attribute: a particular characteristic of a particular entity.
Primary key (Key field): a field that uniquely identifies a record, so that it can be retrieved and updated.
Secondary Key is another field that has some identifying information but typically does not identify the record with
complete accuracy.
Entity-Relationship Modeling
Database designers plan and create the database through a process called entity-relationship (ER) modeling.
ER diagrams consists of entities, attributes, and relationships. [illustrating relationships between database
entities]
Entity classes: groups of entities of a certain type
Instance (record): the representation of a particular entity
Identifiers (Attribute): attributes that are unique to that entity instance.
One-to-One [1:1]
One-to-Many [1:M]
Many-to-Many [M:M]
Entity-Relationship Diagram
Relational Databases
Multidimensional Database
Benefits of Data Warehousing:
End users can access data quickly and easily via Web browsers because they are located in one place.
End users can conduct extensive analysis with data in ways that may not have been possible before.
These benefits can improve business knowledge/ provide competitive advantage/ enhance customer service and
satisfaction/ facilitate decision making/ and streamline business processes.
Incorporating data from obsolete (old) mainframe systems can be difficult and expensive.
People in one department may be reluctant to share data with another department.
Data Marts:
Data mart: a small data warehouse, designed for the end-user needs in a strategic business unit (SBU) or a
department.
Example: Marketing and sale data mart to deal with customer information.
Advantage:
Far less costly than a data warehouse (around R.O. 40, 000)
Knowledge Management
Knowledge: information that is contextual, relevant, and actionable.
Another term for knowledge: Intellectual capital (or intellectual assets)
Best Practices: the most effective and efficient ways of doing things.
Knowledge management (KM): a process of accumulating and creating knowledge efficiently, so that it can
be applied effectively throughout the organization.
KM is not a technology. It a process supported by IS.
“Knowledge management involves efficiently connecting those who know with those who need to know
and converting personal knowledge into organizational knowledge.” (Peter Drucker)
KMS Cycle:
1. Create knowledge. Knowledge is created as people determine new ways of doing things or develop know-
how. Sometimes external knowledge is brought in.
2. Capture knowledge. New knowledge must be identified as valuable and be represented in a reasonable
way.
3. Refine knowledge. New knowledge must be placed in context so that it is actionable. This is where tacit
qualities (human insights) must be captured along with explicit facts.
4. Store knowledge. Useful knowledge must then be stored in a reasonable format in a knowledge repository
so that others in the organization can access it.
5. Manage knowledge. Like a library, the knowledge must be kept current. It must be reviewed regularly to
verify that it is relevant and accurate.
6. Disseminate knowledge. Knowledge must be made available in a useful format to anyone in the
organization who needs it, anywhere and anytime.
Entity-Relationship Modeling
Database designers plan and create the database through a process called entity-relationship (ER) modeling.
ER diagrams consists of entities, attributes, and relationships. [illustrating relationships between database
entities].
Entities:
An entity is an object or concept about which you want to store information.
Entity classes: groups of entities of a certain type.
Attributes:
An attribute describes the property of an entity.
There are four types of attributes:
1. Key attribute
2. Composite attribute
3. Multivalued attribute
4. Derived attribute
1- Key attribute: A key attribute can uniquely identify an entity from an entity set.
3- Multivalued attribute: An attribute that can hold multiple values is known as multivalued attribute.
Example: a person can have more than one phone number. So, the phone number attribute is considered
multivalued.
4- Derived attribute: A derived attribute is one whose value is dynamic and derived from another attribute.
What it does dynamic means? for example, like a person age is a derived attribute
why?
because it changes over time. So, and can be derived from another attribute which is date.
Relationships
It shows how two entities share information in the database.
Three types of Relationships:
1. One to One
2. One to Many
3. Many to Many
1- One-to-one (1-1): One entity from entity set X can be associated with at most one entity of entity set Y
and vice versa.
2- One-to-Many (1-M): One entity from entity set X can be associated with multiple entities of entity set Y,
but an entity from entity set Y can be associated with at least one entity.
3- Many-to-Many (M-M): One entity from X can be associated with more than one entity from Y and vice
versa.
Designing the Database
Primary key (Key field): a field that uniquely identifies a record, so that it can be retrieved and updated.
Foreign Key
A field in one table that uniquely identifies a row (record) of another table. It is used to establish and
enforce a link between two tables.
1 INFS Dr. 12
Zahran
2 ACCT Dr. 10
Fatma
Foreign key is a primary key in another table that has 1:M relationship with this table.
Foreign keys are primary keys from another table.
For every M:M relationship, a new table has to be created.
GOOD LUCK!
ادعوا لي بالتوفيق.