Professional Documents
Culture Documents
Answer :
Database design is the process of producing a detailed data model of database. This data
model contains all the needed logical and physical design choices and physical storage
parameters needed to generate a design in a data definition language, which can then be
used to create a database.
Answer :
The process of logical design involves arranging data into a series of logical relationships
called entities and attributes. An entity represents a chunk of information. In relational
databases, an entity often maps to a table. An attribute is a component of an entity and
helps define the uniqueness of the entity.
Answer :
Instead of having all the data in a list with a random order, a database provides a structure
to organize the data. One of the most common data structures is a database table. A
database table consists of rows and columns. A database table is also called a two-
dimensional array.
Answer :
A logical data model or logical schema is a data model of a specific problem domain
expressed independently of a particular database management product or storage
technology (physical data model) but in terms of data structures such as relational tables
and columns, object-oriented classes, or XML tags.
Answer :
Answer :
A physical database model shows all table structures, including column name, column
data type, column constraints, primary key, foreign key, and relationships between tables.
Features of a physical data model include: Specification all tables and columns. Foreign
keys are used to identify relationships between tables.
Answer :
This phase is called conceptual design. The result of this phase is an Entity-Relationship
(ER) diagram or UML class diagram. It is a high-level data model of the specific
application area. It describes how different entities (objects, items) are related to each
other.
Answer :
Answer :
Answer :
Theories are formulated to explain, predict, and understand phenomena and, in many
cases, to challenge and extend existing knowledge within the limits of critical bounding
assumptions. The theoretical framework is the structure that can hold or support a theory
of a research study.
11. Question 11. What Is The Conceptual Framework?
Answer :
Answer :
Answer :
Physical model (most commonly referred to simply as a model but in this context
distinguished from a conceptual model) is a smaller or larger physical copy of an object.
The object being modelled may be small (for example, an atom) or large (for example,
the Solar System).
Answer :
Answer :
Data modeling is often the first step in database design and object-oriented programming
as the designers first create a conceptual model of how data items relate to each other.
Data modeling involves a progression from conceptual model to logical model to
physical schema.
16. Question 16. What Are The Features Of A Physical Data Model?
Answer :
Answer :
18. Question 18. What Are The Features Of Conceptual Data Model?
Answer :
Answer :
1. In a logical data model, primary keys are present, whereas in a conceptual data
model, no primary key is present.
2. In a logical data model, all attributes are specified within an entity. No attributes
are specified in a conceptual data model.
3. Relationships between entities are specified using primary keys and foreign keys
in a logical data model. In a conceptual data model, the relationships are simply
stated, not specified, so we simply know that two entities are related, but we do
not specify what attributes are used for this relationship.
2. Question 20. What Are The Steps To Design Logical Data Model?
Answer :
The steps for designing the logical data model are as follows:
Data modeling
Question 1. What Is Data Modeling?
Answer :
Answer :
Logical Data Model: Entity, Attributes, Super Type, Sub Type, Primary Key, Alternate
Key, Inversion Key Entry, Rule, Relationship, Definition, business rule, etc
Physical Data Model: Table, Column, Primary key Constraint, Unique Constraint or
Unique Index, Non Unique Index, Check Constraint, Default Value, Foreign Key,
comment etc.
Answer :
A logical data model is the version of a data model that represents the business requirements
(entire or part of an organization). This is the actual implementation and extension of a
conceptual data model. Logical Data Models contain Entity, Attributes, Super Type, Sub Type,
Primary Key, Alternate Key, Inversion Key Entry, Rule, Relationship, Definition etc. The
approach by which logical data models are created is called as logical data modeling.
Answer :
Physical data model includes all required tables, columns, relationship, database properties for
the physical implementation of databases. Database performance, indexing strategy, and physical
storage are important parameters of a physical model. The important or main object in a database
is a table which consists or rows and columns. The approach by which physical data models are
created is called as physical data modeling.
Question 5. What Is The Difference Between A Logical And Physical Data Model?
Answer :
When a data modeler works with the client, his title may be a logical data modeler or a physical
data modeler or combination of both. A logical data modeler designs the data model to suit
business requirements, creates and maintains the lookup data, compares the versions of data
model, maintains change log, generate reports from data model and whereas a physical data
modeler has to know about the source and target databases properties.
A physical data modeler should know the technical-know-how to create data models from
existing databases and to tune the data models with referential integrity, alternate keys, indexes
and how to match indexes to SQL code. It would be good if the physical data modeler knows
about replication, clustering and so on.
Answer :
Data stored in form of rows and columns is called as table. Each column has datatype and based
on the situation, integrity constraints are enforced on columns.
Answer :
Column also known as field is a vertical alignment of the data and contains related information
to that column.
Answer :
Row also known as tuple or record is the horizontal alignment of the data.
Answer :
ER diagram is a visual representation of entities and the relationships between them. In a data
model, entities (tables) look like square boxes or rectangular boxes, which contain attributes and
these entities, are connected by lines (relationship).
Answer :
Primary key constraint is imposed on the column data to avoid null values and duplicate values.
Primary Key=Unique + Not Null. Example: social security number, bank account number, bank
routing number
Answer :
When more than one column is a part of the primary key, it is called as composite primary key
constraint.
Answer :
In normal practice, a numerical attribute is enforced a primary key which is called as surrogate
key. Surrogate key is a substitute for natural keys. Instead of having primary key or composite
primary keys, the data modelers create a surrogate key; this is very useful for creating SQL
queries, uniquely identify a record and good performance.
Answer :
Parent table has primary key and a foreign key constraint is imposed on a column in the child
table. The foreign key column value in the child table will always refer to primary key values in
the parent table.
Answer :
When group of columns are in a foreign key, it is called as composite foreign key constraint.
Question 15. What Are The Important Types Of Relationships In A Data Model?
Answer :
Identifying, Non-Identifying Relationship, Self-Recursive relationship are the types of
relationship.
Answer :
Usually, in a data model, parent tables and child tables are present. Parent table and child table
are connected by a relationship line. If the referenced column in the child table is a part of the
primary key in the child table, relationship is drawn by thick lines by connecting these two
tables, which is called as identifying relationship.
Answer :
Usually, in a data model, parent tables and child tables are present. Parent table and child table
are connected by a relationship line. If the referenced column in the child table is a not a part of
the primary key and standalone column in the child table, relationship is drawn by dotted lines
by connecting these two tables, which is called as non-identifying relationship.
Answer :
A standalone column in a table will be connected to the primary key of the same table, which is
called as recursive relationship.
Answer :
One to One, One to many, and many to many are different types of cardinalities. In a database,
high cardinality means more unique values are stored in a column and vice versa.
Question 20. What Is A Conceptual Data Model And Conceptual Data Modeling?
Answer :
Conceptual data model includes all major entities and relationships and does not contain much
detailed level of information about attributes and is often used in the initial planning phase. Data
Modelers create conceptual data model and forward that model to functional team for their
review. The approach by which conceptual data models are created is called as conceptual data
modeling.
Answer :
Enterprise data model comprises of all entities required by an enterprise. The development of a
common consistent view and understanding of data elements and their relationships across the
enterprise is referred to as Enterprise Data Modeling. For better understanding purpose, these
data models are split up into subject areas.
Answer :
Answer :
Question 24. What Is A Constraint? What Are The Different Types Of Constraint?
Answer :
Constraint is a rule imposed on the data. The different types of constraints are primary key,
unique, not null, foreign key, composite foreign key, check constraint etc.
Answer :
Unique constraint is imposed on the column data to avoid duplicate values, but it will contain
NULL values.
Question 26. How Many Null Values Can Be Inserted In A Column That Has Unique
Constraint?
Answer :
Many null values can be inserted in an unique constraint column because one null value is not
equal to another null value.
Answer :
Answer :
Answer :
Answer :
E.F. Codd gave some rules to design relational databases and the rules were focused on
removing data redundancy which helps to overcome normal data modeling problems. The
process of removing data redundancy is known as normalization.
Answer :
First normal form, Second normal form, third normal forms are three types of normalization used
in practice. Beyond these normal forms, Boyce-Codd fourth and fifth normal forms are also
available.
Answer :
De-Normalization is a process of adding redundancy to the data. This helps to quickly retrieve
the information from the database.
Question 33. What Is Data Model Meta Data?
Answer :
You can take a report of the entire data model, or subject or part of the data model. The data
about various objects in the data model is called as data model Metadata. .Data Modeling Tools
have options to create reports by checking the various options. Either you can create logical data
model Meta data of physical model Meta data.
Answer :
Data Model and its relevant data like entity definition, attribute definition, columns, data types
etc. are stored in a repository, which can be accessed by data modelers and the entire team.
Answer :
Forward Engineering is a process by which DDL scripts are generated from the data model. Data
modeling tools have options to create DDL scripts by connecting with various databases. With
these scripts, databases can be created.
Answer :
Reverse Engineering is a process useful for creating the data models from database or scripts.
Data modeling tools have options to connect to the database by which we can reverse engineer a
database into a data model.
Answer :
An entity can be split into many entities (sub-entities) and grouped based on some characteristics
and each sub entity will have attributes relevant to that entity. These entities are called subtype
entities. The attributes which are common to these entities are moved to a super (higher) level
entity, which is called as supertype entity.
Answer :
Consider any system where people use some kind of resources and compete for them. The non-
computer examples for preemptive scheduling the traffic on the single lane road if there is
emergency or there is an ambulance on the road the other vehicles give path to the vehicles that
are in need. The example for preemptive scheduling is people standing in queue for tickets.
Question 39. What Is The Difference Between Star Flake And Snow Flake Schema?
Answer :
Star Schema: Well in star schema you just enter your desired facts and all the primary keys of
your dimensional tables in Fact table. And fact tables primary is the union of its all dimension
table key. In star schema dimensional tables are usually not in BCNF form.
Snow Flake: Its almost like starschema but in this our dimension tables are in 3rd NF, so more
dimensions tables. And these dimension tables are linked by primary, foreign key relation.
Answer :
Data sparsity is term used for how much data we have for a particular dimension/entity of the
model.
It affects aggregation depending on how deep the combination of members of the sparse
dimension make up. If the combination is a lot and those combination do not have any factual
data then creating space to store those aggregations will be a waste as a result, the database will
become huge.
Question 41. What Is The Difference Between Hashed File Stage And Sequential File
Stage In Relates To Datastage Server?
Answer :
In datastage server jobs, can we use sequential filestage for a lookup instead of hashed filestage.
If yes ,then what’s the advantage of a Hashed File stage over sequential filestage
search is faster in hash files as you can directly get the address of record directly by hash
algorithm as records are stored like that but in case of sequential file u must compare all the
records.
Answer :
Denormalization is used when there are a lot of tables involved in retrieving data.
Denormalization is done in dimensional modelling used to construct a data ware house. This is
not usually done for data bases of transactional systems.
Answer :
Data models are tools used in analysis to describe the data requirements and assumptions in the
system from a top-down perspective. They also set the stage for the design of databases later on
in the SDLC.
Answer :
An entity is in the third normal form if it is in the second normal form and all of its attributes are
not transitively dependent on the primary key. Transitive dependence means that descriptor key
attributes depend not only on the whole primary key, but also on other descriptor key attributes
that, in turn, depend on the primary key. In SQL terms, the third normal form means that no
column within a table is dependent on a descriptor column that, in turn, depends on the primary
key.
For 3NF, first, the table must be in 2NF, plus, we want to make sure that the non-key fields are
dependent upon ONLY the PK, and not other non-key fields for its existence. This is very similar
to to 2NF, except that now you are comparing the non-key fields to OTHER non-key fields.
After all, we know that the relationship to the PK is good, because we established that in 2NF.
Question 45. Why Are Recursive Relationships Are Bad? How Do You Resolve Them?
Answer :
Recursive relationships are an interesting and more complex concept than the relationships you
have seen in the previous chapters, such as a one-to-one, one-to-many, and many-to-many. A
recursive relationship occurs when there is a relationship between an entity and itself. For
example, a one-to-many recursive relationship occurs when an employee is the manager of other
employeess. The employee entity is related to itself, and there is a one-to-many relationship
between one employee (the manager) and many other employees (the people who report to the
manager). Because of the more complex nature of these relationships, we will need slightly more
complex methods of mapping them to a schema and displaying them in a style sheet.
Question 46. Is This Statement True Or False? All Databases Must Be In Third Normal
Form?
Answer :
In general all organization data bases are normalized to 3nf in order to remove redundancy and
efficient access. A data base can also be created without normalization. Hence it is not a
mandatory that a database should be in 3nf.
Question 47. What Is An Artificial (derived) Primary Key? When Should It Be Used?
Answer :
Using a name as the primary key violates the principle of stability. The social security number
might be a valid choice, but a foreign employee might not have a social security number. This is
a case where a derived, rather than a natural, primary key is appropriate. A derived key is an
artificial key that you create. A natural key is one that is already part of the database.
Answer :
An entity is in the second normal form if all of its attributes depend on the whole (primary) key.
In relational terms, every column in a table must be functionally dependent on the whole primary
key of that table. Functional dependency indicates that a link exists between the values in two
different columns.
If the value of an attribute depends on a column, the value of the attribute must change if the
value in the column changes. The attribute is a function of the column. The following
explanations make this more specific:
If the table has a one-column primary key, the attribute must depend on that key.
If the table has a composite primary key, the attribute must depend on the values in all its
columns taken as a whole, not on one or some of them.
If the attribute also depends on other columns, they must be columns of a candidate key; that is,
columns that are unique in every row.
If you do not convert your model to the second normal form, you risk data redundancy and
difficulty in changing data. To convert first-normal-form tables to second-normal-form tables,
remove columns that are not dependent on the primary key.