You are on page 1of 22

4.

1 BIG
DATA
BASICS
You will:
Evaluate the positive and negative aspects of
using Bigdata.
What is a
big data?
4.1 BIG DATA
BASICS
The term “big data” refers not only to large data sets, but also to the frameworks, techniques, and
tools used to analyze it. It can be collected through any data generating process such as social media,
public utility infrastructure, and search engines. Big data may be either semistructured, structured,
or unstructured.
4.2 WHAT IS
DATABASE?
You will:
Explain the concept of "relational database".
WHY WE NEED
DATABASE?
WHAT IS RELATIONAL DATABASE?
Databases come in many forms, but a popular type of
database is called Relational Database. It store each kind of
data in a Table, which is kind of like storing data in a
spreadsheet. A row represents an item, and a column
represents properties about that item.
A software system used to maintain relational databases
is a relational database management system (RDBMS).
Virtually all relational database systems use SQL (Structured
Query Language) for querying and maintaining the database.
WHAT DO DATABASE SYSTEMS DO?

 They store data


 They store special information used to manage the data. This information is called metadata and it is
not shown to all the people looking at the data.
 They can solve cases where many users want to access (and possibly change) the same entries of
data. They manage access rights (who is allowed to see the data, who can change it)
 When there are many users asking questions to the database, the questions must be answered faster.
Certain attributes are more important than others, they can be used to find other data. This is called
indexing. An index contains all the important data and can be used to find the other data.
WHAT DO DATABASE SYSTEMS DO?

 They ensure that the data always has context.


 Databases help programmers focus on
algorithms and features by completing all the
data storage and search work.

The term “Database Management Systems” or


DBMS was coined in the late 1960s. Sometimes
you may even refer a DBMS as a ‘database’.
4.3 OVERVIEW
OF DATABASE
FIELDS
You will:
Formulate definitions of terms: field, record, index.
What is INDEX?
A database index allows a query to efficiently retrieve data from a
database. Indexes are related to specific tables and consist of one or
more keys. A table can have more than one index built from it. The keys
are a fancy term for the values we want to look up in the index. The
keys are based on the tables’ columns. By comparing keys to the index
it is possible to find one or more database records with the same value.
4.4 KEYS IN
DATABASE
You will:
Identify the primary key in the database.
WHAT IS THE KEY IN
DATABASE?
A key is a field, or combination of fields, in a database table
used to retrieve and sort rows in the table based on certain
requirements. Keys are defined to speed up access to data and, in
many cases, to create links between different tables.
Candidate Key
Candidate keys are defined as the minimal set of fields which
can uniquely identify each record in a table. It is an attribute or a set of
attributes that can act as a Primary Key for a table to uniquely identify
each record in that table. There can be more than one candidate key.
In our example, student_id and phone both are candidate keys
for table Student.
 A candiate key can never be NULL or empty. And its value
should be unique.
 There can be more than one candidate keys for a table.
 A candidate key can be a combination of more than one
columns(attributes).
Primary Key
A primary key is a special relational database table column
(or combination of columns) designated to uniquely identify all
table records. A primary key’s main features are:
• It must contain a unique value for each row of data.
• It cannot contain null values.
A primary key is either an existing table column or a column
that is specifically generated by the database according to a
defined sequence. Primary key is a candidate key that is most
appropriate to become the main key for any table. It is a key that
can uniquely identify each record in a table.
Composite Key
Key that consists of two or more attributes that uniquely identify any record in a table is called
Composite key. But the attributes which together form the Composite key are not a key independently
or individually.
The candidate key which are not
selected as primary key are known as
secondary keys or alternative keys.

You might also like