Homework 6

Saif Mahmud Ishtiak- B18231077, Section- A
1. A) Bit, B) Byte, C) Field, D) Record, E) File, F) Database

A bit, short for binary digit, is the most basic unit of data in telecommunications and
computing. Each bit is represented by either a 1 or a 0 which can be executed in various
systems through a two-state device. Bits may be used to describe a computer's processing
power in terms of the number of bits a computer can process at one time.
A byte is a storage unit that can represent a single character, such as a letter, number or
symbol. For large amounts of data, prefixes are added to bytes, for example, Kilobyte,
Megabyte, Gigabyte, Terabyte. Most computer apps compare size and transfer rates in the
aforementioned larger multiples of bytes. Some programming languages also uses bytes as
data type.
In C#, a field is a member of a class or an object of any type that represents a memory
location for storing a value. A field can be of static or instance type. Fields are used to store
data having access to multiple methods of a class and available throughout the lifetime of an
object.
A record is a grouping of fields and related data within a table that reference one particular
object within the same structure. The term record is used synonymously with row quite often
and the term is also known as a tuple.
A file is a container in a computer system for storing information. There are different types of
files such as text files, data files, directory files, binary and graphic files which store different
types of information. In a computer operating system, files can be stored in optical drives,
hard drives or other types of storage devices.
A database is defined as an electronic system that allows data to be easily organized,
accessed, manipulated and updated. It is used by organizations as a method of storing,
managing and retrieving information. Modern databases are managed using a database
management system (DBMS). A database structure typically stores data in a tabular format
and the structure may be external, internal or conceptual.
2. A) Primary Key, B) Foreign Key
A primary key is a special relational database table column designed to uniquely identify all
table records. It is used to quickly parse data within the database and find connections
between different tables. A relational database has one sole primary key.
A foreign key is a column or group of columns in a relational database table that provides a
link between data in two tables. Most tables in a relational database system follow the foreign
key concept. While a primary key can exist on its own, a foreign key must always reference
to a primary key.
3. Capabilities of DBMS
 Includes tools for organizing, managing, and accessing the data in the database
 Data definition, dictionary and manipulation
 Query ability
 Controlling of redundancy
 Computation
 Backup and replication
 Automated optimization
 Rule enforcement and security
 Provides multiple user interfaces
4. Normalization
Normalization is the process of reorganizing data in a database so that it all data is stored in
just one place and all related data items are stored together. Normalization is important
mostly because it allows databases to cover up as little disk space as possible, resulting in
increased performance.
5. Hadoop
Apache Hadoop is a freely licensed software framework developed by the Apache Software
Foundation and used to develop data-intensive, distributed computing. Hadoop is designed to
scale from a single machine up to thousands of computers. The concept was inspired by
Google MapReduce and Google File System papers.
6. A) Data Warehouse B) Data Mart
A data warehouse is a collection of corporate information and data derived from operational
systems and external data sources. It is designed to support business decisions by allowing
data consolidation, analysis and reporting at different aggregate levels through extraction,
transformation and loading.
A data mart is a subject-oriented archive that stores data and uses the retrieved set of
information to assist and support the requirements involved within a particular business
function or department. They exist within a single organizational data warehouse repository.
7. Online analytical processing (OLAP)
Online analytical processing (OLAP) is a high-level concept that describes a category of tools
that helps in the analysis multi-dimensional queries It became relevant in the 1970s as the
volume of business data became too heavy for adequate analysis through SQL queries. OLAP
can uncover data relationships between seemingly unrelated events and trends, thus
enhancing business decision making.
8. Data Mining
Data mining is the process of analyzing hidden patterns of data according to different
perspectives for categorizing into useful information, which is collected and assembled in
common areas, such as data warehouses, for efficient analysis, data mining algorithms,
facilitating business decision making and other information requirements to ultimately cut
costs and increase revenue. Data mining is also known as data discovery and knowledge
discovery.

Homework 6

Uploaded by

Document Information

Original Description:

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Homework 6

Uploaded by

Copyright:

Available Formats

Saif Mahmud Ishtiak- B18231077, Section- A

1. A) Bit, B) Byte, C) Field, D) Record, E) File, F) Database

You might also like