Professional Documents
Culture Documents
Definition
Sources of Digital Data
Types of Digital Data
Structured Data
Semi-structured Data
Unstructured Data
Definition and Meaning
of
Digital Data
Definition of Digital Data
Digital describes electronic technology that generates,
stores, and processes data in terms of two states: positive
and non-positive. Positive is expressed or represented by
the number 1 and non-positive by the number 0.
Spreadsheets
Structured Data
SQL
OLTP systems
SQL stands for Structured Query Language. SQL lets you access and manipulate databases.
Structured Data
3. Storage of Structured Data
Relational Database
Data Warehouse
Structured Data
Spreadsheet
Structured Data
Example of Structured Data
Structured Data
4. Characteristics of Structured Data
Conforms to a
data model
Data is stored in
form of rows and
Similar entities columns
are grouped (e.g., relational
database)
Structured
data
Definition, format
& meaning of data
is explicitly
known
Summary of Structured Data
Unstructured Data
1. Definition
Unstructured data is information that either does not have a
pre-defined data model or is not organized in a pre-defined
manner.
Unstructured data represents any data that does not have a
recognizable structure.
Unstructured data, in contrast, refers to data that doesn't fit
neatly into the traditional row and column structure of
relational databases.
Data which does not conform to a data model or is not in a
form which can be used easily by a computer program.
E.g. memos, chat rooms, PowerPoint presentations, images,
audios, videos, letters, researches, white papers, body of an
e-mail etc.
Formats of Digital Data
Unstructured Data
2. Sources of Unstructured Data
Web pages
Memos
Body of an e-mail
PowerPoint presentations
Chats
Reports
Whitepapers
Surveys
Unstructured Data
3. Challenges in Storage of Unstructured Data
Sheer volume of unstructured data and its unprecedented
growth makes it difficult to store. Audios, videos, images,
Storage Space etc. acquire huge amount of storage space
Update and delete Updating, deleting, etc. are not easy due to
the unstructured form
Possible solutions
RDBMS/ Store in relational databases which
BLOBs support BLOBs which is Binary
Large Objects
Does not
conform to any
data model
Cannot be
stored in form
Has no easily of rows and
identifiable columns as in a
structure database
Unstructured
data
Not in any
Does not particular
follow any format or
rules sequence
Not easily
usable by a
program
Semi-structured Data
1. Definition
Data which does not conform to a data model but has
some structure. It is not in a form which can be used easily
by a computer program.
It is structured data, but it is not organized in a rational
model, like a table.
Semi-structured data is information that does not reside in
a rational database but that have some organizational
properties that make it easier to analyze.
With some process, you can store them in the relation
database.
Semi-structured Data
2. Sources of Semi-structured
E-mail
XML
TCP/IP packets
Mark-up languages
Not sufficient
Metadata