Professional Documents
Culture Documents
information,
knowledge
and
processing
Data,
Information
&
Knowledge
KEYWORDS
o Data – Raw facts without meaning
o Information – Data items with context and meaning
o Knowledge – Understanding of what information is about.
Understanding information in such a way that it can be applied for a
specific purpose.
OR
Information ???
Example 2
Processing
Information ???
Example 3
111192, 111234
Raw Data
Information ???
Suggested answers to examples
• Example 1
• We could add up the yes and no responses and calculate the percentage of customers who
would buy product X at price Y. The information could be presented as a chart to make it
easier to understand.
• Example 2
• Adding Jayne’s scores would give us a mark out of 600 that could then be converted to an A
level grade. Alternatively we could convert the individual module results into grades.
• Example 3
• By subtracting the second value from the first we can work out how many units of gas the
consumer has used. This can then be multiplied by the price per unit to determine the
customer’s gas bill.
Knowledge
• Knowledge is the understanding of rules needed to interpret
information
• Information to which human experience has been applied
• Jayne’s teacher could analyse the results to determine whether it would be worth
her re-sitting a module
• Looking at the pattern of the customer’s previous gas bills may identify that the
figure is abnormally low and they are fiddling the gas meter!!!
Knowledge Workers
• Knowledge workers have specialist knowledge
that makes them “experts”
• Based on formal and informal rules they
have learned through training and
experience
Processing
Data – raw facts and figures
Information – data that has been processed (in a context) to give it meaning
Sources of data
Direct data
Sources of sources
Data
Indirect data
sources
Direct and
Indirect Data
source
• Direct data source (original
data) : data that is collected
for the purpose for which it
will be used
• Indirect data source
(secondary source: data has
been collected for a
particular reason but then
that data is used for
something else
Direct Data
Sources
What is it?
• Gathered for a specific
purpose or task. (e.g.
questionnaires or data
logging)
• gathered without having to
go to a third party. - ‘original
source data’.
Methods of collecting
direct data (sources)
• Questionnaires
• Interviews
• Observations
• Data logging
Questionnaires
• Questionnaires are a commonly used method of collecting
data from people. They are simple to administer and most
respondents are familiar with filling them in, either in a
paper format or online.
Cost
Time
• data that was collected for a particular reason but is then used
for something else.
• This second company might then email you with a list of related items you might be interested in.
• For example, you buy a computer game from one online company, then an email arrives from a
different company asking if you would like to buy a strategy book for the game.
Example 2 – Weather data
Weather data
• But this 'data set' may also be purchased by a local business who
wants to see how sales of their ice-cream relates to the weather.
Sampling bias
First, Second- & Third-Party Data
extra
First Party (direct)
extra
Second party (extra – not in syllabus)
extra
Third Party
extra
Can you do this?
Answer
Quality of Information
• accuracy
• relevance
• age (up-to-date, out-of-date)
• level of detail
• completeness.
Accuracy, Accuracy, Accuracy
further work
Validation
• Validation = 1 way trying to reduce errors in data entered into system.
• The validation is performed by the computer at the point when you enter data. It is the process of checking
the data against the set of validation rules which you set up when developing your new database or
spreadsheet system.
I *
• Presence Check • Format Check
• Length Check
Range check
• A range check is commonly used when you are working with data which consists of numbers, currency or
dates/times.
• A range check allows you to set suitable boundaries:
Type check
• When you begin to set up your new system you will choose the most appropriate data type for each field.
• A type check will ensure that the correct type of data is entered into that field.
• For example, in a clothes shop, dress sizes may range from 8 to 18. A number data type would be a suitable
choice for this data. By setting the data type as number, only numbers could be entered e.g. 10, 12, 14 and
you would prevent anyone trying to enter text such as ‘ten’ or ‘ten and a half’.
• Some data types can perform an extra type check.
• For example, a date data type will ensure that a date you have entered can actually exist e.g. it would not
allow you to enter the date 31/02/07.
Check Digit
• This is used when you want to be sure that a range of numbers has been entered correctly. There are many
different schemes (algorithms) for creating check digits.
• For example, the ISBN-10 numbering system for books makes use of 'Modulo-11' division. In modulo
division, the answer is the remainder of the division. For example
• 8 Mod 3 = 2 i.e. the remainder of dividing 8 by 3 is 2.
• Consider the ISBN number:
• ISBN 1 84146 201 2
• The check digit is the final number in the sequence, so in this example it is the final ‘2’.
• The computer will perform a complex calculation on all of the numbers and then compare the answer to the
check digit. If both match, it means the data was entered correctly.
Length Check
• Sometimes you may have a set of data which always has the same number of characters.
• For example a UK landline telephone number has 11 characters.
• A length check could be set up to ensure that exactly 11 numbers are entered into the field. This type of
validation cannot check that the 11 numbers are correct but it can ensure that 10 or 12 numbers aren't
entered.
• A length check can also be set up to allow characters to be entered within a certain range.
• For example, postcodes can be in the form of:
• CV45 2RE (7 without a space or 8 with a space) or
• B9 3TF (5 without a space or 6 with a space).
• So you could set a length check for postcode to accept data which has a minimum number of 5 characters
and a maximum number of 8.
Can you do this?
Answer
Accuracy - Verification
/ Cypher text
• TLS (Transport layer security – secures communication to website and personal data
Symmetric encryption
Encryption Encryption
Algorithm Algorithm
• Sets of data are processed all at one time without user interaction
• A table in a database containing information about one set of things, e.g.
employees
• Permanent
• Updated periodically
• Existing master files is used with a transaction file to produce new
updated master file
Master file
• Temporary files
• File that contains all ongoing transactions in a batch-processing system
• Has changes to be made to master file (additions, deletions and updates)
• Data that is used to update the master file
Transaction file
Master file Transaction file
• Single, automated process requiring little • Delay because data is not processed
human input – reduce costs until specific time period
• Can be scheduled when little demand on • Only data of the same type can be
computer recourses (at night) processed because identical automated
• No transcription and update errors process is being applied to all data
produced by humans – process are
• Errors cannot be corrected until the
automated
batch process is complete
• Fewer repetitive tasks for human
operators