Professional Documents
Culture Documents
Natureza da Informao
Introduction and some bits about the
History of Information Theory
Course Goals
Goal:
Prerequisites
No formal prerequisites
High school prerequisites
Set theory
Combinatorial analysis
Probability and statistics
Logarithms properties
Text interpretation
8,5 average = A
7,0 average < 8,5 = B
6,0 average < 7,0 = C
5,0 average < 6,0 = D
Average < 5,0 = F
Tidia
http://tidia-ae.ufabc.edu.br
Log in using your institutional
credentials. The professor will
register the students
Course material
Course material: Will be made available at
Tidia (http://tidia-ae.ufabc.edu.br) on the
section Repositrio (Resources).
Class slides
Exercise lists and answers.
Estimated chronogram
Content
22, 28/09
Cronograma estimado
09 e 10/11
Contedo
Efficient coding and data compression
17 e 23/11
24/11 e
01/12
07/12
08/12
15/12
Recommended reading:
Decoding the Universe. Charles Seife (2006) Penguin Books.
(Este livro, embora de carter divulgativo, o que captura melhor o foco multidisciplinar da
disciplina. Apresenta um roteiro que pode ser preenchido abordando cada tpico com uma
profundidade maior . Existe somente um exemplar na biblioteca. Existe verso em pdf)
Informational Axis
The advances of science and technology are
multiplying our capacity to collect, process, produce
and use information leading it to new levels never
reached before. It brings:
new opportunities
new social questions
more science and technology advances, fostering a
virtuous cycle
Informational Axis
Fundamentals and processes:
Information Nature: what is Information and how can
we represent or measure it?
Information Processing: manipulation and information
treatment, under both human and computer aspects
(processing)
Information communication: transmission and
distribution of the information and its impact
Natureza da
Informao
Transforma
o
da
Informao
Comunicao
da
Informao
Abstrata
Concreta
(conceitual)
Terica
(suporte)
Tecnologia
Social
(utilizao)
Humana
O Bit
Entropia
Analog. X Digital
Capac. Shannon
T. Informao
Smbolos e
Sinais
Rudo
T. Computao
Proc. Sinais
Criptografia
Complexidade
Programao
Minerao Dados
Traduo
Aprendizado
Crebro
Conhecimento
Razo/Emoo
Redes Sociais
Sist.
Comunicaes
Redes e Trfego
Eletrnica
Fotnica
Novas Tecnologias
Linguagem
Humana
Internet
Soc. Informao
Econ. Informao
Regulao/tica
Org. Computadores
Compresso Dados
T. Comunicaes
Capacidade canal
Canal gaussiano
Informao
gentica
Codificao
Proc. Estocsticos
Ordem e Desordem
Caos
Transformadas
Sentidos/percepo
Cognio e Ao
Inteligncia
Conscincia
Memria
Amostras
Data
Data: basic element that quantifies or
qualifies something
Do not carry any intrinsic meaning by
itself
Initial perception about the subject
Identified by symbolic characteristics
Data
A set of facts about the world;
They are usually quantifiable;
Can be easily captured and stored in
computational devices;
Do not carry meaning and neither
can be used in judgements;
So actions cant be made based on
just data.
Data
Types
Alphanumeric
Product code, price, amount, etc.
Images
Photographs
Audio
Video
etc...
Data
Basic facts
E.g.: shopping at the supermarket
Code
Description
Individual Price
Quantity
12314
Chocolate
R$ 3,50
86456
Milk
R$ 1,50
45675
Butter
R$ 2,00
54387
Juice
R$ 3,00
57871
Cheese
R$ 5,00
89452
Beer
R$ 1,50
Information
Information: interpreted and
contextualized data
Requires data interpretation
Is the result of processed data, its useful for
decision-making.
Information
A conjugated data set that has
relevance and purpose;
Can be transformed by humans and
be submitted to judgment
Semiotics (quality of information)
Information analysis to produce
knowledge
Information
Transformed data with aditional information
Code
Description
Individual Price
Quantity
12314
Chocolate
R$ 3,50
R$ 7,00
86456
Milk
R$ 1,50
R$ 12,00
45675
Butter
R$ 2,00
R$ 2,00
54387
Juice
R$ 3,00
R$ 12,00
57871
Cheese
R$ 5,00
R$ 5,00
89452
Beer
R$ 1,50
R$ 9,00
22
R$ 47,00
Total
Total Price
Information
Can help to increase the profit
Cdigo
Descrio
12314
Chocolate
R$ 3,50
R$ 7,00
86456
Leite
R$ 1,50
R$ 12,00
45675
Manteiga
R$ 2,00
R$ 2,00
54387
Suco
R$ 3,00
R$ 12,00
57871
Queijo
R$ 5,00
R$ 5,00
89452
Cerveja
R$ 1,50
R$ 9,00
22
R$ 47,00
Total
Preo Individual
Quantidade
Preo Total
Knowledge
Knowledge: ability to create a
model and suggest actions or
decisions to take
Comprehension, analysis and synthesis
procedures (models)
Information + processing = knowledge
(processing = experience, training, etc.)
Knowledge
A structured and organized set of
information;
Requires intelligent human
judgement
Semiotics: assignment of meaning to
information;
Knowledge
Ex. supermarket:
The items X and Y are frequently bought
together
Milk and chocolate
Hierarchy
Data x Information x
Knowledge
Data x Information x
Knowledge
What is information?
Latin etimology
from in- "into", + formare to form, shape
(into form, delineate)
Different perspectives
Biology
Linguistics
Physics
Computer Science
What is the relationship between information and
communication?
Communication: information exchange between the
actors
Information Theory
Information is inversely proportional to
the probability of occurrence of a fact
Higher probability facts are less informative
Examples:
The first time we listen a disc, it brings new musical
knowledge. But after listen to it many times, we can
predict the next accords, so this disc brings no more
information to us.
We all can predict the missing O at the word L_VE,
because its a common word. So it is unnecessary to
write that character at that position.
Shannon
Em 1948, Shannon published the paper A Mathematical
Theory of Communication, republished as book next
year
Before him, isolated works went step by step towards a
general theory of communication
Nowadays the Theory of Communication (or Information
Theory) is a huge research area with many books and
symposiums about the subject
Information Theory
Information Theory is a broad theory that involves a lot of
math.
Bit is the fundamental measure to quantify information
Information Theory
Information theory allows us to:
Say how many informative bits could be sent per second through
a certain communication channel
Measure the rate in which a source can produce information
Say how to represent efficiently, or encode, messages to be
transmitted through some channel.
Say how we can avoid transmission errors
Thermodynamic studies
Cryptography and computers built at
the World War II
Transmission technology, starting with
Morse code and telephony
Industrial Revolution
1769. Watt
invented the
steam
machine
Research to
improve the
machine
efficiency
Cold
source
Q2
Heat
Engine
Q1
Hot source
Q2
work
work
Heat
Source
Q1
Hot source
Refrigeration
Rudolf Claussius
It is impossible to
stop the tendency
to thermodynamic
equilibrium of the
universe (1860)
In other words, the
entropy always
tends to increase
Q2
Heat
Engine
Work
Q1
Heat
source
Q2
Heat
Engine
Q1
Work
Q1
Hot source
II Thermodynamics law
Entropy measures the
thermodynamics equilibrium
More entropy means more
equilibrium
The second thermodynamics law
says that the entropy amount of any
system tends to increase with time
until its maximum value
S=k log(W)
Entropy equation
inscribed at Boltzmann's
tomb
Atoms of a gas tend to
disperse itself uniformly.
The universe entropy
always increases
Later in the course we
will see the bridge
between the entropies of
Boltzmann and Shannon
Cryptography and
Computers at World War II
AF is short of water
These words
changed the
course of the
war at Pacific.
The
cryptographic
japanese code
(JN-25) was
decoded by
americans
Commander Rochefort
AF is out of water!
Admiral Yamamoto
End of War
Americans waited
for the japanese
arrival
Four Japanese
aircraft carriers
were destroyed.
The end of World
War II began
European U-boats
Enigma
The german Arthur
Scherbius invented
Enigma, a machine
to encrypt messages
3x10114 different
states possible
Brute force would
require: each atom
on universe
computing a trillion
of keys per second,
since the beginning
of universe
Cryptography
A simple cryptographic method is to
replace each letter (character) by the
letter placed N positions forward in
the alphabet
Caesar cipher
Redundancy
-s v-n-s d- c--nc-- e d- t-cn-l-g-- -stm-lt-pl-c-ndo -s n-ss-s c-p-c-d-d-s dc-l-t-r, tr-t-r, g-r-r - -t-l-z-r
-nf-rm--es.
Technologies for
Information Transmission
Telegraph origins
1838: Samuel B. Morse worked with Alfred Vail on a
code known today as the Morse code:
Alphabetic characters are represented by spaces, dots and
dashes
Electronic transmission was achieved representing spaces by
current absence, dots by short currents, and dashes by long
currents
Morse Code
Answer:
Using the modern Information Theory we found that the
transmission rate gain would be about 15% at most.
This suggests that Morse intuitively attacked on of the main
problems addressed by Information Theory.
Telegraph limitations
Limitations summary
Contributions to the
Information Theory
Many people contributed mathematically to the information
theory at XIX century:
Fouriers contribution
Fourier based his works on the sine function
Finally Shannon
So when Shannon published his works in 1947,
much was already done
In a certain way, he summarized and produced
new knowledge about all these problems
previously studied by other researchers
Shannons contribution
But we could say that his great contribution was to answer
the following questions:
How could we encode (with electric signals) a message from a
source to transmit it as fast as possible through a channel that
introduces noises with a certain pattern?
How fast could we transmit certain message through specific
channels without errors?