You are on page 1of 4

11/13/2019 Code - Wikipedia

Code
In communications and information processing, code is a system of rules to
convert information—such as a letter, word, sound, image, or gesture—into
another form or representation, sometimes shortened or secret, for
communication through a communication channel or storage in a storage
medium. An early example is the invention of language, which enabled a
person, through speech, to communicate what they saw, heard, felt, or
thought to others. But speech limits the range of communication to the
distance a voice can carry, and limits the audience to those present when the
speech is uttered. The invention of writing, which converted spoken
language into visual symbols, extended the range of communication across
space and time.

The process of encoding converts information from a source into symbols


for communication or storage. Decoding is the reverse process, converting
code symbols back into a form that the recipient understands, such as
English or Spanish.

One reason for coding is to enable communication in places where ordinary


plain language, spoken or written, is difficult or impossible. For example,
semaphore, where the configuration of flags held by a signaler or the arms of Morse code, a famous type of code
a semaphore tower encodes parts of the message, typically individual letters
and numbers. Another person standing a great distance away can interpret
the flags and reproduce the words sent.

Contents
Theory
Variable-length codes
Error-correcting codes
Examples
Codes in communication used for brevity
Character encodings
Genetic code
Gödel code
Other
Cryptography
Other examples
Codes and acronyms
See also
References

Theory
In information theory and computer science, a code is usually considered as an algorithm that uniquely represents symbols from
some source alphabet, by encoded strings, which may be in some other target alphabet. An extension of the code for representing
sequences of symbols over the source alphabet is obtained by concatenating the encoded strings.

Before giving a mathematically precise definition, this is a brief example. The mapping

https://en.wikipedia.org/wiki/Code 1/4
11/13/2019 Code - Wikipedia

is a code, whose source alphabet is the set and whose target alphabet is the set . Using the extension of the code, the
encoded string 0011001011 can be grouped into codewords as 0 011 0 01 011, and these in turn can be decoded to the sequence of
source symbols acabc.

Using terms from formal language theory, the precise mathematical definition of this concept is as follows: let S and T be two finite
sets, called the source and target alphabets, respectively. A code is a total function mapping each symbol from S to a
sequence of symbols over T, and the extension of to a homomorphism of into , which naturally maps each sequence of
source symbols to a sequence of target symbols, is referred to as its extension.

Variable-length codes
Codes that encode each source (clear text) character by a code word from some dictionary, and concatenation of such code words
give us an encoded string. Variable-length codes are especially useful when clear text characters have different probabilities; see also
entropy encoding.

A prefix code is a code with the "prefix property": there is no valid code word in the system that is a prefix (start) of any other valid
code word in the set. Huffman coding is the most known algorithm for deriving prefix codes. Prefix codes are widely referred to as
"Huffman codes" even when the code was not produced by a Huffman algorithm. Other examples of prefix codes are country calling
codes, the country and publisher parts of ISBNs, and the Secondary Synchronization Codes used in the UMTS WCDMA 3G Wireless
Standard.

Kraft's inequality characterizes the sets of codeword lengths that are possible in a prefix code. Virtually any uniquely decodable one-
to-many code, not necessary a prefix one, must satisfy Kraft's inequality.

Error-correcting codes
Codes may also be used to represent data in a way more resistant to errors in transmission or storage. This so-called error-
correcting code works by including carefully crafted redundancy with the stored (or transmitted) data. Examples include Hamming
codes, Reed–Solomon, Reed–Muller, Walsh–Hadamard, Bose–Chaudhuri–Hochquenghem, Turbo, Golay, Goppa, low-density
parity-check codes, and space–time codes. Error detecting codes can be optimised to detect burst errors, or random errors.

Examples

Codes in communication used for brevity


A cable code replaces words (e.g. ship or invoice) with shorter words, allowing the same information to be sent with fewer
characters, more quickly, and less expensively.

Codes can be used for brevity. When telegraph messages were the state of the art in rapid long distance communication, elaborate
systems of commercial codes that encoded complete phrases into single mouths (commonly five-minute groups) were developed, so
that telegraphers became conversant with such "words" as BYOXO ("Are you trying to weasel out of our deal?"), LIOUY ("Why do
you not answer my question?"), BMULD ("You're a skunk!"), or AYYLU ("Not clearly coded, repeat more clearly."). Code words were
chosen for various reasons: length, pronounceability, etc. Meanings were chosen to fit perceived needs: commercial negotiations,
military terms for military codes, diplomatic terms for diplomatic codes, any and all of the preceding for espionage codes.
Codebooks and codebook publishers proliferated, including one run as a front for the American Black Chamber run by Herbert
Yardley between the First and Second World Wars. The purpose of most of these codes was to save on cable costs. The use of data
coding for data compression predates the computer era; an early example is the telegraph Morse code where more-frequently used
characters have shorter representations. Techniques such as Huffman coding are now used by computer-based algorithms to
compress large data files into a more compact form for storage or transmission.

Character encodings
Character encodings are representations of textual data. A given character encoding may be associated with a specific character set
(the collection of characters which it can represent), though some character sets have multiple character encodings and vice versa.
Character encodings may be broadly grouped according to the number of bytes required to represent a single character: there are
single byte encodings, multibyte (also called wide) encodings, and variable-width (also called variable-length) encodings. The

https://en.wikipedia.org/wiki/Code 2/4
11/13/2019 Code - Wikipedia

earliest character encodings were single-byte, the best known example of which is ASCII. ASCII remains in use today, for example in
HTTP headers. However, single-byte encodings cannot model character sets with more than 256 characters. Scripts which require
large character sets such as Chinese, Japanese and Korean must be represented with multibyte encodings. Early multibyte
encodings were fixed-length, meaning that although each character was represented by more than one byte, all characters used the
same number of bytes ("word length"), making them suitable for decoding with a lookup table. The final group, variable-width
encodings, is a subset of multibyte encodings. These use more complex encoding and decoding logic to efficiently represent large
character sets while keeping the representations of more commonly used characters shorter or maintaining backwards compatibility
properties. This group includes UTF-8, an encoding of the Unicode character set; UTF-8 is the most common encoding of text
media on the Internet.

Genetic code
Biological organisms contain genetic material that is used to control their function and development. This is DNA, which contains
units named genes from which messenger RNA is derived. This in turn produces proteins through a genetic code in which a series of
triplets (codons) of four possible nucleotides can be translated into one of twenty possible amino acids. A sequence of codons results
in a corresponding sequence of amino acids that form a protein molecule; a type of codon called a stop codon signals the end of the
sequence.

Gödel code
In mathematics, a Gödel code was the basis for the proof of Gödel's incompleteness theorem. Here, the idea was to map
mathematical notation to a natural number (using a Gödel numbering).

Other
There are codes using colors, like traffic lights, the color code employed to mark the nominal value of the electrical resistors or that
of the trashcans devoted to specific types of garbage (paper, glass, organic, etc.).

In marketing, coupon codes can be used for a financial discount or rebate when purchasing a product from a (usually internet)
retailer.

In military environments, specific sounds with the cornet are used for different uses: to mark some moments of the day, to
command the infantry on the battlefield, etc.

Communication systems for sensory impairments, such as sign language for deaf people and braille for blind people, are based on
movement or tactile codes.

Musical scores are the most common way to encode music.

Specific games have their own code systems to record the matches, e.g. chess notation.

Cryptography
In the history of cryptography, codes were once common for ensuring the confidentiality of communications, although ciphers are
now used instead.

Secret codes intended to obscure the real messages, ranging from serious (mainly espionage in military, diplomacy, business, etc.)
to trivial (romance, games) can be any kind of imaginative encoding: flowers, game cards, clothes, fans, hats, melodies, birds, etc., in
which the sole requisite is the pre-agreement on the meaning by both the sender and the receiver.

Other examples
Other examples of encoding include:

Encoding (in cognition) - a basic perceptual process of interpreting incoming stimuli; technically speaking, it is a complex, multi-
stage process of converting relatively objective sensory input (e.g., light, sound) into subjectively meaningful experience.
A content format - a specific encoding format for converting a specific type of data to information.
Text encoding uses a markup language to tag the structure and other features of a text to facilitate processing by computers.
(See also Text Encoding Initiative.)

https://en.wikipedia.org/wiki/Code 3/4
11/13/2019 Code - Wikipedia

Semantics encoding of formal language A in formal language B is a method of representing all terms (e.g. programs or
descriptions) of language A using language B.
Electronic encoding transforms a signal into a code optimized for transmission or storage, generally done with a codec.
Neural encoding - the way in which information is represented in neurons.
Memory encoding - the process of converting sensations into memories.
Television encoding: NTSC, PAL and SECAM
Other examples of decoding include:

Decoding (computer science)


Decoding methods, methods in communication theory for decoding code words sent over a noisy channel
Digital signal processing, the study of signals in a digital representation and the processing methods of these signals
Digital-to-analog converter, the use of analog circuit for decoding operations
Word decoding, the use of phonics to decipher print patterns and translate them into the sounds of language

Codes and acronyms


Acronyms and abbreviations can be considered codes, and in a sense all languages and writing systems are codes for human
thought.

International Air Transport Association airport codes are three-letter codes used to designate airports and used for bag tags. Station
codes are similarly used on railways, but are usually national, so the same code can be used for different stations if they are in
different countries.

Occasionally, a code word achieves an independent existence (and meaning) while the original equivalent phrase is forgotten or at
least no longer has the precise meaning attributed to the code word. For example, '30' was widely used in journalism to mean "end
of story", and has been used in other contexts to signify "the end".[1] [2]

See also
Asemic writing
Cipher
Code (semiotics)
Equipment codes
Quantum error correction
Semiotics
Universal language

References
1. Kogan, Hadass "So Why Not 29" (http://www.ajr.org/Article.asp?id=4408) Archived (https://web.archive.org/web/201012121017
05/http://ajr.org/Article.asp?id=4408) 2010-12-12 at the Wayback Machine American Journalism Review. Retrieved 2012-07-03.
2. "WESTERN UNION "92 CODE" & WOOD'S "TELEGRAPHIC NUMERALS" " (http://www.civilwarsignals.org/pages/tele/wurules
1866/92code.html). Signal Corps Association. 1996. Archived (https://web.archive.org/web/20120509135118/http://www.civilwa
rsignals.org/pages/tele/wurules1866/92code.html) from the original on 2012-05-09. Retrieved 2012-07-03.

Chevance, Fabienne (2017). "Case for the genetic code as a triplet of triplets" (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5
422812). Proceedings of the National Academy of Sciences of the United States of America. 114 (18): 4745–4750.
doi:10.1073/pnas.1614896114 (https://doi.org/10.1073%2Fpnas.1614896114). PMC 5422812 (https://www.ncbi.nlm.nih.gov/pm
c/articles/PMC5422812). PMID 28416671 (https://www.ncbi.nlm.nih.gov/pubmed/28416671).

Retrieved from "https://en.wikipedia.org/w/index.php?title=Code&oldid=925305862"

This page was last edited on 9 November 2019, at 06:05 (UTC).

Text is available under the Creative Commons Attribution-ShareAlike License; additional terms may apply. By using this site, you
agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit
organization.

https://en.wikipedia.org/wiki/Code 4/4

You might also like