Professional Documents
Culture Documents
COMMUNICATIONS
© DIGITALVISION
he recent establishment of the 10/40 Gbps technology marily nonlinear (NL) effects but also chromatic-dispersion
IEEE CIRCUITS & DEVICES MAGAZINE ■ NOVEMBER/DECEMBER 2006 8755-3996/06/$20.00 ©2006 IEEE 79 ■
generation FEC methods have been demonstrated, yielding Outband Versus Inband
coding-gains in excess of 10 db [2]. Generally, to achieve strong, payload-agnostic FEC, the line-
A remarkable variety of FEC systems hold a share in today’s rate must be increased in accordance with the FEC-rate. This
market of optical communications. They differ in a number of is the usual mode of operation, called outband FEC. Alterna-
respects, such as transmission overhead (redundancy), imple- tively, if the client-signal contains superfluous overhead (OH),
mentation complexity, coding-gain, BER-performance, burst- parity code-symbols can be accommodated therein. Such
error correction ability, etc. Hybrid-schemes, combining inband modes of operation preserve the line-rate, rendering
inband/outband FEC with advanced line-coding formats and link-quality upgrades affordable. Notably, inband and outband
equalization-methods, are a particularly active research topic. modes may coexist in a link, complementing each other. The
In this article, we present and discuss the most representative SDH (SONET) standards define an abundance of OH that
architectures of 1/2/3-g outband and inband FEC schemes. We could be used for inband-coding and yet remain in compliance
also comment on FEC performance, we refer to actual chipsets with the Reduced SOH functionalities interface of the ITU-T
and examine future prospects. G.707 recommendation.
Register
Register
Register
Symbol
Symbol
Symbol
Symbol
Output Symbol
Input Symbol
termination. Resulting codewords have unequal lengths, even
Output-Bits Grouped
if input-data have the same length; shorter codewords are in Code-Symbols
implicitly prolonged to the maximum length by assuming (to FIFO)
Input-Bits Grouped
extra input-data that do not cause the encoder to depart from
in Code-Symbols
its initial state [5]. Convolutional decoding generally involves
direct computations with probability-estimates, the two most
2. Reed-Solomon decoding stages.
widely adopted algorithms being: 1) Viterbi (maximum-likeli-
hood codeword) [4] and 2) Bahl, Cocke, Jelinek and Raviv
(BCJR) [symbol by symbol maximum a posteriori (MAP)] [6]. Transmitter
Outer Inner
Serial Versus Parallel-Concatenation FEC Interleaver FEC
Concatenation of FEC codes is an intuitive method to increase Encoder Encoder
FEC effectiveness. With serial-concatenation, input-data are Lightwave
encoded in succession by an outer (h,k) code and next by an inner Channel
(n,h) code, to produce a concatenated (n,k) coding-scheme. It is
Iterative Decoding
the designer’s intention that the corrective properties of the outer (Optional)
code match the error-statistics on failure of the inner code. In Interleaver
particular, a very effective combination results from a convolu- Outer Inner
De-
tional inner code and a Reed-Solomon outer block-code; the FEC FEC
Decoder Interleaver Decoder
decoding-errors of the former have a natural tendency to mani-
fest themselves as error-bursts, which can be effectively dealt with Receiver
by the latter. On the assumption of burst-errors generated by the 3. Serial FFC concatenation (product codes).
inner code, it is standard practice to place an interleaver between
the two serially-concatenated FEC codes. Interleavers spread the nonbinary structure [3]. An extensive experience on the cor-
decoding-errors of the inner code, so that no single codeword of rective properties of RS codes (compact disks, wireless/satellite
the outer can be easily overwhelmed by these errors. links, deep space missions) made them a natural first choice
The advent of Turbo-coding was a major historical break- for the enhancement of optical links. More specifically, ITU-T
through in error-control coding (1993). The iterative-decod- study-group 15 issued recommendation G.975 (1996) on the
ing of two codes in parallel-concatenation was introduced for use of RS(255,239) for optical submarine communications.
the first time. With parallel-concatenation, the same input- Later, with the advent of Optical Transport Networks (OTN),
data are encoded twice (or more), once in the order of arrival the same code was proposed again for OTN Inter-Domain
and again in permuted order. The permutation is effected be Interfaces in ITU-T recommendation G.709 (2003).
means of an interleaver. The underlying Turbo-principle sug- By definition, the minimum (Hamming) distance dmin of
gests that iteratively decoding, once for each order-of-encod- any RS code is equal to n − k + 1 code-symbols, ensuring that
ing, converges to the Maximum-Likelihood (ML) codeword any distorted symbols E in a code-word are detectable up to
[7]; errors that might overwhelm decoding in one order could n − k and correctable up to (n − k)/2; in these formulas, if
become correctable in another order. The importance of n − k is odd, it must be decremented by 1. It is noteworthy
Turbo-codes was underlined by a performance demonstration that code-symbols, marked by the receiver as erasures e, dou-
within 0.5 db from the channel-capacity. ble the corrective ability of the RS code, making
2E + e ≤ n − k the condition for successful corrections.
FEC ARCHITECTURES RS codes are cyclic, when in full length (nmax ). Even when
Representative FEC architectures are outlined in this section, shortened however (n < nmax ), they still have the implementa-
with emphasis on the evolution from classic first generation tion-benefits of cyclic codes, assuming all truncated symbols as
methods to modern super-FEC schemes. zero. The circuit, used almost exclusively for encoding, is an
n − k taps Linear Feedback Shift Register (LFSR) (Figure 1). Its
First Generation Outband FEC (Single Code) purpose is to impose the n − k distinct algebraic roots (AR i ) of a
Reed-Solomon (RS) codes are Maximum Distance Separable constant polynomial g(x) to the output codewords, which are
(MDS) and suitable for burst-form errors, because of their also treated as polynomials. The polynomial g(x), called the
code-generator polynomial, identifies the code uniquely. Its Second Generation Outband FEC (Concatenated)
coefficients (CG i ) are related to the algebraic-roots (AR i ) Propelled by the increasing maturity of DWDM systems, diverse
through the formula: FEC-concatenation schemes made their appearance in the mar-
n−k
n−k ket. Some of them found their way to standardization, as of ITU-T
g(x) = (x − AR i ) = (GC i ⊗ x i ). recommendation G.975.1 (2004). The clear performance-advan-
i=1 i=0 tage over classic FEC and the common element of serial FEC
The decoding of RS codes has been inherited from their concatenation put these schemes under the umbrella of second
super-class, Bose-Chaudhuri-Hocquenghem (BCH) codes [8]. generation FEC systems for optical communications.
Its performance is suboptimum with regard to ML decoding: Serial-concatenation of FEC codes is outlined in Figure 3.
the decoder selects the codeword that is the closest match to Notice the presence of an interleaver between the outer and
the received word (bounded distance decoding). It is typically a the inner encoder; a corresponding de-interleaver is placed
3-stage procedure (Figure 2): In the first stage (syndrome-val- between the outer and the inner FEC decoder accordingly. The
ues calculator), it is verified, whether the roots (AR i ) of g(x) performance of the concatenated scheme is strongly related to
still hold in the received-codeword. A set of numbers, called the choice of the interleaving type and depth. The objective of
syndrome-values (SV), are computed and communicated to interleaver design is to minimize/eliminate the weak (low-
the next stage. When all SV are zero, the received word is rec- weight) codewords.
ognized as a codeword and decoding ceases, otherwise decod- Decoding separately and in succession the inner and the
ing proceeds with the second stage (key-equation solver). In outer code is the rule, but not ML; ML decoders for concate-
this stage, the error-locator λ and the error-corrector ω poly- nated schemes are of prohibitive complexity. On the contrary,
nomials are formulated with respect to the SV. The roots of λ iterative decoding is a practical method to construct very effi-
BER
instances of FEC decoders, which significantly increases the
10−4 6 Iterations 3 Iterations
cost of implementation.
A prominent representative of second generation FEC 10−5 10 Iterations
schemes can be found in [1] and also in clause I.3 of ITU-T rec-
ommendation G.975.1 (2004). This super-FEC is a concatenated 10−6 18 Iterations
scheme with BCH(3860,3824) as the outer and BCH(2040,1930)
10−7
as the inner code. The total redundancy amounts to 6.69%, keep- 0.5 1 1.5 2
ing rate-compatibility with G.709. It provides 7.98 dB net coding Eb/No in dB
gain at an output BER of 10−12 , by using 3 times iterative decod-
6. BER improvement versus decoding iterations
ing. Even higher coding-gains (up to 9 db) are claimed by prod- (Turbo code, rate 1⁄2, block size 65536).
ucts that employ FEC concatenation, such as: Intel IXF30007,
AMCC NIAGARA (10 Gbps) and CoreOptics CO40FEC10 the previous ones, called extrinsic information. Extrinsic is the
(40 Gbps). The extra coding gain of these products is nonetheless net contribution of a single decoding session computed as the
at the expense of a higher code- and line-rate (up to 30%). symbol-by-symbol difference APsP-APrP. The exchange of
extrinsic information between decoders takes place through
3rd Generation Outband FEC (Super-FEC) appropriate interleaving/de-interleaving. This repetitive process
Although a breakthrough in their own right, it was soon real- can render correctable channel-data that would otherwise
ized that second generation FEC schemes do not achieve the cause error-overflow. Under normal channel conditions, this
required system margin for some important ULH links (e.g., iterative algorithm converges towards the ML solution; returns
transpacific) [2], [9]. This conclusion directed the relative are however diminishing (Figure 6). The real-time character of
research towards even stronger FEC methods. Today’s state-of- optical networks imposes that the algorithm is terminated after
the-art methods are registered as 3rd generation FEC, mostly a fixed number of iterations, properly chosen so that more of
relying on Turbo and Low-Density Parity-Check (LDPC) coding them wouldn’t incur any significant improvement.
concepts; they also leverage iterative decoding to obtain realiz- No matter the excellent performance of Turbo, these codes
able receivers and achieve very high coding-gains. are not second to none. LDPC codes have in many cases
More specifically, the original Turbo architecture combines demonstrated superior characteristics [11]. LDPC are linear
two encoders in parallel-concatenation [7]; the rule is that they block codes and are consequently fully defined through their
implement the same code. The first encoder applies on the parity check matrix H. The latter must be characterized by
incoming data-symbols in their order of arrival, whereas the sparseness i.e., a small number of nonzero entries. In binary
second one applies on the same data but in permuted order LDPC, H is designed with a ones per column and b ones per
(Figure 4). An interleaver permutes the data-order. Noticeably, row and as a result, each information bit is involved in a parity
the input-data are transmitted only once, multiplexed with par- checks and each parity check involves b information bits. This
ity-data from both encoders. The constituent codes need to be fact gives rise to a special notation for LDPC as (a,b), in addi-
punctured, in order to control the code-rate, hence the MUX at tion to the usual (n,k).
the output. With regard to interleaving, the same considera- Parity checks are conveniently visualized in the form of
tions hold as with serially concatenated FEC. Tanner Graphs. The association between a parity-check matrix
Turbo decoders operate on the Turbo principle [7] of itera- and the corresponding Tanner Graph is presented in Figure 7.
tive decoding (Figure 5). Constituent decoders must be Soft- Variable nodes are initialized with the corresponding received
Input-Soft-Output (SISO) i.e., capable of co-processing symbols and check nodes stand for the applicable parity-
unequal a priori probabilities (APrP) of the incoming code- checks. LDPC can be decoded with complexity O(n) with Belief
symbols and generating a posteriori probabilities (APsP) in Propagation (BP) algorithms, employing Soft Message Passing
return. For trellis-based Turbo codes, the major SISO decod- (SMP) techniques [12]. Two major steps are iteratively execut-
ing algorithm is the MAP of BCJR [6], [10]. ed: i. Variable nodes communicate to check nodes their relia-
Both constituent decoders shown are fed with the same bility (log-likelihood) and ii. Check nodes decide which
input-data and the corresponding parity. However, input-data variables are not reliable and “suppress” the corresponding
enter the first decoder in the order of arrival and the second inputs. In implementing LDPC, the bottleneck appears to be
one in permuted order. The execution of the first iteration in encoding, rather than decoding. To reduce the complexity of
each decoder is on a zero-knowledge basis. On the contrary, LDPC encoding to O(n), this operation can be performed by an
next iterations take into account reliability information from iterative decoder instead, if properly initialized.