GLDPC-Staircase AL-FEC codes: a fundamental study and new results

Mattoussi, Ferdaouss; Roca, Vincent; Sayadi, Bessem

doi:10.1186/s13638-016-0660-y

Review
Open access
Published: 05 September 2016

GLDPC-Staircase AL-FEC codes: a fundamental study and new results

Ferdaouss Mattoussi¹,
Vincent Roca¹ &
Bessem Sayadi²

EURASIP Journal on Wireless Communications and Networking volume 2016, Article number: 206 (2016) Cite this article

1566 Accesses
1 Citations
Metrics details

Abstract

This paper provides fundamentals in the design and analysis of Generalized Low-Density Parity Check (GLDPC)-Staircase codes over the erasure channel. These codes are constructed by extending an LDPC-Staircase code (base code) using Reed-Solomon (RS) codes (outer codes) in order to benefit from more powerful decoders. The GLDPC-Staircase coding scheme adds, in addition to the LDPC-Staircase repair symbols, extra-repair symbols that can be produced on demand and in large quantities, which provides small rate capabilities. Therefore, these codes are extremely flexible as they can be tuned to behave either like predefined rate LDPC-Staircase codes at one extreme, or like a single RS code at another extreme, or like small rate codes. Concerning the code design, we show that RS codes with “quasi” Hankel matrix-based construction fulfill the desired structure properties, and that a hybrid (IT/RS/ML) decoding is feasible that achieves maximum likelihood (ML) correction capabilities at a lower complexity. Concerning performance analysis, we detail an asymptotic analysis method based on density evolution (DE), extrinsic information transfer (EXIT), and the area theorem. Based on several asymptotic and finite length results, after selecting the optimal internal parameters, we demonstrate that GLDPC-Staircase codes feature excellent erasure recovery capabilities, close to that of ideal codes, both with large and very small objects. From this point of view, they outperform LDPC-Staircase and Raptor codes and achieve correction capabilities close to those of RaptorQ codes. Therefore, all these results make GLDPC-Staircase codes a universal Application-Layer FEC (AL-FEC) solution for many situations that require erasure protection such as media streaming or file multicast transmission.

1 Review

Low-Density Parity Check (LDPC) codes have been intensively studied during the last decade due to their near-Shannon limit performance under iterative belief-propagation (BP) decoding [1–3]. A (N,K) LDPC code, where N is the code length and K is its dimension, can be graphically represented as a bipartite graph with N “variable nodes” (VNs) and M=N−K “check nodes” (CNs). Equivalently, LDPC codes can be represented through their H _L parity check matrix translating the connection between VNs and CNs. The degree of a VN or a CN is defined as the number of edges connected to it. A VN of degree n can be interpreted as a “length repetition code” (n,1), i.e., as a linear block code repeating n times its single information bit towards the CN set. Similarly, a CN of degree n can be interpreted as a Single Parity Check (SPC) code (n,n−1), i.e., as a linear block code associated with one parity equation. To improve error floor, minimal distance, and decoding complexity performances, a generalization of these codes was suggested by Tanner in [3], for which subsets of the variable nodes obey a more complex constraint than an SPC constraint. The SPC check nodes in a GLDPC structure are replaced with a generic linear block codes (n,k) referred to as sub-codes or component codes while the sparse graph representation is kept unchanged. More powerful decoders at the check nodes have been investigated by several researchers in recent years after the work of Boutros et al. [4] and Lentmaier and Zigangirov [5] where BCH codes and Hamming codes were proposed as component codes, respectively. Later several works, on several channels, have been carried out in order to afford very large minimum distance and exhibit performance approaching Shannon’s limit. Each construction differs from others by modifying the linear block codes (component codes) on the check nodes [6, 7, 7–11] or/and the distribution of the structure of GLDPC codes [6] to offer a good compromise between waterfall performance and error floor under iterative decoding.

A GLDPC-Staircase code is an LDPC-Staircase code [12] in which the constraint nodes of the code graph are Reed-Solomon (RS) codes (rather than SPCs) in order to benefit from more powerful decoders. The construction of these RS codes, with the desired properties, is omitted from the initial work [13]. Therefore in [14], we introduce RS code-based Hankel matrices to that purpose. GLDPC-Staircase codes differ from the GLDPC codes proposed by Tanner and their successive variants. In particular, the GLDPC-Staircase coding scheme allows each check node to produce a potentially large number of repair symbols in terms of RS codes, called extra-repair symbols, on demand. These extra-repair symbols extend the base LDPC-Staircase code and very small rates are easily achievable. This feature is well suited to situations where channel conditions can be worse than expected and to fountain-like content distribution applications. More generally, these codes can easily be tuned to behave either like predefined rate LDPC-Staircase codes at one extreme, or like a single RS code at another extreme, or like a small rate code.

From a decoding perspective, we propose a new hybrid (IT/RS/ML) decoding approach that achieves the optimal correction capabilities of ML decoding at a lower complexity [14]. Finally, in order to analyze their performance, we detail in [15] an asymptotic analysis method based on the density evolution (DE) and extrinsic information transfer (EXIT) tools and the area theorem. Then, using this theoretical analysis combined with a finite length analysis, we discuss the impacts of the code structure and its internal parameters on performance.

Asymptotic and finite length analyses show that these codes achieve excellent decoding performance (i.e., good average decoding overhead, good waterfall region, small error floor, and channel capacity approaching performance), close to that of ideal codes, both with very large and very small objects. This independence with respect to the code dimension is a key practical benefit (e.g., LDPC codes are known to be asymptotically good only). We show in this work that our codes outperform the Raptor codes as well as some GLDPC codes, while being close to RaptorQ codes. Their extreme flexibility makes it possible to tune them to perfectly match each use-case (like low bit-rate streaming applications or at the opposite large file multicast transmission). The purpose of this paper is to give the reader a detailed overview of GLDPC-Staircase codes and to provide new results.

This paper is organized as follows. Section 2 focuses on the design of GLDPC-Staircase codes based on RS codes. Then in Section 3, we explain the proposed asymptotic analysis method. Section 4 presents several analyses and optimizations of GLDPC-Staircase codes. Then, we analyze the achieved performance, compare these codes with other erasure codes, and provide preliminary decoding complexity results in Section 5. Finally, we conclude.

2 GLDPC-Staircase code design

2.1 Code description

As mentioned in introduction, GLDPC-Staircase codes are constructed from:

LDPC-Staircase code: this is the base code with length N _L and dimension K. Let M _L=N _L−K and let H _L=(H ₁|H ₂) be the associated parity check matrix¹. From the LDPC-Staircase viewpoint, each row of H _L defines the connections between the source and LDPC repair symbols. From the GLDPC-Staircase viewpoint, each row of H _L defines the connections between the RS repair symbols and the source and LDPC repair symbols. Consequently, each LDPC-Staircase CN is represented as a powerful CN, called generalized check node, with GLDPC-Staircase codes.
RS codes: they are the outer codes (or components codes). A generalized check node of index m can generate e(m) extra-repair symbols from the RS point of view (plus one LDPC repair symbol if we use scheme A as we will see below). This is done with an RS (n _m,k _m) encoding over G F(2^b) with 0≤e(m)≤E and m=1,...,M _L. Here, E, k _m, and n _m are respectively the maximum number of extra-repair symbols per generalized check node, the RS code dimension and length for the generalized check node m.

Figure 1 illustrates the bipartite graph of a GLDPC-Staircase $\left (N_{G}, K \right)$ code of length N _G and dimension K. It is composed of two sets of nodes:

the generalized check node that corresponds to RS codes;
Fig. 1
Bipartite graph. Figure showing the case of GLDPC-Staircase (13,4) code, e(m)=2 extra-repair symbols per generalized check node (i.e., regular distribution)
Full size image
the variable nodes (VN) further divided into three categories:
- source symbols;
- LDPC repair symbols;
- extra-repair symbols.

2.2 Schemes A and B

Let us now define two variants, schemes A and B, depending on the definition of n _m and k _m:

Scheme A

For row m>1, the source symbols (from the user viewpoint) involved in this row plus the LDPC repair symbol of row m−1 are considered as source symbols from the RS viewpoint. The new LDPC repair symbol plus the e(m) extra-repair symbols are considered as repair symbols from the RS viewpoint. Therefore, the LDPC repair symbol is also an RS repair symbol. For m=1, the only difference is that there is no previous repair symbol (beginning of the staircase).

No matter the row, we have
$$ n_{m} = k_{m} + 1 + e(m) ~ \text{and} ~ k_{m} = d_{r}(m) - 1, $$
(1)

where d _r(m) is the degree of row m of H _L. Due to the staircase structure of H ₂, it follows
$$ { d_{r}(m) = \left\lbrace \begin{array}{ccc} d_{r_{H_{1}}}+ 1 & \text{if} & m=1 \text{;} \left(d_{r_{H_{1}}} = \frac{N_{1}}{\frac{1}{r_{L}}-1}\right)\\ d_{r_{H_{1}}} + 2 & \text{if} & m>1 \text{;} \left(d_{r_{H_{1}}} = \frac{N_{1}}{\frac{1}{r_{L}}-1}\right). \end{array}\right. } $$
(2)

In order to fulfill the duality property of the LDPC repair symbols, we propose in [16] a specific construction of RS codes based on “quasi” Hankel matrix. The generator matrix G of these codes has the following form:
$$ G =\left[ \begin{array}{cccccccccc} 1&0&\ldots&\ldots&0&\boldsymbol{1}&1&\ldots&\ldots&1 \\ 0&1&\ddots&\ddots&\vdots&\boldsymbol{1}&b_{1}&b_{2}&\ldots&b_{n_{m}-k_{m}} \\ 0&0&\ddots&\ddots&\vdots&\boldsymbol{1}&b_{2} &{{\cdot}{\raisebox{3pt}{$\cdot$}}{\raisebox{6pt}{$\cdot$}}}& {{\cdot}{\raisebox{3pt}{$\cdot$}}{\raisebox{6pt}{$\cdot$}}} &b_{n_{m}-k_{m}+1}\\ \vdots&\vdots&\ddots&\ddots&0&\vdots&\vdots&{{\cdot}{\raisebox{3pt}{$\cdot$}}{\raisebox{6pt}{$\cdot$}}}&{{\cdot}{\raisebox{3pt}{$\cdot$}}{\raisebox{6pt}{$\cdot$}}}&\vdots \\ 0&0&\ldots&0&1&\boldsymbol{1}&b_{k_{m}}&{{\cdot}{\raisebox{3pt}{$\cdot$}}{\raisebox{6pt}{$\cdot$}}}&{{\cdot}{\raisebox{3pt}{$\cdot$}}{\raisebox{6pt}{$\cdot$}}}&b_{n_{m}-1} \\ \end{array} \right] $$
(3)

where $b_{i} = \frac {1} {1- {y}^{i}}$, for 1≤i≤q−1, y is an arbitrary primitive element of G F(q) and y ⁱ is computed over G F(q).

Thanks to the column full of “1” in G for the first RS repair symbol, this latter can also be considered as an LDPC-Staircase symbol (it is the XOR sum of source symbols from the RS viewpoint).
Scheme B

For each row m, the various source symbols (from the user viewpoint) involved in this row plus the LDPC repair symbol(s) are considered as source symbols from the RS viewpoint. The e(m) extra-repair symbols are the only repair symbols from the RS viewpoint. No matter the row, we have
$$ n_{m} = k_{m} + e(m)~\text{and}~k_{m} = d_{r}(m). $$
(4)

Here, any RS code (e.g., based on Hankel, Cauchy, or Vandermonde matrices) can be used.

2.3 Extra-repair symbol regular/irregular distributions

For a fixed code rate r _L of LDPC-Staircase code (N _L,K), the code rate of the GLDPC-Staircase code is given by

$$ r_{G}= \frac{K}{N_{L}+ M_{L}\bar{\,f}} = \frac{r_{L}}{1+ (1 - r_{L})\bar{\,f}}. $$

(5)

where $\bar {f}$ is the average number of extra-repair symbols per generalized check node:

$$ \bar{f}= \sum_{e=0}^{E} (f_{e}.e), $$

(6)

and f _e denotes the fraction of generalized check nodes with e extra-repair symbols:

$$ f_{e}= \frac{\text{card}\{m=1\dots M_{L} \mid e(m)=e \}}{M_{L}} $$

(7)

We can consider the following two distributions of extra-repair symbols on the various generalized check nodes:

Regular distribution: f _e=0 for $e \in \{0,1,\dots,E-1\}$ and f _E=1. Thus, each generalized check node m has the same number e(m)=E of extra-repair symbols and the rate of the extended code (GLDPC-Staircase code) is
$$ r_{G} =\frac{r_{L}}{1+ (1 - r_{L})*E} $$
(8)

Figure 1 shows such a regular variant.
Irregular distribution: the generalized check nodes can have a different number of extra-repair symbols.

Figure 2 shows such an irregular variant.
Fig. 2
Bipartite graph. Figure showing the case of GLDPC-Staircase (13,4) code with irregular distribution, e(m)={3,1,2}
Full size image

Cunche et al. [13] shows that there exists an irregular uniform distribution of extra-repair symbols which achieves performance close to the optimal irregular distribution. This irregular uniform distribution allows to allocate the extra-repair symbols with $\bar {f} =\frac {E}{2}$ and $f_{e}=\frac {1}{E+1}$ for $e \in \{0,1,\dots,E\}$.

Throughout this paper, we only consider the regular distribution and the irregular uniform distribution, and we assess in Section 4.4.1 their impacts on performance.

2.4 Encoding method

Encoding generates two types of repair symbols:

M _L LDPC-Staircase repair symbols, (p ₁, …$p_{M_{L}}$), and
$M_{L}\bar {f}$ extra-repair symbols, ((e _1,1, … e _1,e(1)), …($e_{M_{L},1}$, …$e_{M_{L},e(M_{L})}$)).

Let S=(S ₁,S ₂,…S _K) be the K source symbols. The $(p_{1}, \ldots p_{M_{L}})\phantom {\dot {i}\!}$ repair symbols are computed following the “stairs” of H _L: p _m is the XOR sum of the subset x of S of source symbols that have a “1” coefficient in row m, plus p _m−1 if m>1.

Then, the e(m) extra-repair symbols for row m are computed by multiplying the k _m LDPC symbols by the systematic generator matrix G _m of RS $\left (n_{m}, k_{m} \right)$ associated to this row².

Example 2.1.

Consider the GLDPC-Staircase code, scheme A, defined by the bipartite graph of Fig. 1. We have N _G=13, K=4 and exactly e(m)=2 extra-repair symbols per generalized check node (regular distribution). H _L and the various RS codes are as follows:

$$\begin{array}{*{20}l} &\begin{array}{lllllll} \quad S_{1}&S_{2} &S_{3} & S_{4}& P_{1}& P_{2}& P_{3}\\ \quad \downarrow & \downarrow & \downarrow & \downarrow & \downarrow & \downarrow & \downarrow \\ \end{array}\\ H_{L} =& \left[\begin{array}{cccc|ccc} 1& \ 1 & \ \ 0& \ \ 1 & \ \ \textbf{1}& \ \ 0& \ \ 0\\ 0& \, 1& \ \ 1& \ \ 0& \ \ \textbf{1}& \ \ \textbf{1}& \ \ 0\\ 1& \, 0& \ \ 0& \ \ 1 & \ \ 0& \ \ \textbf{1}& \ \ \textbf{1}\\ \end{array}\right] \begin{array}{l} RS_{1}=RS(6,3)\\ RS_{2}=RS(6,3)\\ RS_{3}=RS(6,3)\\ \end{array} \end{array} $$

(9)

We note that here (regular distribution and scheme A), the same RS code can be used for all the rows. Its generator matrix G _rs has the form

$$\begin{array}{*{20}l} &\hspace{80pt} \begin{array}{lll} \!G^{4}_{rs}&\,G^{5}_{rs}&\,\,\,G^{6}_{rs} \\ \downarrow&\,\,\downarrow &\,\,\,\,\, \downarrow\\ \end{array}\\ G_{rs} =& \left[\begin{array}{cccccc} 1&\quad 0&\quad 0&\quad 1&\quad 1&\quad 1 \\ 0&\quad 1&\quad 0&\quad 1&\quad b_{1}&\quad b_{2} \\ 0&\quad 0\quad &\quad 1&\quad 1&\quad b_{2}&\quad b_{2} \\ \end{array}\right] \end{array} $$

(10)

To summarize, encoding is as follows:

First row, using x=(S ₁,S ₂,S ₄), produces
$$ P_{1} = G^{4}_{rs}\times (S_{1}, S_{2}, S_{4}) $$

$$ e_{1,1} = G^{5}_{rs} \times (S_{1}, S_{2}, S_{4}) $$

$$ e_{1,2} = G^{6}_{rs} \times (S_{1}, S_{2}, S_{4}) $$
Second row, using x=(S ₂,S ₃), produces
$$ P_{2} = G^{4}_{rs}\times (S_{2}, S_{3}, P_{1}) $$

$$ e_{2,1} = G^{5}_{rs} \times (S_{2}, S_{3}, P_{1}) $$

$$ e_{2,2} = G^{6}_{rs} \times (S_{2}, S_{3}, S_{1}) $$
Third row, using x=(S ₁,S ₄), produces
$$ P_{3} = G^{4}_{rs}\times (S_{1}, S_{4}, P_{2}) $$

$$ e_{3,1} = G^{5}_{rs} \times (S_{1}, S_{4}, P_{2}) $$

$$ e_{3,2} = G^{6}_{rs} \times (S_{1}, S_{4}, P_{2}) $$

A key advantage is the fact that extra-repair symbols can be produced incrementally, on demand, rather than all at once (unlike LDPC-Staircase repair symbols for instance). Their number can also be rather high since it is only limited by the finite field size, usually GF(2⁸). Said differently, GLDPC-Staircase codes can easily and dynamically be turned into small rate codes.

2.5 Decoding method

To recover erased source symbols, in addition to the (IT + RS) decoding method, we proposed a new decoding approach called hybrid (IT/RS/ML) decoding.

Let us consider a GLDPC-Staircase (N _G,K) code, built from an LDPC-Staircase (N _L,K) base code.

2.5.1 (IT + RS) decoding

The (IT + RS) decoding, for both schemes A and B, consists of a joint use of

the IT decoder over the LDPC-Staircase graph. Extra-repair symbols are ignored at this step. This decoder features a linear complexity but also sub-optimal erasure recovery capabilities;
the RS decoder over a given generalized check node. This is a classic RS decoding that takes into account the three types of symbols. This decoder features a higher complexity but is MDS;

Example 2.2.

Figure 3 shows a simple example for GLDPC-Staircase code, scheme A, with N _G=12, K=4, N1=3, and $r_{L}=\frac {1}{2}$. Here we assume that only symbols {S ₁,P ₁,P ₂,P ₃,e ₁,e ₂} have been received. The receiving order for these symbols is {S ₁,P ₁,P ₂,e ₂,e ₁,P ₃} (i.e., symbol transmission order is random). After receiving the first four symbols, the RS decoder triggers on the second generalized check node. This node is associated with RS(6, 4) code which recovers the (S ₂, S ₃) erased symbols in step 2. Then these recovered symbols trigger the SPC decoding on the first generalized check node which recovers S ₄ in step 3. Decoding is successful.

Finally, Algorithm 1 details the (IT + RS) decoding that works symbol per symbol, in a recursive manner. This algorithm does not necessarily use all the received symbols: IT decoding is always preferred to RS decoding if both are possible, in order to reduce decoding complexity.

2.5.2 Hybrid (IT/RS/ML) decoding

We propose an hybrid (IT/RS/ML) decoding, generalization of the decoding approach proposed for LDPC codes in [17, 18]. Hybrid (IT/RS/ML) decoding consists of a joint use of IT, RS, and (binary/Non binary) ML decoding to achieve the performance of ML decoding at a lower complexity. It works as follows. It starts with (IT + RS) decoding. If (IT + RS) decoding succeeds, the hybrid decoding succeeds. Otherwise, the receiver switches to ML decoding, using the simplified linear system that results from the (IT + RS) decoding.

During ML decoding, we use the following decoders:

Binary ML decoder: extra-repair symbols are ignored at this step and instead it only considers binary equations, made of simple XOR sums, in order to reduce complexity. ML decoding can consist of simple Gaussian elimination (GE) on this sub-system.
Non-binary ML decoder: the full linear system is considered here and GE is performed on G F(2^b). As in binary ML decoding, this step also features a quadratic complexity but operations are now significantly more complex (performed on G F(2^b) instead of simple XOR). However, it allows reaching the maximum correction capabilities of the code.

The hybrid (IT/RS/ML) decoding algorithm is presented in Algorithm 2.

3 Asymptotic analysis method

3.1 Preliminaries

In the sequel, we denote by d _vmax and d _cmax respectively the maximum variable and check node degrees in the bipartite (Tanner) graph associated with LDPC-Staircase. Following [19], we define the edge-perspective DD polynomials by (λ(x), ρ(x)) and the node perspective DD polynomials by ($L(x)=\lambda (x) = \sum _{i=1}^{dvmax} \lambda _{i}.x^{i-1}$, $R(x)=\sum _{i=1}^{dcmax} \rho _{i}.x^{i-1}$).

Given a GLDPC-Staircase code, DD pair (λ, ρ) are defined by the underlying LDPC-Staircase code, defined by the bottom graph of Fig. 1 (that is, not containing the extra-repair nodes). Assume that transmission takes place over an erasure channel with parameter ε. We denote by ${\mathcal {E}}(\lambda, \rho, {f_{e}})$ the ensemble of GLDPC-Staircase with DD pair (λ, ρ) and with f _e the fraction of generalized check nodes with e extra-repair symbols as presented in Eq. (7).

3.2 Density evolution

3.2.1 Introduction

Over erasure channels, DE becomes one-dimensional, and it allows to analyze and even to construct capacity-achieving codes [20]. It works by recursively tracking the erasure probability messages passed around the edges of the graph during IT decoding. Roughly speaking, this means that it recursively computes the fraction of erased messages passed during the IT decoding. Using this technique, the decoding threshold of codes is defined as the supremum value of ε (that is, the worst channel condition) that allows transmission with an arbitrary small error probability assuming N goes to infinity [19].

Let us determine the DE equations of GLDPC-Staircase codes.

3.2.2 DE equations of GLDPC-Staircase codes

Assume that an arbitrary GLDPC-Staircase code from ${\mathcal {E}}(\lambda, \rho, {f_{e}})$, with length N _G goes to infinity.

We are interested in the erasure probability of messages exchanges by the (IT + RS) decoding along the messages of the LDPC-Staircase code using extra-repair variable nodes. We denote by

P _ℓ, the probability of an LDPC symbol (source or repair) node sending an erasure at iteration ℓ to the connected generalized check nodes. Clearly, P ₀ is equal to the channel erasure probability ε.
Q _ℓ, the probability of a generalized check node sending an erasure (to an LDPC symbol-node) at iteration ℓ.

The calculus of these probabilities depends on the coding scheme used to design the GLDPC-Staircase code (scheme A or B). Next, we give more details for each case. At iteration ℓ, the LDPC symbols are erased with probability P _ℓ, while extra-repair symbols are always erased with probability ε (the channel erasure probability).

Scheme A: The first repair symbol generated by any RS code is one of the repair symbols of the LDPC-Staircase code.

Consider a generalized check node c connected to symbol-nodes $(v_{1},\dots,v_{d},e_{1,c},\dots,e_{e(c),c})$ where v _i denotes an LDPC (source or repair) symbol node and e _i,c denotes the ith extra-repair symbol node. Since c corresponds to an RS code, it can recover the value of an LDPC symbol node, say v ₁, if and only if the number of erasures among the other symbol-nodes $(v_{2},\dots,e_{e(c),c})$ is less than or equal to e(c).

It follows that the probability of a generalized check node c recovering the value of an LDPC symbol at iteration ℓ+1, denoted by $\bar {Q}_{\ell +1,A}(d, e(c))$, is given by:

$$ {{} {\begin{aligned} \bar{Q}_{\ell+1,A}(d, e(c)) = \sum_{\stackrel{0\leq i < d, 0\leq j\leq e(c)}{i+j\leq e(c)}} {d-1\choose{}i} P_{\ell,A}^{i} (1-P_{\ell,A})^{d-1-i}\\ {e(c)\choose{}j} \varepsilon^{j} (1-\varepsilon)^{e(c)-j} \end{aligned}}} $$

(11)

Hence, the probability of a generalized check node c sending an erasure to an LDPC symbol at iteration ℓ+1 is $(1- \bar {Q}_{\ell +1,A}(d, e(c)))$. Averaging over all possible values of d and e(c), we get

$$ {Q}_{\ell+1,A} = 1 - \sum_{d=1}^{d_{cmax}}\rho_{d} \sum_{e=0}^{E} f_{e}\bar{Q}_{\ell+1,A}(d, e) $$

(12)

Scheme B: All the LDPC-Staircase repair symbols are source symbols for the RS codes.

Consider a constraint node c connected to symbol-nodes $(v_{1},\dots,v_{d},e_{1,c},\dots,e_{e(c),c})$ where v _i denotes an LDPC (source or repair) symbol node and e _i,c denotes the ith extra-repair symbol node. The node c corresponds both to a parity check constraint between LDPC symbol nodes $(v_{1},\dots,v_{d})$ and to an RS linear constraint between all the symbol-nodes $(v_{1},\dots,v_{d},e_{1,c},\dots,e_{e(c),c})$.

Thus, c can recover the value of an LDPC symbol node, say v ₁, if and only if one of the following (disjoint conditions) holds:

there are no erased symbols among $v_{2},\dots,v_{d}$ (i.e., LDPC decoding);
there is at least one erased symbol among $v_{2},\dots,v_{d}$, but the number of erasures among all the symbol-nodes $(v_{1},\dots,v_{d},e_{1,c},\dots,e_{e(c),c})$ is less than or equal to e(c)−1.

The second condition is also equivalent to the following one:

the number of erased symbols among $v_{2},\dots,v_{d}$ is equal to i and the number of erased symbols among $e_{1,c},\dots,e_{e(c),c}$ is equal to j, with $1 \leq i \leq \min (d-1,e(c)-1)$ and 0≤j≤e(c)−1−i.

It follows that the probability of a generalized check node c recovering the value of an LDPC symbol at iteration ℓ+1, denoted by $\bar {Q}_{\ell +1,B}(d, e(c))$, is given by

$$ {{} {\begin{aligned} \bar{Q}_{\ell+1,B}(d,e(c)) = (1-P_{\ell, B})^{d-1} + \sum_{i=1}^{\min(d-1,e(c)-1)}\,\,\sum_{j=0}^{e(c)-1-i}\\ {{d-1}\choose{i}} P_{\ell,B}^{i} (1-P_{\ell,B})^{d-1-i}{{e(c)}\choose{j}} \varepsilon^{j} (1-\varepsilon)^{e(c)-j}. \end{aligned}}} $$

(13)

Averaging over all possible values of d and e(c), we get

$$ {{Q}_{\ell+1,B} = 1 - \sum_{d=1}^{d_{cmax}}\rho_{d} \sum_{e=0}^{E} f_{e}\bar{Q}_{\ell+1,B}(d, e)} $$

(14)

Remark 3.1.

For both schemes with regular distribution of extra-repair symbols, all the generalized check nodes have E extra-repair symbols, the Eqs. (12) and (14) are reduced to:

$$ {Q}_{\ell+1} = 1 - \sum_{d=1}^{d_{cmax}}\rho_{d}\bar{Q}_{\ell+1}(d) $$

(15)

Conversely, for both schemes, an LDPC symbol node v of degree d, connected to generalized check nodes $c_{1},\dots, c_{d}$, sends an erasure to c ₁ iff it was erased by the channel, and it received erased messages from all generalized check nodes $c_{2},\dots, c_{d}$. Since this happens with probability $\varepsilon \cdot Q_{\ell +1}^{d-1}$, and averaging over all possible degrees d, we get

$$ {{P}_{\ell+1} = \varepsilon\sum_{d=1}^{d_{vmax}}\lambda_{d} Q_{\ell+1}^{d-1} = \varepsilon \lambda({Q}_{\ell+1})} $$

(16)

For both schemes, using Eqs. (11) or (13), (12) or (14), and (16), we can determine a recursive relation between P _ℓ and P _ℓ+1, with P ₀=ε.

The decoder can recover from a fraction of ε erased symbols iff ${\lim }_{\ell \rightarrow +\infty }P_{l} =0$. This means that, when $l \to +\infty $, the (IT + RS) decoding succeeds if the DE recursion converges to zero. Then, the (IT + RS) decoding threshold of an GLDPC-Staircase code over an erasure channel is defined as the supremum value of ε such that the DE recursion converges to zero. Therefore, the (IT + RS) decoding threshold can be computed by

$$ \varepsilon^{\text{(IT\,+\,RS)}}({\mathcal{E}}) = \max\left\{\varepsilon \mid {\lim}_{\ell \rightarrow +\infty}P_{l} =0\right\}. $$

(17)

If we transmit at ε≤ε ^(IT+RS), then all the erased LDPC symbols can be recovered. But if we transmit at ε>ε ^(IT+RS), then some or all the erased LDPC symbols remain erased after the decoding ends.

Additionally, using the DE recursion equation, we can plot the evolution of the (IT + RS) decoding process of an GLDPC-Staircase code for an erasure channel probability ε by tracing P _ℓ+1=f(P _ℓ) with $l \to +\infty $ as shown in the following example.

Example 3.1.

Let us consider a GLDPC-Staircase (scheme A) code with the following parameters:

Rate: $r_{G}=\frac {1}{2}$
Base code: r _L=0.8, N1=5

$$ DD: \left\lbrace \begin{array}{l} \lambda(x) = 0.0909.x^{1}+0.9091.x^{4}, \rho(x) = x^{21}\\ L(x) = 0.2.x^{2}+0.8.x^{5}, R(x) =x^{22} \\ \end{array}\right. $$
(18)
E=3 (regular distribution of extra-repair symbols).

Figure 4 provides the evolution of erasure probability during the (IT + RS) decoding of GLDPC-Staircase at ε=0.3. The initial fraction of erasure messages emitted by the LDPC variable nodes is P ₀=1. After an iteration (at the next output of the LDPC variable nodes), this fraction has evolved to P ₁=0.3. After second full iteration, i.e., at the output of the LDPC variable nodes, we see an erasure fraction of P ₂=0.2555. This process continues in the same fashion for each subsequent iteration, corresponding graphically to a staircase function which is bounded above by P _ℓ+1=P _ℓ and below by P _out.

3.3 EXIT functions of GLDPC-Staircase codes

3.3.1 Introduction

EXIT technique is a tool for predicting the convergence behavior of iterative processors for a variety of communication problems [21]. Over erasure channel, to visualize the convergence of iterative systems, rather than mutual information, the entropy information can be used (i.e., one minus mutual information). It is natural to use entropy in the setting of the erasure channel since the parameter ε itself represents the channel entropy. We focused in our work on EXIT based on entropy to evaluate the performance of GLDPC-Staircase codes under (IT + RS) and ML decoding. Therefore, we extended the method presented in [22]. These EXIT functions are based on DE equations derived in Section 3.2. The EXIT technique defined in this section relates to the asymptotic performance of the ensemble ${\mathcal {E}}(\lambda, \rho, {f_{e}})$ under the decoding.

3.3.2 (IT + RS) EXIT function: h ^(IT+RS) (ε)

The (IT + RS) EXIT function of GLDPC-Staircase code is denoted by h ^(IT+RS)(ε). It corresponds to running an (IT + RS) decoder on a very large LDPC-Staircase graph that is connected to the extra-repair variable nodes at ε until the decoder reaches a fixed point. This fixed point defines the stability of erasure probability improvement during decoding iterations. The extrinsic erasure probability of the LDPC-Staircase symbols at this fixed point gives the (IT + RS) EXIT function. Therefore, consider an ${\mathcal {E}}(\lambda, \rho, {f_{e}})$, the EXIT function of the GLDPC-Staircase codes under (IT + RS) decoding, over erasure channel (ε), is equal to the following equation:

$$ h^{(\text{IT+RS})} (\varepsilon) = \frac{1}{N_{L}} \sum_{i=1}^{N_{L}} h^{(\text{IT+RS})}_{i} (\varepsilon) $$

(19)

where, $h^{(\text {IT+RS})}_{i}$ is the extrinsic (IT + RS) erasure probability of LDPC-Staircase symbol “i” as shown in Fig. 5. h ^(IT+RS)(ε) is the asymptotic (average on all the LDPC variable nodes, $N_{L}\to +\infty $) extrinsic erasure probability at the output of an (IT + RS) decoding. This function value can be easily computed using the DE equations of GLDPC-Staircase codes. After an infinite number of iterations of the DE recursion (Eq. (16)), the (IT + RS) decoder reaches a fixed point (i.e., P _ℓ+1=P _ℓ, $\ell \to +\infty $).

Hence, we can also write

$$ h^{(\text{IT+RS})} (\varepsilon) = L({Q}_{+\infty}) $$

(20)

where ${Q}_{+\infty }$ is Q _ℓ, derived from the DE equations of GLDPC-Staircase codes in Section 3.2, when the number of iterations goes to infinity.

Next, we present how can visualize the evolution of extrinsic erasure probability during (IT + RS) decoding in a graph called EXIT curve.

3.3.3 (IT + RS) EXIT curve

The (IT + RS) EXIT curve of the GLDPC-Staircase code under (IT + RS) decoding can be derived, in terms of extrinsic erasure probability (at the output of the decoder) as a function of the a prior erasure probability (input of the decoder, ε).

Therefore, the asymptotic (IT + RS) EXIT curve, denoted by h ^(IT+RS), is given in a parametric form by

$$ h^{(\text{IT+RS})} (\varepsilon) = \left\lbrace \begin{array}{ccc} 0 & \text{if} & \varepsilon \in \left[0 \ \varepsilon^{(\text{IT+RS})}\right]\\ L({Q}_{+\infty}) & \text{if}& \varepsilon \in ]\varepsilon^{(\text{IT+RS})} \ 1] \end{array}\right. $$

(21)

Summarizing, the (IT + RS) EXIT curve is the trace of h ^(IT+RS)(ε) equation for ε starting from ε=ε ^(IT+RS) until ε=1. In other hand, it is zero up to the (IT + RS) decoding threshold ε ^(IT+RS). It then jumps to a non-zero value and also continues smoothly until it reaches one at ε=1. Therefore, by using this curve, ε ^(IT+RS) is given by the value of ε where h ^(IT+RS)(ε) drops down to zero.

Example 3.2.

Given a GLDPC-Staircase code with rate $r_{G}=\frac {1}{3}$, 2 extra-repair symbols per generalized check nodes (regular distribution) and base code with the following parameters:

r _L=0.6, N1=5
DD:
$$ \left\lbrace \begin{array}{l} \lambda(x) = 0.2105x^{1} + 0.7895x^{4}, \rho(x) = x^{9}, \\ L(x) = 0.4x^{2} + 0.6x^{5}, R(x) = x^{10} \\ \end{array}\right. $$
(22)

The (IT + RS) EXIT function h ^(IT+RS)(ε) is depicted in Fig. 6. The (IT + RS) decoding threshold, ε ^(IT+RS), is given by the point where h ^(IT+RS)(ε) drops down to zero. This gives ε ^(IT+RS)=0.5376. It can be seen that h ^(IT+RS)(ε)=0 for values ε≤ε ^(IT+RS), then it jumps to a non-zero value and continues to increase until it reaches a value of 1 for ε=1.

3.3.4 Upper bound on the ML decoding threshold

As for the (IT + RS) decoding, the EXIT curve of the ML decoding is also defined in terms of extrinsic erasure probability based on entropy. Precisely, in the limit of infinite code length, for a given channel erasure probability ε, h ^ML(ε) is the probability of a symbol node being erased after ML decoding, assuming that the received value (if any) of this particular symbol has not been submitted to the decoder. The asymptotic, average on all the LDPC variable nodes, extrinsic erasure probability at the output of an ML decoding (ML EXIT function) is obtained by

$$ h^{ML} (\varepsilon) = \frac{1}{N_{L}} \sum_{i=1}^{N_{L}} h^{ML}_{i} (\varepsilon) $$

(23)

where, $h^{ML}_{i}(\varepsilon)$ is the extrinsic erasure probability of LDPC symbol “i” after ML decoding as shown in Fig. 5.

Just like LDPC codes [22], the exact computation of the EXIT function for the ML decoding is difficult. However, using the area theorem [22, 23], we have:

$$ \int_{{\epsilon}^{ML}}^{1} h^{ML} (\varepsilon) = r_{G}, $$

(24)

where r _G is the designed coding rate of the given ensemble of GLDPC-Staircase codes. Moreover, since the (IT + RS) decoding is sub-optimal with respect to the ML decoding, we have h ^IT+RS(ε)≥h ^ML(ε). Hence, if for some $\bar {\epsilon }^{ML}$

$$ \int_{\bar{\epsilon}^{ML}}^{1} h^{(\text{IT+RS})} (\varepsilon) = r_{G}, $$

(25)

we necessarily have $\bar {\epsilon }^{ML} \geq \epsilon ^{ML}$. This gives an upper bound on the ML threshold, which is easily computed using h ^(IT+RS).

The ML EXIT curve of the GLDPC-Staircase codes, h ^ML(ε), can be constructed in the following manner:

Step 1: Plot the (IT + RS) EXIT curve as parametrized in Eq. (21).
Step 2: Determine the $\bar {\epsilon }^{ML}$ by integrate backwards from the right end of the curve where ε=1. The integration process stops at $\bar {\varepsilon }^{ML}$ where it assures Eq. (25). This gives the upper bound $\bar {\varepsilon }^{ML}$ of the GLDPC-Staircase codes.
Step 3: The ML EXIT curve is now the curve which is zero at the left of the upper bound on the ML decoding threshold and equals to the (IT + RS) EXIT curve to the right of this decoding threshold (i.e., the (IT + RS) EXIT and the ML EXIT curves coincide above $\bar {\varepsilon }^{ML}$).

Remark 3.2.

This upper bound is conjectured to be tight because the GLDPC-Staircase codes are based on LDPC-Staircase codes, which are binary codes and defined by quasi-regular graphs.

Example 3.3.

Consider the same code of Example 3.2.

Figure 7 shows the (IT + RS) EXIT curve (h ^(IT+RS)(ε)) and the integral bound on ε ^ML for GLDPC-Staircase code with the same distributions of Fig. 6.

The (IT + RS) decoding threshold value is ε ^(IT+RS)=0.5376.

The ML decoding threshold upper-bound is the unique point $\bar {\varepsilon }^{ML} \in \left [\varepsilon ^{(\text {IT+RS})} \ 1\right ]$ such that the red area below the (IT + RS) EXIT curve, delimited by $\varepsilon =\bar {\varepsilon }^{ML}$ at the left and by ε=1 at the right, is equal to the GLDPC code rate, r _G=1/3. In this case, we obtain $\bar {\varepsilon }^{ML} = 0.6664$.

4 Optimization of GLDPC-Staircase codes

4.1 Description

GLDPC-Staircase codes can be viewed as an extension of LDPC-Staircase code (base code) into generalized LDPC-Staircase code using RS codes. Moreover, GLDPC-Staircase codes can be constructed using two structures which differ in the type of the generated LDPC repair symbols that are either RS repair symbols or not, as follows:

Scheme A has the property that on each generalized check node, the repair symbol generated by the LDPC code is also an RS repair symbol.
On the opposite, with scheme B the generated LDPC repair symbol, on each generalized check node, is an RS source symbol.

In addition, the configuration of GLDPC-Staircase codes depends on the important internal parameters, namely

the extra-repair symbols distribution across the H _L rows: regular distribution or irregular uniform distribution,
the N1 parameter of the base code: degree of source variable nodes in H _L,
the base code rate r _L.

Therefore, in this section, we start by showing the impacts of the property that the generated LDPC repair symbols are at the same time RS repair symbols, on the decoding behavior (i.e., compare scheme A and scheme B). Then, the best configuration of these parameters for hybrid (IT/RS/ML) decoding³ will be discussed. To gauge the correction capabilities of decoding, we use the asymptotic analysis based on DE and EXIT techniques presented in Section 3, as well as the finite length analysis.

4.2 Experimental conditions

For the finite length analysis, we have developed a GLDPC-Staircase codec based on RS codes under (IT + RS) and ML decoding methods, in C language, using the OpenFEC.org project (http://openfec.org). All experiments are carried out by considering a memory-less erasure channel along with a transmission scheme where all the source and repair symbols are sent in a fully random order.

This has the benefit to make the performance results independent of the loss model⁴ and the target channel loss rate is the only parameter that needs to be considered.

Different LDPC-Staircase matrices are used (more precisely we change the PRNG seed used to create the matrix). Then, the results, averaged over the tests obtained by varying LDPC-Staircase matrix, show the average behavior of GLDPC-Staircase codes.

In the sequel, we evaluate the finite length performance based on the decoding overhead⁵, the decoding inefficiency ratio⁶ and the failure decoding probability⁷.

For the asymptotic analysis, we use commonly the following DD of LDPC-Staircase codes as presented in Tables 1 and 2.

Table 1 Used degree distribution. Table showing LDPC-Staircase DD for different values of N1 where $r_{L}=\frac {2}{3}$

GLDPC-Staircase AL-FEC codes: a fundamental study and new results

Abstract

1 Review

2 GLDPC-Staircase code design

2.1 Code description

2.2 Schemes A and B

2.3 Extra-repair symbol regular/irregular distributions

2.4 Encoding method

Example 2.1.

2.5 Decoding method

2.5.1 (IT + RS) decoding

Example 2.2.

2.5.2 Hybrid (IT/RS/ML) decoding

3 Asymptotic analysis method

3.1 Preliminaries

3.2 Density evolution

3.2.1 Introduction

3.2.2 DE equations of GLDPC-Staircase codes

Remark 3.1.

Example 3.1.

3.3 EXIT functions of GLDPC-Staircase codes

3.3.1 Introduction

3.3.2 (IT + RS) EXIT function: h (IT+RS) (ε)

3.3.3 (IT + RS) EXIT curve

Example 3.2.

3.3.4 Upper bound on the ML decoding threshold

Remark 3.2.

Example 3.3.

4 Optimization of GLDPC-Staircase codes

4.1 Description

4.2 Experimental conditions

4.3 Best coding scheme for GLDPC-Staircase codes

4.3.1 Asymptotic results

4.3.2 Finite length results

4.3.3 Conclusion of the analysis

4.4 Tuning internal parameters of GLDPC-Staircase codes

4.4.1 The extra-repair symbol distribution

4.4.2 N1 parameter

4.4.3 The base code rate r L

5 Performance evaluation

5.1 Achieved performance

5.1.1 (IT + RS) versus hybrid (IT/RS/ML) decoding

5.1.2 Detailed hybrid (IT/RS/ML) decoding inefficiency ratio results

5.1.3 Hybrid (IT/RS/ML) decoding failure probability results

5.2 Comparison with other erasure correcting codes

5.2.1 Comparison with LDPC-Staircase codes

5.2.2 Comparison with another GLDPC code [26]

5.2.3 Comparison with Raptor and RaptorQ codes

5.3 Hybrid (IT/RS/ML) decoding complexity

5.3.1 Experimental conditions

5.3.2 Results

6 Conclusions

7 Endnotes

References

Acknowledgements

Competing interests

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

3.3.2 (IT + RS) EXIT function: h ^(IT+RS) (ε)

4.4.3 The base code rate r _L