On The Automorphism Group of Polar Codes
On The Automorphism Group of Polar Codes
Marvin Geiselhart, Ahmed Elkelesh, Moustafa Ebada, Sebastian Cammerer and Stephan ten Brink
Institute of Telecommunications, Pfaffenwaldring 47, University of Stuttgart, 70569 Stuttgart, Germany
{geiselhart,elkelesh,ebada,cammerer,tenbrink}@inue.uni-stuttgart.de
Abstract—The automorphism group of a code is the set of (ensemble) decoding, as proven in [10, Theorem 2]. Therefore,
permutations of the codeword symbols that map the whole code it is crucial to find automorphisms outside the LTA to enable
onto itself. For polar codes, only a part of the automorphism efficient parallel ensemble decoding of polar codes. Further
group was known, namely the lower-triangular affine group
(LTA), which is solely based upon the partial order of the code’s potential applications include analysis of some post-quantum
synthetic channels. Depending on the design, however, polar codes cryptography schemes [12].
arXiv:2101.09679v2 [cs.IT] 27 Jan 2021
can have a richer set of automorphisms. In this paper, we extend The main contribution of this work is the introduction
the LTA to a larger subgroup of the general affine group (GA), of a larger automorphism group of decreasing monomial
namely the block lower-triangular affine group (BLTA) and show codes, namely the block lower-triangular affine group (BLTA).
that it is contained in the automorphism group of polar codes.
Furthermore, we provide a low complexity algorithm for finding We provide efficient algorithms for finding this group and
this group for a given information/frozen set and determining its sampling from it. The concept applies to polar codes, RM
size. Most importantly, we apply these findings in automorphism- codes and the recently proposed partially symmetric monomial
based decoding of polar codes and report a comparable error-rate codes (PSMCs) [13].
performance to that of successive cancellation list (SCL) decoding
with significantly lower complexity. II. P RELIMINARIES
| Aut(C )|
that LTA(2, n) and GA(2, n) are themselves special cases of
BLTA(s, n), with s = [1, . . . , 1] and s = [n], respectively.
Lemma 1: The join of the group of block-permutation 1011
transformations Ps (n) and the group lower triangular affine
|LTA(2,7)|
transformations LTA(2, n) is exactly BLTA(s, n), i.e., 108 ← “RM-like” “Polar-like” →
BLTA(s, n) = hPs (n), LTA(2, n)i. (17) 10−2 10−1
In other words, any composition of transformations from Design Erasure Probability ε
LTA(2, n) and Ps (n) is a transformation from BLTA(s, n) and Fig. 2: Number of automorphisms of (128,64) polar codes with
vice versa. Bhattacharyya-based construction versus Binary Erasure Channel (BEC) de-
sign erasure probability ε .
Proof: “⇒”: Obviously, Ps (n) ⊆ BLTA(s, n), as permutation
matrices are non-singular and the block structure is given.
1017 RM(3,7) Maximum
Similarly, LTA(2, n) ⊆ BLTA(s, n), where again we have a |GA(2,7)|
Average
special case of affine transformations. Also, as BLTA(s, n) is Minimum
1014
| Aut(C )|
closed, a composition of any transformations will not generate
any elements outside BLTA(s, n).
“⇐”: We can show this by observing that any 1011
block lower-triangular matrix A may be decomposed as Bhat. @1dB
|LTA(2,7)|
A = P1 · L1 · P2 · L2 · P3 , with Pi ∈ Ps (n) and Li ∈ LTA(2, n). 108 ← “RM-like” “Polar-like” →
For this, consider the LUP decomposition of A, i.e., PA =
LU [16]. The block lower-triangular structure of A ensures 1 2 3 4 5 6 7 8
that also U and P are block lower-triangular. One can now Number of Generators |Imin |
transform U into a conventional lower-triangular matrix by Fig. 3: Maximum, average and minimum number of automorphisms of all
reversing the order of the rows and columns within each (128,64) decreasing monomial codes versus their number of generators |Imin |.
block. This can be written as L2 = PBR (s) · U · PBR (s), with !
m−1 sk −1
PBR (s) = [pi, j ] and
i′
( | BLTA(s, n)| = NIBLT (s)·2n = 2n · ∏ 2γk ·sk ∏ 2 sk − 2 .
1 for j = 2γk(i) + sk(i) − 1 − i k=0 i′ =0
pi, j = . (18) (20)
0 else
Note that this equates to the sizes of LTA(2, n) and GA(2, n)
Finally, as PBR (s) = P−1 −1 for the special cases s = [1, . . . , 1] and s = [n], respectively.
BR (s), we have P1 = P , P2 = P3 =
PBR (s) and L1 = L. The additive term b may be included (i.e., Fig. 2 shows the sizes of the automorphism groups for
also properly permuted) in any of the LTA transformations. polar codes with N = 128 and K = 64 designed according to
Theorem 2 (Automorphisms of Polar Codes): Let C be the Bhattacharyya parameter of the synthetic channels. This
a decreasing monomial code in n variables with information construction assumes a BEC with erasure probability ε . It
set I. Then can be seen that for low erasure probability, this construction
BLTA(s, n) ⊆ Aut(C ) (19) generates the RM(3,7)-code. The larger the values of ε , the
fewer the automorphisms featured by the code. In Fig. 3, we
with s being the block structure of Stab(I). evaluate the influence of the number of generators of a code
Proof: The proof directly follows from Lemma 1, as both |Imin | on the size of the automorphism group, also for the case
LTA and Stab(I) correspond to automorphisms of the code. of N = 128 and K = 64. Since there exist usually many codes
We furthermore conjecture that Eq. (19) holds with equality, with the same number of generators, we plot the minimum,
which is also suggested by means of numerical results. How- average and maximum automorphism group sizes for each
ever, we were not yet able to find a rigorous proof. To prove it, value of |Imin |. To obtain these numbers, we enumerated all
one would need to show that for every nonzero element ai, j 1007 (128, 64) decreasing monomial codes using a tree search.
with i < j in an affine transformation (A, b) ∈ Aut(C ), the As just mentioned, it can be seen that a smaller size of Imin
variable permutation π = (i, j) must be contained in Stab(I). generally results in a larger number of automorphisms. We
find the RM code on the very left of the plot, while typical
A. Number of Automorphisms polar codes lie more towards the right edge. It is worth it
The number of automorphisms is (at least) the size of mentioning that, from a code design perspective, several code
BLTA(s, n) for a code with block structure s. Clearly, this is the constructions can be viewed as lying between polar and RM
number of non-singular block lower-triangular matrices times codes (e.g., [13], [17], [18] and [19]).
the number of affine translations b. Using Eq. (16), we have
Design s |Aut(C )| dmin a Admin b
Bhat. @1 dB [2,1,1,1,1,1,1] 2.06 · 1011 8 96
Imin = {31,99} [5,3] 1.41 · 1016 16 69936
10−1
Imin = {31,57} [3,5] 1.41 · 1016 16 69936
Table I: Properties of the compared (256,128) polar codes. a dmin : minimum
10−2 distance of the code. b Admin : number of minimum-weight codewords.
BLER
10−1
BLER
need the algorithm to be deterministic and stop after a finite 10−3
number of steps. Design SC SCL-8 Aut-8-SC OSD-4
To summarize, we can move any variable in f in any other Bhat. @1 dB
10−4 Imin = {27,56}
place in Ωπ , as we can move it to i0 using the partial order, Imin = {23,112}
rotate it to i1 using the cycle permutation π , and then place it RM(3,7)
in the target position as all indices are reachable by the partial 10−5
1 1.5 2 2.5 3 3.5 4
order from i1 . Due to the resemblance of the cycle π with
an irregular paternoster elevator running around a building Eb /N0 [dB]
with floors Ωπ , we call this algorithm the crazy paternoster Fig. 6: Comparison of (N = 128,K = 64) polar codes under SC, Aut-SC and
algorithm. SCL decoding; BI-AWGN channel.
The described method can be extended to the case where
Design s |Aut(C )| dmin Admin
π is a product of interlocked cycles. We can classify all
Bhat. @1 dB [2,2,1,1,1] 2.42 · 109 8 688
cycles as either transit cycles or parking cycles. A parking Imin = {27,56} [3,4] 1.78 · 1012 8 240
cycle σ is fully enclosed by another cycle, i.e., ∃σ ′ ∈ C(π ) Imin = {23,112} [4,3] 1.78 · 1012 8 16
with Ωσ ⊂ Ωσ ′ ; while transit cycles partially overlap with RM(3,7) [7] 2.10 · 1016 16 94488
another. It is easy to see that there exists a chain of transit Table II: Properties of the compared (128,64) polar codes.
cycles σ1 , · · · , σt with the properties min(σ1 ) = i0 , max(σ j ) >
min(σ j+1 ) and max(σt ) = i1 . Fig. 5 shows the interlocked
cycles for the example of π = (1, 5, 2)(3, 7)(6, 10)(8, 9). Here, decoding (OSD)-4 results serve as an upper bound on the
the cycles σ1 = (1, 5, 2), σ2 = (3, 7) and σ3 = (6, 10) form the maximum likelihood (ML) performance of each code [21].
chain of transit cycles, while (8, 9) fully overlaps with (6, 10) Again, the Bhattacharyya construction at design SNR of 1 dB
and therefore is classified as a parking cycle. Using this chain (Imin = {31, 45, 51, 71, 84, 97}) does not show any gains for
of transit cycles, we can again move any variable in f to any Aut-SC decoding. Also, note that the gains of SCL-8 when
other position within Ωπ , as the overlap of the cycles allows xi compared to SC are also smaller than 0.2 dB. Next, we
to “change” from one cycle to the next higher cycle. Note that designed codes by selecting two generators Imin = {27, 56} and
depending on the degrees of monomials, the order in which Imin = {23, 112} in order to have a large automorphism group,
the xi are moved, must be adjusted. The maximum number of under the constraint of K = 64. Both codes can be viewed as
steps remains upper bounded, however, by the same number, examples of PSMCs [13]. While the SC performance degrades,
namely now a significant performance gain is achieved by both Aut-SC
Q′ = 2d · |C(π )| · max {ord(σ )} . (29) and SCL decoding with comparable performance. In particular,
σ ∈C(π )
Aut-SC is within 0.1 dB to 0.2 dB of the SCL performance.
For the most general case, i.e., if π is a product of multiple For completeness, we also included performance results for the
interval disjoint permutations, we can use the upper bound RM code construction (Imin = {15}). As previously reported
in [10], in the RM case, automorphism-based decoding can
Q′′ = max 2d · |C(ρ )| · max {ord(σ )} , (30) even outperform SCL decoding.
ρ ∈T (π ) σ ∈C(ρ )