SlideShare a Scribd company logo
A High-Performance FIR Filter Architecture for Fixed
and Reconfigurable Applications
High-Performance NB-LDPC Decoder With
Reduction of Message Exchange
Abstract:
This paper presents a novel algorithm based on trellis min–max for decoding non-binary low-
density parity check (NB-LDPC) codes. This decoder reduces the number of messages
exchanged between check node and variable node processors, which decreases the storage
resources and the wiring congestion and, thus, increases the throughput of the decoder. Our
frame error rate performance simulations show that the proposed algorithm has a negligible
performance loss for high rate codes with GF(16) and GF(32), and a performance loss smaller
than 0.07 dB for high-rate codes over GF(64). The proposed architecture of this paper analysis
the logic size, area and power consumption using Xilinx 14.2.
Enhancement of the project:
Existing System:
The first algorithm proposed to decode NB-LDPC codes was the Q-ary sum-of-product
algorithm (QSPA), which was developed as a generalization of the SPA for binary LDPC codes.
Further improvements, such as fast Fourier transform SPA, log-SPA, and max-log-SPA, were
proposed to reduce the complexity of the CN processing equations without introducing any
performance loss. More recently, a trellisbased implementation for QPSA (T-max-log-QSPA)
was proposed, offering a solution that increases the throughput with respect to previous solutions
based on QPSA. Its main drawback is that the required area is prohibitive for real applications in
communications and storage systems. Extended min-sum (EMS) and min–max algorithms were
presented as the approximations of the QSPA, so that they reduce considerably the CN
complexity, which only requires additions and/or comparisons. In addition, EMS and min–max
algorithms utilize forward–backward metrics to derive the CN output messages. These metrics
involve serial computations that limit the throughput of the derived hardware architectures.
Trellis EMS (T-EMS) algorithm was proposed with the aim of enabling parallel processing of
the messages in the CN. The input messages are organized in a trellis structure, while the output
messages are generated in parallel by means of an extra column included in the trellis.
Trellis min–max (T-MM) algorithm in adapts the idea of T-EMS to min–max algorithm. One
minimum only T-MM (OMO T-MM) is an approximation of T-MM that reduces the complexity
of the CN by obtaining only one minimum and estimating the second one. All these algorithms
A High-Performance FIR Filter Architecture for Fixed
and Reconfigurable Applications
exchange q × dc reliability values between CN and variable node (VN) processors. This amount
of exchanged messages is large enough to cause wiring congestion, and this limits the maximum
throughput, especially for high-rate NB-LDPC codes and high-order GFs. In addition, in decoder
architectures with a layered schedule, the CN output messages are stored to be used in the next
iteration. So, the required memory, which is the main part of the area in NB-LDPC decoder
architectures, is too high.
Disadvantages:
 Area coverage is high
 Throughput is low
Proposed System:
MODIFIED TRELLIS MIN–MAX ALGORITHM:
Reformulation of Trellis Min–Max Algorithm:
We reformulate the T-MM algorithm as a first step to define our proposal. As can be seen in
Algorithm 2, Steps 4 and 5 are the ones reformulated. The function ψ’ in Step 4 obtains, which
path in the trellis was used to obtain ΔQ (a), that is, the most reliable path. Considering that a
maximum of two deviations is evaluated, the function returns the two GF symbols that define
this path, η∗ 1(a) and η∗ 2(a).
Algorithm 1 T-MM Algorithm
A High-Performance FIR Filter Architecture for Fixed
and Reconfigurable Applications
Fig. 1 includes an example of trellis with GF(4) and dc = 5. It shows the CN input messages
before (Qmn(a)) and after (ΔQmn(a)) delta domain transformation. The hard-decision symbols
are z = {α1, α0, 0, α0, 0}. After the normal-to-delta domain transformation, the reliabilities ΔQ
mn(a) in the first row of the trellis are equal to 0.
A High-Performance FIR Filter Architecture for Fixed
and Reconfigurable Applications
Fig. 1. Top: example of CN input messages in normal domain. Bottom: messages in delta domain and
organized in trellis way including the extra column Q(a) (bottom size). Example for GF(4) and dc = 5.
NB-LDPC DECODER IMPLEMENTATION
We describe the architecture designed to implement the proposed mT-MM algorithm.
CN Architecture for mT-MM Algorithm
The main characteristic of the proposed mT-MM Algorithm is to move part of the complexity of
the CN processor to the VN processor. In this way, the number of exchanged messages between
them and also the storage resources of the decoder are reduced. Therefore, the CN architecture
presented in this section requires less functional blocks than a conventional implementation of
the T-MM algorithm. Fig. 2 shows the block diagram for the top-level CN architecture, where
each block corresponds to a step in the mT-MM algorithm.
A High-Performance FIR Filter Architecture for Fixed
and Reconfigurable Applications
Fig. 2. Proposed CN block diagram.
The complete block diagram for the proposed decoder is presented in Fig. 3. As can be seen,
there is only one CN processor and one VN processor, which processes one row of H per clock
cycle. A layered schedule requires to store the CN output messages from one iteration to be used
in the next one. This is done by means of a shift register with M stages (SR in Fig. 3).
Fig. 3. Top-level proposed decoder architecture.
Advantages:
 Improve area and throughput.
Software implementation:
 Modelsim
 Xilinx ISE

More Related Content

DOCX
A high performance fir filter architecture for fixed and reconfigurable appli...
DOCX
Flexible dsp accelerator architecture exploiting carry save arithmetic
DOCX
Graph based transistor network generation method for supergate design
DOCX
High performance pipelined architecture of elliptic curve scalar multiplicati...
DOCX
Flexible dsp accelerator architecture exploiting carry save arithmetic
DOCX
Graph based transistor network generation method for supergate design
PDF
DUAL FIELD DUAL CORE SECURE CRYPTOPROCESSOR ON FPGA PLATFORM
PDF
A comparative study of different multiplier designs
A high performance fir filter architecture for fixed and reconfigurable appli...
Flexible dsp accelerator architecture exploiting carry save arithmetic
Graph based transistor network generation method for supergate design
High performance pipelined architecture of elliptic curve scalar multiplicati...
Flexible dsp accelerator architecture exploiting carry save arithmetic
Graph based transistor network generation method for supergate design
DUAL FIELD DUAL CORE SECURE CRYPTOPROCESSOR ON FPGA PLATFORM
A comparative study of different multiplier designs

What's hot (20)

PDF
Iaetsd pipelined parallel fft architecture through folding transformation
PDF
Transpose Form Fir Filter Design for Fixed and Reconfigurable Coefficients
PDF
High Performance MAC Unit for FFT Implementation
PDF
J0166875
PDF
Aw4102359364
PDF
Iaetsd vlsi architecture for exploiting carry save arithmetic using verilog hdl
PDF
Performance Analysis of OFDM Transceiver with Folded FFT and LMS Filter
PDF
B1030610
DOCX
A novel area efficient vlsi architecture for recursion computation in lte tur...
PDF
Design and Implementation of Low-Power and Area-Efficient 64 bit CSLA using VHDL
PDF
Gn3311521155
PDF
Modified montgomery modular multiplier for cryptosystems
PDF
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
PDF
HIGH SPEED MULTIPLE VALUED LOGIC FULL ADDER USING CARBON NANO TUBE FIELD EFFE...
PDF
Multiplier and Accumulator Using Csla
PDF
1.area efficient carry select adder
PDF
Eq36876880
PDF
International Journal of Engineering Research and Development (IJERD)
PDF
IRJET- Low Complexity Pipelined FFT Design for High Throughput and Low Densit...
Iaetsd pipelined parallel fft architecture through folding transformation
Transpose Form Fir Filter Design for Fixed and Reconfigurable Coefficients
High Performance MAC Unit for FFT Implementation
J0166875
Aw4102359364
Iaetsd vlsi architecture for exploiting carry save arithmetic using verilog hdl
Performance Analysis of OFDM Transceiver with Folded FFT and LMS Filter
B1030610
A novel area efficient vlsi architecture for recursion computation in lte tur...
Design and Implementation of Low-Power and Area-Efficient 64 bit CSLA using VHDL
Gn3311521155
Modified montgomery modular multiplier for cryptosystems
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
HIGH SPEED MULTIPLE VALUED LOGIC FULL ADDER USING CARBON NANO TUBE FIELD EFFE...
Multiplier and Accumulator Using Csla
1.area efficient carry select adder
Eq36876880
International Journal of Engineering Research and Development (IJERD)
IRJET- Low Complexity Pipelined FFT Design for High Throughput and Low Densit...
Ad

Similar to High performance nb-ldpc decoder with reduction of message exchange (20)

PDF
Reduced Complexity Maximum Likelihood Decoding Algorithm for LDPC Code Correc...
PDF
Reduced Energy Min-Max Decoding Algorithm for Ldpc Code with Adder Correction...
PDF
New low-density-parity-check decoding approach based on the hard and soft dec...
PDF
International Journal of Engineering Research and Development (IJERD)
DOCX
Novel design algorithm for low complexity programmable fir filters based on e...
PDF
Lecture summary: architectures for baseband signal processing of wireless com...
PDF
Analysis and design of cost effective, high-throughput ldpc decoders
PDF
GF(q) LDPC encoder and decoder FPGA implementation using group shuffled beli...
PDF
VLSI IEEE Transaction 2018 - IEEE Transaction
PDF
Performance comparison of eg ldpc codes
PDF
Matlab Dsp Hdl Toolbox Users Guide Unknown
PDF
Matlab Dsp Hdl Toolbox Users Guide Unknown
PDF
A reconfigurable ldpc decoder optimized applications
PDF
Performance analysis and implementation for nonbinary quasi cyclic ldpc decod...
PDF
FOLDED ARCHITECTURE FOR NON CANONICAL LEAST MEAN SQUARE ADAPTIVE DIGITAL FILT...
PDF
FOLDED ARCHITECTURE FOR NON CANONICAL LEAST MEAN SQUARE ADAPTIVE DIGITAL FILT...
PDF
Design of High Speed and Low Power Veterbi Decoder for Trellis Coded Modulati...
PDF
Ijecet 06 10_004
PDF
Digital signal processors architecture programming and applications 2nd Editi...
PDF
Digital signal processors architecture programming and applications 2nd Editi...
Reduced Complexity Maximum Likelihood Decoding Algorithm for LDPC Code Correc...
Reduced Energy Min-Max Decoding Algorithm for Ldpc Code with Adder Correction...
New low-density-parity-check decoding approach based on the hard and soft dec...
International Journal of Engineering Research and Development (IJERD)
Novel design algorithm for low complexity programmable fir filters based on e...
Lecture summary: architectures for baseband signal processing of wireless com...
Analysis and design of cost effective, high-throughput ldpc decoders
GF(q) LDPC encoder and decoder FPGA implementation using group shuffled beli...
VLSI IEEE Transaction 2018 - IEEE Transaction
Performance comparison of eg ldpc codes
Matlab Dsp Hdl Toolbox Users Guide Unknown
Matlab Dsp Hdl Toolbox Users Guide Unknown
A reconfigurable ldpc decoder optimized applications
Performance analysis and implementation for nonbinary quasi cyclic ldpc decod...
FOLDED ARCHITECTURE FOR NON CANONICAL LEAST MEAN SQUARE ADAPTIVE DIGITAL FILT...
FOLDED ARCHITECTURE FOR NON CANONICAL LEAST MEAN SQUARE ADAPTIVE DIGITAL FILT...
Design of High Speed and Low Power Veterbi Decoder for Trellis Coded Modulati...
Ijecet 06 10_004
Digital signal processors architecture programming and applications 2nd Editi...
Digital signal processors architecture programming and applications 2nd Editi...
Ad

Recently uploaded (20)

PDF
IFRS Notes in your pocket for study all the time
PDF
Dr. Enrique Segura Ense Group - A Self-Made Entrepreneur And Executive
PDF
pdfcoffee.com-opt-b1plus-sb-answers.pdfvi
PPTX
Amazon (Business Studies) management studies
PPTX
The Marketing Journey - Tracey Phillips - Marketing Matters 7-2025.pptx
PDF
COST SHEET- Tender and Quotation unit 2.pdf
PPT
Data mining for business intelligence ch04 sharda
DOCX
unit 1 COST ACCOUNTING AND COST SHEET
PDF
Traveri Digital Marketing Seminar 2025 by Corey and Jessica Perlman
PDF
Unit 1 Cost Accounting - Cost sheet
PDF
Nidhal Samdaie CV - International Business Consultant
PDF
Business model innovation report 2022.pdf
DOCX
unit 2 cost accounting- Tender and Quotation & Reconciliation Statement
PPTX
5 Stages of group development guide.pptx
PDF
20250805_A. Stotz All Weather Strategy - Performance review July 2025.pdf
PDF
Katrina Stoneking: Shaking Up the Alcohol Beverage Industry
PDF
BsN 7th Sem Course GridNNNNNNNN CCN.pdf
PDF
Elevate Cleaning Efficiency Using Tallfly Hair Remover Roller Factory Expertise
PDF
MSPs in 10 Words - Created by US MSP Network
PPTX
HR Introduction Slide (1).pptx on hr intro
IFRS Notes in your pocket for study all the time
Dr. Enrique Segura Ense Group - A Self-Made Entrepreneur And Executive
pdfcoffee.com-opt-b1plus-sb-answers.pdfvi
Amazon (Business Studies) management studies
The Marketing Journey - Tracey Phillips - Marketing Matters 7-2025.pptx
COST SHEET- Tender and Quotation unit 2.pdf
Data mining for business intelligence ch04 sharda
unit 1 COST ACCOUNTING AND COST SHEET
Traveri Digital Marketing Seminar 2025 by Corey and Jessica Perlman
Unit 1 Cost Accounting - Cost sheet
Nidhal Samdaie CV - International Business Consultant
Business model innovation report 2022.pdf
unit 2 cost accounting- Tender and Quotation & Reconciliation Statement
5 Stages of group development guide.pptx
20250805_A. Stotz All Weather Strategy - Performance review July 2025.pdf
Katrina Stoneking: Shaking Up the Alcohol Beverage Industry
BsN 7th Sem Course GridNNNNNNNN CCN.pdf
Elevate Cleaning Efficiency Using Tallfly Hair Remover Roller Factory Expertise
MSPs in 10 Words - Created by US MSP Network
HR Introduction Slide (1).pptx on hr intro

High performance nb-ldpc decoder with reduction of message exchange

  • 1. A High-Performance FIR Filter Architecture for Fixed and Reconfigurable Applications High-Performance NB-LDPC Decoder With Reduction of Message Exchange Abstract: This paper presents a novel algorithm based on trellis min–max for decoding non-binary low- density parity check (NB-LDPC) codes. This decoder reduces the number of messages exchanged between check node and variable node processors, which decreases the storage resources and the wiring congestion and, thus, increases the throughput of the decoder. Our frame error rate performance simulations show that the proposed algorithm has a negligible performance loss for high rate codes with GF(16) and GF(32), and a performance loss smaller than 0.07 dB for high-rate codes over GF(64). The proposed architecture of this paper analysis the logic size, area and power consumption using Xilinx 14.2. Enhancement of the project: Existing System: The first algorithm proposed to decode NB-LDPC codes was the Q-ary sum-of-product algorithm (QSPA), which was developed as a generalization of the SPA for binary LDPC codes. Further improvements, such as fast Fourier transform SPA, log-SPA, and max-log-SPA, were proposed to reduce the complexity of the CN processing equations without introducing any performance loss. More recently, a trellisbased implementation for QPSA (T-max-log-QSPA) was proposed, offering a solution that increases the throughput with respect to previous solutions based on QPSA. Its main drawback is that the required area is prohibitive for real applications in communications and storage systems. Extended min-sum (EMS) and min–max algorithms were presented as the approximations of the QSPA, so that they reduce considerably the CN complexity, which only requires additions and/or comparisons. In addition, EMS and min–max algorithms utilize forward–backward metrics to derive the CN output messages. These metrics involve serial computations that limit the throughput of the derived hardware architectures. Trellis EMS (T-EMS) algorithm was proposed with the aim of enabling parallel processing of the messages in the CN. The input messages are organized in a trellis structure, while the output messages are generated in parallel by means of an extra column included in the trellis. Trellis min–max (T-MM) algorithm in adapts the idea of T-EMS to min–max algorithm. One minimum only T-MM (OMO T-MM) is an approximation of T-MM that reduces the complexity of the CN by obtaining only one minimum and estimating the second one. All these algorithms
  • 2. A High-Performance FIR Filter Architecture for Fixed and Reconfigurable Applications exchange q × dc reliability values between CN and variable node (VN) processors. This amount of exchanged messages is large enough to cause wiring congestion, and this limits the maximum throughput, especially for high-rate NB-LDPC codes and high-order GFs. In addition, in decoder architectures with a layered schedule, the CN output messages are stored to be used in the next iteration. So, the required memory, which is the main part of the area in NB-LDPC decoder architectures, is too high. Disadvantages:  Area coverage is high  Throughput is low Proposed System: MODIFIED TRELLIS MIN–MAX ALGORITHM: Reformulation of Trellis Min–Max Algorithm: We reformulate the T-MM algorithm as a first step to define our proposal. As can be seen in Algorithm 2, Steps 4 and 5 are the ones reformulated. The function ψ’ in Step 4 obtains, which path in the trellis was used to obtain ΔQ (a), that is, the most reliable path. Considering that a maximum of two deviations is evaluated, the function returns the two GF symbols that define this path, η∗ 1(a) and η∗ 2(a). Algorithm 1 T-MM Algorithm
  • 3. A High-Performance FIR Filter Architecture for Fixed and Reconfigurable Applications Fig. 1 includes an example of trellis with GF(4) and dc = 5. It shows the CN input messages before (Qmn(a)) and after (ΔQmn(a)) delta domain transformation. The hard-decision symbols are z = {α1, α0, 0, α0, 0}. After the normal-to-delta domain transformation, the reliabilities ΔQ mn(a) in the first row of the trellis are equal to 0.
  • 4. A High-Performance FIR Filter Architecture for Fixed and Reconfigurable Applications Fig. 1. Top: example of CN input messages in normal domain. Bottom: messages in delta domain and organized in trellis way including the extra column Q(a) (bottom size). Example for GF(4) and dc = 5. NB-LDPC DECODER IMPLEMENTATION We describe the architecture designed to implement the proposed mT-MM algorithm. CN Architecture for mT-MM Algorithm The main characteristic of the proposed mT-MM Algorithm is to move part of the complexity of the CN processor to the VN processor. In this way, the number of exchanged messages between them and also the storage resources of the decoder are reduced. Therefore, the CN architecture presented in this section requires less functional blocks than a conventional implementation of the T-MM algorithm. Fig. 2 shows the block diagram for the top-level CN architecture, where each block corresponds to a step in the mT-MM algorithm.
  • 5. A High-Performance FIR Filter Architecture for Fixed and Reconfigurable Applications Fig. 2. Proposed CN block diagram. The complete block diagram for the proposed decoder is presented in Fig. 3. As can be seen, there is only one CN processor and one VN processor, which processes one row of H per clock cycle. A layered schedule requires to store the CN output messages from one iteration to be used in the next one. This is done by means of a shift register with M stages (SR in Fig. 3). Fig. 3. Top-level proposed decoder architecture. Advantages:  Improve area and throughput. Software implementation:  Modelsim  Xilinx ISE