SlideShare a Scribd company logo
Bulletin of Electrical Engineering and Informatics
Vol. 8, No. 2, June 2019, pp. 422~427
ISSN: 2302-9285, DOI: 10.11591/eei.v8i2.1483  422
Journal homepage: http://beei.org/index.php/EEI
Efficient FPGA implementation of high speed digital delay for
wideband beamforming using parallel architectures
Gian Carlo Cardarilli1
, Luca Di Nunzio2
, Rocco Fazzolari3
, Daniele Giardino4
, Marco Matta5
, Marco
Re6
, Sergio Spanò7
, Lorenzo Simone8
1,2,3,4,5,6,7
University of Rome Tor Vergata, Via del Politecnico 1, 00133 Roma, Italy
8
Thales Alenia Space Roma, Via Saccomuro 24-00131 Roma, Italy
Article Info ABSTRACT
Article history:
Received Jan 23, 2019
Revised Feb 2, 2019
Accepted Feb 25, 2019
In this paper, the authors present an FPGA implementation of a digital delay
for beamforming applications. The digital delay is based on a Parallel Farrow
Filter. Such architecture allows to reach a very high processing rate with
wideband signals and it is suitable to be used with Time-Interleaved Analog
to Digital Converters (TI-ADC). The proposed delay has been simulated in
MATLAB, implemented on FPGA and characterized in terms of amplitude
and phase response, maximum clock frequency and area.
Keywords:
Farrow filter
TI-ADC
Variable fractional delay
Wideband digital beamfoming
Copyright © 2019 Institute of Advanced Engineering and Science.
All rights reserved.
Corresponding Author:
Luca Di Nunzio,
University of Rome Tor Vergata
Via del Politecnico 1, 00133 Roma Italy
Email: di.nunzio@ing.uniroma2.it
1. INTRODUCTION
The beamforming technique [1-5] is based on the combination of M different signals coming from
M antennas. Such combination is obtained by delaying and summing the signals in order to produce additive
interferences in some directions and destructive interferences in others.
In this application, the delay blocks represent a crucial element. When the beamforming involves
narrowband signals, the delays can be realized with simple phase shifters, implemented with complex
multipliers. Vice versa in case of wide-band signals, delay blocks must be implemented using more complex
circuits [6-7]. A common solution consists in the use of fractional delay filters. These filters are able to
generate delays, which are a fraction of the system clock cycle. In wideband applications, as for example
wideband beamforming, the filters must comply with some specifications, namely:
a. Large bandwidth (ideally Nyquist frequency)
b. Reduced Magnitude ripple
c. Reduced Phase ripple
d. Ideally constant Group delay
These requirements can be easily met using the Weighted Least-Squares (WLS) method
implemented with the Farrow architecture [8-13]. However, as the processing rate increases, e.g. more than 1
GHz, the hardware implementation could present some complications, for example the high sample rate that
makes impossible the use of FPGAs. The reason is the impossibility of FPGAs to reach processing rates
beyond the GHz. In fact, although modern Time-Interleaved ADCs [14-16] (TI-ADC) are able to provide
wide-band signals with sample rates over the GHz, FPGAs are not able to process such signals without
decimation and, consequently, without reducing the bandwidth of signals involved in processing [17, 18].
Bulletin of Electr Eng and Inf ISSN: 2302-9285 
Efficient FPGA implementation of high speed digital delay for wideband… (Gian Carlo Cardarilli)
423
In this paper, the authors present an FPGA based digital delay for wideband digital beamforming.
The proposed solution is able to reach preocessing rates compatible with actual TI-ADC. Such digital delay
is based on Parallel FIR Filters which are used to compose a Parallel Farrow Filter.
2. RESEARCH METHOD
Farrow filters [19-21] represent the most common solution for the implementation of fractional
delay filters. They are widely discussed in the literature and a detailed analysis is provided in [20]. In Figure
1 the block diagram of a Farrow filter is provided. The filter is composed of delay blocks, adders, multipliers
(used for the selection of the delay entity) and M subfilters. Subfilters are represented in the figure with
blocks named Ci(z) with 0≤I≤M-1. As discussed in [3, 4], modern TI-ADCs for high speed/wide-band
applications are usually composed of 2 or 4 ADC cores. Each core provides the output on a separate bus. All
the cores work in parallel and the total sample rate of the ADC is the sum of the sample rates of the
single cores.
In other words, the total sample rate is L∙fsa where L is the number of cores and fsa is the sample rate
of each core. The idea of the proposed digital delay is to parallelize the Farrow architecture in order to
process the incoming parallel data from the TI-ADC cores as shown in Figure 2. This is possible by
parallelizing the sub-filters that compose the Farrow filter. The parallel architecture is based on polyphase
filters banks [22, 23]. This technique allows to reduce the operating frequency by parallelizing the filters and,
as consequence, could be used to reduce the system power consumption by acting on power supply [24].
To achieve such parallelization, we introduce a new polyphase architecture that we call Parallel-
Polyphase. This architecture, differently from traditional polyphase architectures, is able to process data
without any decimation and consequently without any bandwidth reduction.
Figure 1. Farrow filter architecture
Let's consider the impulse response ci[n] of a generic sub-filter Ci composing the Farrow filter. In
order to implement the architecture shown in Figure 2, each sub-filter must be parallelized by a factor equal
to the number of TI-ADC available cores (Figure 2 shows the case of a TI-ADC composed by 4 cores). In the
following, we get the equations for a generic parallel filter having L inputs and L outputs, where L is the
number of the TI-ADC cores. We transform the discrete convolution of a classic SISO (Single Input, Single
Output) FIR filter into a parallel convolution to model a MIMO (Multi-Input, Multi-Output) FIR Filter. A
FIR filer is described by the discrete convolution:
, - ∑ , - , - (1)
where N is the length of the filter output and c[n] is the impulse response of Ci(z).
 ISSN: 2302-9285
Bulletin of Electr Eng and Inf, Vol. 8, No. 2, June 2019 : 422 – 427
424
Figure 2. Parallel farrow filter with L=4
Applying a factor L polyphase decomposition [25] to the output y[n], we split the output in L
branches yk[n] with k= 0, 1, 2 to L-1:
, - , - ∑ , - , - (2)
Note that each output yk[n] depends on every sample of the input sequence x[n]. Because TI-ADCs
provide the inputs in parallel (each core has an independent bus), input x[n] in the equation must be also
parallelized. This is possible using a change of variables: i=Lm+l with
0 ≤ l ≤ L-1 and 0 ≤ m ≤ M-1 where M is the length of the subfilters obtained by the polyphase decomposition
of c[n] [6]. The new equation of the model is shown in (3).
, - ∑ (∑ , ( ) - , -) (3)
(3) describes the L outputs yk[n] in function of the L inputs. In terms of Z transform we have:
( ) ∑ ( ) ( ) (4)
where * , ( ) -+ ( ) are the L parallel inputs provided by the TI-ADC and * ,
-+ ( ) is the subfilter of c[n]. The term may have a negative subscript. For this reason, we
consider β=k-l and we rewrite Xk-l with subscript between 0 and L-1:
( ) * , -+ * , ( )- ( )+ ( ) (5)
Consequently:
( ) ∑ ( ) ( ) ∑ ( )( ) ( )
( ) ∑ ( ) ( )
(6)
Bulletin of Electr Eng and Inf ISSN: 2302-9285 
Efficient FPGA implementation of high speed digital delay for wideband… (Gian Carlo Cardarilli)
425
For example, choosing L=4, a generic filter Ci(z) can be parallelized as shown in Figure 3. The
polyphase sub-filters Hi(zL
) are computed as shown in [6]. Note that the regularity of such architecture allows
an easy scalability for any value of L.
Figure 3. Parallel polyphase decomposition for L=4
3. RESULTS AND DISCUSSION
The architecture of Figure 2 has been coded in VHDL and implemented on a XILINX XCVU9P-
L2FLGA2104E FPGA [26]. After the hardware implementation, the digital delay has been tested injecting
sinusoids at the input. Keeping a 16-bit resolution in the entire data-path (also inputs are represented with 16
bits) we obtain the following results:
a. Magnitude ripple < 0.2 dB.
b. Phase ripple < 2°.
c. Minimum delay 10 ps.
d. Maximum clock frequency (500 MHz)
The Farrow filter is composed by M=5 subfilters with a length of N=11. The maximum clock
frequency of 500 MHz allows the reaching of 2 GSPS using 4 cores. In Figure 4 the magnitude and the phase
error response in function of the delay are provided. Figure 5 shows examples of delay in the time domain.
Table 1. Resources utilization
Resources Utilization Available Utilization%
LUT 4,449 1,182,240 0.38%
LUT RAM 784 591,840 0.13%
FF 10,735 2,364,480 0.45%
DSP 80 6,840 1.17%
 ISSN: 2302-9285
Bulletin of Electr Eng and Inf, Vol. 8, No. 2, June 2019 : 422 – 427
426
Figure 4. Bode diagrams of the proposed digital delay
Figure 5. Time response with delay 31.25 ps and 62.5 ps
4. CONCLUSION
In this paper, a digital delay for beamforming has been presented. The proposed digital delay is
based on a parallel polyphase decomposition that can be easily implemented on FPGA or ASIC. Thanks to its
regularity, this parallel polyphase decomposition can be easily generalized for any value of L. The introduced
digital delay allows FPGAs to process wide-band signals without any decimation and consequently without
any bandwidth reduction. We have shown an implementation’s example able to process a 2 GSPS signal
using a TI-ADC with 4 cores and a 500 MHz clock frequency. The signals can be delayed of small quantities
up to 10ps with Magnitude Ripple and Frequency Ripple respectively less than 0.2 dB and 2°.
REFERENCES
[1] Steyskal, Hans. “Digital Beamforming Antennas: An Introduction”. Microwave journal, 1987; 30 (1): pp. 10p
between p 107 and 124
[2] R. Mucci, "A comparison of efficient beamforming algorithms," in IEEE Transactions on Acoustics, Speech, and
Signal Processing, vol. 32, no. 3, pp. 548-558, June 1984.
[3] Malek, N.A., Khalifa, O.O., Abidin, Z.Z., Mohamad, S.Y., Abdul Rahman, N.A. Beam steering using the active
element pattern of antenna array. Telkomnika (Telecommunication Computing Electronics and Control), 2018;
16(4): 1542-1550.
[4] M. Younis, C. Fischer and W. Wiesbeck, "Digital beamforming in SAR systems," in IEEE Transactions on
Geoscience and Remote Sensing, vol. 41, no. 7, pp. 1735-1739, July 2003.
[5] D. E. Dudgeon, "Fundamentals of digital array processing," in Proceedings of the IEEE, vol. 65, no. 6, pp. 898-904,
June 1977.
Bulletin of Electr Eng and Inf ISSN: 2302-9285 
Efficient FPGA implementation of high speed digital delay for wideband… (Gian Carlo Cardarilli)
427
[6] Cardarilli G.C., Di Nunzio L., Fazzolari R., Re M., Rufolo G., Bernocchi G. Analog chain calibration in Digital
Beam-Forming applications. ARPN Journal of Engineering and Applied Sciences, 2018 13 (2), pp. 752-760.
[7] C. Cheung, R. Shah and M. Parker, "Time delay digital beamforming for wideband pulsed radar
implementation," 2013 IEEE International Symposium on Phased Array Systems and Technology, Waltham, MA,
2013, pp. 448-455.
[8] C. K. Chu and Yee-Hong Leung, "Further results on the WLS design of variable fractional delay filters," 2012 6th
International Conference on Signal Processing and Communication Systems, Gold Coast, QLD, 2012, pp. 1-7.
[9] T. Deng, "Symmetric Structures for Odd-Order Maximally Flat and Weighted-Least-Squares Variable Fractional-
Delay Filters," in IEEE Transactions on Circuits and Systems I: Regular Papers, vol. 54, no. 12, pp. 2718-2732,
Dec. 2007.
[10] Wu-Sheng Lu and Tian-Bo Deng, "An improved weighted least-squares design for variable fractional delay FIR
filters," in IEEE Transactions on Circuits and Systems II: Analog and Digital Signal Processing, vol. 46, no. 8, pp.
1035-1040, Aug. 1999.
[11] Chien-Cheng Tseng, "Design of variable fractional delay allpass filter using weighted least squares method," 2002
IEEE International Symposium on Circuits and Systems. Proceedings (Cat. No.02CH37353), Phoenix-Scottsdale,
AZ, USA, 2002, pp. V-V.
[12] S. Tahir, M. Elnamaky, M. A. Ashraf and K. Jamil, "Hardware implementation of digital beamforming network for
Ultra Wide band signals using uniform linear arrays," The 2nd Middle East Conference on Antennas and
Propagation, Cairo, 2012, pp. 1-4.
[13] Cardarilli G.C., Giardino D., Matta M., Re M., Silvestri F., Simone L., Spanó S. “Comparison and Implementation
of Variable Fractional Delay Filters for Wideband Digital Beamforming”. Lecture Notes in Electrical Engineering,
Article In Press 2019
[14] F. Harris, Xiaofei Chen, E. Venosa and F. A. N. Palmieri, "Two channel TI-ADC for communication signals," 2011
IEEE 12th International Workshop on Signal Processing Advances in Wireless Communications, San Francisco,
CA, 2011, pp. 576-580.
[15] G. Manganaro and D. Robertson Interleaving ADCs: Unraveling the Mysteries https://www.analog.com/en/analog-
dialogue/articles/interleaving-adcs
[16] Interleaving ADCs for Higher Sample Rates Literature Number: SNAA111
http://www.ti.com/lit/wp/snaa111/snaa111.pdf
[17] Cappello S., Cardarilli G.C., Di Nunzio L., Fazzolari R., Re M., Albicocco P. “Flexible channel extractor for
wideband systems based on polyphase filter bank”. Journal of Theoretical and Applied Information Technology,
2017; 95(16): 3841-3850.
[18] Cardarilli G.C., Di Nunzio L., Fazzolari R., Re M., Nannarelli A. “Power Efficient Digital Front-End for Cognitive
Radio Systems”. Conference Record of the IEEE Asilomar Conference on Signals, Systems and Computers, 2018
[19] V. Valimaki and T. I. Laakso, "Principles of fractional delay filters," 2000 IEEE International Conference on
Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100), Istanbul, Turkey, 2000, pp. 3870-
3873 vol.6.
[20] Laakso T.I., Välimäki V., Karjalainen M., Laine U.K.:”Splitting the unit: Delay tools for fractional delay filter
design”. IEEE Signal Processing Magazine, 1996; 13(1), pp. 30-60.
[21] C. K. S. Pun, Y. C. Wu, S. C. Chan and K. L. Ho, "An efficient design or fractional-delay digital FIR filters using
the Farrow structure," Proceedings of the 11th IEEE Signal Processing Workshop on Statistical Signal Processing
(Cat. No.01TH8563), Singapore, 2001, pp. 595-598.
[22] P. P. Vaidyanathan, "Multirate digital filters, filter banks, polyphase networks, and applications: a tutorial,"
in Proceedings of the IEEE, vol. 78, no. 1, pp. 56-93, Jan. 1990.
[23] Cardarilli, G.C., Di Nunzio, L., Fazzolari, R., Fereidountabar, A., Giuliani, F., Re, M., Simone, L. “Comparison of
jamming excision methods for direct sequence/spread spectrum (DS/SS) modulated signal”, Journal of Theoretical
and Applied Information Technology, 2017; 95(13): 2878-2888.
[24] Cardarilli, G.C., Di Nunzio, L., Fazzolari, R., Re, M., Silvestri, F., Spanò, S. “Energy consumption saving in
embedded microprocessors using hardware accelerators”. TELKOMNIKA (Telecommunication, Computing,
Electronics and Control). (2018); 16(3): 1019-1026.
[25] Mitra S.K., and Yonghong K. Digital signal processing: a computer-based approach. Vol. 2. New York: McGraw-
Hill, 2006.
[26] UltraScale Architecture Staying a Generation Ahead with an Extra Node of Value.
https://www.xilinx.com/products/technology/ultrascale.html

More Related Content

PDF
Design and Implementation of an Embedded System for Software Defined Radio
PDF
IRJET- Implementation of Reversible Radix-2 FFT VLSI Architecture using P...
PDF
20120130406009 2
PDF
PDF
IRJET- Synchronization Scheme of MIMO-OFDM using Monte Carlo Method
PDF
Survey of Optimization of FFT processor for OFDM Receivers
PDF
Design of Multiplier Less 32 Tap FIR Filter using VHDL
PDF
Implementationof Reconfigurable Architecture For Generic OFDM Modulator (Hybr...
Design and Implementation of an Embedded System for Software Defined Radio
IRJET- Implementation of Reversible Radix-2 FFT VLSI Architecture using P...
20120130406009 2
IRJET- Synchronization Scheme of MIMO-OFDM using Monte Carlo Method
Survey of Optimization of FFT processor for OFDM Receivers
Design of Multiplier Less 32 Tap FIR Filter using VHDL
Implementationof Reconfigurable Architecture For Generic OFDM Modulator (Hybr...

What's hot (18)

PDF
Capsulization of Existing Space Time Techniques
PDF
IRJET- VLSI Architecture for Reversible Radix-2 FFT Algorithm using Programma...
PDF
PERFORMANCE ANALYSIS OF QOS PARAMETERS LIKE PSNR, MAE & RMSE USED IN IMAGE TR...
PDF
Analysis of different FIR Filter Design Method in terms of Resource Utilizati...
PDF
DESIGN AND IMPLEMENTATION OF COMBINED PIPELINING AND PARALLEL PROCESSING ARCH...
PDF
Performance evaluation on the basis of bit error rate for different order of ...
PDF
Channel Coding and Clipping in OFDM for WiMAX using SDR
PDF
Performance Analysis of MIMO-OFDM System Using QOSTBC Code Structure for M-PSK
PDF
Simulation of ofdm modulation adapted to the transmission of a fixed image
PDF
FPGA Implementation of Efficient Viterbi Decoder for Multi-Carrier Systems
PDF
MartinDickThesis
PDF
Reduction of Outage Probability in Fast Rayleigh Fading MIMO Channels Using OFDM
PDF
FPGA Implementation of FIR Filter using Various Algorithms: A Retrospective
PDF
Pilot induced cyclostationarity based method for dvb system identification
PDF
Study of timing synchronization in mimoofdm systems using dvb t
PPTX
HIGH-SPEED LOW-POWER VITERBI DECODER DESIGN FOR TCM DECODERS
PDF
Performance analysis of DWT based OFDM over FFT based OFDM and implementing o...
Capsulization of Existing Space Time Techniques
IRJET- VLSI Architecture for Reversible Radix-2 FFT Algorithm using Programma...
PERFORMANCE ANALYSIS OF QOS PARAMETERS LIKE PSNR, MAE & RMSE USED IN IMAGE TR...
Analysis of different FIR Filter Design Method in terms of Resource Utilizati...
DESIGN AND IMPLEMENTATION OF COMBINED PIPELINING AND PARALLEL PROCESSING ARCH...
Performance evaluation on the basis of bit error rate for different order of ...
Channel Coding and Clipping in OFDM for WiMAX using SDR
Performance Analysis of MIMO-OFDM System Using QOSTBC Code Structure for M-PSK
Simulation of ofdm modulation adapted to the transmission of a fixed image
FPGA Implementation of Efficient Viterbi Decoder for Multi-Carrier Systems
MartinDickThesis
Reduction of Outage Probability in Fast Rayleigh Fading MIMO Channels Using OFDM
FPGA Implementation of FIR Filter using Various Algorithms: A Retrospective
Pilot induced cyclostationarity based method for dvb system identification
Study of timing synchronization in mimoofdm systems using dvb t
HIGH-SPEED LOW-POWER VITERBI DECODER DESIGN FOR TCM DECODERS
Performance analysis of DWT based OFDM over FFT based OFDM and implementing o...
Ad

Similar to Efficient FPGA implementation of high speed digital delay for wideband beamforming using parallel architectures (20)

PDF
A Novel Architecture for Different DSP Applications Using Field Programmable ...
PDF
FPGA Implementation of High Speed FIR Filters and less power consumption stru...
PDF
International Journal of Engineering Research and Development (IJERD)
PDF
Digital Implementation of Costas Loop with Carrier Recovery
PDF
First order sigma delta modulator with low-power consumption implemented in a...
PDF
First order sigma delta modulator with low-power
PDF
A Simulation of Wideband CDMA System on Digital Up/Down Converters
PDF
Design of Optimized FIR Filter Using FCSD Representation
PDF
Optimized implementation of an innovative digital audio equalizer
PDF
Design Radix-4 64-Point Pipeline FFT/IFFT Processor for Wireless Application
PDF
F0213137
PDF
RTB: BIDIRECTIONAL TRANSCEIVER (ESSCIRC85)
PDF
FPGA Implementation of Optimized CIC Filter for Sample Rate Conversion in Sof...
PDF
IRJET- FPGA based Processor for Feature Detection in Ultra-Wide Band Radar
PDF
DESIGN AND IMPLEMENTATION OF COMBINED PIPELINING AND PARALLEL PROCESSING ARCH...
PDF
Design And Implementation of Combined Pipelining and Parallel Processing Arch...
PDF
DESIGN AND IMPLEMENTATION OF COMBINED PIPELINING AND PARALLEL PROCESSING ARCH...
PDF
Performance Analysis of OFDM Transceiver with Folded FFT and LMS Filter
PDF
Design of Low Power Sigma Delta ADC
A Novel Architecture for Different DSP Applications Using Field Programmable ...
FPGA Implementation of High Speed FIR Filters and less power consumption stru...
International Journal of Engineering Research and Development (IJERD)
Digital Implementation of Costas Loop with Carrier Recovery
First order sigma delta modulator with low-power consumption implemented in a...
First order sigma delta modulator with low-power
A Simulation of Wideband CDMA System on Digital Up/Down Converters
Design of Optimized FIR Filter Using FCSD Representation
Optimized implementation of an innovative digital audio equalizer
Design Radix-4 64-Point Pipeline FFT/IFFT Processor for Wireless Application
F0213137
RTB: BIDIRECTIONAL TRANSCEIVER (ESSCIRC85)
FPGA Implementation of Optimized CIC Filter for Sample Rate Conversion in Sof...
IRJET- FPGA based Processor for Feature Detection in Ultra-Wide Band Radar
DESIGN AND IMPLEMENTATION OF COMBINED PIPELINING AND PARALLEL PROCESSING ARCH...
Design And Implementation of Combined Pipelining and Parallel Processing Arch...
DESIGN AND IMPLEMENTATION OF COMBINED PIPELINING AND PARALLEL PROCESSING ARCH...
Performance Analysis of OFDM Transceiver with Folded FFT and LMS Filter
Design of Low Power Sigma Delta ADC
Ad

More from journalBEEI (20)

PDF
Square transposition: an approach to the transposition process in block cipher
PDF
Hyper-parameter optimization of convolutional neural network based on particl...
PDF
Supervised machine learning based liver disease prediction approach with LASS...
PDF
A secure and energy saving protocol for wireless sensor networks
PDF
Plant leaf identification system using convolutional neural network
PDF
Customized moodle-based learning management system for socially disadvantaged...
PDF
Understanding the role of individual learner in adaptive and personalized e-l...
PDF
Prototype mobile contactless transaction system in traditional markets to sup...
PDF
Wireless HART stack using multiprocessor technique with laxity algorithm
PDF
Implementation of double-layer loaded on octagon microstrip yagi antenna
PDF
The calculation of the field of an antenna located near the human head
PDF
Exact secure outage probability performance of uplinkdownlink multiple access...
PDF
Design of a dual-band antenna for energy harvesting application
PDF
Transforming data-centric eXtensible markup language into relational database...
PDF
Key performance requirement of future next wireless networks (6G)
PDF
Noise resistance territorial intensity-based optical flow using inverse confi...
PDF
Modeling climate phenomenon with software grids analysis and display system i...
PDF
An approach of re-organizing input dataset to enhance the quality of emotion ...
PDF
Parking detection system using background subtraction and HSV color segmentation
PDF
Quality of service performances of video and voice transmission in universal ...
Square transposition: an approach to the transposition process in block cipher
Hyper-parameter optimization of convolutional neural network based on particl...
Supervised machine learning based liver disease prediction approach with LASS...
A secure and energy saving protocol for wireless sensor networks
Plant leaf identification system using convolutional neural network
Customized moodle-based learning management system for socially disadvantaged...
Understanding the role of individual learner in adaptive and personalized e-l...
Prototype mobile contactless transaction system in traditional markets to sup...
Wireless HART stack using multiprocessor technique with laxity algorithm
Implementation of double-layer loaded on octagon microstrip yagi antenna
The calculation of the field of an antenna located near the human head
Exact secure outage probability performance of uplinkdownlink multiple access...
Design of a dual-band antenna for energy harvesting application
Transforming data-centric eXtensible markup language into relational database...
Key performance requirement of future next wireless networks (6G)
Noise resistance territorial intensity-based optical flow using inverse confi...
Modeling climate phenomenon with software grids analysis and display system i...
An approach of re-organizing input dataset to enhance the quality of emotion ...
Parking detection system using background subtraction and HSV color segmentation
Quality of service performances of video and voice transmission in universal ...

Recently uploaded (20)

PDF
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
PDF
R24 SURVEYING LAB MANUAL for civil enggi
PDF
composite construction of structures.pdf
PPTX
Geodesy 1.pptx...............................................
PDF
Operating System & Kernel Study Guide-1 - converted.pdf
PDF
Embodied AI: Ushering in the Next Era of Intelligent Systems
PPTX
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
PPT
Project quality management in manufacturing
PDF
PPT on Performance Review to get promotions
DOCX
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
DOCX
573137875-Attendance-Management-System-original
PPTX
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
PDF
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
PDF
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
PPTX
CH1 Production IntroductoryConcepts.pptx
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PDF
Model Code of Practice - Construction Work - 21102022 .pdf
PPTX
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
PPTX
Internet of Things (IOT) - A guide to understanding
PDF
Digital Logic Computer Design lecture notes
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
R24 SURVEYING LAB MANUAL for civil enggi
composite construction of structures.pdf
Geodesy 1.pptx...............................................
Operating System & Kernel Study Guide-1 - converted.pdf
Embodied AI: Ushering in the Next Era of Intelligent Systems
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
Project quality management in manufacturing
PPT on Performance Review to get promotions
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
573137875-Attendance-Management-System-original
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
CH1 Production IntroductoryConcepts.pptx
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
Model Code of Practice - Construction Work - 21102022 .pdf
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
Internet of Things (IOT) - A guide to understanding
Digital Logic Computer Design lecture notes

Efficient FPGA implementation of high speed digital delay for wideband beamforming using parallel architectures

  • 1. Bulletin of Electrical Engineering and Informatics Vol. 8, No. 2, June 2019, pp. 422~427 ISSN: 2302-9285, DOI: 10.11591/eei.v8i2.1483  422 Journal homepage: http://beei.org/index.php/EEI Efficient FPGA implementation of high speed digital delay for wideband beamforming using parallel architectures Gian Carlo Cardarilli1 , Luca Di Nunzio2 , Rocco Fazzolari3 , Daniele Giardino4 , Marco Matta5 , Marco Re6 , Sergio Spanò7 , Lorenzo Simone8 1,2,3,4,5,6,7 University of Rome Tor Vergata, Via del Politecnico 1, 00133 Roma, Italy 8 Thales Alenia Space Roma, Via Saccomuro 24-00131 Roma, Italy Article Info ABSTRACT Article history: Received Jan 23, 2019 Revised Feb 2, 2019 Accepted Feb 25, 2019 In this paper, the authors present an FPGA implementation of a digital delay for beamforming applications. The digital delay is based on a Parallel Farrow Filter. Such architecture allows to reach a very high processing rate with wideband signals and it is suitable to be used with Time-Interleaved Analog to Digital Converters (TI-ADC). The proposed delay has been simulated in MATLAB, implemented on FPGA and characterized in terms of amplitude and phase response, maximum clock frequency and area. Keywords: Farrow filter TI-ADC Variable fractional delay Wideband digital beamfoming Copyright © 2019 Institute of Advanced Engineering and Science. All rights reserved. Corresponding Author: Luca Di Nunzio, University of Rome Tor Vergata Via del Politecnico 1, 00133 Roma Italy Email: di.nunzio@ing.uniroma2.it 1. INTRODUCTION The beamforming technique [1-5] is based on the combination of M different signals coming from M antennas. Such combination is obtained by delaying and summing the signals in order to produce additive interferences in some directions and destructive interferences in others. In this application, the delay blocks represent a crucial element. When the beamforming involves narrowband signals, the delays can be realized with simple phase shifters, implemented with complex multipliers. Vice versa in case of wide-band signals, delay blocks must be implemented using more complex circuits [6-7]. A common solution consists in the use of fractional delay filters. These filters are able to generate delays, which are a fraction of the system clock cycle. In wideband applications, as for example wideband beamforming, the filters must comply with some specifications, namely: a. Large bandwidth (ideally Nyquist frequency) b. Reduced Magnitude ripple c. Reduced Phase ripple d. Ideally constant Group delay These requirements can be easily met using the Weighted Least-Squares (WLS) method implemented with the Farrow architecture [8-13]. However, as the processing rate increases, e.g. more than 1 GHz, the hardware implementation could present some complications, for example the high sample rate that makes impossible the use of FPGAs. The reason is the impossibility of FPGAs to reach processing rates beyond the GHz. In fact, although modern Time-Interleaved ADCs [14-16] (TI-ADC) are able to provide wide-band signals with sample rates over the GHz, FPGAs are not able to process such signals without decimation and, consequently, without reducing the bandwidth of signals involved in processing [17, 18].
  • 2. Bulletin of Electr Eng and Inf ISSN: 2302-9285  Efficient FPGA implementation of high speed digital delay for wideband… (Gian Carlo Cardarilli) 423 In this paper, the authors present an FPGA based digital delay for wideband digital beamforming. The proposed solution is able to reach preocessing rates compatible with actual TI-ADC. Such digital delay is based on Parallel FIR Filters which are used to compose a Parallel Farrow Filter. 2. RESEARCH METHOD Farrow filters [19-21] represent the most common solution for the implementation of fractional delay filters. They are widely discussed in the literature and a detailed analysis is provided in [20]. In Figure 1 the block diagram of a Farrow filter is provided. The filter is composed of delay blocks, adders, multipliers (used for the selection of the delay entity) and M subfilters. Subfilters are represented in the figure with blocks named Ci(z) with 0≤I≤M-1. As discussed in [3, 4], modern TI-ADCs for high speed/wide-band applications are usually composed of 2 or 4 ADC cores. Each core provides the output on a separate bus. All the cores work in parallel and the total sample rate of the ADC is the sum of the sample rates of the single cores. In other words, the total sample rate is L∙fsa where L is the number of cores and fsa is the sample rate of each core. The idea of the proposed digital delay is to parallelize the Farrow architecture in order to process the incoming parallel data from the TI-ADC cores as shown in Figure 2. This is possible by parallelizing the sub-filters that compose the Farrow filter. The parallel architecture is based on polyphase filters banks [22, 23]. This technique allows to reduce the operating frequency by parallelizing the filters and, as consequence, could be used to reduce the system power consumption by acting on power supply [24]. To achieve such parallelization, we introduce a new polyphase architecture that we call Parallel- Polyphase. This architecture, differently from traditional polyphase architectures, is able to process data without any decimation and consequently without any bandwidth reduction. Figure 1. Farrow filter architecture Let's consider the impulse response ci[n] of a generic sub-filter Ci composing the Farrow filter. In order to implement the architecture shown in Figure 2, each sub-filter must be parallelized by a factor equal to the number of TI-ADC available cores (Figure 2 shows the case of a TI-ADC composed by 4 cores). In the following, we get the equations for a generic parallel filter having L inputs and L outputs, where L is the number of the TI-ADC cores. We transform the discrete convolution of a classic SISO (Single Input, Single Output) FIR filter into a parallel convolution to model a MIMO (Multi-Input, Multi-Output) FIR Filter. A FIR filer is described by the discrete convolution: , - ∑ , - , - (1) where N is the length of the filter output and c[n] is the impulse response of Ci(z).
  • 3.  ISSN: 2302-9285 Bulletin of Electr Eng and Inf, Vol. 8, No. 2, June 2019 : 422 – 427 424 Figure 2. Parallel farrow filter with L=4 Applying a factor L polyphase decomposition [25] to the output y[n], we split the output in L branches yk[n] with k= 0, 1, 2 to L-1: , - , - ∑ , - , - (2) Note that each output yk[n] depends on every sample of the input sequence x[n]. Because TI-ADCs provide the inputs in parallel (each core has an independent bus), input x[n] in the equation must be also parallelized. This is possible using a change of variables: i=Lm+l with 0 ≤ l ≤ L-1 and 0 ≤ m ≤ M-1 where M is the length of the subfilters obtained by the polyphase decomposition of c[n] [6]. The new equation of the model is shown in (3). , - ∑ (∑ , ( ) - , -) (3) (3) describes the L outputs yk[n] in function of the L inputs. In terms of Z transform we have: ( ) ∑ ( ) ( ) (4) where * , ( ) -+ ( ) are the L parallel inputs provided by the TI-ADC and * , -+ ( ) is the subfilter of c[n]. The term may have a negative subscript. For this reason, we consider β=k-l and we rewrite Xk-l with subscript between 0 and L-1: ( ) * , -+ * , ( )- ( )+ ( ) (5) Consequently: ( ) ∑ ( ) ( ) ∑ ( )( ) ( ) ( ) ∑ ( ) ( ) (6)
  • 4. Bulletin of Electr Eng and Inf ISSN: 2302-9285  Efficient FPGA implementation of high speed digital delay for wideband… (Gian Carlo Cardarilli) 425 For example, choosing L=4, a generic filter Ci(z) can be parallelized as shown in Figure 3. The polyphase sub-filters Hi(zL ) are computed as shown in [6]. Note that the regularity of such architecture allows an easy scalability for any value of L. Figure 3. Parallel polyphase decomposition for L=4 3. RESULTS AND DISCUSSION The architecture of Figure 2 has been coded in VHDL and implemented on a XILINX XCVU9P- L2FLGA2104E FPGA [26]. After the hardware implementation, the digital delay has been tested injecting sinusoids at the input. Keeping a 16-bit resolution in the entire data-path (also inputs are represented with 16 bits) we obtain the following results: a. Magnitude ripple < 0.2 dB. b. Phase ripple < 2°. c. Minimum delay 10 ps. d. Maximum clock frequency (500 MHz) The Farrow filter is composed by M=5 subfilters with a length of N=11. The maximum clock frequency of 500 MHz allows the reaching of 2 GSPS using 4 cores. In Figure 4 the magnitude and the phase error response in function of the delay are provided. Figure 5 shows examples of delay in the time domain. Table 1. Resources utilization Resources Utilization Available Utilization% LUT 4,449 1,182,240 0.38% LUT RAM 784 591,840 0.13% FF 10,735 2,364,480 0.45% DSP 80 6,840 1.17%
  • 5.  ISSN: 2302-9285 Bulletin of Electr Eng and Inf, Vol. 8, No. 2, June 2019 : 422 – 427 426 Figure 4. Bode diagrams of the proposed digital delay Figure 5. Time response with delay 31.25 ps and 62.5 ps 4. CONCLUSION In this paper, a digital delay for beamforming has been presented. The proposed digital delay is based on a parallel polyphase decomposition that can be easily implemented on FPGA or ASIC. Thanks to its regularity, this parallel polyphase decomposition can be easily generalized for any value of L. The introduced digital delay allows FPGAs to process wide-band signals without any decimation and consequently without any bandwidth reduction. We have shown an implementation’s example able to process a 2 GSPS signal using a TI-ADC with 4 cores and a 500 MHz clock frequency. The signals can be delayed of small quantities up to 10ps with Magnitude Ripple and Frequency Ripple respectively less than 0.2 dB and 2°. REFERENCES [1] Steyskal, Hans. “Digital Beamforming Antennas: An Introduction”. Microwave journal, 1987; 30 (1): pp. 10p between p 107 and 124 [2] R. Mucci, "A comparison of efficient beamforming algorithms," in IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 32, no. 3, pp. 548-558, June 1984. [3] Malek, N.A., Khalifa, O.O., Abidin, Z.Z., Mohamad, S.Y., Abdul Rahman, N.A. Beam steering using the active element pattern of antenna array. Telkomnika (Telecommunication Computing Electronics and Control), 2018; 16(4): 1542-1550. [4] M. Younis, C. Fischer and W. Wiesbeck, "Digital beamforming in SAR systems," in IEEE Transactions on Geoscience and Remote Sensing, vol. 41, no. 7, pp. 1735-1739, July 2003. [5] D. E. Dudgeon, "Fundamentals of digital array processing," in Proceedings of the IEEE, vol. 65, no. 6, pp. 898-904, June 1977.
  • 6. Bulletin of Electr Eng and Inf ISSN: 2302-9285  Efficient FPGA implementation of high speed digital delay for wideband… (Gian Carlo Cardarilli) 427 [6] Cardarilli G.C., Di Nunzio L., Fazzolari R., Re M., Rufolo G., Bernocchi G. Analog chain calibration in Digital Beam-Forming applications. ARPN Journal of Engineering and Applied Sciences, 2018 13 (2), pp. 752-760. [7] C. Cheung, R. Shah and M. Parker, "Time delay digital beamforming for wideband pulsed radar implementation," 2013 IEEE International Symposium on Phased Array Systems and Technology, Waltham, MA, 2013, pp. 448-455. [8] C. K. Chu and Yee-Hong Leung, "Further results on the WLS design of variable fractional delay filters," 2012 6th International Conference on Signal Processing and Communication Systems, Gold Coast, QLD, 2012, pp. 1-7. [9] T. Deng, "Symmetric Structures for Odd-Order Maximally Flat and Weighted-Least-Squares Variable Fractional- Delay Filters," in IEEE Transactions on Circuits and Systems I: Regular Papers, vol. 54, no. 12, pp. 2718-2732, Dec. 2007. [10] Wu-Sheng Lu and Tian-Bo Deng, "An improved weighted least-squares design for variable fractional delay FIR filters," in IEEE Transactions on Circuits and Systems II: Analog and Digital Signal Processing, vol. 46, no. 8, pp. 1035-1040, Aug. 1999. [11] Chien-Cheng Tseng, "Design of variable fractional delay allpass filter using weighted least squares method," 2002 IEEE International Symposium on Circuits and Systems. Proceedings (Cat. No.02CH37353), Phoenix-Scottsdale, AZ, USA, 2002, pp. V-V. [12] S. Tahir, M. Elnamaky, M. A. Ashraf and K. Jamil, "Hardware implementation of digital beamforming network for Ultra Wide band signals using uniform linear arrays," The 2nd Middle East Conference on Antennas and Propagation, Cairo, 2012, pp. 1-4. [13] Cardarilli G.C., Giardino D., Matta M., Re M., Silvestri F., Simone L., Spanó S. “Comparison and Implementation of Variable Fractional Delay Filters for Wideband Digital Beamforming”. Lecture Notes in Electrical Engineering, Article In Press 2019 [14] F. Harris, Xiaofei Chen, E. Venosa and F. A. N. Palmieri, "Two channel TI-ADC for communication signals," 2011 IEEE 12th International Workshop on Signal Processing Advances in Wireless Communications, San Francisco, CA, 2011, pp. 576-580. [15] G. Manganaro and D. Robertson Interleaving ADCs: Unraveling the Mysteries https://www.analog.com/en/analog- dialogue/articles/interleaving-adcs [16] Interleaving ADCs for Higher Sample Rates Literature Number: SNAA111 http://www.ti.com/lit/wp/snaa111/snaa111.pdf [17] Cappello S., Cardarilli G.C., Di Nunzio L., Fazzolari R., Re M., Albicocco P. “Flexible channel extractor for wideband systems based on polyphase filter bank”. Journal of Theoretical and Applied Information Technology, 2017; 95(16): 3841-3850. [18] Cardarilli G.C., Di Nunzio L., Fazzolari R., Re M., Nannarelli A. “Power Efficient Digital Front-End for Cognitive Radio Systems”. Conference Record of the IEEE Asilomar Conference on Signals, Systems and Computers, 2018 [19] V. Valimaki and T. I. Laakso, "Principles of fractional delay filters," 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100), Istanbul, Turkey, 2000, pp. 3870- 3873 vol.6. [20] Laakso T.I., Välimäki V., Karjalainen M., Laine U.K.:”Splitting the unit: Delay tools for fractional delay filter design”. IEEE Signal Processing Magazine, 1996; 13(1), pp. 30-60. [21] C. K. S. Pun, Y. C. Wu, S. C. Chan and K. L. Ho, "An efficient design or fractional-delay digital FIR filters using the Farrow structure," Proceedings of the 11th IEEE Signal Processing Workshop on Statistical Signal Processing (Cat. No.01TH8563), Singapore, 2001, pp. 595-598. [22] P. P. Vaidyanathan, "Multirate digital filters, filter banks, polyphase networks, and applications: a tutorial," in Proceedings of the IEEE, vol. 78, no. 1, pp. 56-93, Jan. 1990. [23] Cardarilli, G.C., Di Nunzio, L., Fazzolari, R., Fereidountabar, A., Giuliani, F., Re, M., Simone, L. “Comparison of jamming excision methods for direct sequence/spread spectrum (DS/SS) modulated signal”, Journal of Theoretical and Applied Information Technology, 2017; 95(13): 2878-2888. [24] Cardarilli, G.C., Di Nunzio, L., Fazzolari, R., Re, M., Silvestri, F., Spanò, S. “Energy consumption saving in embedded microprocessors using hardware accelerators”. TELKOMNIKA (Telecommunication, Computing, Electronics and Control). (2018); 16(3): 1019-1026. [25] Mitra S.K., and Yonghong K. Digital signal processing: a computer-based approach. Vol. 2. New York: McGraw- Hill, 2006. [26] UltraScale Architecture Staying a Generation Ahead with an Extra Node of Value. https://www.xilinx.com/products/technology/ultrascale.html