High-Speed Equalizer Design in 28-nm CMOS
High-Speed Equalizer Design in 28-nm CMOS
Behzad Razavi
E
Equalizers are widely used in broad-
band wireline systems. At high data
rates, the imperfections of the medium
through which the signal travels (the
“channel”) become more critical, mak-
therefore assume certain character-
istics for the channel. We employ a
“scalable” model that reflects the
physical loss mechanisms in cop-
per media [1]. Shown in Figure 1(a)
those computed by electromagnetic
field simulations of a 50-Ω channel.
Cascading 12 such sections yields
the loss profile depicted in Fig-
ure 1(b) and a value of 21 dB at
ing equalization an essential function is one section of the model, with 28 GHz. Note that the line is driven
in receivers (Rxs). its horizontal and vertical branches by a 50-Ω source and is terminated
In this two-part article, we study the representing frequency-dependent with a 50-Ω load.
transistor-level design of a high-speed copper and dielectric losses, respec- In addition to loss, copper media
equalizer in 28-nm CMOS technology. tively [1]. The component values can also suffer from impedance dis-
The first part describes channel mod- are obtained by fitting the circuit’s continuities. For example, a connec-
eling and linear equalizer design. The magnitude and phase responses to tor attaching a cable to a line card
second part deals with decision-feed-
back equalizers (DFEs). We target the
470 pH
following performance: 77.3 pH
■■ Data format: nonreturn to zero
■■ Power consumption = 10 mW
(a)
■■ V DD = 1 V.
0
It is customary to specify the chan-
nel loss at the “Nyquist frequency,” –5
fNyq, i.e., the frequency equal to half
Channel Response (dB)
Digital Object Identifier 10.1109/MSSC.2021.3111426 FIGURE 1: (a) One section of a scalable channel model and (b) the loss profile for 12
Date of current version: 17 November 2021 cascaded sections.
Channel Din
Continuous-Time Decision−Feedback
Dout
Linear Equalizer Equalizer
CK
(a)
f f fNyq f
(b)
FIGURE 2: (a) A typical equalizer architecture and (b) an illustration of its overall frequency response.
0.9 0.85
0.85
0.8
0.8
0.75
0.75
Voltage (V)
Voltage (V)
0.7 0.7
0.65
0.65
0.6
0.6
0.55
FIGURE 3: (a) The input and (b) output waveforms of a lossy channel.
Authorized licensed use limited to: UCLA Library. Downloaded on November 30,2021 at 21:06:34 UTC from IEEE Xplore. Restrictions apply.
pair, as shown in Figure 4(a). It can a greater C1 and hence a lower ~ p ity, however, the situation is more com-
be readily proved that the circuit’s reduce the frequency at which the plex. With the low available boost factor
transfer function, H (s) = Vout /Vin, is peak occurs. These points encour- per stage, the Bode plot of | H (j~) | in
given by age us to apply inductive peaking Figure 4(b) proves inaccurate because
so as to approach the first case. ~ z and ~ p are separated by a factor
- g m R D (R S C S s + 1)
H (s) = , (1) To raise the boost factor, we can of only 2 to 3. The actual behavior is
R S C S s + 1 + g m R S /2
cascade multiple CTLE stages, but depicted in Figure 6 and can be quan-
where gm denotes the transconduc- at the cost of reduced small-signal tified as follows. We express | H (j~) | 2
tance of M 1 and M 2 and channel- bandwidth and greater power con- from (1) as
length modulation is neglected. The sumption. If each stage’s bandwidth 2 2
stage exhibits a zero and a pole, limitation is approximated by a single | H (j~) | 2 = K 2 ~ 2 + ~ 2z , (5)
~ + ~p
respectively: pole at 2rf0, then n identical stages
yield a total bandwidth of where K = g m R D, and evaluate it at
| ~z | = 1 (2) ~ = ~ p = (1 + g m R S /2) ~ z:
RS CS
BW tot = 2 1/n - 1 f0 .(4)
1 + g m R S /2 2
1
| ~p | = , (3) | H (j~ p) | 2 = K 1 +
2 > gm RS 2 H
RS CS .
For example, two stages lower the (6) c1 + m
2
and the magnitude of its response bandwidth by 35%. These constraints
varies from g m R D / (1 + g m R S /2) at imply that it is difficult to use more
low frequencies to g m R D at high than two CTLE stages, and that the That is,
frequencies [Figure 4(b)]. The cir- DFE in Figure 2(a) must shoulder the
cuit thus provides a boost factor of remainder of the equalization. | H (j~ p) | = K 1+ 1 .
c1 + m
2 gm RS 2
1 + g m R S /2. Note that the zero and (7) 2
pole frequencies are separated by Pole-Placement Considerations
the same factor. In the CTLE stage studied previously, If the boost factor, 1 + g m R S /2, ranges
The CTLE stage of Figure 4(a) merits we may naturally conclude that the from 2 to 3, then | H (j~ p) | falls between
two remarks. First, the boost factor trades degeneration pole, ~ p , should be placed 0.79K and 0.74K, i.e., roughly 2 to 2.5 dB
with the low-frequency gain (also called around the Nyquist frequency. In real- below the maximum value. The key
the dc gain), g m R D / (1 + g m R S /2) . We
wish to maintain this gain around
unity so that the received data swings VDD
are not attenuated. With low supply RD RD
|H(jω)|
voltages and channel-length modu-
lation, the boost factor typically Vout
gmRD
does not exceed 6 dB. Second, we Vin M1 M2
surmise that ~ p should be placed
roughly around the Nyquist frequency, RS gmRD
but as indicated by the red plot in
gmRS
Figure 4(b), the limited bandwidth 1+
CS 2
at the output makes it difficult to do ωz ωp ω
so. For this reason, high-speed CTLEs
often employ inductive peaking. (a) (b)
The output pole, ~ 0 , in Figure 4(a)
results from the load resistance FIGURE 4: (a) A basic CTLE stage and (b) its approximate response.
and capacitance, presenting sig-
nificant challenges in high-speed
CTLE design. The relative magni- |H(jω)| |H(jω)|
tudes of this pole and that due to
gmRD gmRD
source degeneration lead to differ-
ent responses and dependencies CS CS
gmRD gmRD
upon CS . As shown in Figure 5(a),
if ~ 0 2 ~ p, then increasing CS sim- gmRS gmRS
1+ 1+
ply shifts the high-pass response 2 2
ωz ω p ω0 ω ωz ω0 ω p ω
to the left. On the other hand, if
~ 0 1 ~ p [Figure 5(b)], then two effects (a) (b)
emerge. First, | H | fails to reach its
maximum value of g m R D . Second, FIGURE 5: The CTLE response for (a) ~ 0 2 ~ p and (b) ~ 0 1 ~ p .
6
5 LD = 0
VDD LD = 600 pH
4
LD LD
Magnitude (dB)
3
400 Ω 400 Ω
2
1
Vin M1 M2
0
400 Ω W 10 µm
= –1
L 30 nm
–2
1.25 mA 150 fF 1.25 mA 108 109 1010 1011
Frequency (Hz)
(a) (b)
FIGURE 7: (a) The design of the first CTLE stage and (b) its frequency response.
Authorized licensed use limited to: UCLA Library. Downloaded on November 30,2021 at 21:06:34 UTC from IEEE Xplore. Restrictions apply.
VDD
Channel
Stage 1 Stage 2
VDD
(a)
14 5
12 0
10
–5
Magnitude (dB)
Magnitude (dB)
8
–10
6
–15
4
–20
2
0 –25
–2 –30
108 109 1010 1011 108 109 1010 1011
Frequency Frequency
(b) (c)
FIGURE 8: (a) The two-stage CTLE, (b) its frequency response, and (c) the channel-CTLE cascade response.
0.4 0.5
0.3 0.4
0.3
0.2
0.2
0.1 0.1
VCTLE (V)
VCTLE (V)
0 0
–0.1 –0.1
–0.2
–0.2
–0.3
–0.3 –0.4
–0.4 –0.5
0 5 10 15 20 25 30 35 0 5 10 15 20 25 30 35
Time (ps) Time (ps)
(a) (b)
FIGURE 9: The output eyes of (a) the channel and (b) the CTLE.
reveal that the eye suffers further assumption at this point is that the data if the loss is lower. As a result,
closure in this case. Conversely, DFE will further improve the eye. significant intersymbol interference
R S = 600 X and C S = 100 fF increase appears at the CTLE output. For this
the boost factor but do not improve The Need for Programmable Boost reason, the boost must be variable.
the output eye. Broadband Rxs must operate proper This is typically accomplished by
Whether the eye opening depict ly with different channel responses. implementing the degeneration capaci-
ed in Figure 9(b) is adequate for The CTLE developed thus far is opti- tors as programmable units. A lower
a BER < 10 -12 or not is ultimately mized for a loss of approximately
determined by the DFE design. Our 20 dB at fNyq, and “overequalizes” the (continued on p. 160)
online contents and industry sys- Society of Applied Physics in coopera- the Japan Society of Applied Physics in
tems session. For further information, tion with the Institute of Electronics, cooperation with SSCS.
please visit www.vlsisymposium.org. Information, and Communication Engi- —Sugako Otani
The Symposium on VLSI Circuits is neers and the IEEE Electron Devices Publicity Chair, Symposium
sponsored by the IEEE Solid-State Society (EDS). The Symposium on VLSI on VLSI Circuits
Circuits Society (SSCS) and the Japan Technology is sponsored by EDS and
value pushes the pole to higher fre- Conf. (ISSCC), Feb. 2019, pp. 118–120. doi: [12] J. Han, N. Sutardja, Y. Lu, and E. Alon, “De-
10.1109/ISSCC.2019.8662523. sign techniques for a 60-Gb/s 288-mW NRZ
quencies, thereby providing less [6] J. Im et al., “A 112-Gb/s PAM-4 long-reach transceiver with adaptive equalization and
equalization at fNyq . wireline transceiver using a 36-way time- baud-rate clock and data recovery in 65-nm
interleaved SAR ADC and inverter-based CMOS technology,” IEEE J. Solid-State Cir-
In the second part of this article, RX analog front-end in 7nm FinFET,” IEEE J. cuits, vol. 52, no. 12, pp. 3474–3485, Dec.
we design a DFE and cascade it with Solid-State Circuits, vol. 56, no. 1, pp. 7–18, 2017. doi: 10.1109/JSSC.2017.2740268.
Jan. 2021. doi: 10.1109/JSSC.2020.3024261. [13] E. Depaoli et al., “A 4.9pJ/b 16-to-64Gb/s
the CTLE. [7] T. Ali et al., “A 460mW 112Gb/s DSP-based PAM-4 VSR transceiver in 28nm FDSOI
transceiver with 38dB loss compensa- CMOS,” in Proc. IEEE Int. Solid-State Cir-
tion for next-generation data centers in cuits Conf. (ISSCC), pp. 112–114. doi:
References 7nm FinFET technology,” in Proc. IEEE 10.1109/ISSCC.2018.8310209.
[1] S. Gondi and B. Razavi, “Equalization and Int. Solid-State Circuits Conf. (ISSCC), pp. [14] A. Cevrero et al., “A 100Gb/s 1.1pJ/b PAM-4
clock and data recovery techniques for 10- 118–120. doi: 10.1109/ISSCC19947.2020. RX with dual-mode 1-tap PAM-4 / 3-tap NRZ
Gb/s CMOS serial-link receivers,” IEEE J. Solid- 9062925. speculative DFE in 14nm CMOS FinFET,”
State Circuits, vol. 42, no. 9, pp. 1999–2011, [8] A. Atharav and B. Razavi, “A 56Gb/s 50mW in Proc. IEEE Int. Solid-State Circuits Conf.
Sept. 2007. doi: 10.1109/JSSC.2007.903076. NRZ receiver in 28nm CMOS,” in Proc. IEEE (ISSCC), pp. 112–114. doi: 10.1109/ISSCC.
[2] A. Sheikholeslami, “Equalizer circuit [Cir- Int. Solid-State Circuits Conf. (ISSCC), pp. 2019.8662495.
cuit Intuitions],” IEEE Solid-State Circuits 192–194. doi: 10.1109/ISSCC42613.2021. [15] A. Roshan-Zamir et al., “A 56-Gb/s PAM4
Mag., vol. 12, no. 1, pp. 6–7, Jan. 2020. 9365997. receiver with low-overhead techniques for
doi: 10.1109/MSSC.2019.2952233. [9] T. Shibasaki et al., “A 56Gb/s NRZ-electrical threshold and edge-based DFE FIR- and IIR-
[3] B. Razavi, “The decision-feedback equalizer 247mW/lane serial-link transceiver in 28nm tap adaptation in 65-nm CMOS,” IEEE J. Sol-
[A Circuit for All Seasons],” IEEE Solid-State CMOS,” in Proc. IEEE Int. Solid-State Circuits id-State Circuits, vol. 54, no. 3, pp. 672–684,
Circuits Mag., vol. 9, no. 4, pp. 13–16, Nov. Conf. (ISSCC), pp. 64–66. doi: 10.1109/ Mar. 2019. doi: 10.1109/JSSC.2018.2881278.
2017. doi: 10.1109/MSSC.2017.2745939. ISSCC.2016.7417908. [16] A. Manian and B. Razavi, “A 40-Gb/s 14-mW
[4] P. Upadhyaya et al., “A fully adaptive 19-to- [10] P. J. Peng et al., “A 56Gb/s PAM-4/NRZ CMOS wireline receiver,” IEEE J. Solid-State
56Gb/s PAM-4 wireline transceiver with transceiver in 40nm CMOS,” in Proc. IEEE Circuits, vol. 52, no. 9, pp. 2407–2421, Sept.
a configurable ADC in 16nm FinFET,” in Int. Solid-State Circuits Conf. (ISSCC), pp. 2017. doi: 10.1109/JSSC.2017.2705913.
Proc. IEEE Int. Solid-State Circuits Conf. (ISS- 110–111. doi: 10.1109/ISSCC.2017.7870285. [17] P. Mishra et al., “A 112Gb/s ADC-DSP-based
CC), pp. 108–110. doi: 10.1109/ISSCC.2018 [11] B. Dehlaghi et al., “A 1.41-pJ/b 56-Gb/s PAM-4 transceiver for long-reach applica-
.8310207. PAM-4 receiver using enhanced transition tions with >40dB channel loss in 7nm Fin-
[5] T. Ali et al., “A 180mW 56Gb/s DSP-based utilization CDR and genetic adaptation al- FET,” in Proc. IEEE Int. Solid-State Circuits
transceiver for high density IOs in data gorithms in 7-nm CMOS,” IEEE Solid-State Conf. (ISSCC), pp. 138–139. doi: 10.1109/
center switches in 7nm FinFET technol- Circuits Lett., vol. 2, no. 11, pp. 248–251, Nov. ISSCC42613.2021.9365929.
ogy,” in Proc. IEEE Int. Solid-State Circuits 2019. doi: 10.1109/LSSC.2019.2938677.
Authorized licensed use limited to: UCLA Library. Downloaded on November 30,2021 at 21:06:34 UTC from IEEE Xplore. Restrictions apply.