A 6-Gb/s MIMO Crosstalk Cancellation Scheme

IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 46, NO. 8, AUGUST 2011
1843
A 6-Gb/s MIMO Crosstalk Cancellation Scheme
for High-Speed I/Os
Taehyoun Oh, Student Member, IEEE, and Ramesh Harjani, Fellow, IEEE
Abstract—A continuous-time multiple-input multiple-output
crosstalk cancellation (MIMO-XTC) architecture operating at 2–6
Gb/s has been proposed. The performance of the XTC equalizer
has been measured with various spacings of FR4 channels and data
rates. The crosstalk energy reutilization technique efficiently handles crosstalk and achieves high signal integrity in severe crosstalk
environments where crosstalk had completely closed the data eye.
Measurement results show improvement in
and vertical
opening of the eye diagram by 67% UI and 58.2%, respectively,
which is the best known improvement to date. The MIMO-XTC
portion occupies 0.03 mm and consumes 2.8 mW/Gbps/lane,
which is two times lower than previously proposed XTC schemes.
Index Terms—Bandwidth, equalization, high-speed links, intersymbol interference (ISI), jitter, multiple-input multiple-output
crosstalk cancellation (MIMO-XTC), parallel interfaces, pre-emphasis, pulse response, single-ended I/Os, vertical eye-opening.
I. INTRODUCTION
R
APID advances in CMOS technology continue to
increase the on-chip clock speeds exponentially, while
high-speed I/Os that are used to connect between chips continue
to be a performance bottleneck for the system. Finite-channel
bandwidths generate ISI and reduce the amplitude of the received signal and thus degrade SNR. Techniques to mitigate
ISI in single wireline channels have been used for more than
a decade, and systems operating at 20 Gb/s have recently
been published [1]–[6]. With increased speeds, I/Os become
more vulnerable to electromagnetic interference (crosstalk)
from adjacent channels. Channel equalization, in the form of
pre-emphasis, that is often used to tackle ISI, unintentionally
increases crosstalk by boosting the high frequency signal
component. A number of XTC techniques have been proposed
to remove the effects of crosstalk [7]–[13]. Jitter equalization
[7], [8], staggered I/Os [9], [10] and amplitude XTC with
a finite-impulse response (FIR) filter at the transmitter [11]
can be used to reduce the effects of crosstalk to a limited
extent. However, these schemes result in increased current
consumption and have limited or no impact on channel spacing.
Current crosstalk cancellation techniques do not adapt well to
I/O systems where power efficiency is critical [14]. In most
high-speed links, crosstalk is avoided by maintaining sufficient
Manuscript received November 30, 2010; revised February 03, 2011; accepted March 28, 2011. Date of publication June 09, 2011; date of current version July 22, 2011. This paper was approved by Guest Editor Pavan Kumar
Hanumolu. This work was supported in part by a grant from the Semiconductor
Research Corporation.
The authors are with the University of Minnesota, Minneapolis, MN 55455
USA (e-mail: [email protected]).
Color versions of one or more of the figures in this paper are available online
at http://ieeexplore.ieee.org.
Digital Object Identifier 10.1109/JSSC.2011.2151410
distance between channels and/or shielding the channels at
a higher cost, choosing differential I/Os at the expense of
doubled power, I/O pins and board area, or running the system
at lower data rates with closely spaced parallel channels. The
techniques developed in this paper rely on two distinct characteristics. First, we regard the crosstalk as a signal component
that can be reutilized to increase SNR which contrasts with
other XTC schemes where the crosstalk is simply considered as
a noise component to be removed. Second, the continuous-time
MIMO-XTC, introduced here, relies on cancelling a continuous-time phenomenon using a continuous-time technique.
This results in very efficient handling of the high-frequency
crosstalk signal and its cancellation.
The remainder of this paper is organized as follows. Section II
introduces the far-end crosstalk (FEXT) channel model for both
single-ended and differential I/Os and develops an intuitive
technique for pulse response analysis in terms of eye-diagram
properties. Section III proposes a new 2 2 MIMO-XTC
algorithm. A CMOS prototype implementation of the 2 2
MIMO-XTC for single-ended I/Os is presented in Section IV.
Measurement results are shown for the prototype circuit in
Section V. Section VI summarizes this work.
II. CHANNEL MODEL
A. Low-Impedance Microstrip-Line FEXT Model
FEXT in transmission lines is the signal energy that is
coupled between two closely spaced channels. When an active
signal is transmitted on one of the channels, as illustrated in
Fig. 1(a) and (b), the end of the adjacent channel receives the
coupled FEXT signal. If the adjacent channel transmits another
independent signal in the same direction, it will receive both
its own original signal and FEXT coupled from the adjacent
channel. Since these two signals are uncorrelated, the FEXT
degrades the horizontal and vertical eye-opening of the original
signal and is normally considered as noise. In a homogeneous
channel, i.e., strip-line, the inductive and capacitive coupling
is well-balanced and the FEXT becomes negligible [15]. On
the other hand, in an inhomogeneous channel like a micro-strip
line, significant crosstalk energy couples through the asymmetrical field [7]. However, micro-strip lines have a cost advantage
due to its convenient implementation so most interfaces are
manufactured using micro-strip lines. As PCBs are required
to have more and more channels in a limited board area for
higher data throughput, the physical spacing between channels
is reduced and crosstalk is rapidly becoming the dominant
factor affecting signal integrity.
Longer channel lengths and reduced channel spacings result
in a larger coupling coefficient and more crosstalk transfers
0018-9200/$26.00 © 2011 IEEE
1844
IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 46, NO. 8, AUGUST 2011
Fig. 1. Pulse responses of NRZ signal and FEXT in single-ended I/Os. (a) Closely spaced single-ended I/Os. (b) Pulse responses. (c) Vertical and horizontal eye
margin. (d) Eye patterns.
onto the adjacent channel [7], [15]. Transmitted signals with
sharper transitions are more easily coupled because of the
high-pass filter characteristic of the crosstalk channel. Fig. 1(a)
presents the physical parameters used to formulate the crosstalk
model in single-ended I/Os.
is the channel length,
is
the channel width and
is the center-line distance between
channels. When the aggressor signal
is transmitted on
closely spaced channels, the FEXT signal
occurs at
the adjacent channel output as
(1)
where
(
) is the forward coupling strength and
is a function of channel length
and channel height. For
simplicity of analysis, we have assumed a fixed channel length,
channel height, and a constant transition time for the transmitted
signal. The crosstalk energy diminishes approximately by a
factor of , where, for single-ended I/Os, the nominal value
for
is between 1-2 depending on channel conditions [15],
[16]. In most low-impedance micro-strip lines used in portable
electronics, the inductive coupling component is dominant
and the FEXT pulse response is approximately the negative
derivative of the channel pulse response
[7], [11], [15].
B. Predicting Eye-Diagram Properties From the Pulse
Response
Here, we discuss the limitations of the previously proposed
FIR XTC filter in [11]. The channel ISI tails at a bit period of
and FEXT pulse response tails sampled at half a period
, as
shown in Fig. 1(b), are immediately related to the eye opening
level and jitter performance of eye-diagram. An algorithm to
determine the crosstalk cancellation tap weights has been suggested in [11]. However, the FEXT tails half a bit period away
from the eye-center timing,
significantly limit the
efficacy of XTC schemes with integer FIR taps.
The eye-diagram performance can be directly evaluated
from the shape of a single pulse response analytically. Experimentally, this single pulse response can be conveniently
measured via a digital sampling oscilloscope and a pulse generator. We propose a method to calculate the
and the
vertical eye-opening from a single output pulse. As shown in
Fig. 1(a) and (b), when a single pulse at the appropriate data
rate is transmitted, we can obtain the pulse responses of the
data and the crosstalk at the end of the channels, which are
expressed as
and
, respectively. The data eye-diagram
is a convolution of the pulse response
and the impulse
train sequence
and can be obtained by folding
it repeatedly with
integer multiple symbol periods. If we
assume that
is the response for a causal channel, then
,
. Further, for simplification in this analysis, we shall assume that the ISI tail is
limited to
and is zero for all positive values of greater
than 2. Then, the maximum data eye-opening can be directly
calculated as
in the worst case scenario. The
maximum magnitude of the vertical signal is
,
which is
larger than the maximum pulse amplitude
( ) due to the randomness of data. The time duration of the
maximum horizontal eye,
is the time interval between
the two points where the voltage level of the data pulse is
. As a result, the
becomes
.
In a highly lossy channel or higher data rates,
decreases
and the number of ISI tails and their magnitude increases and
therefore the eye-opening,
reduces. At the
same time, the horizontal eye interval reduces as the vertical
eye-opening
decreases and the two points are
brought closer together.
In the case of the crosstalk pulse response, shown in Fig. 1(b),
the maximum amplitude occurs at approximately half of the
two data eye centers. The crosstalk has its maximum amplitude during the data transition, typically
away from the
OH AND HARJANI: A 6-GB/S MIMO CROSSTALK CANCELLATION SCHEME FOR HIGH-SPEED I/OS
1845
Fig. 2. Skewed FEXT signal in a transmitter per-pin de-skew scheme.
data eye-center and crosses the zero when the data eye is at its
peak (i.e.,
). The maximum crosstalk amplitude
in the FEXT eye-diagram occurs at
away from the center of data eye, as shown in Fig. 1(d). On
the other hand, the integer crosstalk tails, and , align with
the center of data eye. If uncorrelated data and FEXT are combined, representing the case of crosstalk coupling to an adjacent
channel, the vertical eye-opening reduces by
in the
worst case and becomes
. Here,
we assume the delays between two channels are matched. The
maximum vertical eye-opening in the channel with both ISI and
FEXT can be more generally written as
(2)
The FEXT tails,
and
take place during the data
transitions and disturb the timing. The absolute magnitude of
and
are larger than
and , and, therefore, the
majority of the crosstalk energy is normally transformed into
jitter. The detailed behavior of how crosstalk affects timing
variation is described in [7], [8]. Additionally, in situations
where per-pin de-skew scheme may be applied, as shown in
Fig. 2, the
will further reduce because the timing of
FEXT center ( ) does not align with the timing of data center
( ). A simulation of the performance degradation in situations
where a per-pin de-skew scheme may be required is presented
in Section V.
Similar degradation by the
and
crosstalk occurs
even at lower speed signals. Here, the lower speed applies to
the overall data rate. We note that ISI tails are limited at this
lower rate for the limited loss cases. For example, as illustrated
in the low speed signal pulse response in Fig. 3, the integer
ISI tails (
) are zero and do not influence the following
symbol amplitude at the data center. In this example, the integer
crosstalk tails (
) are zero and the crosstalk tails at
and
only degrade the timing variation. For both lowspeed signal and high-speed signal, the large amplitude of
and
limits the benefits of crosstalk cancellation schemes
with integer FIR filters that have maximum effect only at the
center of the eye. Fractional taps are required to remove the
noninteger crosstalk tails, which increases the necessary clock
speed and power. Reference [11] proposes a zero-forcing algorithm to optimize the tap weights for the FIR filter for maximum
crosstalk cancellation. Here, 25-mW/Gbps/lane power is consumed to drive the digital taps with sufficient speed.
Fig. 3. NRZ signal pulse response
data rate versus high data rate.
and FEXT pulse response
(low
Moreover, XTC with a FIR filter is preferentially implemented on the transmitter-side due to availability of the timing
information coincidentally with the crosstalk signal. A typical
FIR filter with discrete taps at the receiver samples the signal at
the timing of
for maximum SNR. However, we can notice
that, from the crosstalk pulse response in Fig. 1(b), the
is received before
. Therefore, the sampled signal is not
available when the
disturbs the transition timing in the
adjacent channel and a XTC implementation with FIR taps in
the receiver becomes ineffective. A transversal FIR filter that
uses continuous-time analog delay, may be used instead of a
sample-data FIR filter to avoid the issues of the availability of
sampled signals from adjacent channels and fractional taps.
Besides, adding additional taps at the transmitter to generate
the crosstalk cancellation signal in addition to the pre-emphasis, reduces the overall output swing of the transmitter and
decreases the absolute value of the vertical eye-opening of the
received signal [11].
C. Single-Ended Versus Differential Signaling
The power spectrum of a NRZ signal coupled to an adjacent
channel can be expressed as
(3)
which is the product of the FEXT channel transfer
, and the NRZ data spectrum,
function,
. The power spectra of a 2-Gb/s
and a 5-Gb/s NRZ signal and the FEXT channel transfer
functions when
is
, have been measured and are shown
1846
IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 46, NO. 8, AUGUST 2011
Fig. 4. (a) 2-5 Gb/s NRZ spectrum and FEXT transfer function. (b) Coupled NRZ spectrum.
Fig. 5. Differential I/Os.
in Fig. 4(a). A PCB trace with a length of 16 inches, channel
width of 120 mils and channel height of 62.5 mils was used for
this set of measurements. The left-hand axis shows the units
for the NRZ spectrum in dBm, while the right-hand sides shows
the scattering parameters ( ) for the FEXT transfer function.
Notice that, as the data rates increase, a larger portion of the
signal energy moves towards frequencies where crosstalk has a
higher gain, allowing more NRZ signal energy to be coupled
to the adjacent channel as crosstalk. The power spectrum of
the NRZ signal coupled to an adjacent channel is shown in the
Fig. 4(b), i.e., this is the FEXT signal we wish to cancel.
These measurements suggest that to maximize the throughput
and at the same time to avoid crosstalk degradation, data should
be distributed over a large number of parallel lines operating
at lower speeds ( 2 Gb/s) where the crosstalk strength is minimal. Traditionally, most CPU-to-memory interfaces follow this
guidance. However, the overall throughput in this case is limited
by the number of channels, I/O pins, lower speed, and available
PCB area. An alternate choice is to run the system at a higher
speed utilizing differential signaling, which is less susceptible
to crosstalk. Nevertheless, low signal gain and long ISI tails of
the received NRZ signals place challenging design constraints
on the clock and data recovery circuits. Also, the additional
power necessary for the I/O makes the differential solution less
attractive.
It is instructive to derive the crosstalk equations for differential I/Os as well. Fig. 5 shows all the necessary physical parameters we will use to describe the coupled crosstalk signal. Here,
is the ratio of the distance between the differential pair to the
channel width. The bandwidth of the differential channel and
the characteristic impedance reduces for a small , while the
shielding effect provided by the wave guide declines and the
channel has a reduced common-mode rejection ratio (CMRR)
benefit for a large . A typical range for
is 1-1.5 [17]. Here,
is the distance between the two differential pairs. The aggressor signal
is applied at the differential input nodes
and the output signal at the adjacent differential channels receives the FEXT signals,
and
. As discussed earlier, the inductive crosstalk coupling component is
dominant in most micro-strip lines. Each differential output contains a superposition of the inductively coupled signals originated from
and
. Using (1),
and
can be expressed as
(4)
(5)
is usually larger than
for differential I/Os for better
crosstalk rejection. Expanding
and
, for
, the FEXT
signals seen at the differential output node is
(6)
This equation illustrates three important features of differential signaling. First, contrary to single-ended I/Os, the crosstalk
signal in differential I/Os has a positive polarity even though
each single channel is inductively coupled. Second, as the distance between the single differential pair
decreases, the
OH AND HARJANI: A 6-GB/S MIMO CROSSTALK CANCELLATION SCHEME FOR HIGH-SPEED I/OS
Fig. 6. 2
1847
2 MIMO-XTC architecture.
FEXT diminishes due to the increased shielding effect. However, this can only be achieved at the cost of reduced channel
bandwidth and reduced characteristic impedance. Finally, assuming that
in (1) equals
in (6), the modulus for differential crosstalk is lower than for the single-ended case by a
factor of
because of the shielding effect just described. Intuitively, the coupled signals of two differential aggressors with slightly skewed distances cancel each other in the
adjacent channel. The value of
depends on the board type
[see (1)]. In case of the channels presented in [11] and [16],
equals 1 and the crosstalk strength in differential I/Os decreases
proportional to
, whereas the crosstalk in a single-ended
channel decreases by
. In other words, differential I/Os
are more immune to crosstalk degradation than single-ended
I/Os at higher data rates. In addition, differential I/O circuits
have a better power supply rejection ratio (PSRR). However,
the MIMO-XTC scheme presented here minimizes the crosstalk
in single-ended I/Os allowing reductions in transmitter driver
power, I/O pin count and channel PCB area, provided the other
system constraints, such as PSRR, allow for this change.
Fig. 7. Bandwidth improvement by MIMO signals.
the gain is correctly adjusted to
term in the resultant matrix of
, it forces the nondiagonal
III. MIMO-XTC ARCHITECTURE
Our approach utilizes a continuous-time receiver-side
crosstalk cancellation scheme that is applicable to single-ended
I/Os. The proposed algorithm is shown in Fig. 6.
and
are the frequency-domain representations of the transmitted signals
on channels 1 and 2.
is
the channel transfer function and
is the FEXT. As
stated earlier, the FEXT is the negative derivative of channel
transfer function multiplied by
[see (1)]. The received
signals
and
contain both the primary signal with
the channel ISI and the FEXT of the adjacent channel signal.
The behavior of this two-channel system can be expressed in
matrix form as
(8)
to become zero and the crosstalk is cancelled out. At this point,
the equalized signals does not include the adjacent channel’s
signal component, and hence the two channels become independent as shown in
(7)
(9)
,
are added
In our algorithm, the received signals
to the adjacent channel’s received signal after differentiation
and with an appropriate value for the gain term . When
, where
is a primary signal path gain. An addiwhen
tional term,
, is now present in the diagonal terms.
This second derivative of the signal boosts the high frequency
1848
Fig. 8. 2
IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 46, NO. 8, AUGUST 2011
2 MIMO crosstalk cancellation circuit implementation for single-ended I/Os.
signal content by
and works against the channel loss in
. This additional signal (MIMO signal) is normally lost
in previous schemes since the crosstalk is only considered as a
noise component to be cancelled out. In our MIMO algorithm,
the crosstalk signal energy is reutilized to reduce the jitter and
increase the vertical eye opening while mitigating the ISI tails.
When we differentiate the received signal, both the crosstalk
cancellation signal and a MIMO signal for the adjacent channel
are obtained. For clarity purposes, the signal path without differentiation is designated as the primary FEXT signal path and
the path with differentiation as the MIMO-XTC signal path, as
indicated in Fig. 6.
A continuous-time high-frequency boosting analog equalizer
stage follows the architecture to compensate for any remaining
ISI. In our test setup, the analog equalizer only compensates for
a small part of the channel ISI. In channels with large crosstalk
the MIMO boosting can significantly ease the burden on any
follow-on channel equalizer. It reduces both the number of
stages used in analog equalizers and the number of taps that may
be used in a decision feedback equalizer. Fig. 7 shows the frequency-domain impact of MIMO-XTC for various strengths of
the MIMO signal (
15 ps,
23 ps). We note that for channels with increased crosstalk degradation, the MIMO crosstalk
cancellation scheme results in a larger bandwidth improvement.
This implies that high-quality signal integrity can be achieved
even in severe crosstalk environments where the crosstalk completely closes the horizontal and vertical eye-opening. However,
it is important to discuss three fine points here. First, an increased
cannot be used to substitute for channel equalization for ISI
as they operate on different signals. Second, wireline channels
tend to be crosstalk and ISI dominant (by over 20 dB); thus,
increasing the high-frequency response does not degrade SNR
significantly but improves signal-to-distortion ratios. Third,
in the presence of per-pin de-skew scheme, the primary signal
and coupled FEXT signal will not align perfectly, as depicted
in Fig. 2. The FEXT and its cancellation signal or primary and
MIMO signal will be skewed even with an optimal value and
degrades some of the benefits of both crosstalk cancellation and
MIMO improvement discussed here. However, the degradation
is limited and is affected by where the majority of the skew is
accumulated in the channel. Further details about per-pin skew
are discussed in Section V.
IV. 2
2 MIMO-XTC PROTOTYPE IMPLEMENTATION
A 5-Gb/s prototype circuit for the proposed 2 2
MIMO-XTC scheme for single-ended I/Os has been
implemented in a 130 nm CMOS technology using passive differentiators, 180 phase shifters, pseudo-differential
pairs and analog equalizers as shown in Fig. 8. Replica circuits
OH AND HARJANI: A 6-GB/S MIMO CROSSTALK CANCELLATION SCHEME FOR HIGH-SPEED I/OS
1849
Fig. 9. Input network performance in the frequency domain. (a) Input network for primary FEXT and MIMO XTC signal path. (b) Frequency responses. (c)
amplitude from IC prototype.
Measured
for the differentiators and the 180 phase shifters are added to
corresponding stages of each path to equalize phase delays.
Fig. 9(a) and (b) shows the passive input network and its magnitude and phase response. The continuous-time capacitor-resistor (CR) filter for implementing the differentiator enables a
low power and low area design. The transfer function of the CR
, now differs from the
passive differentiator,
, by a factor of exactly
and
replica circuit’s,
the relation between two paths is only a differentiator. As shown
in Fig. 4(a) and (b), a 5-Gb/s NRZ signal and its FEXT frequency corner are around 2.5 GHz, so the low-pass RC replica
with a pole at 10 GHz does not affect the gain of the primary
value needs to be sufficiently large to
signal path. The
ensure that the pole of the replica does not reduce the signal
of interest. However, the MIMO-XTC signal gain is directly
proportional to the RC value to the first order, therefore
cannot be too large. In this design, the magnitude loss due to
the RC filter at 10 GHz on the signal frequency at 2.5 GHz
dB. The location of this
is
pole frequency is chosen, considering the trade-off between the
MIMO-XTC signal gain and the replica circuit’s pole. Tuning
resistors can adjust the pole frequency and reduce any mismatch
due to process variation of the RC constant between the differentiator and its replica circuit.
Circuit phase delays of the primary FEXT signal path
and MIMO-XTC signal path from two receiver inputs to an
analog equalizer input, are matched to 90 difference in our
MIMO-XTC scheme as stated earlier. Any per-pin skew would
detract from this matched case. However, for the prototype we
assume matched channel paths. Results for per-pin de-skew
degradation are presented in Section V. For a broadband
50- receiver input matching, the passive RC-CR network
impedance ( ) should have a large value, as indicated in
Fig. 9(a). In our case, the impedance
is approximately
and the measured input
remains below 15 dB
for the signals of interest up to 2.5 GHz as can be seen in
Fig. 9(c).
The primary FEXT and MIMO-XTC path’s signals are combined after differentiation and the remaining ISI can be removed
by the follow-on analog equalizer, as shown in Fig. 6. The combination of the adder and the analog equalizer is required to
have two features. First, since the original signal from the lossy
channel and the FEXT signal coupled from the adjacent channel
passes through the primary FEXT signal path and are random
and uncorrelated, the adder should have a large input swing,
500 mV . Second, the received single-ended signal needs
to be converted to a differential signal for improved PSRR at
some point.
Fig. 10 shows a couple of potential circuit combinations for
the high-speed signal adder plus analog equalizer. In the circuit
of Fig. 10(a), a current domain adder with a skewed gain control
combines the signals from the two paths and single-to-differential conversion is achieved by a passive low-pass filter at
the analog equalizer input with the penalty of 6-dB gain loss.
A difference between the primary signal-side amplifier and
XTC signal-side amplifier gain of 10 dB can cover a wide
distance. The primary FEXT
range of FEXT strengths for
signal-side amplifier’s gain only needs to avoid signal loss. The
output swing at the summing node of the first stage is dropped
to satisfy these demands, narrowing the output swing and limiting the dynamic range through the gain control. The circuit in
Fig. 10(b) circumvents this issue by adding pseudo-differential
circuits. A 180 phase shifter flips the sign of either the primary
FEXT signal path or the MIMO XTC signal path. The FEXT
signal and its cancellation signal appear as a common-mode
signal and is suppressed by the CMRR of the differential analog
1850
IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 46, NO. 8, AUGUST 2011
Fig. 10. Signal adder plus analog equalizer circuit design options. (a) Current domain adder + Analog equalizer. (b) Pseudo-differential pair + Analog equalizer.
Fig. 12. Die photograph.
Fig. 11. Simulated analog equalizer frequency response.
equalizers that follow. Although the 180 phase shifter and
its replica circuit result in approximately 6-dB loss, the gain
control amplifiers of the two paths now drive separate loads and
dynamic range need not be traded off. The ordering of the phase
shifter stage and pseudo-differential gain control stage can be
switched. With the arrangement in Fig. 10(b), the gain control
amplifier has lower input signal swing and operates in a more
linear region. The analog equalizers that follow both eliminate
the common-mode FEXT signal and boost the high-frequency
signals to remove the residual ISI. The frequency-dependent
gain equation is shown in [18]
(10)
The frequency for the zero,
is calibrated by
and
depending on both the channel loss and the
changing
distance,
necessary MIMO high-frequency boosting. For
the majority of the high-frequency loss is compensated by the
MIMO signals and the burden of high-frequency boosting in the
following stages can be significantly reduced. Fig. 11 shows the
simulated frequency response of the analog equalizer including
variations of
and
[19]. Instead of a varactor,
can
be implemented by a capacitor with a fixed minimum value or
a parasitic capacitance of the source for a higher zero at the
expense of a degree of freedom for calibration, making both
the zero frequency and the dc gain dependent only on the
value [20].
The prototype 2 2 MIMO-XTC circuit occupies 0.03 mm
and the die photograph is shown in Fig. 12. The physical conditions of channels 1 and 2 are expected to be symmetric, and,
therefore, the output results of channels 1 and 2 are normally
identical. In our prototype implementation, we have added two
additional analog equalizer stages in channel 2 to increase the
flexibility of our prototype so that we are able to handle both
low-loss and high-loss channels optimally. For our particular
test conditions, a single stage of the analog equalizer in channel
1 was sufficient to compensate for the residual ISI after the
high-frequency boosting due to the MIMO signal was included.
distance and
The MIMO-XTC circuit draws 11.3 mA at
distance from 1.2 V supply regardless of signal
14.3 mA at
speed. The magnitude of the FEXT depends mostly on the distance rather than on the data rate and the gain of the pseudodifferential amplifier need only be adjusted according to the
channel distance. The analog equalizer consumes 5.3 mA.
OH AND HARJANI: A 6-GB/S MIMO CROSSTALK CANCELLATION SCHEME FOR HIGH-SPEED I/OS
1851
Fig. 13. Calibrating FEXT cancellation gain and MIMO improvement (100 mV,100 ps/division).
V. MEASUREMENT RESULTS
A. 2
2 MIMO-XTC Gain Calibration – Single Input Signal
The receiver proposed in Fig. 8 has been implemented and
measured in a single-ended I/O environment. A 16-in FR4 board
with a channel loss of 9 dB at 2.5 GHz is used for this test. A
Centellax TG2P1A PRBS of pattern length
generates a
random NRZ signal on a single-ended channel, and the signal
at the receiver output is measured by an Infiniium DSO81204A
oscilloscope. The FEXT cancellation and MIMO improvement
were directly observed in our test setup after proper termination
and equalization. By transmitting a single NRZ signal onto either the channel 2 or the channel 1 input, FEXT cancellation or
MIMO improvement operation can be seen separately, as shown
in the test setup in Fig. 13 and only the measured output at the
channel 1 was used for both cases. A 5-Gb/s NRZ signal and
FR4 channels with
distance were used in this calibration
example.
When the input to channel 1 is off and only channel 2 is on,
then path A contains the crosstalk coupled to the channel 1 receiver. Path B is the crosstalk cancellation signal derived from
the received signals from channel 2. When the gain of the two
paths is adjusted correctly, the output of path A plus path B will
ideally have no signal. Their eye-diagrams are presented in the
Fig. 13. The gain of the pseudo-differential pair has a 3-bit resolution and was manually optimized to result in a minimum
amplitude for the sum of these two signals (path A and path
B). Fig. 14 shows the residual FEXT energy depending on the
gain control value. Both over cancellation and under cancellation increase the residual FEXT level and degrades SNR. In this
test the data rate is 5 Gb/s and channel distance is
. For this
channel condition, the pseudo-differential gain control of
results in the minimum achievable value for the residual error
mV . The value of the residual FEXT has a maximum
at the data transition and minimum at the center of the data eye.
The residual FEXT normally occurs at the data transition timing
and detracts minimally from the data eye center.
With a gain value that minimizes the residual FEXT, the
MIMO bandwidth improvement was observed in the path C
and path D in Fig. 13. The path C includes the original signal
with channel ISI. Path D has the second derivative signal that is
coupled to the adjacent channel as crosstalk and returns through
the differentiation path in our algorithm. When the signals from
these two paths are added, the MIMO signal reduces the jitter
1852
IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 46, NO. 8, AUGUST 2011
Fig. 14. Pulse response of residual FEXT depending on gain control ( ). (a) Gain control and residual FEXT. (b) Optimal gain control point.
B. 2 2 MIMO XTC Measurement Results—Two Independent
Input Signals
Fig. 15. Measurement results for 2,5,6 Gb/s NRZ signals in
and
distances (left-side graphs: 200 mV, 100 ps/division, right-side graphs: 100 mV,
) diatance. (b) 2 Gbps 240 mil (
)
100 ps/division). (a) 2 Gbps 360 mil (
) distance. (d) 5 Gbps 240 mil (
) distance.
distance. (c) 5 Gbps 360 mil (
) distance.
(e) 6 Gbps 360 mil (
horizontally and increases the data eye-opening vertically by
mitigating channel ISI, as shown in the eye-diagram of path C
plus path D in Fig. 13. The parameters for the follow on analog
equalizer was set to remove any residual ISI. This calibration
procedure could be done in parallel for each pair of channels.
During the calibration phase discussed above, only one
signal was used. In the measurements of this section, two
independent NRZ signals on channel 1 and channel 2 are
transmitted and the eye-diagrams of the received signals with
crosstalk (before MIMO-XTC) and the equalized signals (after
MIMO-XTC) have been measured for the channel distances of
and
for different data rates, as shown in Fig. 15. To
observe the FEXT degradation by varying the degree of ISI,
eye-diagrams with various data rates (2,5,6 Gb/s) are measured.
As the channel distance reduces and the signal frequency
increases, the FEXT impact becomes more severe and closes
the eye completely, as shown in Fig. 15(d) and (e). In the pulse
response of NRZ signals with higher data rates, the main tap
( ) is lower and ISI tails (
) have longer duration
and higher amplitude as explained in Section II-B. The data
eyes are more susceptible to FEXT degradation because of the
reduced vertical data eye-opening. For example, the vertical
and horizontal eye openings with the 5 Gb/s NRZ signal for
distance is 60 mV/821 mV (7.3%) and 30 ps/200 ps (15%UI).
For the same condition, by using our MIMO crosstalk cancellation scheme, the
reduces by 67% and the vertical
eye-opening increases by 58.2%.
Table I shows a summary of the measurement results before
and after MIMO-XTC for various channel distances and data
rates. Here,
is the jitter,
is the vertical eye-opening at
the center (
) and
is
the overall height of data eye (
). These are
indicated in Fig. 15(c). The case of a 6-Gb/s NRZ signal with
distance is not presented because the gain of the pseudodifferential pair in our scheme was not sufficient to generate
an adequate crosstalk cancellation signal at this frequency. At
2 Gb/s, MIMO-XTC has more impact on jitter reduction rather
than increasing the eye-opening because the crosstalk degrades
only the data transition timing and not the vertical eye-opening
at this low frequency.
An issue which needs discussion with any FEXT cancellation scheme is transmitter per-pin de-skew that adjusts timing
OH AND HARJANI: A 6-GB/S MIMO CROSSTALK CANCELLATION SCHEME FOR HIGH-SPEED I/OS
TABLE I
MEASUREMENT RESULTS (BEFORE AND AFTER MIMO-XTC)
of signal in channels with unequal lengths. A length mismatch
physically close to the transmitter outputs has a limited effect
because the coupled signal will align with the original signal
at the end of the channel. However, a length mismatch near
the receiver input directly skews timing of the crosstalks, reducing both jitter margin and vertical eye-opening even after
XTC. In general, skew mismatch is likely to be distributed over
the channel and its effect will lie somewhere between being
placed at the transmitter end or at the receiver end. In Table II,
we have assumed that all the per-pin skew, due to length mismatch, is lumped at the receiver input (i.e., worst case that is unlikely), and show the simulation results of performance degradation for the proposed MIMO-XTC scheme. In Fig. 2,
is
the amount of timing skew. As an approximate rule of thumb, a
quarter inch mismatch at the receiver input is expected to skew
the timing between the received signals by 40 ps. As we can see
from the table, that even though there is progressively larger
impact due to per-pin skew, the proposed MIMO-XTC scheme
still performs relatively well even at a quarter inch mismatch.
Prior crosstalk cancellation techniques have been implemented at the receiver end or at the transmitter end for various
channel conditions and data rates. Table III shows a comparison
of the MIMO-XTC technique to prior schemes. Differential
signaling is used for [7] and [11]. In [10] staggered I/O cancellation is realized using both the transmitter and receiver. The
MIMO-XTC technique achieves the best eye-performance improvement with the lowest power consumption. As technology
scaling advances, the corner bandwidth of the MIMO-XTC
schemes will need to increase further but should be equally
effective for higher frequency.
The proposed continuous-time MIMO FEXT cancellation
scheme has three advantages over the previous discrete time
digital schemes [7], [10], [11]. First, the simple CR differentiator and wideband pseudo-differential amplifiers generate the
crosstalk cancellation signal with low power, while discrete
time schemes normally require several high-speed fractional
taps that consume significant power. Second, the simple circuit
implementation allows for low chip area compared to prior
architectures [10], [11]. The MIMO FEXT cancellation block
1853
TABLE II
SIMULATED PERFORMANCE DEGRADATION OF THE PROPOSED MIMO-XTC
SCHEME DUE TO PER-PIN SKEW (LINE LENGTH 16 IN, WIDTH 120 MIL, HEIGHT
62.5 MIL, DISTANCE
, 5 GB/S NRZ SIGNAL)
spans only two stages for single-ended I/Os and one stage
for differential I/Os. Third, as the spacing between channels
reduces, the signal integrity improves due to the additional
high-frequency boosting from the MIMO signal energy that is
available for reuse. Operation with extremely narrow channel
distances, where the FEXT completely closes the eye-opening
as shown in Fig. 15(d) and (e), can be achieved. The additional
bandwidth improvement eases the burden on the following
ISI equalizer. In conclusion, we can both save board area
and increase signal integrity at higher data rates by using
MIMO-XTC.
VI. CONCLUSION
This paper presents an efficient continuous-time architecture
to cancel and reutilize the crosstalk signal energy to improve
SNR of NRZ signals. Slight modifications of this scheme are
applicable to environments where the unwanted crosstalk signal
is proportional to the first order derivative i.e., pad-to-pad and
pin-to-pin coupling [21]. The measurement results show the validity of the MIMO-XTC algorithm and efficiency of the continuous-time XTC scheme. The MIMO-XTC portion consumes
low power (2.8 mW/Gbps/lane) and low area (0.03 mm ). For
the prototype 2 2 MIMO-XTC scheme, the
reduces
by 67% UI and the vertical eye-opening improves by 58.2% at 5
Gb/s, which to the best of our knowledge is the highest performance improvement for any crosstalk cancellation technique.
Due to the crosstalk, single-ended I/Os have slowly been loosing
ground to differential I/Os as the go-to technology for high performance. However, single-ended I/Os with MIMO-XTC maintains the I/O pin, pads, and power advantages of single-ended
designs but allows for speeds that are comparable to differential I/Os although PSRR considerations may require converting
the single-ended signals to differential signals within the chip
as early as possible. In comparison to low-speed single-ended
I/Os, MIMO-XTC enhanced single-ended I/Os allow for closer
channel spacing resulting in further board area savings. Finally,
in addition to the crosstalk cancellation aspects of MIMO-XTC,
the MIMO signal improves the SNR and reduces the burden on
follow on equalization circuits.
1854
IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 46, NO. 8, AUGUST 2011
COMPARISON
OF
TABLE III
FEXT CANCELLATION SCHEMES.
Fig. 16. Noise simulation setup in a severe crosstalk environment. (a) A conventional analog equalizer scheme. (b) A MIMO-XTC scheme.
APPENDIX
Noise Analysis: Here, we compare the noise performance
of a conventional analog equalizer and the MIMO-XTC scheme.
Fig. 16 shows the noise simulation setup for the two schemes.
For this simulation we assume that four independent random
,
,
,
) at the transmitter outputs
noise sources (
and at the receiver inputs degrade SNR. The signal, crosstalk
and noise energy at points A, B, C and D are summarized in
Fig. 17. The simulation test setup used for these results are:
800 mV
NRZ signal operating at 5 Gb/s at the transmitter
output, 16-in channel length, 120-mils width, 62.5-mils height,
and 240-mils separation between channels. These are similar to
the measured responses shown in Fig. 15(d).
In this crosstalk dominant channels, a conventional analog
equalizer scheme is designed for comparison purposes, as
OH AND HARJANI: A 6-GB/S MIMO CROSSTALK CANCELLATION SCHEME FOR HIGH-SPEED I/OS
Fig. 17. Signal, crosstalk and noise energy (mV
) and BER performance.
shown in Fig. 16(a). The gain and pole location in
are set to result in an equivalent signal swing as the MIMO-XTC
scheme in Fig. 16(b) and thus
Note that the signal energy at points C and D in Fig. 17 is
matched as 311 mV . The corner frequency used for simulation purposes ( ) is 1 THz. The output signal and crosstalk
are
for channel 1 and
for channel 2. The crosstalk energy is
and increases from 163 mV
to 182
amplified by
mV
as shown in Fig. 17.
The noise at the receiver input [point B in Fig. 16(a)],
considering coupling, can be expressed as
for channel 1 and
for channel 2. When the noise passes through
a conventional analog equalizer system, the output noise
(
) is
We can see that the noise at the transmitter outputs does not
appear in the adjacent channel though the channels are coupled. The MIMO-XTC scheme cancels the coupled noise between channels, which looks similar to crosstalk cancellation
process. However, the noise component from the receiver input
in an adjacent channel is coupled through the differentiator and
increases the overall noise level. In Fig. 17, the noise energy
at point D increases from 5.5 mV
to 16 mV
due to the
additional noise from the adjacent channel receiver input noise.
As expected, the MIMO-XTC scheme has a noise penalty in
comparison to the conventional analog equalizer. However, the
residual crosstalk in the conventional analog equalizer scheme
significantly degrades SNDR in this severe crosstalk environment and therefore, the overall noise plus crosstalk is considerably smaller for MIMO-XTC. The BER can be evaluated by
considering the eye-opening [E,H in Fig. 15(c)] and noise standard deviation (
) at each point, as analyzed in [22]
(13)
,
The BERs at point B, C and D are approximately
and
. For this simulation, the noise bandwidth for
the sources
,
,
and
was limited to 100 GHz
[22] and a standard deviation of 5 mV
was used [23]. Fig. 17
shows the noise and signal energies at A, B, C and D in Fig. 16.
The noise and signal powers at each point has been estimated in
mV
by numerically evaluating the following:
• Point A
(11)
The output signal for MIMO-XTC, which can be obtained by
appending an analog equalizer stage to (9), is
for both channels. Here,
is the transmitted signal. The crosstalk component is cancelled
and reused as a MIMO signal in our MIMO-XTC. The signal
energy at point D in Fig. 16(b) includes this beneficial MIMO
energy. On the other hand, the noise component
at point D
can be expressed as
(12)
1855
• Point B
• Point C
1856
IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 46, NO. 8, AUGUST 2011
• Point D
ACKNOWLEDGMENT
The authors would like to thank F. O’Mahony, B. Casper, and
others at Intel Circuits Research Laboratory, M. R. Ahmadi at
AMD, and B. Hardy at LSI for technical help with this project.
The authors would also like to thank the anonymous reviewers
for their constructive comments and feedback.
REFERENCES
-spaced FFE in 65 nm
[1] A. Momtaz et al., “An 80 mW 40 Gb/s 7-tap
CMOS,” in IEEE ISSCC Dig. Tech. Papers, Feb. 2009, pp. 364–365.
[2] H. Wang et al., “A 21-Gb/s 87-mW transceiver with FFE/DFE/analog
equalizer in 65-nm CMOS technology,” IEEE J. Solid-State Circuits,
vol. 45, no. 4, pp. 909–920, Apr. 2010.
[3] J. Lee et al., “A 20 Gb/s full-rate linear CDR circuit with automatic
frequency acquisition,” in IEEE ISSCC Dig. Tech. Papers, Feb. 2009,
pp. 366–367.
[4] K.-C. Wu et al., “A 2 25 Gb/s deserializer with 2:5 DMUX for 100
Gb/s Ethernet applications,” in IEEE ISSCC Dig. Tech. Papers, Feb.
2010, pp. 374–375.
[5] S. A. Ibrahim et al., “A 20 Gb/s 40 mW equalizer in 90 nm CMOS technology,” in IEEE ISSCC Dig. Tech. Papers, Feb. 2010, pp. 170–171.
[6] C.-F. Liao et al., “A 40 Gb/s CMOS serial-link receiver with adaptive
equalization and clock/data recovery,” IEEE J. Solid-State Circuits,
vol. 43, no. 11, pp. 2492–2502, Nov. 2008.
[7] J. F. Buckwalter et al., “Cancellation of crosstalk-induced jitter,” IEEE
J. Solid-State Circuits, vol. 41, no. 3, pp. 621–631, Mar. 2006.
[8] H.-K. Jung et al., “A 4 Gb/s 3-bit parallel transmitter with the crosstalkinduced jitter compensation using Tx data timing control,” IEEE J.
Solid-State Circuits, vol. 44, no. 11, pp. 2891–2900, Nov. 2009.
[9] K.-J. Sham et al., “I/O staggering for low-power jitter reduction,” in
Proc. Eur. Microwave Conf., Aug. 2008, pp. 1226–1229.
[10] K.-I. Oh et al., “A 5-Gb/s/pin transceiver for DDR memory interface
with a crosstalk suppression scheme,” in Proc. IEEE CICC, Sep. 2008,
pp. 639–642.
[11] K.-J. Sham et al., “FEXT crosstalk cancellation for high-speed serial
link design,” in Proc. IEEE CICC, Sep. 2006, pp. 405–408.
[12] C. Pelard et al., “Realization of multigigabit channel equalization and
crosstalk cancellation integrated circuits,” IEEE J. Solid-State Circuits,
vol. 39, no. 10, pp. 1659–1669, Oct. 2004.
[13] Y. Hur et al., “Equalization and near-end crosstalk (NEXT) noise cancellation for 20-Gb/s 4-PAM backplane serial I/O interconnections,”
IEEE J. Solid-State Circuits, vol. 40, no. 1, pp. 246–255, Jan. 2005.
[14] G. Balamurugan et al., “A scalable 5–15 Gbps, 14–75 mW low-power
I/O transceiver in 65 nm CMOS,” IEEE J. Solid-State Circuits, vol. 43,
no. 4, pp. 1010–1019, Apr. 2008.
[15] F. D. Mbairi et al., “High-frequency transmission lines crosstalk reduction using spacing rules,” IEEE Trans. Compon. Pacakg. Technol.,
pp. 601–610, Sep. 2008.
[16] W. R. Eisenstadt et al., “Common and differential crosstalk characterization on the silicon substrate,” IEEE Microw. Guided Wave Lett., vol.
9, no. 1, pp. 25–27, Jan. 1999.
[17] “DDR2-533 memory design guide for two-dimm unbuffered systems,”
Micron, Tech. Note, 2003, p. 14.
[18] M. R. Ahmadi et al., “A 5 Gbps 0.13 m CMOS pilot-based clock
and data recovery scheme for high-speed links,” IEEE J. Solid-State
Circuits, vol. 45, no. 8, pp. 1533–1541, Aug. 2010.
[19] J.-S. Choi et al., “A 0.18- m CMOS 3.5-Gb/s continuous-time
adaptive cable equalizer using enhanced low-frequency gain control
method,” IEEE J. Solid-State Circuits, vol. 39, no. 3, pp. 419–425,
Mar. 2004.
[20] K. Fukuda et al., “A 12.3 mW 12.5 Gb/s complete transceiver in 65 nm
CMOS,” in IEEE ISSCC Dig. Tech. Papers, Feb. 2010, pp. 368–369.
[21] C. Pfeil, “BGA breakout challenges,” PCB Fabrication Mag., pp.
10–13, Oct. 2007.
[22] S. Gondi et al., “Equalization and clock and data recovery techniques
for 10-Gb/s CMOS serial-link receivers,” IEEE J. Solid-State Circuits,
vol. 42, no. 9, pp. 1999–2011, Sep. 2007.
[23] B. K. Casper et al., “An accurate and efficient analysis method for
multi-Gb/s chip-to-chip signaling schemes,” in VLSI Circuits Dig.
Tech. Papers, Jan. 2002, pp. 54–57.
Taehyoun Oh (S’05) received the B.S. and M.S. degrees in electrical engineering from Seoul National
University, Seoul, Korea, in 2005 and 2007, respectively. He is currently working toward the Ph.D. degree in electrical engineering from the University of
Minnesota, Minneapolis, under the supervision of Dr.
R. Harjani.
His research is focused on high-speed I/O circuits
and architectures. During the summer of 2010, he
worked on I/O channel modeling and transmitter
equalization at AMD Boston Design Center, MA.
Ramesh Harjani (S’87–M’89–SM’00–F05) received the B.S. degree from the Birla Institute of
Technology and Science, Pilani, in 1982, the M.S.
degree from the Indian Institute of Technology, New
Delhi, in 1984, and the Ph.D. degree from Carnegie
Mellon University, Pittsburgh, PA, in 1989, all in
electrical engineering.
He is the Edgar F. Johnson Professor with the
Department of Electrical and Computer Engineering
and a Member of the graduate faculty of the Department of Biomedical Engineering, University of
Minnesota, Minneapolis. Prior to joining the University of Minnesota, he was
with Mentor Graphics Corporation, San Jose, CA. He cofounded Bermai, Inc., a
startup company developing CMOS chips for wireless multimedia applications
in 2001. He has been a Visiting Professor with Lucent Bell Labs, Allentown,
PA, and the Army Research Labs, Adelphi, MD. He is an author/editor of four
books. His research interests include analog/RF circuits for wired and wireless
communications.
Dr. Harjani was the recipient of the National Science Foundation Research
Initiation Award in 1991 and Best Paper Awards at the 1987 IEEE/ACM Design Automation Conference, the 1989 International Conference on ComputerAided Design, the 1998 GOMAC and the 2007 TECHCON. His research group
was the winner of the SRC Copper Design Challenge in 2000 and the winner
of the SRC SiGe challenge in 2003. He was an associate editor for the IEEE
TRANSACTIONS ON CIRCUITS AND SYSTEMS II from 1995 to 1997 and guest editor for the International Journal of High-Speed Electronics and Systems and
Analog Integrated Circuits and Signal Processing in 2004 and is currently a
guest editor for the IEEE JOURNAL OF SOLID-STATE CIRCUITS. He was the Chair
of the IEEE Circuits and Systems Society technical committee on Analog Signal
Processing from 1999 to 2000 and a Distinguished Lecturer of the IEEE Circuits
and Systems Society for 2001-2002.