### 11.9 A 105Gb/s Dielectric-Waveguide Link in 130nm BiCMOS Using Channelized 220-to-335GHz Signal and Integrated Waveguide Coupler

Jack W. Holloway<sup>1,2</sup>, Georgios C. Dogiamis <sup>3</sup>, Ruonan Han<sup>1</sup>

<sup>1</sup>Massachusetts Institute of Technology, Cambridge, MA <sup>2</sup>Raytheon, Tewksbury, MA <sup>3</sup>Intel, Chandler, AZ

The rapid surge of data transmission within computation, storage and communication infrastructures is pushing the speed boundary of traditional copper-based electrical links. Recent realizations of 100Gb/s wired links require advanced FinFET technologies, highcost packaging/cables and power-consuming equalization. High-frequency waves over dielectric waveguides have been considered as an alternative solution that exploits the low-loss, broadband medium while maintaining compatibility with existing silicon IC platforms. However, since its debut in 2011 [1], this scheme, previously using <140GHz carriers, has only achieved data rates of up to 36Gb/s [2]. It is expected that higher carrier frequencies (e.g. >200GHz) and multi-channel aggregation would further increase the data rate while shrinking the interconnect size; but that scheme has been hindered by challenges related to the required high-order multiplexer and ultra-broadband waveguide coupler operating efficiently at sub terahertz (sub-THz) frequencies. In this paper, using a 130nm SiGe BiCMOS technology, we present a multi-channel, multiplexer/coupler-integrated transmitter (Tx) that delivers a data rate of 105Gb/s (3×35Gb/s). To demodulate each channel, a 35Gb/s coupler-integrated receiver (Rx) is also developed. Our link, including the chipset and a 0.4mm-wide, 30cm-long dielectric ribbon, experimentally demonstrates the potential speed, efficiency, size and cost advantages of THz fiber links in high-speed inter-server and backplane fabrics.

The architecture of the Tx is shown in Fig. 11.9.1. It divides a band ranging from 220 to 335GHz into three 35GHz-wide channels. Two 5GHz-wide guard bands are inserted to ensure isolation between adjacent channels. The Tx consists of three sub-harmonic LO paths generating signals at  $f_{L0}$ =110, 130 and 150GHz. The 130GHz signal is obtained from a frequency tripler with a 43.3GHz input, and is then amplified by three cascode power amplifier stages. Part of the 130GHz power is diverted to the other two LO paths, which utilize single-sideband mixers to shift the input tones by ±20GHz to generate signals at 110 and 150GHz. To reduce the circuit size and power consumption, the frequency doubling for the link-carrier generation (at 220, 260 and 300GHz) and the data modulation are realized within a single component (Fig. 11.9.2). A push-push structure  $(T_7-T_8)$  is adopted for the second-harmonic generation. The tail current of the transistor pair is switched by the incoming data stream via T<sub>9</sub>, hence realizing OOK modulation of the THz carrier. The capacitor  $C_1$  is chosen to be sufficiently large so that when  $T_7$  and  $T_8$  are active, it acts as a current path at  $2f_{L0}$  in order to increase the frequency conversion efficiency; meanwhile it is small enough to still allow for up to 35GHz modulation of the tail current. The NPN devices  $T_{7}$ - $T_{9}$  are biased by a PMOS  $T_{10}$ through a dumbbell broadband stop structure, which isolates the large parasitic capacitance of T<sub>10</sub> from the THz output path. The doubler-modulator module consumes 22mW of power, and with ~0dBm input LO, its output power is around -12dBm with ~3dB fluctuation across the 35GHz modulation frequency range (Fig. 11.9.2).

A high-Q triplexer, based on three hairpin filters, is implemented on the chip (Fig. 11.9.2). It not only prevents the injection of THz carrier power into the closely-spaced channels, but also suppresses the lower sideband spectrum of the modulated output for improved spectral efficiency. Traditional high-order filter topologies with rapid roll-off adopt cascaded resonators, which cause excessive loss at THz frequencies. Our hairpin filter utilizes four quarter-wave, twist-modified split-ring resonators that are placed adjacent to each other. The resultant one-to-all electric and magnetic couplings, through interresonator capacitance and inductance respectively, reduce the filter size and loss. A quasi-elliptical filter response is implemented with three passband poles and four stopband zeros. For verification, three standalone triplexer test structures are fabricated, allowing for separate tests of the three channel filters. The results (Fig. 11.9.2) agree well with the simulation, showing a 3dB pass bandwidth of 30~35GHz and 10~30dB stop-band rejection. Finally, the multi-channel-aggregated output is coupled into a dielectric ribbon (Rogers R3006, 0.4×0.25mm<sup>2</sup> of cross-section) through an integrated waveguide coupler (Fig. 11.9.3) [4]. The dielectric ribbon is sized to operate in a singlemode regime in the excited horizontal polarization. The coupler is based on a differential substrate-integrated-waveguide (SIW) structure, which turns a microstrip input signal into a leaky travelling wave that is exposed to the dielectric ribbon through the slot of the SIW. As the wave propagates in the longitudinal direction, it gradually couples to the EH1 hybrid mode of the dielectric waveguide. The SIW cross section is tapered (from  $W_{c1}$ =364µm to  $W_{c2}$ =204µm in Fig. 11.9.3) to present an increasing characteristic

impedance; this further boosts the coupling and decreases the length of the coupler to 750 $\mu$ m. Compared to prior works [1,2,5,6], this coupling scheme eliminates the lossy THz-signal routing to off-chip couplers/radiators, and enables a planar waveguide placement with a direct contact to the chip edge. In the simulation (Fig. 11.9.3), the coupler exhibits a low insertion loss of 2.5~6dB across the entire 220~340GHz band.

To form a complete all-silicon link with the Tx, a single-channel Rx integrating the same waveguide coupler is implemented using the 130nm BiCMOS process. Shown in Fig. 11.9.3,  $T_{15}$ - $T_{17}$  form an active balun, with full-band matching, that converts the single-ended THz input from the coupler into a pair of differential currents. To downconvert the sub-THz energy to baseband, these currents are injected into four resistively loaded switching transistors  $T_{11}$ - $T_{14}$ , which are driven by an externally applied LO signal. Through a three-stage buffer pair, the recovered data is amplified and extracted at the output. The above THz down-conversion mixer operates across a broad band. In the experiment, the applied LO is selected to be at 220, 260 and 300GHz and manually tuned for phase coherence with the Tx, so that each transmitted channel is tested. Future multi-channel receivers to be developed will include the same triplexer as that in Fig. 11.9.2, as well as separate Rx circuits shown in Fig. 11.9.3 for each channel.

The fabricated Tx and Rx chips are 2.4×3.9mm<sup>2</sup> and 0.9×0.9mm<sup>2</sup> in size. The assembly of the link is shown in Fig. 11.9.4, where a dielectric ribbon is directly bonded with low permittivity epoxy on top of the two chips. Unlike optical links, no precise waveguidechip alignment is required in this assembly. Link lengths of 5cm and 30cm are chosen, which are ultimately limited by the available space of the assembly and fragility of the thin ribbon, rather than the link budget. The test setup is also shown in Fig. 11.9.4. To include the impact of inter-channel interference, two PRBS-7 generators simultaneously drive both the channel under test and an adjacent channel of the Tx chip. On the Rx side, a VDI WR3.4GX-M source provides the THz LO signal through a Cascade probe. LO frequencies of 220, 260 and 300GHz are applied respectively to test the transmission in Channel 1~Channel 3. Only one of the differential outputs of the Rx (hence 3dB lower SNR) is measured by a single-ended high-speed oscilloscope. In Fig. 11.9.5, the eye diagrams of the three channels running at 35Gb/s are shown. Without any equalization, the three-channel bit-error rate (BER) is  $\leq 5 \times 10^{-9}$  for the 5cm distance, and  $\leq 5 \times 10^{-8}$  for the 30cm distance. With a reduced channel data rate of 30Gb/s, the BER is improved to <10<sup>-12</sup>. Lastly, by extending one end of the fiber into a WR-3 horn antenna feeding an Erickson PM5 power meter, the peak output power of the Tx channels is measured to be -24dBm. The DC power of the Tx and Rx are 256mW and 73mW, respectively. The energy efficiency of the Tx is 2.4pJ/b. A future inclusion of a 40GHz/20GHz PLL (for the carrier generator) with typically tens of mW of DC power [7] would increase the above value to ~3pJ/b. Shown in Fig. 11.9.6, this link has 3× higher data rate than prior art and has the smallest interconnect size due to its integrated coupler and the small dimension of the THz dielectric waveguide. Future applications of the work include medium-reach (meter-level) datacentric interconnects with high speed and low latency, as well as lightweight communications within autonomous cars, drones and other aeronautical vehicles.

#### Acknowledgement:

This work is funded by Intel through a Semiconductor Research Corporation Member Specific Research Grant to MIT (Grant # 2017-IN-2752), the Raytheon Advanced Studies Program, MIT Lincoln Labs., the Naval Research Lab., the Office of Naval Research, and MIT Center of Integrated Circuits and Systems.

#### References:

[1] S. Fukuda et. al., "A 12.5+12.5 Gb/s Full-Duplex Plastic Waveguide Interconnect," *IEEE JSSC*, vol. 46, no. 12, pp. 3113-3125, Dec. 2011.

[2] M. Sawaby et. al., "A Fully Packaged 130-GHz QPSK Transmitter With an Integrated PRBS Generator," *IEEE SSC-L*, pp. 166-169, July 2018.

[3] Q. Zhong, et. al., "300-GHz CMOS QPSK Transmitter for 30-Gbps Dielectric Waveguide Communication," *IEEE CICC*, Apr. 2018.

[4] J. W. Holloway et. al., "A Fully Integrated Broadband Sub-mmWave Chip-to-Chip Interconnect," *IEEE TMTT*, vol. 65, no. 7, pp. 2373-2386, July 2017.

[5] N. van Thienen et. al., "An 18Gbps Polymer Microwave Fiber (PMF) Communication Link in 40nm CMOS," *IEEE ESSCIRC*, Sept. 2016.

[6] M. de Wit et. al., "Analysis and Design of a Foam-Cladded PMF Link with Phase Tuning in 28-nm CMOS," *IEEE JSSC*, vol. 54, no. 7, July 2019.

[7] H. Wang et al., "A 9.6 mW Low-Noise Millimeter-Wave Sub-Sampling PLL with a Divider-less Sub-Sampling Lock Detector in 65 nm CMOS," *IEEE RFIC*, May 2019.

## ISSCC 2021 / February 17, 2021 / 7:56 AM



Figure 11.9.1: (Top) system diagram and (bottom) the multiplier and amplifier chain for LO generation.



Figure 11.9.3: (Top) the integrated waveguide coupler and simulation results. (Bottom) schematic of the coupler-integrated, single-channel link receiver.



Figure 11.9.5: (Left) measured eye diagram of the 30cm link with an aggregated data rate of 3x35Gb/s, (top right) the measured BER, and (bottom right) the measured output power of each channel.



Figure 11.9.2: (Top) design of the 260GHz doubler-modulator, and (bottom) the design, simulation and measurement of the 220~335GHz hairpin triplexer.



Figure 11.9.4: (Top) photo of the link with a 30cm dielectric waveguide connecting the Tx and Rx chips. (Bottom) the block diagram of the link testing setup.





|                     | Technology      | Carrier<br>Frequency<br>(GHz) | Data Rate<br>(Gbps) | BER                          | Fiber<br>Coupler                   | Fiber Size<br>(W×H, mm) | Demonstrated<br>Link Length<br>(cm) | TX DC Power<br>& Efficiency        | RX DC Powe<br>& Efficiency |
|---------------------|-----------------|-------------------------------|---------------------|------------------------------|------------------------------------|-------------------------|-------------------------------------|------------------------------------|----------------------------|
| JSSC<br>2011 [1]    | 40nm CMOS       | 57, 80                        | 12.5+12.5†          | <10-12                       | QuasiYagi<br>(Off-Chip)            | 8×1.1                   | 120                                 | 56mW<br>2.2pJ/bit                  | 87mW<br>3.5pJ/bit          |
| SSC-L<br>2018 [2]   | 55nm SiGe       | 130                           | 36                  | <10 <sup>-8</sup><br>@25Gbps | Vivaldi<br>(Off-Chip)              | 1.3×1.3                 | 100                                 | 216mW <sup>t†</sup><br>6pJ/bit     | No RX                      |
| ESSCIRC<br>2016 [5] | 40nm CMOS       | 120                           | 17.7                | < <b>10</b> <sup>-12</sup>   | Tapered Slot<br>(Off-Chip)         | 2<br>(Circular)         | 100                                 | 11.1mW<br>0.63pJ/bit               | 59.6mW<br>3.4pJ/bit        |
| JSSC<br>2019 [6]    | 28nm CMOS       | 140                           | 12                  | <10-12                       | CPW-WG<br>Transition<br>(Off-Chip) | 1.9×1.0                 | 100                                 | 65mW<br>5.4pJ/bit                  | 165mW<br>13.8pJ/bit        |
| This Work           | 130nm<br>BiCMOS | 220, 260 &<br>300             | 35×3‡               | 5×10-8                       | Leaky SIW<br>(Integrated)          | 0.4×0.25                | 30                                  | 256mW <sup>++</sup> ,<br>2.4pJ/bit | 73mW#,<br>2.1pJ/bit        |
|                     |                 |                               | 30×3‡               | <10-12                       |                                    |                         |                                     |                                    |                            |

Full-duplex transmission (12.5Cbps each way)

Tinput signal sources (16.25CHz in [2], 43.3 & 20CHz in this work) not included

The link is demonstrated with a three-channel TX and one-channel RX

To gure 11.9.6: (Top) fiber connection and power consumption breakdown of the Tx

rigure 11.9.6: (10p) tiber connection and power consumption breakdown of the 1x chip, and (bottom) a comparison with other dielectric-waveguide links.

# **ISSCC 2021 PAPER CONTINUATIONS**

