## Burst-Mode Clock and Data Recovery for Optically Interconnected Data Centers

Bhavin J. Shastri, and David V. Plant

Photonic Systems Group, Dept. of Electrical and Computer Eng., McGill University, Montreal, QC H3A 2A7, Canada shastri@ieee.org

Abstract—We propose a novel burst-mode clock/data recovery (BM-CDR) architecture for optical data center applications. Our design is based on a hybrid topology of a CDR (feedback) and clock phase aligner (feed-forward) utilizing multi-phase clocks.

## I. INTRODUCTION

Data centers or large clusters of servers are currently being aggressively deployed in a number of institutions to harness petaflops of computational power and petabytes of storage in a cost-efficient manner [1]. Consequently, there exists a world-wide research interest in designing such large data centers for optimally supporting various applications including scientific computing, financial analysis, data analysis and warehousing, and large-scale network services.

Data centers in general follow a tiered architecture in which network devices (switches or routers) are organized into two or three layers. The highest layer—core tier—is at the root of the tree, whereas the lowest layer—edge tier—is at the leaves of the tree. Between these layers, an aggregation tier may exist when the number of devices is large. The need for highly-specialized ASICs is undeniable [2], with clock and data recovery (CDR) being a critical function in backplane routing and chip-to-chip interconnects. The data received on the aggregation and edge node links is inherently bursty [3] with asynchronous phase steps / ' / 2 rad, that exist between the consecutive  $k^{th}$  and  $(k + 1)^{th}$  packet. This inevitably causes conventional CDR circuits to lose pattern synchronization leading to packet loss. Preamble bits can be inserted at the beginning of each packet to allow the CDR feedback loop enough time to settle down and thus acquire lock. However, the use of a preamble introduces overhead, reducing the effective throughput and increasing delay. Consequently, to deal with bursty data, these nodes require a burst-mode CDR (BM-CDR). The most important characteristic of the BM-CDR is its phase acquisition time which must be as short as possible. In this paper, we present a novel BM-CDR architecture based on a hybrid topology; that is, a combination of feedback and feed-forward.

## II. Novel BM-CDR Architecture

A block diagram of the proposed BM-CDR is shown in Fig. 1. The BM-CDR is composed of a phase-tracking CDR and a clock phase aligner (CPA). The CDR senses data  $D_{\rm in}$ , and generates a synchronized clock CK, with a voltage-controlled oscillator (VCO) in a phase-locked (feedback) loop (PLL). The phase and frequency of CK is compared to  $D_{\rm in}$  in



Fig. 1.at the frequency of interest.

Burst-mode functionality is obtained with the CPA which utilizes multi-phase clocks and a phase picking algorithm based on an "early-late" detection principle. This CPA is based on a feed-forward topology, and comprises of phase ( -) shifters, an Alexander PD, a -picker, and a Eflop (D-FF). The -shifters utilize the clock recover the CDR CK, to provide multiple clocks:  $CK_0$ , CK and  $CK_{+}$  =2, with low skew and different phases:



Fig. 3. CPA phase picking algorithm.

 $CK_o$  early-late information (X and Y) together with the two multi-phase clocks, CK = 2 and  $CK_{+} = 2$ , is provided to the -picker. The idea then behind the phase picking algorithm is depicted with the aid of eye diagrams in Fig. 3. When there is no phase difference between the consecutive packets, '=0 rad, either of the clocks,  $CK_{=2}$  and  $CK_{=2}$ , will correctly sample the data bits of the phase shifted  $(k + 1)^{th}$ packet [see Fig. 3(a)]. This is also true for an antiphase step rad—not shown as this is a modulo- process. For < ' < 0 rad, clock  $CK_{+}$  =2 will sample a phase step the bits on or close to the transitions of the data eye, whereas clock CK =2 will correctly sample the data [see Fig. 3(b)]. Similarly for a phase step 0 < ' < + rad, clock CK  $_{=2}$ will sample the bits on or close to the transitions, whereas clock  $CK_{+}$  =2 will correctly sample the data [see Fig. 3(c)]. That is, regardless of any phase step, there will be at least one clock, either  $CK_{=2}$  or  $CK_{+=2}$ , that will yield an accurate sample. The -picker then selects the most accurate clock  $CK_{out}$ , from these two possibilities for driving the D-FF to retime the data; that is, sample the noisy data, yielding an output  $D_{out}$  with less jitter. The foregoing concepts on the Alexander PD and the -picker are summarized in Table I, leading to the circuit topology in Fig. 4.

## III. HARDWARE IMPLEMENTATION

The BM-CDR is being implemented for operation at 10 Gb/s. The main building blocks include a CDR from Centellax (Part #TR1C1-A) and a CPA built by integrating individual chips from Hittite Microwave on a custom designed printed circuit board (PCB). More specifically, the PCB is populated with three 4-bit digital -shifters (Part #HMC543), an Alexander PD comprised of four D-FFs (Part #HMC673LC3C) and two XOR gates (Part #HMC671LC3C), and a -picker comprised of an AND gate (Part #HMC672LC3C) and a 2:1