# Probabilistic Theory for Semi-Blind Oversampling Burst-Mode Clock and Data Recovery Circuits

Bhavin J. Shastri, and David V. Plant

Photonic Systems Group, Dept. of Electrical and Computer Eng., McGill University, Montreal, QC H3A 2A7, Canada shastri@ieee.org

*Abstract*—We develop a unified probabilistic theory for phaselocked clock and data recovery circuits (CDRs), CDRs based on *N* -oversampling techniques in either the time- or space domain, and burst-mode CDRs built from oversampling CDRs. This theory quantitatively explains the performance of these circuits in terms of the bit error rate.

## I. INTRODUCTION

Passive optical networks (PONs) are recognized as an economic solution to alleviate the bandwidth bottleneck in access networks by deploying fiber-to-the-home [1]. The challenge in the design of a chip set for PONs arises from the upstream data path as the network is point-to-multipoint. Using time division multiple access, multiple optical network units transmit data to the optical line terminal (OLT) in the central office. Due to optical path differences, packets can vary in amplitude and phase—bursty data. Consequently, this necessitates burstmode receivers (BMRx) at the OLT. The BMRx front-end is responsible for amplitude recovery, whereas clock and data recovery (CDR) together with fast phase acquisition is performed by a burst-mode CDR (BM-CDR) circuit.

Random noise which is always present at the BMRx frontend affects the determination of the decision threshold and introduces sensitivity penalty. A sensitivity penalty using Gaussian noise statistics for BMRx was first addressed in [2]. A more accurate model is provided in [3], while a unified theory which includes the interaction of Gaussian noise with the finite charging/discharging time of the adaptive threshold detection circuitry is derived in [4]. In [5], the influence of random direct current offsets on the sensitivity of BMRx is analyzed. While there has been an appreciable amount of research on the theory of BMRx front-end circuits in literature, virtually no attention has been paid to the mathematical modeling of BM-CDRs. In this paper, we develop a probabilistic theory for BM-CDRs.

# II. BURST-MODE CLOCK AND DATA RECOVERY

*Problem:* A CDR circuit that is in phase-lock samples (with recovered lock) the incoming data in the center of the data eye. Fig. 1 depicts the nature of bursty traffic in a PON upstream with asynchronous phase steps  $j'$  ' $j'$  2 rad, between the consecutive  $k^{\text{th}}$  and  $(k + 1)^{\text{th}}$  packet. This phase step will result in the sampling clock  $t_{inst}$  = 0 rad; and (b) '  $\neq$  0 rad.

to allow the CDR feedback loop enough time to settle down and thus acquire lock; that is, align the instantaneous clock  $t_{\text{inst}}$ , to the lock state  $t_{\text{lock}}$ , so as to sample in the middle of the data bit. However, the use of a preamble introduces overhead, reducing the effective throughput and increasing delay.

*Solution:* The most important characteristic of a BM-CDR is its phase acquisition time which must be as short as possible. We define the lock acquisition time as the number of preamble hits I needed to achieve error-free operation. Fig. 2 shows a



Fig. 2. BM-CDR architecture based on an oversampling CDR and CPA.

of the data eye. This technique requires low skew between multiple phases of the clock. In either case, the phase picking algorithm guarantees at least one data sample or clock edge that will yield uncorrupted data regardless of any phase step  $j'$   $'j$  2 rad between the consecutive packets. The phase picker then uses a feedback mechanism to select the correct sample from the  $N$  possibilities. It has been experimentally demonstrated in [1], [6], [7] that this BM-CDR architecture achieves instantaneous phase acquisition  $(l = 0$  preamble bits) with error-free operation for any phase step  $j'$  ' $j$  2 rad. In this paper, we develop a unified probabilistic theory for BM-CDRs built from N oversampling CDRs in either the time- or space domain.

### III. THEORETICAL MODELING

The probabilistic theory developed here is for data transmitted in the non-return-to-zero format, and it is independent of the bit rate and pulse shape, as long as the intersymbol interference (ISI) at the sampling point is negligible. This will remain valid at high bit rates, as long as the channel remains limited by Gaussian noise [2].

Jitter can be interpreted as the perturbations of the thresholdcrossing time of data transitions from their ideal position in time. A part of the jitter of the data is inherited as phase uncertainty of the recovered sampling clock in the clock recovery circuit. As a result, the regenerated (retimed) data sequence by the CDR may be erroneous, degrading the bit error rate (BER) performance. Jitter is in general classified as being either random or deterministic. Random jitter (RJ) is unpredictable, unbounded, and results from physical noise sources based on random processes. RJ is attributed to thermal noise, shot noise, and flicker noise. The generation of RJ is approximated to a Gaussian probability distribution. This follows from the central limit theorem which states that the composite effect of many uncorrelated noise sources, regardless of the distributions, approaches a Gaussian distribution. The Gaussian approximation is sufficiently accurate for design purposes and is far easier to evaluate than the more exact probability distribution within the receiver [8]. RJ is characterized by the root-mean-square (RMS) value of the Gaussian probability distribution. Deterministic jitter (DJ) is predictable, bounded, and is attributed to duty cycle distortions. DJ is classified as ISI and data-dependent jitter, pulse-width-distortion jitter, sinusoidal jitter, and uncorrelated bounded jitter. The effect of DJ is to shrink the data eye by a finite amount and will only further deteriorate a device under test's performance. Thus, in order to simplify the mathematical modeling, DJ is ignored.

In deriving the theoretical probabilistic model, we make use of continuous random variables  $\alpha$ , that follow a Gaussian distribution denoted as  $\alpha$  $N(1^2)$ , where is the mean, > 0 is the standard deviation, and the probability density function (PDF) of  $\alpha_i$  is given by  $f(x) = 1 = 2$ exp h  $(x)$  $2^{2}$  = 2  $^{2}$ , x 2 R, with the following characteristics:  $f(x) > 0$ , for all x and  $\int_{1}^{R_+} f(x) dx = 1$ . In the context of clock and data recovery, we define the following continuous random variables with a Gaussian distribution:

 $e$   $N(0, 2)$ , with PDF  $f( )$ , is the jitter on the edges of the data bits with a zero mean, where  $t_s$  corresponds to the RMS jitter on the sampling clock signal;

 $t_s^2$   $N(t_s^2, t_s^2)$ , with PDF  $f(t_s)$ , is the *actual* clock sampling point in the presence of random jitter; and  $f_0$   $N(t_o^{\text{ideal}} / t_o)$ , with PDF  $f(t_o)$ , is the clock sampling point *determined* by the CDR, where  $t_o^{\text{ideal}}$  is the  $\frac{d}{d}$  ideal  $\frac{2}{t_o}$ , with PDF  $f(t_o)$ , is the clock sam*ideal* clock sampling point in the middle of the data bit, and  $\frac{2}{t_o}$  =  $\frac{2}{t_s}$  $\frac{2}{t_s}$ , with being a constant of proportionality.

For convenience, the left and right edges of the data eye are located at  $T_b=2$  and  $+T_b=2$ , respectively [see Fig. 1(a)]. Thus, the expectation of the clock sampling point is given by

$$
E \tfrac{e}{b} \tfrac{1}{t} \tfrac{1}{t_0} \tfrac{f(t_0)}{f(t_0)} \tfrac{dt_0}{dt_0} = \tfrac{t_0}{t_0} = 0; \t\t(1)
$$

as the ideal clock sampling point is in the center of the data bit. Let  $\frac{e}{f}$  left and  $\frac{e}{f}$  right be the jitter on the left edge and right edge of the  $j^{\text{th}}$  bit of an *I*-bit preamble. We assume that ej  $T_{\text{end}}$  and  $\frac{e^{\text{right}}}{T}$  are independent with common RMS jitter  $_{t_{\rm s}}$ . Then the mid-point of the  $j^{\text{th}}$  bit  $\Theta$ , is expressed as

$$
\Theta = \frac{\Theta^{\text{left}} + \Theta^{\text{right}}}{2}.
$$
 (2)

After the *l*-bit preamble, the clock sampling point determined by the CDR  $f<sub>o</sub>$ , at the first bit where the decision circuit will start sampling the data bits, is given by the average of the individual mid-points  $\Theta$ , in (2) as

$$
\mathbf{f}_{0} = \frac{1}{(l+1)} \sum_{j=1}^{\mathcal{H}^{1}} \Theta \tag{3}
$$

Thus,  $t_o$  can be related to sampling clock RMS jitter  $t_{s}$ , as

$$
\frac{2}{t_o}, \quad E \quad \mathcal{E}_o \quad \underset{=0}{\underset{=0}{\bigcap}} \left( \frac{\mathcal{E}_o}{\mathcal{E}_o} \right)^2 = \frac{1}{\underset{=0}{\underbrace{2(I_1+1)}}} \quad \frac{2}{t_s}. \tag{4}
$$

Hence, the PDFs  $f(t_s)$  (actual sampling point) and  $f(t_o)$ (sampling point determined by CDR), can be expressed as:

$$
f(t_s) = \frac{1}{2 \pi r_s} \exp \frac{t_s}{2 \frac{2}{t_s}} \left( 5 \right)
$$
 (5)

$$
f(t_0) = \frac{1}{t_s} \frac{(l+1)}{(l+1)} \exp \frac{(l+1) \frac{t_0^2}{t_s}}{t_s} \qquad (6)
$$



Fig. 3. Probability of the clock sampling point determined by the CDR  $\epsilon_0$ , to be within the data bit after an *I*-bit preamble.

The probability that the sampling point determined by CDR  $t_{0}$ , will be within the data bit after *l* preamble bits is given by

$$
\begin{array}{rcl}\n\Pr & \mathbf{f}_0 < \frac{T_b}{2} & = & \frac{1}{T_b = 2} \mathbf{f}(t_o) \, dt_o \\
& = & 1 \quad 2Q & \frac{1}{t_s \text{[UI]}} \quad \frac{(I+1)}{2} \quad (7)\n\end{array}
$$

where  $Q(x)$ , p  $\overline{2}$   $\frac{N}{x}$  $2=2$  d is the normalized Gaussian tail probability. Note that (7) has been made independent of the data rate; thus, the RMS jitter  $t_s$ , is expressed in terms of the unit interval (UI). In Fig. 3 we plot (7) as a function of  $t_s$  for different *l*. The probability Pr  $\epsilon_0$  <  $T_b$  = 2, decreases with increasing jitter but can be compensated by increasing the preamble length. Also, for  $t_s$  0.25 UI, Pr  $t_b < T_b = 2$  1 with no preamble bits.

When there is no phase difference  $' = 0$  rad, between two consecutive packets in a PON uplink [see Fig. 1(a)], the CDR's sampling error probability is equivalent to the probability that the clock transition occurs either before the leading data transition or after the trailing data transition, Pr  $f_s > T_b=2$ , given that the sampling point determined by the CDR  $f_{\theta}$ , is within the data eye. Assuming uncorrelated data with equiprobable ONEs and ZEROs, the sampling error probability  $P_{s}$ , of the CDR can be expressed as

$$
P_s = \frac{1}{2} \text{ Pr } f_b < \frac{T_b}{2} \text{ Pr } f_s \frac{T_b}{2} \text{ and } (8)
$$

$$
\Pr \quad \mathbf{f}_{S} \quad \frac{T_{b}}{2} = \int_{1}^{Z} \frac{T_{b}=2}{f(t_{S})} \, dt_{S} + \int_{1}^{Z} \frac{1}{f(t_{S})} \, dt_{S}. (9)
$$

Ideally, the sampling clock must bear a well-defined phase relationship with respect to the received data so that the decision circuit samples each bit at the mid-point of the data eye. Thus, it is desirable that the CDR sampling point be as close as possible to the ideal sampling point,  $f<sub>b</sub>$  $t_o^{\text{ideal}}$  = 0. Also, since the PDF  $f(t_s)$ , is even-symmetric, then Pr  $f_s < T_b=2$  = Pr  $f_s > T_b=2$ , and the sampling error probability is given as

$$
P_s = Q \frac{T_b}{2 t_s} \qquad (10)
$$

With a finite phase difference  $\neq$  6 0 rad, between the consecutive packets [see Fig. 1(b)], the phase step has the effect of displacing the instantaneous CDR sampling clock  $t_{inst}$ , by  $j'$  '  $j'$  ( $T_b=2$ ). By inserting preamble bits, the CDR feedback loop will have time to settle down. Specifically, after an l-bit preamble, the sampling point determined by the CDR  $t_{0}$ , will be displaced by  $t_{i}$ ,  $i = j$ ,  $j = 1$  (i)  $(T_b=2)$ , where  $(1)$ , is the CDR feedback loop function analytically derived for a second-order PLL to be [9]

$$
I(t) = 1 \quad \exp\left(-\frac{I}{I} - \frac{I}{I}t\right) \quad \cosh\left(-\frac{I}{I} - \frac{I}{I}t\right) \quad \text{cosh}\left(-\frac{I}{I} - \frac{I}{I}t\right) \quad \text
$$

where is the "damping ratio" and  $\ell_n$  in [rad/s] is the "natural frequency", both dependent on CDR circuit parameters. Note that the expression for  $t_{j}\rightarrow j$  is only valid for phase steps j 'j rad, and does not account for  $\langle j \rangle$  'j 2 rad. Thus, a correcting factor , must be introduced to account for the symmetrical performance about the edges of the data bit such that  $($  ;  $'$   $)$   $2$   $0$ ;  $[$   $+$   $]$   $\neq$   $2$   $[$   $]$   $\neq$   $2$   $]$   $\Rightarrow$ hence,

$$
t_j
$$
  $t_j = \begin{vmatrix} h & h \\ h & h \end{vmatrix}$   $t_j$   $t_j$ 

It follows from (12), that the PDF  $f(t_s)$  in (5), can therefore be modified to account for this phase step as

$$
f(t_s) = \frac{1}{2 \cdot 2 \cdot 5} \exp \frac{t_s - t_0 - t_j}{2 \cdot 2 \cdot 2 \cdot 5} \qquad (13)
$$

Subsequently, the probability that the clock transition occurs either before the leading data transition or after the trailing data transition can then be expressed as  $\overline{1}$ 

Pr 
$$
f_s
$$
  $\frac{T_b}{2} = \frac{1}{2} \begin{bmatrix} \frac{f_x}{f} & \frac{f_{j-1}}{f_s} \\ 0 & \frac{f_x}{f_s} \end{bmatrix} + Q \frac{f_x + t_{j-1}}{t_s}$  (14)

where  $f_x = T_b = 2$   $f_b$ . Before we proceed, we make two assumptions: (1) the sampling point determined by the CDR is ideally located at the center of the data eye  $(\ell_0 = 0)$  *before* a phase step  $j'$  '; and (2) the RMS jitter on the clock signal  $t_s$  0.25 UI, implying the probability that the CDR clock sampling point is within the data eye *after* the phase step is Pr  $\epsilon_0$  <  $T_b = 2$  1, for any number of preamble bits *l*. Consequently, for a given phase step  $j'$   $'j$  2 rad, the sampling error probability  $P_{s_i}$  in (8) can be expressed as

$$
P_{s} j' j = \frac{1}{2} \begin{bmatrix} 0 & j' j & 1 & (j) \\ 0 & \frac{j' j}{2} & t_{s}[U] \end{bmatrix} + Q \frac{+ j' j}{2} \frac{1}{t_{s}[U]} \frac{1}{2} \begin{bmatrix} 0 \\ 1 \end{bmatrix}
$$
 (15)

For a CDR that is based on an  $N$  -oversampling architecture in either time or space, the absolute value of the maximum phase difference between the ideal sampling point and the sampling point determined by the CDR, is max  $jt_0^{\text{ideal}}$   $\epsilon_b$  = Sampling point determined by the CDR, is max  $f_0$   $f_0$   $f_0$  =  $T_b$ =2N =N [rad]. For  $t_0^{\text{ideal}} = 0$ , the N-clock sampling points determined by the CDR  $t_{o/N}^{\eta}$ , are located at:

$$
f_{\partial}^{1} 2 \frac{n_{0}}{t_{0}^{n}} = \frac{0}{N}(2n + 1) \quad N) \quad ; n = 0; 1; \dots; N \quad 1: (16)
$$

For each of the  $N$  data samples, the sampling error probabilities  $P_s^n j_N$ , can be calculated by convolving  $P_s$   $j'$  '  $j'$  in (15), with the N-sampling points  $t_{o}^{\prime\prime}$  in (16), as

$$
P_s^n j_N = P_s j' j' j' j' j' j' j_0 j_0 j \quad (17)
$$

*j 'j* 
$$
t_{o}^{n}j_{N}
$$
   
 1 if  $j'j = t_{o}^{n}j_{N}$  (18)  
0 if  $j'j \notin t_{o}^{n}j_{N}$ :

is the Dirac-delta function. It follows from the sifting property

$$
P_{s}^{n}j_{N} = \int_{1}^{2} P_{s} j'j
$$
  
=  $P_{s} j'j' f_{o}^{n}j_{N}$  : (19)

For a BM-CDR based on the N -oversampling CDR and a CPA which selects the correct set of samples with the aid of a phase picking algorithm, the sampling error probability  $P_S^{\text{BM}}$  CDR, is expressed as

$$
P_S^{\text{BM~CDR}} = \min \bigg| P_S \bigg/ \bigg|' \bigg/ \bigg| t_{0/N}^n \bigg| \tag{20}
$$

We define the BER, denoted as  $P_{e}$ , of the CDR, N oversampling CDR, and BM-CDR, from the sampling error probabilities in (15), (19), and (20), as follows:

**BER**

\n
$$
P_e, \quad P_s / \quad 'j
$$
\nFor CDR

\n
$$
\Rightarrow P_{e_1} \quad P_{s} / \quad 'j
$$
\nfor N - CDR

\n
$$
\Rightarrow \quad \text{min } P_{s} / \quad 'j
$$
\nfor N - CDR

\n
$$
\Rightarrow \quad \text{min } P_{s} / \quad 'j
$$
\nfor BM-CDR

Fig. 4(a) shows the BER performance of the CDR and BM-CDR as a function of phase step for a zero preamble length  $(1 = 0)$ . As expected the worst-case phase steps for the CDR are rad because these represent the half-bit periods, and therefore the CDR is sampling exactly at the edge of the data eye, resulting in a BER 0:5. At phase shifts near 0 or 2 rad, we can easily achieve error-free operation, BER  $\epsilon$  10<sup>-10</sup>, because the CDR is almost sampling at the middle of each data bit. For the BM-CDR we achieve error-free operation, for any phase step  $j'$   $'j$  2 rad. Similar results have been obtained experimentally in [1], [6], [7], clearly validating our probabilistic theoretical model. In Fig. 4(b) we plot the BER performance of the CDR and BM-CDR as a function of the RMS jitter for different phase steps and zero preamble bits. As anticipated, for a given BER and phase step, the allowable RMS jitter on the sampling clock is higher with the BM-CDR than the CDR in each case. More importantly, it can be perceived that the BM-CDR achieves far superior BERs for any given phase step and RMS jitter.



Fig. 4. BER performance of the CDR and BM-CDR (for zero preamble length) versus: (a) phase step; and (b) sampling clock RMS jitter.

### IV. CONCLUSION

We have developed a unified probabilistic theory for conventional CDRs, N -oversampling CDRs (in time or space), and BM-CDRs built from oversampling CDRs. The theoretical model quantitatively explains the performance of these circuits in terms of the BER by taking into account the phase steps between successive packets, preamble length, and jitter on the sampling clock. This model will help refine theoretical models of PONs and provide input for establishing realistic power budgets.

#### **REFERENCES**

- [1] B. J. Shastri, et. al., *Analog Integr. Circuit and Signal Process.*, vol. 60, no. 1-2, 2009.
- [2] C. A. Eldering, *J. Lightw. Technol.*, vol. 11, no. 12, 1993.
- [3] P. Menendez-Valdes, *J. Lightw. Technol.*, vol. 13, no. 11, 1995.
- 
- [4] C. Su, et. al., *J. Lightw. Technol.*, vol. 15, no. 4, 1997. [5] P. Ossieur, et. al., *IEEE J. Lightw. Technol.*, vol. 24, no. 3, 2006.
- [6] B. J. Shastri, et. al., *J. Opt. Commun. and Netw.*, vol. 2, no. 1, 2010.
- [7] B. J. Shastri, et. al., *IEEE J. Sel. Topic Quantum Electron.*, vol. 16, no. 5, 2010, to appear.
- [8] P. P. Webb, et. al., *RCA Review*, vol. 35, no. 2, 1974.
- [9] F. M. Gardner, *Phaselock Techniques*, 2nd ed. New York: Wiley, 1979.
- [10] M. van Ierssel, et. al., *IEEE J. Solid-State Circuits*, vol. 42, no. 10, 2007.