100% found this document useful (2 votes)

922 views52 pages

Clock Distribution and Metrics

- Clock distribution networks aim to synchronize millions of elements within 10 picoseconds while spanning chip distances of 2-4 cm. Key metrics for clock networks include skew, jitter, power, area, and slew rates. - Clock skew is the maximum difference in arrival times of the clock signal to any two latches. It is caused by designed and undesigned variations and increases with clock latency. Jitter is the uncertainty in clock delay from cycle to cycle. - Common clock distribution network types include grids, which have very low skew but high power, and H-trees, which can have low power but are sensitive to variations and non-uniform loads. Balancing the tradeoffs between skew and power is

Uploaded by

tejanossam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

100% found this document useful (2 votes)

922 views52 pages

Clock Distribution and Metrics

Uploaded by

tejanossam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 52

Clock Distribution

based on Dennis Sylvester at Univ. Michigan www.eecs.umich.edu

[email protected]

Contents
Introduction Clock Power Clock Distribution for Uniform Delay Clock Integrity

Function of clock distribution network

Synchronize millions (billions) of separate elements
Within a time scale on order of ~10 ps At distances spanning 2-4 cm
Ratio of synchronizing distance to element size on order of 105

Reference: light travels <1 cm in 10 ps

Metrics/Goals
Besides basic connectivity, what makes a clock network good or bad?
Skew Jitter Power Area Slew rates

Clock Skew
The most high-profile of the clock network metrics Defined as: Maximum difference in arrival times of clock signal to any 2 latches/FFs fed by the network

Skew = max | t1 t2 |

Clock Skew
Causes:
Designed (unavoidable) variations mismatch in buffer load sizes, interconnect lengths Process variation process spread across die yielding different Leff, Tox, etc. values Temperature gradients changes MOSFET performance across die IR voltage drop in power supply changes MOSFET performance across die

Note: Delay from clock generator to fan-out points (clock latency) is not important by itself
BUT: increased latency leads to larger skew for the same amount of relative variation

Clock Skew
Effect:
Eats into timing budget Needs to be considered for maximum (setup) and minimum (hold) path timings

Cycle time
Ref : Simplex website

Jitter
Clock network delay uncertainty From one clock cycle to the next, the period is not exactly the same each time Maximum difference in phase of clock between any two periods is jitter

NOTES : JITTER J1 = t2 t1 JITTER J2 = t3 t2

Jitter
Caused by variations in clock period that result from: Phased-lock loop (PLL) oscillation frequency Various noise sources affecting clock generation and distribution
Ex. Power supply noise which dynamically alters the drive strength of intermediate buffer stages

Jitter can be reduced by minimizing power supply noise (IR and L*di/dt)

Jitter Impact on Timing Budget

Needs to be considered in maximum path timing (setup) Typically on the order of 50ps in high-end microprocessors

Clock Power

Clock Power
Power consumption in clocks due to:
Clock drivers Long interconnections Large clock loads all clocked elements (latches, FFs) are driven

Different components dominate

Depending on type of clock network used Ex. Grid huge pre-drivers & wire cap. drown out load cap.

Clocks: Power-Hungry

P = C Vdd2 f

Not only is the clock capacitance large, it switches every cycle!

Low Power Clocking Techniques

Gated clocks
Prevent switching in areas of the chip not being used Easier in static designs

Edge-triggered flip-flops in ARM rather than transparent latches in Alpha

Reduced load on clock for each flip-flop as well as eliminated spurious power-consuming transitions during flow-through of latches

Clock Distribution Metric: Area

Clock networks consume silicon area (clock drivers, PLL, etc.) and routing area Routing area is most vital Top-level metals are used to reduce RC delays
These levels are precious resources (unscaled) Power routing, clock routing, key global signals

By minimizing area used, we also reduce wiring capacitance & power Typical #s: Intel Itanium 4% of M4/5 used in clock routing

Slew Rates
To maintain signal integrity and latch performance, minimum slew rates are required Too slow clock is more susceptible to noise, latches are slowed down, eats into timing budget Too fast burning too much power, overdesigned network, enhanced ground bounce Rule-of-thumb: Trise and Tfall of clock are each between 10-20% of clock period (10% aggressive target)
1 GHz clock; Trise = Tfall = 100-200ps

Slew Rates
Latch set-up times are dependent on clock input slew rates (eats into timing budget) Short-circuit power grows with larger slew rates This can be significant for large clock drivers

Ref : IBM sebsite, Carring

Clock Distribution Example

Alpha 21264 clock distribution -- grid + Htree approach Power = 32% of total

Wire usage = 3% of metals 3 & 4

4 major clock quadrants, each with a large driver connected to local grid structures

Technology Trends: Timing

Clock period dropping fast, so skew must follow accordingly Slew rates must scale with cycle time also Jitter PLLs get better with CMOS scaling but other sources of noise increase
Power supply noise more important Switching-dependent temperature gradients

Technology Trends: New Interconnect Materials

Copper reduces RC slew degradation and potential skew Low-k dielectrics decrease clock power and improve latency/skew/slew rates

Ref: IBM, JSSCC, 11/

Technology Trends: Power

Heavily pipelined design more latches, more capacitive load for clock Larger chips more wire-length needed to cover the entire die Complexity more functionality and devices means more clocked elements Dynamic logic more clocked elements

Fundamental Design Decision

Power vs. Skew
Meeting skew requirements is relatively easy With an unlimited power budget! Ex. Wide wires minimize RC product but increase total C Ex. Size up drivers limit latency (translates to skew) but buffer cap. jumps

SoC power requrements

SoCs have more stringent power limitations due to packaging constratints
Plastic packaging, power ~ 2-3 W

This pushes the skew-power tradeoff towards higher skew Intolerable considering the drive for high performance
SoCs are good candidates for power-friendly skewreducing tricks

Network Types: Grid

Gridded clock distribution was common on earlier DEC Alpha microprocessors Advantages:
Skew determined by grid density and not overly sensitive to load position Clock signals are available everywhere Tolerant to process variations Usually yields extremely low skew values
Global grid

Pre-drivers

Clock Distribution for Uniform Delay

Grid Disadvantages
Huge amounts of wiring & power
Wire cap large Strong drivers needed pre-driver cap large Routing area large

To minimize all these penalties, make grid pitch coarser

Skew gets worse Losing the main advantage

Dont overdesign let the skew be as large as tolerable Still grids seem non-feasible for SoCs

Network Types: Tree

Original H-tree (Bakoglu)
One large central driver Recursive H-style structure to match wire-lengths Halve wire width at branching points to reduce reflections

H-Tree Problems
Drawback to original tree concept
slew degradation along long RC paths unrealistically large central driver
Clock drivers can create large temperature gradients (ex. Alpha 21064 ~30 C)

non-uniform load distribution

Inherently non-scalable (wire resistance skyrockets) Solution to some problems

Introduce intermediate buffers along the way Specifically at branching points

Buffered Clock Tree

Buffered H-tree
Advantages
Ideally zero-skew Can be low power (depending on skew requirements) Low area (silicon and wiring) CAD tool friendly (regular)

Disadvantages
Sensitive to process variations Local clocking loads are inherently non-uniform

Balancing a Tree

Some techniques:

Con: Routing area often more valuable than Silicon

(a) Introduce dummy loads (b) Snaking of wirelength to match delays

Clock Skew and Clock balancing

Clock skew
Hold time violation is critical to working silicon Aggressive skew budget for high speed operation Large turn-around-time for clock tree synthesis at P&R stage Skew Source : process + voltage + temp + load + jitter} Skew Budget == ( Target Cycle Time ) /20 , min clk->Q

Solution
CTS (Clock Tree Synthesis) Insert dummy delay at Synthesis Over-design

Clock tree style

1 H-Tree Model
3 Fanout Balance Tree Model
Easy to construct Weak for blest latch distribution Less flexible Net applicable to placement

2 Binary Tree Model

4 Spine and trunk Model (Fish and Bone)

Skew Hardly influenced by Process Scattering Die size increase

Easy to adjust Net Loading Many dummy cells are needed

Practical problem in Clock tree synthesis

Problems
Large chip size due to SOC integration # of FFs = enormous, memory Unbalanced FF distribution Top-level : Interconnect RC dominant Block-level : turn-around-time Iteration cost Test clock Multiple clock frequency

Solution
Plan from the early design stage Skew budgeting : 100ps @ 200MHz

Block level clock tree

Block-level clock skew
Driver-limited Optimization of the buffer strength and number

Clock tree synthesis

Commercial tool -- P&R stage Many iterations Long turn-around-time

Clock tree planning

Virtual clock tree generation Need engineering approximation

Real Clock Tree

clk.4.1

clk.5.1

clk.3.1 Clock tree style Trunk-and-Branch

Virtual Clock Tree Model

Assumption :Uniform distribution of clock buffers and flip-flops Model : Hierarchical trunk-and-branch

L4 CLK buffer

*
N=7

from L2 buffer

L3 CLK buffer

N=6

N=8

Top Level Clock Distribution

PLL

L2
SS SW SE

system

Real Example

Clock Integrity

Clock Integrity
Shield everywhere
Laterally and above/below

Provide current return paths and eliminate coupled noise effects (both C and L )
GND

Vdd CLK

Vdd

GND

Clock Integrity
di/dt for clock drivers can be enormous
All clocks should be switching at the same instant

Potential for L*di/dt noise on power supply Explicit decoupling capacitance has been taken as solution to this problem Thin gate oxides used in silicon white space to create large (100+nF) capacitance to supply charge Alpha 21264 required additional decoupling capacitance at package level to limit switching noise (!)

Clock Shielding
How much does shielding help? Or reference planes? Is it worth the area penalty?

Impact of Reference Planes in Power Distribution

Conditions not given! Aluminum

Clock Grid Simulations /Reference Planes

Copper wiring allows for smaller wires, finer grid pitch, lower power Eliminating need for reference planes

Reduce Self Inductance

Dedicated Ground Planes
G

c
W

Dedicated G.P.

G
Wg

Guard Traces vs. Reference Plane

Below 5GHz, guard traces appear better

LF Current Distribution

Current Spreads Through outer Bigger Current Loops

HF Current Distribution

Current Concentrates Underneath Signal Line Smaller Current Loops, and Smaller Inductance

Network of choice in high performance

Globally Tree Why? Power requirements are reduced compared to global grid
Smaller routing requirements, frees up global tracks

Trees are easily balanced at the global level

Keeps global skew low (with minimal process variation)

Network of Choice
Locally Grid Why? Smaller grid distribution area allows for coarser grid pitch
Lower power in interconnect Lower power in predrivers Routing area reduced

Local skew is kept very small Easy access to clock by simply connecting to grid

Scaling of Distribution Networks

Buffered H-trees
Regular, low-power, acceptable skew Scalable although # of sub-blocks will rise with shrinking timing budget

Grid
As chips get larger, so do grids Power and routing area penalties increase

VLSI Clock & Power Routing Guide
No ratings yet
VLSI Clock & Power Routing Guide
30 pages
Lecture 8 CTS PDF
No ratings yet
Lecture 8 CTS PDF
40 pages
Cts
No ratings yet
Cts
79 pages
Clock Distribution: Skew and Jitter Analysis
No ratings yet
Clock Distribution: Skew and Jitter Analysis
26 pages
Team VLSI
No ratings yet
Team VLSI
3 pages
Vlsi Interview Qns
No ratings yet
Vlsi Interview Qns
46 pages
FloorPlan Questions
0% (1)
FloorPlan Questions
1 page
CTS (PART - I) - VLSI - Physical Design For Freshers
No ratings yet
CTS (PART - I) - VLSI - Physical Design For Freshers
10 pages
FloorPlanning Principles
100% (1)
FloorPlanning Principles
30 pages
Standard Cell Library Overview
No ratings yet
Standard Cell Library Overview
13 pages
Advanced OCV Timing Analysis Techniques
100% (1)
Advanced OCV Timing Analysis Techniques
9 pages
Physical Verification in VLSI Design
100% (1)
Physical Verification in VLSI Design
17 pages
Global Routing: VLSI Physical Design: From Graph Partitioning To Timing Closure
No ratings yet
Global Routing: VLSI Physical Design: From Graph Partitioning To Timing Closure
72 pages
Low Power PDF
100% (1)
Low Power PDF
42 pages
Standard Cell Placement Guide
No ratings yet
Standard Cell Placement Guide
26 pages
VLSI Physical Design Course Details
No ratings yet
VLSI Physical Design Course Details
3 pages
Clock Tree Synthesis Overview and Techniques
No ratings yet
Clock Tree Synthesis Overview and Techniques
16 pages
New PD 1
No ratings yet
New PD 1
78 pages
ASIC Physical Design with Cadence Innovus
100% (1)
ASIC Physical Design with Cadence Innovus
45 pages
An Efficient RDL Routing For Flip Chip Designs
No ratings yet
An Efficient RDL Routing For Flip Chip Designs
10 pages
Macro Placement (Guide Lines)
No ratings yet
Macro Placement (Guide Lines)
12 pages
Low Power Syntheis
100% (3)
Low Power Syntheis
18 pages
Vlsi Physical Design PDF
100% (1)
Vlsi Physical Design PDF
41 pages
ASIC Physical Design Guide
No ratings yet
ASIC Physical Design Guide
272 pages
Physical Design Interview Questions and Answers
No ratings yet
Physical Design Interview Questions and Answers
23 pages
Placement - VLSI Guide
No ratings yet
Placement - VLSI Guide
10 pages
Cells of Low-Power Design - Switch Cell
50% (2)
Cells of Low-Power Design - Switch Cell
19 pages
PD Flow I - Floorplan
No ratings yet
PD Flow I - Floorplan
17 pages
VLSI Basics - Clock Tree Optimization
100% (2)
VLSI Basics - Clock Tree Optimization
3 pages
Stop Pinignore Pinexclude Pinfloat Pin
No ratings yet
Stop Pinignore Pinexclude Pinfloat Pin
4 pages
Congestion Driven Placement
No ratings yet
Congestion Driven Placement
25 pages
Power Gating - Power Management Technique - VLSI Basics and Interview Questions
No ratings yet
Power Gating - Power Management Technique - VLSI Basics and Interview Questions
3 pages
Robust Chip Level CTS
100% (1)
Robust Chip Level CTS
14 pages
VLSI Interview Questions 2 2
No ratings yet
VLSI Interview Questions 2 2
18 pages
Physical Verification Real Time
No ratings yet
Physical Verification Real Time
5 pages
VLSI Design Constraints and Interview Insights
No ratings yet
VLSI Design Constraints and Interview Insights
26 pages
PD Interview Questions
No ratings yet
PD Interview Questions
54 pages
Chipedge
No ratings yet
Chipedge
4 pages
SoC Encounter Training Guide
100% (1)
SoC Encounter Training Guide
44 pages
Digital Physical Design Flows
100% (2)
Digital Physical Design Flows
37 pages
Physical Design - POWER PLANING
No ratings yet
Physical Design - POWER PLANING
10 pages
Legalization in Standard Cell Placement
No ratings yet
Legalization in Standard Cell Placement
13 pages
Clock Tree Synthesis
100% (1)
Clock Tree Synthesis
2 pages
Basic Terminology in Physical Design VLSI Basics and Interview Questions
No ratings yet
Basic Terminology in Physical Design VLSI Basics and Interview Questions
10 pages
Advanced Clock Tree Design with ICC CTS
No ratings yet
Advanced Clock Tree Design with ICC CTS
14 pages
VLSI Logic Synthesis Part 3
No ratings yet
VLSI Logic Synthesis Part 3
19 pages
Physical Design
No ratings yet
Physical Design
96 pages
Primetime
67% (3)
Primetime
5 pages
Floorplan
No ratings yet
Floorplan
29 pages
Article 10: Power Gating
No ratings yet
Article 10: Power Gating
3 pages
Backend (Physical Design) Interview Questions and Answers
No ratings yet
Backend (Physical Design) Interview Questions and Answers
3 pages
Clock Tree Synthesis for VLSI Designs
No ratings yet
Clock Tree Synthesis for VLSI Designs
5 pages
Vlsi Digital Design Issues
No ratings yet
Vlsi Digital Design Issues
94 pages
CTS Part 1
No ratings yet
CTS Part 1
21 pages
7.low Power Clock Distribution
No ratings yet
7.low Power Clock Distribution
99 pages
Clock Skew and Jitter in ASIC Design
No ratings yet
Clock Skew and Jitter in ASIC Design
26 pages
Multipoint CTS for Clock Distribution
No ratings yet
Multipoint CTS for Clock Distribution
59 pages
Power Dissipation in Clock Distribution
No ratings yet
Power Dissipation in Clock Distribution
66 pages
VLSI Clock & Power Routing Guide
100% (1)
VLSI Clock & Power Routing Guide
30 pages
Digital VLSI Clock Design Basics
100% (2)
Digital VLSI Clock Design Basics
29 pages
AM Modulation and Demodulation Lab
No ratings yet
AM Modulation and Demodulation Lab
12 pages
FLUKE 199 Datasheet
No ratings yet
FLUKE 199 Datasheet
9 pages
7609 Specs
No ratings yet
7609 Specs
4 pages
Principles of Electronic Communication
No ratings yet
Principles of Electronic Communication
56 pages
Question Bank Series-1 For SSC & CESE Examination (P-101-180) - 2023
No ratings yet
Question Bank Series-1 For SSC & CESE Examination (P-101-180) - 2023
80 pages
Power Electronics: Applications & Impact
No ratings yet
Power Electronics: Applications & Impact
12 pages
Basic Electronics Interview Questions
No ratings yet
Basic Electronics Interview Questions
3 pages
Class-10 Electrycity MCQ
No ratings yet
Class-10 Electrycity MCQ
10 pages
750 In001 - en P PDF
No ratings yet
750 In001 - en P PDF
302 pages
BQ24780s Schematic
No ratings yet
BQ24780s Schematic
5 pages
Equipment Name Item Name Vendor Address
No ratings yet
Equipment Name Item Name Vendor Address
13 pages
Introduction To Quantum Electromagnetic Circuits
No ratings yet
Introduction To Quantum Electromagnetic Circuits
59 pages
Construction of 11 KV
No ratings yet
Construction of 11 KV
3 pages
TEPEx PSF 2020-1
No ratings yet
TEPEx PSF 2020-1
2 pages
Electronic Siren Installation Guide
No ratings yet
Electronic Siren Installation Guide
16 pages
3 General Terms Used in The Physics
No ratings yet
3 General Terms Used in The Physics
17 pages
Module 7 B2
No ratings yet
Module 7 B2
23 pages
10.8 Pioneers Ofzpf-Theory: Tom Bearden: Francisco: Strawberry Hill Press
No ratings yet
10.8 Pioneers Ofzpf-Theory: Tom Bearden: Francisco: Strawberry Hill Press
3 pages
Long Transmission Line
No ratings yet
Long Transmission Line
27 pages
Eurocable LK Connectors
No ratings yet
Eurocable LK Connectors
20 pages
Laser Doppler Vibrometer: Advanced Test Equipment Rentals
No ratings yet
Laser Doppler Vibrometer: Advanced Test Equipment Rentals
101 pages
App Note Transformer Modelling V 2 0
No ratings yet
App Note Transformer Modelling V 2 0
20 pages
Full Wave Rectifier
No ratings yet
Full Wave Rectifier
5 pages
Stress Cone Effect Analysis and Optimum Design of
No ratings yet
Stress Cone Effect Analysis and Optimum Design of
10 pages
CEA PPC Compliance Test Matrix With Comparison
No ratings yet
CEA PPC Compliance Test Matrix With Comparison
2 pages
Toshiba 8ch High-Voltage Driver IC
No ratings yet
Toshiba 8ch High-Voltage Driver IC
10 pages
Solar Panel Specs for Engineers
No ratings yet
Solar Panel Specs for Engineers
2 pages
Physical Sciences P1 Feb-March 2015 Eng
No ratings yet
Physical Sciences P1 Feb-March 2015 Eng
20 pages
Cemont CITIG 1500 DC Rev.00
No ratings yet
Cemont CITIG 1500 DC Rev.00
21 pages
dc2019 12 0018
No ratings yet
dc2019 12 0018
10 pages

Clock Distribution and Metrics

Uploaded by

Clock Distribution and Metrics

Uploaded by

Clock Distribution

based on Dennis Sylvester at Univ. Michigan www.eecs.umich.edu

Function of clock distribution network

Reference: light travels <1 cm in 10 ps

NOTES : JITTER J1 = t2 t1 JITTER J2 = t3 t2

Jitter Impact on Timing Budget

Different components dominate

Not only is the clock capacitance large, it switches every cycle!

Low Power Clocking Techniques

Edge-triggered flip-flops in ARM rather than transparent latches in Alpha

Clock Distribution Metric: Area

Ref : IBM sebsite, Carring

Clock Distribution Example

Wire usage = 3% of metals 3 & 4

Technology Trends: Timing

Technology Trends: New Interconnect Materials

Ref: IBM, JSSCC, 11/

Technology Trends: Power

Fundamental Design Decision

SoC power requrements

Network Types: Grid

Clock Distribution for Uniform Delay

To minimize all these penalties, make grid pitch coarser

Network Types: Tree

non-uniform load distribution

Inherently non-scalable (wire resistance skyrockets) Solution to some problems

Buffered Clock Tree

Con: Routing area often more valuable than Silicon

(a) Introduce dummy loads (b) Snaking of wirelength to match delays

Clock Skew and Clock balancing

Clock tree style

2 Binary Tree Model

4 Spine and trunk Model (Fish and Bone)

Easy to adjust Net Loading Many dummy cells are needed

Practical problem in Clock tree synthesis

Block level clock tree

Clock tree synthesis

Clock tree planning

Real Clock Tree

clk.3.1 Clock tree style Trunk-and-Branch

Virtual Clock Tree Model

Top Level Clock Distribution

Impact of Reference Planes in Power Distribution

Conditions not given! Aluminum

Clock Grid Simulations /Reference Planes

Reduce Self Inductance

Guard Traces vs. Reference Plane

Below 5GHz, guard traces appear better

Current Spreads Through outer Bigger Current Loops

Network of choice in high performance

Trees are easily balanced at the global level

Scaling of Distribution Networks

You might also like