A Priori System-Level Interconnect Prediction The Road to Future Computer Systems Dirk Stroobandt Ghent University Electronics and Information Systems Department Presentation at Northwestern University May 11th, 2000 Outline • • • • • Why do we need a priori interconnect prediction? Basic models Rent’s rule with extensions and applications A priori wirelength prediction New evolutions: • 3D and anisotropic systems • System-level predictions • Applications • Conclusions May 11th, 2000 Talk at NWU, Dirk Stroobandt 2 Outline • • • • • Why do we need a priori interconnect prediction? Basic models Rent’s rule with extensions and applications A priori wirelength prediction New evolutions: • 3D and anisotropic systems • System-level predictions • Applications • Conclusions May 11th, 2000 Talk at NWU, Dirk Stroobandt 3 Why do we need a priori interconnect prediction? • Importance of wires increases (they do not scale as components). • For future designs, very little is known. Roadmapping uses a priori estimation techniques. • To improve CAD tools for design layout generation. • CAD tools have to take into account: timing constraints, area constraints, performance, power dissipation… • All these constraints: wires should be as short as possible. • Estimation at early stage aids the CAD tools in finding a better solution through fewer design cycle iterations. May 11th, 2000 Talk at NWU, Dirk Stroobandt 4 Why do we need a priori interconnect prediction? To evaluate new computer architectures • To adhere to the increasing performance demands, new computer architectures are needed. • Each of them must be evaluated thoroughly. • A priori estimates immediately provide a ground for drawing preliminary conclusions. • Different architectures can be compared to each other. • Applications for evaluating three-dimensional (optoelectronic) architectures, FPGA’s, MCM’s,... May 11th, 2000 Talk at NWU, Dirk Stroobandt 5 Components of the physical design step circuit architecture Layout generation layout May 11th, 2000 Talk at NWU, Dirk Stroobandt 6 Circuit model Logic block Net Internal net External net Terminal / pin Multi-terminal nets have a net degree > 2 May 11th, 2000 Talk at NWU, Dirk Stroobandt 7 Model for partitioning 8 nets cut 4 nets cut Optimal partitioning: minimal number of nets cut May 11th, 2000 Talk at NWU, Dirk Stroobandt 8 Model for partitioning New net New terminal Module May 11th, 2000 Talk at NWU, Dirk Stroobandt 9 The three basic models Circuit model Model for the architecture Cell Pad Channel Manhattan grid using Manhattan metric d | x1 x2 | | y1 y2 | Placement and routing model May 11th, 2000 Talk at NWU, Dirk Stroobandt 10 The three basic models Optimal placement = placement with minimal total wire length over all possible placements. Optimal routing = routing through shortest path • requires channels with sufficiently high density • for multi-terminal nets: Steiner trees This defines the net length for known endpoints Placement and routing model May 11th, 2000 Talk at NWU, Dirk Stroobandt 11 Outline • • • • • Why do we need a priori interconnect prediction? Basic models Rent’s rule with extensions and applications A priori wirelength prediction New evolutions: • 3D and anisotropic systems • System-level predictions • Applications • Conclusions May 11th, 2000 Talk at NWU, Dirk Stroobandt 12 Rent’s rule Rent’s rule was first described by [Landman and Russo, 1971] For average number of terminals and blocks per module: p 100 T=tB T p = Rent exponent t = average # term./block 10 Measure for the complexity of the interconnection topology (simple) 0 p 1 (complex) average Rent’s rule Normal values: 0.5 p 0.75 1 1 May 11th, 2000 10 B 100 1000 Talk at NWU, Dirk Stroobandt 13 Rent’s rule If B cells are added, what is the increase T? In the absence of any other information we guess T T B B T B B T Overestimate: many of T terminals connect to T terminals and so do not contribute to the total. We introduce a factor p (p <1) which indicates how self connected the netlist is Statistically homogenous system T T p B B Or, if B & T are small compared to B and T dT dB p p T tB T B May 11th, 2000 Talk at NWU, Dirk Stroobandt 14 Rent’s rule T=tB 100 p Rent’s rule is experimentally validated for a lot of real circuits and for different partitioning methodologies. T 10 average Rent’s rule 1 1 10 100 B Deviation for high B and T: Rent’s region II (cfr. later). May 11th, 2000 1000 Distinguish between: • p* : intrinsic Rent exponent • p : Rent exponent for a given placement • p’ : Rent exponent for a given partitioning Talk at NWU, Dirk Stroobandt 15 Rent’s rule Rent’s rule is a result of the self-similarity within circuits Assumption: interconnection complexity is equal at all levels. May 11th, 2000 Talk at NWU, Dirk Stroobandt 16 Extension: the local Rent exponent Variations in Rent’s rule: • global variations (e.g., lower complexity after Technology mapping of the circuit, duplication); • local variations. Two kinds of local variations in Rent’s rule: • hierarchical locality: some hierarchical levels are more complex than others; • spatial locality: some circuit parts are more complex than others. Both are deviations from Rent’s rule that can be modelled well. May 11th, 2000 Talk at NWU, Dirk Stroobandt 17 Hierarchical locality: Rent’s region II 100 T 10 average Rent’s rule 1 1 10 100 B May 11th, 2000 1000 Causes of region II: - pin limitation problem; - parallel to serial (complexity is moved from space to time, number of pins is lowered); - coding (input and output stream compact). Talk at NWU, Dirk Stroobandt 18 Hierarchical locality: region III For some circuits: also deviation at low end. T Mismatch between the available (library) and the desired (design) complexity of interconnect topology. Only for circuits with logic blocks that have many inputs. May 11th, 2000 Talk at NWU, Dirk Stroobandt 19 Hierarchical locality: modelling Use incremental Rent exponent (proportional to the slope of Rent’s curve in a single point). T tB p(B) p2 log( T ) p( B) log( B) T p1 p3 B May 11th, 2000 Talk at NWU, Dirk Stroobandt 20 Spatial locality in Rent’s rule Inhomogeneous circuits: different parts have different interconnection complexity. For separate parts: Ti tBi pi p1 0.80 p2 0.35 Only one Rent exponent (heterogeneous) might not be realistic. Clustering: simple parts will be absorbed by complex parts. May 11th, 2000 Talk at NWU, Dirk Stroobandt 21 Local Rent exponent 1 1 2 1 T 1 2 1 1 2 2 2 B May 11th, 2000 2 Higher partitioning levels: Rent exponents will merge. Spreading of the values with steep slope (decreasing) for complex part and gentle slope (increasing) for simple part. Local Rent exponent tangent slope of the line that combines all partitions containing the local block(s). Talk at NWU, Dirk Stroobandt 22 Heterogeneous Rent’s rule Suggested by (Zarkesh-Ha, Davis, Loh, and Meindl,’98) Weighted arithmetic average of the logarithm of T: B1 log T1 B2 log T2 log Teq B1 B2 Heterogeneous Rent’s rule (for 2 parts): Teq teq B May 11th, 2000 peq 1 B B B B teq (t1 1 t 2 2 ) 1 2 p1 B1 p2 B2 peq B1 B2 Talk at NWU, Dirk Stroobandt 23 Use of Rent’s rule in CAD Rent’s rule is very powerful as a measure of interconnection complexity Can aid in the partitioning process Benchmark generators are based on Rent’s rule Is basis for a priori estimates in CAD May 11th, 2000 Talk at NWU, Dirk Stroobandt 24 Rent’s rule in partitioning Actual goal: minimize the number of pins per module. We should use a pin count criterion. External multi-terminal nets lead to only one new pin instead of two when cut. Preferring external nets to be cut will better keep clusters together. May 11th, 2000 Talk at NWU, Dirk Stroobandt 25 Rent’s rule in partitioning Solution: use a new ratio value (in ratiocut partitioning) based on terminal count: Tn Rp | A || A' | Better partitions are obtained because the total number of pins for each module is taken into account by the cost function. May 11th, 2000 Talk at NWU, Dirk Stroobandt 26 Rent’s rule in partitioning Better (ratio cut) heuristic by using terminal count prediction (Stroobandt, ISCAS‘99). • Clustering property of the ratio cut: use Rent’s rule instead of uniformly distributed random graph. • New ratio: Tn Rp p p p B1 B2 ( B1 B2 ) Instead of old ratio: May 11th, 2000 Rold Tn B1 B2 Talk at NWU, Dirk Stroobandt 27 Rent’s rule in partitioning Important (especially in pin-limited designs): terminal balancing (Stroobandt, Swiss CAD/CAM‘99). • Minimizing the terminal count alone is not enough. Additional cost function for terminal balancing: Terminal May 11th, 2000 TA TA ' Rb p p B A BA' Talk at NWU, Dirk Stroobandt 2 28 Rent’s rule in benchmark generation Generating benchmarks in a hierarchical way • Rent’s rule is used for estimating the number of connections • Other parameters have to be controlled as well: – Classical parameters: * total number of gates * total number of nets * total number of pins – Gate terminal distribution – Net degree distribution • Other issues: gate functionality, redundancy, timing constraints, ... May 11th, 2000 Talk at NWU, Dirk Stroobandt 29 Outline • • • • • Why do we need a priori interconnect prediction? Basic models Rent’s rule with extensions and applications A priori wirelength prediction New evolutions: • 3D and anisotropic systems • System-level predictions • Applications • Conclusions May 11th, 2000 Talk at NWU, Dirk Stroobandt 30 Donath’s hierarchical placement model 1. Partition the circuit into 4 modules of equal size such that Rent’s rule applies (minimal number of pins). 2. Partition the Manhattan grid in 4 subgrids of equal size in a symmetrical way. May 11th, 2000 Talk at NWU, Dirk Stroobandt 31 Donath’s hierarchical placement model 3. Each subcircuit (module) is mapped to a subgrid. mapping 4. Repeat recursively until all logic blocks are assigned to exactly one grid cell in the Manhattan grid. May 11th, 2000 Talk at NWU, Dirk Stroobandt 32 Donath’s length estimation model At each level: Rent’s rule gives number of connections • number of terminals per module directly from Rent’s rule (partitioning based Rent exponent p’); • every net not cut before (internal net): 2 new terminals; • every net previously cut (external net): 1 new terminal; • assumption: ratio f = (#internal nets)/(#nets cut) is constant over all levels k (Stroobandt and Kurdahi, GLSVLSI’98); • number of nets cut at level k (Nk) equals N k Tk where =1/(1+f); depends on the total number of nets in the circuit and is bounded by 0.5 and 1. May 11th, 2000 Talk at NWU, Dirk Stroobandt 33 Donath’s length estimation model Length of the connections at level k ? Adjacent (A-) combination Diagonal (D-) combination Donath assumes: all connection source and destination cells are uniformly distributed over the grid. May 11th, 2000 Talk at NWU, Dirk Stroobandt 34 Average interconnection length p 1 k ( p 1) Number of connections at level k: N k tG(1 4 )4 4 1 l Average length A-combination: k ,a 3 3 , Average length D-combination: lk ,d 2 , lk 149 92 Average length level k: Total average length: 2K ( 2r x ) 1 H ( K , r, x ) 2 r x 2 1 May 11th, 2000 L 14 H ( K , r,1) 2 H ( K , r,3) 9 H ( K , r,2) with and 2K = G = total number of gates Talk at NWU, Dirk Stroobandt 35 Results Donath Scaling of the average length L as a function of the number of logic blocks G : G p 0.5 ( p 0.5) L log( G ) ( p 0.5) f ( p ) ( p 0.5) 30 25 p = 0.7 20 L 15 p = 0.5 10 p = 0.3 5 0 1 10 100 103 104 105 106 107 G Similar to measurements on placed designs. May 11th, 2000 Talk at NWU, Dirk Stroobandt 36 Results Donath L 8 7 6 5 theory experiment 4 3 2 1 0 10 100 1000 10000 G Theoretical average wire length too high by a factor 2 May 11th, 2000 Talk at NWU, Dirk Stroobandt 37 Including optimal placement model • Keep wire length scaling by hierarchical placement. • Improve on uniform probability for all connections at one level (not a good model for an optimal placement). Enumeration: site density function (only architecture dependent). Occupying probability favours short interconnections (for an optimal placement) (darker) May 11th, 2000 Talk at NWU, Dirk Stroobandt 38 Including optimal placement model Wirelength distributions contain two parts: site density function and probability distribution all possibilities probability of occurrence requires enumeration shorter wires more probable (use generating polynomials) N (l ) K D(l ) q(l ) May 11th, 2000 Talk at NWU, Dirk Stroobandt 39 Wire length distribution Local distributions at each level have similar shapes (self-similarity) peak values scale. Integral of local distributions equals number of connections. Global distribution follows peaks. From this we can deduct that N (l ) l 2 p 3 For short lengths: D(l ) l N (l ) q(l ) l 2 p 4 D( L) May 11th, 2000 Talk at NWU, Dirk Stroobandt 40 Occupying probability: results Use probability on each hierarchical level (local distributions). 8 L Occupying prob. Donath experiment 7 6 5 4 3 2 1 0 10 100 1000 10000 G May 11th, 2000 Talk at NWU, Dirk Stroobandt 41 Occupying probability: results Effect of the occupying probability: boosting the local wire length distributions (per level) for short wire lengths percent of wires Donath global trend per level total Occupying prob. 100 global trend per level total 10 1 0,1 0,01 10-3 10-4 1 10 100 1000 10000 Wire length May 11th, 2000 1 10 100 1000 10000 Wire length Talk at NWU, Dirk Stroobandt 42 Occupying probability: results Effect of the occupying probability on the total distribution: more short wires = less long wires average wire length is shorter May 11th, 2000 percent wires 100 10 1 10-1 10-2 10-3 10-4 10-5 1 Donath Occupying prob. 10 100 1000 10000 Wire length Talk at NWU, Dirk Stroobandt 43 Occupying probability: results Percent wires 60 -8% 50 -23% 40 Donath Occupying prob. global trend 30 +10% +6% 20 10 1 May 11th, 2000 2 3 4 5 6 7 Wire length 8 9 10 Talk at NWU, Dirk Stroobandt 44 Occupying probability: results Number of wires 1000 Donath Occupying prob. measurement 100 10 1 0,1 May 11th, 2000 1 10 Wire length Talk at NWU, Dirk Stroobandt 100 45 Davis’ probability function Introduced by Davis, De, and Meindl (IEEE T El. Dev., ‘98). Number of interconnections at distance l is calculated for every gate separately, using Rent’s rule. Three regions: gate under investigation (A), target gates (C), and gates in between (B). Number of connections between A and C is calculated. May 11th, 2000 This approach alleviates the discrete effects at the boundaries of the hierarchical levels while maintaining the scaling behaviour. Talk at NWU, Dirk Stroobandt 46 Davis’ probability function A B C C = A B C C B C C B B B C B B A B B C B B B C C B C + A C = TAC B - C + TAB TBC A B - C - TB A B C - TABC C Assumption: net cannot connect A,B, and C C TAB t 1 BB TB tBB TBC t BB BC p p TABC t 1 BB BC p N AC TAC t 1 BB BB BC BBp 1 BB BC May 11th, 2000 p p Talk at NWU, Dirk Stroobandt p p 47 Davis’ probability function For cells placed in infinite 2D plane C C C B C C B B B C B B A B B C B B B C C B C BC 4l C l 1 BB 4l ' 2l (l 1) l '1 C N AC t 1 2l (l 1) 2l (l 1) 4l 2l (l 1) 1 2l (l 1) 4l p q(l ) May 11th, 2000 p p N A C l 2 p4 4l Talk at NWU, Dirk Stroobandt 48 p 28 Planar wirelength model A L Finite system, Btot=L2, no edges, approximate form for q(l) L Da (l ) 2lL2 q(l ) l May 11th, 2000 2 p 4 N (l ) Ntot Da (l )q(l ) p N tot t Btot Btot Talk at NWU, Dirk Stroobandt 49 29 Planar wirelength model B (Davis) L Finite system, Btot=L2, includes edge effects, use q(l) L l l 2 1 6 L L l 3 for 1 l L Db (l ) 2 L l 12 L l 2 L l 1 3 for L l 2 L 0 else N (l ) Ntot Db (l )q(l ) p N tot t Btot Btot May 11th, 2000 Talk at NWU, Dirk Stroobandt 50 30 10 Number of nets 10 10 4 3 2 Planar wirelength model comparison 10 1 0 10 0 10 1 10 Length 2 10 Btot = 1024 p = 0.66 Model A Model A: Lav = 4.53 Model B: Lav = 2.27 Model B May 11th, 2000 Talk at NWU, Dirk Stroobandt 51 2 x 10 Relationship between models from Davis (planar model B) and Stroobandt (hierarchical model C) 4 Number 1.5 1 0.5 0 0 10 20 40 30 Length 50 60 70 Db(l) Dc(l,h) H Db (l ) 4 H h Dc (l , h) h 1 same q’(l) essentially identical! May 11th, 2000 Talk at NWU, Dirk Stroobandt 52 Hierarchical wirelength model comparison 10 Number 10 10 10 4 3 2 1 0 10 0 10 1 10 Length 2 10 Ctot = 1024 p = 0.66 Model D Model C (Stroobandt): Lav = 2.05 Model D (Donath): Lav = 5.14 Model C: q(l) and hierarchy Model D: only hierarchy (q(l)=1) May 11th, 2000 Model C Talk at NWU, Dirk Stroobandt 53 Number of nets 10 10 10 Planar and hierarchical model comparison 4 3 10 2 10 Number 10 1 10 0 10 10 10 0 1 10 Length 2 10 4 3 2 1 0 10 0 10 1 10 Length 2 10 Model A Model B Model D Model C Models B (planar) and C (hierarchical ) are equivalent if the Rent exponent used for the probability function (depends on placement) and the one used for the number of nets per hierarchical level (based on partitioning) are the same May 11th, 2000 Talk at NWU, Dirk Stroobandt 54 Outline • • • • • Why do we need a priori interconnect prediction? Basic models Rent’s rule with extensions and applications A priori wirelength prediction New evolutions: • 3D and anisotropic systems • System-level predictions • Applications • Conclusions May 11th, 2000 Talk at NWU, Dirk Stroobandt 55 Extension to three-dimensional grids May 11th, 2000 Talk at NWU, Dirk Stroobandt 56 Three-dimensional grids: basic results Average length converges (for G) up to r = 2/3 1/2 Average wire length is lower than for 2D (no long wires) May 11th, 2000 Talk at NWU, Dirk Stroobandt 57 Anisotropic systems May 11th, 2000 Talk at NWU, Dirk Stroobandt 58 Anisotropic systems Basic method: Donath’s method in 3D Not all dimensions are equal (e.g., optical links in 3rd D) • possibly larger latency of the optical link (compared to intrachip connection); • influence of the spacing of the optical links across the area (detours may have to be made); • limitation of number of optical layers Introducing an optical cost May 11th, 2000 Talk at NWU, Dirk Stroobandt 59 Anisotropic systems If limited number of layers: use third dimension for topmost hierarchical levels (fewest interconnections). For lower levels: 2D method. 2D and 1D partitioning are sometimes used to get closer to the (optimal in isotropic grids) cubic form. Depending on the optical cost, it is advantageous either to strive for getting to the electrical plains as soon as possible (high optical cost, use at high levels only) or to partition the electrical planes first (low optical cost). May 11th, 2000 Talk at NWU, Dirk Stroobandt 60 External nets Importance of good wire length estimates for external nets during the placement process: For highly pin-limited designs: placement will be in a ring-shaped fashion (along the border of the chip). May 11th, 2000 Talk at NWU, Dirk Stroobandt 61 Wire lengths at system level At system level: many long wires (peak in distribution). How to model these? Davis and Meindl ‘98: estimation based on Rent’s rule with the floorplanning blocks as logic blocks. IMPORTANT! May 11th, 2000 Talk at NWU, Dirk Stroobandt 62 Improving CAD tools for design layout Digital design is complex Computer-aided design (CAD) More efficient layout generation requires good wire length estimates. • Layer assignment in routing • effects of vias, blockages • congestion, ... A priori estimates are rough but can already provide us with a lot of information. May 11th, 2000 Talk at NWU, Dirk Stroobandt 63 Evaluating new computer architectures Estimation for evaluating and comparing different architectures Circuit characterization We need parameters to classify circuits in classes and to optimize them. Benchmark generation based on Rent’s rule. May 11th, 2000 Talk at NWU, Dirk Stroobandt 64 Conclusion • Wire length estimates are becoming more and more important. • A priori estimates can provide a lot of information at virtually no cost. • Methods are based on Rent’s rule. • Important for future research: how can we build a priori estimates into CAD layout tools? More information at http://www.elis.rug.ac.be/~dstr/dstr.html May 11th, 2000 Talk at NWU, Dirk Stroobandt 65
© Copyright 2025 Paperzz