Enabling Technologies for Reconfigurable Computing July 8, 2002, ENST, Paris, France Reiner Hartenstein University of Kaiserslautern Enabling Technologies for Reconfigurable Computing and Software / Configware Co-Design Part 4: Recent developments -. Schedule Xputer Lab University of Kaiserslautern time slot 10.00 – 11.00 Reconfigurable Computing (RC) 11.00 – 11.30 coffee break 11.30 – 12.30 Data-Stream-based Computing 12.30 – 14.00 lunch break 14.00 – 15.00 15.00 – 15.30 Resources for RC and Data-Stream-based Computing Recent developments 15.30 – 16.00 Discussion © 2002, [email protected] 2 http://kressarray.de >> Configware Market Xputer Lab University of Kaiserslautern • Configware Market (also see appendix) • FPGA Market • Embedded Systems (Co-Design) • Hardwired IP Cores on Board • Run-Time Reconfiguration (RTR) • Rapid Prototyping & ASIC Emulation • Evolvable Hardware (EH) • Academic Expertise (see appendix) • ASICs dead • Soft CPU • HLLs • Problems to be solved © 2002, [email protected] 3 http://kressarray.de bleeding edge designs Xputer Lab University of Kaiserslautern • Infinite amount of gates not yet available on a chip • 3 mio gates (10 mio in 2003 ?) far away from "infinite" • Bleeding edge designs only with sophisticated EDA tools • Excessive optimization needed • Hardware epertise is inevitable for the designer. • improve and simplify the design flow the user • provide rich configware libraries of soft IP cores, • control appl., networking, wireless telecommunication, data communication, embedded and consumer markets. © 2002, [email protected] 4 http://kressarray.de Xputer Lab EDA as the Key Enabler (major EDA vendors) University of Kaiserslautern • Select EDA quality / productivity, not FPGA architectures • EDA often has massive software quality problems • Customer: highest priority EDA center of excellence – – – – – collecting EDA expertise and EDA user experience to assemble best possible tool environments for optimum support design teams to cope with interoperability problems to keep track with the EDA scene as a rapidly moving target • being fabless, FPGA vendors spend most qualified manpower in development of EDA, IP cores, applications , support • Xilinx and Altera are morphing into EDA companies. 5 http://kressarray.de © 2002, [email protected] >> FPGA Market Xputer Lab University of Kaiserslautern • Configware Market • FPGA Market • Embedded Systems (Co-Design) • Hardwired IP Cores on Board • Run-Time Reconfiguration (RTR) • Rapid Prototyping & ASIC Emulation • Evolvable Hardware (EH) • Academic Expertise (see appendix) • ASICs dead • Soft CPU • HLLs • Problems to be solved © 2002, [email protected] 6 http://kressarray.de Xputer Lab Top 4 PLD Manufacturers 2000 University of Kaiserslautern Lattice 15% Altera 37% © 2002, [email protected] Actel 6% Xilinx 42% $3.7 Bio Top 4 PLD Manufacturers 2000 7 http://kressarray.de FPGA market 1998 / 1999 Xputer Lab University of Kaiserslautern Source: IC Insights Inc. 1999 rank 1 2 3 4 5 6 7 8 Xilinx Altera Lattice Actel Lucent Cypress Quicklogic Atmel © 2002, [email protected] 8 Meanwhile, Xilinx acquired Philips' MOS PLD business, Lattice purchased Vantis. . global sales 1998 629 654 206 154 100 41 30 32 (mio $) 1999 899 837 410 172 120 43 40 38 http://kressarray.de .... going into every type of application Xputer Lab University of Kaiserslautern • [Dataquest] PLD market > $7 billion by 2003. [Gordon Bell] • „fastest growing segment of semiconductor market.“ • IP reuse: "pre-fabricated" components for the efficiency of design and use for PLDs • FPGAs are going into every type of application. © 2002, [email protected] 9 http://kressarray.de >> Embedded Systems (Co-Design) Xputer Lab University of Kaiserslautern • Configware Market • FPGA Market • Embedded Systems (Co-Design) • Hardwired IP Cores on Board • Run-Time Reconfiguration (RTR) • Rapid Prototyping & ASIC Emulation • Evolvable Hardware (EH) • Academic Expertise • ASICs dead • Soft CPU • HLLs • Problems to be solved © 2002, [email protected] 10 http://kressarray.de Triscend CSoC Xputer Lab University of Kaiserslautern [Kean] Configurable system logic ARM Digital Filter Display Interface Viterbi A/D Interface CSI Socket Configurable System Interconnect (CSI) Bus Memory © 2002, [email protected] Other System Resources 11 http://kressarray.de Goal: away from complex design flow Xputer Lab University of Kaiserslautern [à la S. Guccione] Schematics/ HDL Netlister Netlist Place and Route Bitstream HLL Compiler © 2002, [email protected] 12 http://kressarray.de Xputer Lab Overcome traditional separate design flow University of Kaiserslautern [à la S. Guccione] HLL Schematics/ HDL Netlister Netlist Compiler Place and Route . . Bitstream User Code Compiler Executable © 2002, [email protected] 13 http://kressarray.de Overcome traditional co-processing design Xputer Lab separate flow -> JBits Design Flow University of Kaiserslautern [à la S. Guccione] Schematics/ HDL JBits API Netlister Netlist Place and Route User Java Code Java Compiler Executable . . Bitstream User Code Compiler Executable © 2002, [email protected] 14 http://kressarray.de >> hardwired IP cores on board Xputer Lab University of Kaiserslautern • Configware Market • FPGA Market • Embedded Systems (Co-Design) • Hardwired IP Cores on Board • Run-Time Reconfiguration (RTR) • Rapid Prototyping & ASIC Emulation • Evolvable Hardware (EH) • Academic Expertise (see appendix) • ASICs dead • Soft CPU • HLLs • Problems to be solved © 2002, [email protected] 15 http://kressarray.de Xputer Lab University of Kaiserslautern HLL Embedded hardw. CPU & memory cores on chip. Compiler FPGA core HLL Compiler CPU Memory core core [à la S. Guccione] © 2002, [email protected] 16 http://kressarray.de Xputer Lab new directions in application development University of Kaiserslautern • new directions in application development. • aut. partitioning compilers: designer productivity • like CoDe-X (Jürgen Becker, Univ. of Karlsruhe), • supports Run-Time Reconfiguration (RTR), a key enabler of error handling and fault correction by partial re-routing the FPGA at run time, as well as remote patching for upgrading, remote debugging, and remote repair by reconfiguration - even over the internet. © 2002, [email protected] 17 http://kressarray.de Xputer Lab >> Run-Time Reconfiguration (RTR) University of Kaiserslautern • Configware Market • FPGA Market • Embedded Systems (Co-Design) • Hardwired IP Cores on Board • Run-Time Reconfiguration (RTR) • Rapid Prototyping & ASIC Emulation • Evolvable Hardware (EH) • Academic Expertise (see appendix) • ASICs dead • Soft CPU • HLLs • Problems to be solved © 2002, [email protected] 18 http://kressarray.de Xputer Lab CPU use for configuration management University of Kaiserslautern • on-board microprocessor CPU is available anyhow - even along with a little RTOS • use this CPU for configuration management RTR System Design HLL © 2002, [email protected] 19 Compiler http://kressarray.de hard CPU & memory core on same chip Xputer Lab University of Kaiserslautern HLL Compiler FPGA core RTR System Design HLL © 2002, [email protected] Compiler 20 CPU Memory core core http://kressarray.de Xputer Lab Converging factors for RTR University of Kaiserslautern • Converging factors make RTR based system design viable • 1) million gate FPGA devices and co-processing with standard microprocessors are commonplace • direct implementation of complex algorithms in FPGAs. • This alone has already revolutionized FPGA design. JBits • 2) new tools like Xilinx Jbits API software tool suite directly support coprocessing and RTR. User Java Code © 2002, [email protected] 21 Java Compiler Executable http://kressarray.de RTR Xputer Lab University of Kaiserslautern • divides application into a series of sequentially executed stages, each mapped as a separate execution module. • Excellent example :Xtrem platform by PACT AG, Munich • Without RTR, all configurable platforms just ASIC emulators. • directly support development and debugging of RTR applications • will also heavily influence the future system organization © 2002, [email protected] 22 http://kressarray.de >> Rapid Prototyping & ASIC Emulation Xputer Lab University of Kaiserslautern • Configware Market • FPGA Market • Embedded Systems (Co-Design) • Hardwired IP Cores on Board • Run-Time Reconfiguration (RTR) • Rapid Prototyping & ASIC Emulation • Evolvable Hardware (EH) • Academic Expertise (see appendix) • ASICs dead • Soft CPU • HLLs • Problems to be solved © 2002, [email protected] 23 http://kressarray.de Xputer Lab ASIC emulation: a new business model ? University of Kaiserslautern • ASIC emulation / Rapid Prototyping: to replace simulation • Quickturn (Cadence), IKOS (Synopsys), Celaro (Mentor) • from rack to board to chip (from other vendors, e. g. Virtex and VirtexE family (emulate up to 3 million gates) • Easy configuration using SmartMedia FLASH cards • ASIC emulators will become obsolete within years • By RTR: in-circuit execution debugging instead of emulation © 2002, [email protected] 24 http://kressarray.de >> Evolvable Hardware (EH) Xputer Lab University of Kaiserslautern • Configware Market • FPGA Market • Embedded Systems (Co-Design) • Hardwired IP Cores on Board • Run-Time Reconfiguration (RTR) • Rapid Prototyping & ASIC Emulation • Evolvable Hardware (EH) • Academic Expertise (see appendix) • ASICs dead • Soft CPU • HLLs • Problems to be solved © 2002, [email protected] 25 http://kressarray.de Xputer Lab EH, EM, ... University of Kaiserslautern • "Evolvable Hardware" (EH), "Evolutionary Methods" (EM), „digital DNA“, "Darwinistic Methods", and biologically inspired electronic systems • new research area, also a new application area of FPGAs • revival of cybernetics or bionics: stimulated by technology • „evolutionary“ and „DNA“ metaphor create awareness • EM sucks, also thru mushrooming funds in the EU, in Japan, Korea, and the USA • EM-related international conference series are in their stormy visionary phase, like EH, ICES, EuroGP, GP, CEC, GECCO, EvoWorkshops, MAPLD, ICGA 26 http://kressarray.de © 2002, [email protected] EH, EM, ... Xputer Lab University of Kaiserslautern • Shake-out phenomena expected, like in the past with „Artificial Intelligence“ • should be considered as a specialized EDA scene, focusing on theoretical issues. • Genetic algorithms suck - often replacable by more efficient ones from EDA • It is recommendable to set-up an interwoven competence in both scenes, EM scene and the highly commercialized EDA scene • EH should be done by EDA people, rather than EM freaks. © 2002, [email protected] 27 http://kressarray.de >> Academic Expertise Xputer Lab University of Kaiserslautern • Configware Market • FPGA Market • Embedded Systems (Co-Design) • Hardwired IP Cores on Board • Run-Time Reconfiguration (RTR) • Rapid Prototyping & ASIC Emulation • Evolvable Hardware (EH) • Academic Expertise (see appendix) • ASICs dead • Soft CPU • HLLs • Problems to be solved © 2002, [email protected] 28 http://kressarray.de >> ASICs dead ? Xputer Lab University of Kaiserslautern • Configware Market • FPGA Market • Embedded Systems (Co-Design) • Hardwired IP Cores on Board • Run-Time Reconfiguration (RTR) • Rapid Prototyping & ASIC Emulation • Evolvable Hardware (EH) • Academic Expertise • ASICs dead ? • Soft CPU • HLLs • Problems to be solved © 2002, [email protected] 29 http://kressarray.de Xputer Lab (When) Will FPGAs Kill ASICs? [Jonathan Rose] University of Kaiserslautern ASICs Are Already Dead My Position [Jonathan Rose] They Just Don’t Know It Yet! © 2002, [email protected] 30 http://kressarray.de Xputer Lab What’s Wrong with This Picture? University of Kaiserslautern What About PLD Cores on ASICs ? Embedded FPGA Fabric [Jonathan Rose] 1. Still Have to Make the Chip 2. Need Two Sets of Software to Build It – The ASIC Flow – The PLD Flow 3. Have No Idea What to Connect the PLD Pins to – Chances Are, You Are Going to Get It Wrong! 31 http://kressarray.de © 2002, [email protected] Xputer Lab What’s Right with This Picture! University of Kaiserslautern Embedded CPU Serial Link, Analog, “etc.” [Jonathan Rose] 1. Pre-Fabricated 2. One CAD Tool Flow! 3. Can Connect Anything to Anything PLDs are built for general connectivity © 2002, [email protected] 32 http://kressarray.de >> Soft CPU Xputer Lab University of Kaiserslautern • Configware Market • FPGA Market • Embedded Systems (Co-Design) • Hardwired IP Cores on Board • Run-Time Reconfiguration (RTR) • Rapid Prototyping & ASIC Emulation • Evolvable Hardware (EH) • Academic Expertise • ASICs dead • Soft CPU • HLLs • Problems to be solved © 2002, [email protected] 33 http://kressarray.de Xputer Lab Free 32 bit processor core University of Kaiserslautern © 2002, [email protected] 34 http://kressarray.de Soft CPU: new job for compilers Xputer Lab University of Kaiserslautern Memory FPGA core HLL © 2002, [email protected] soft CPU Compiler 35 FPGA http://kressarray.de Some soft CPU core examples Xputer Lab University of Kaiserslautern core architecture platform MicroBlaze 125 MHz 70 D-MIPS 32 bit standard RISC 32 reg. by 32 LUT RAMbased reg. Xilinx up to 100 on one FPGA Nios 16-bit instr. set Nios 50 MHz Nios core architecture platform Leon 25 Mhz SPARC ARM7 clone ARM uP1232 8-bit CISC, 32 reg. Altera Mercury 200 XC4000E CLBs REGIS 32-bit instr. set Altera 22 D-MIPS 8 bits Instr. + ext. ROM 2 XILINX 3020 LCA Reliance-1 12 bit DSP 8 bit Altera – Mercury Lattice 4 isp30256, 4 isp1016 1Popcorn-1 8 bit CISC Altera, Lattice, Xilinx gr1040 16-bit gr1050 32-bit My80 i8080A FLEX10K30 or EPF6016 YARD-1A 16-bit RISC, 2 opd. Instr. old Xilinx FPGA Board DSPuva16 16 bit DSP Spartan-II xr16 RISC integer C SpartanXL © 2002, [email protected] Acorn-1 36 1 Flex 10K20 http://kressarray.de Xputer Lab free DSP or Processor Cores University of Kaiserslautern CPU core Description Language Implementation Reliance 1 12bit DSP and peripherals Schematic Viewlogic 7 Lattice CPLDs PopCorn 1 small 8 bit CISC Verilog 1 Lattice CPLD isp3256-90 Acorn 1 small 8 bit CISC VHDL Max2PlusII+ 1 Altera 10k20 16-bit DSP A 16-bit Harvard DSP with 5 pipeline stages. VHDL Xilinx XC4000 Free-6502 6502 compatible core VHDL DLX Generic 32-bit RISC CPU VHDL DLX2 Generic 32-bit RISC CPU VHDL GL85 i8085 clone VHDL AMD 2901 AMD 2901 4-bit slice VHDL AMD 2910 AMD 2910 bit slice VHDL i8051 8-bit micro-controller VHDL Synopsys i8051 another i8051 clone VHDL Mentor Graphics © 2002, [email protected] 37 Synopsys http://kressarray.de Xputer Lab University of Kaiserslautern FPGA CPUs in teaching and academic research • Michigan State • Universidad de Valladolid, Spain • Virginia Tech • Washington University, St. Louis • New Mexico Tech • UC Riverside • Tokai University, Japan • UCSC: 1990! • Märaldalen University, Eskilstuna, Sweden • Chalmers University, Göteborg, Sweden • Cornell University • Gray Research • Georgia Tech • Hiroshima City University, Japan © 2002, [email protected] 38 http://kressarray.de Xputer Lab Xilinx 10Mg, 500Mt, .12 mic University of Kaiserslautern © 2002, [email protected] 39 http://kressarray.de Soft rDPA feasible ? Xputer Lab University of Kaiserslautern [à la S. Guccione] © 2002, [email protected] 40 http://kressarray.de Array I/O examples Xputer Lab University of Kaiserslautern data streams, or, from / to embedded memory banks Performance 1000 100 µProc 60%/yr.. 10 1 1980 Processor-Memory Performance Gap: (grows 50% / year) CPU DRAM 1990 [à la S. Guccione] 2000 DRAM 7%/yr.. data streams, or, from / to embedded memory banks © 2002, [email protected] 41 http://kressarray.de HLL 2 Soft Array Xputer Lab University of Kaiserslautern miscellanous HLL Compiler soft CPU Memory [à la S. Guccione] © 2002, [email protected] 42 http://kressarray.de HLL 2 „flex“ rDPA Xputer Lab University of Kaiserslautern miscellanous HLL Compiler CPU Memory [à la S. Guccione] © 2002, [email protected] 43 http://kressarray.de >> HLLs Xputer Lab University of Kaiserslautern • Configware Market • FPGA Market • Embedded Systems (Co-Design) • Hardwired IP Cores on Board • Run-Time Reconfiguration (RTR) • Rapid Prototyping & ASIC Emulation • Evolvable Hardware (EH) • Academic Expertise • ASICs dead • Soft CPU • HLLs • Problems to be solved © 2002, [email protected] 44 http://kressarray.de HLLs for Hardware Design vs. System Design vs. RTR System Design Xputer Lab University of Kaiserslautern HLL Compiler System Design HLL [à la S. Guccione] © 2002, [email protected] Compiler RTR System Design 45 http://kressarray.de HLLs for Hardware Design vs. System Design vs. RTR System Design Xputer Lab University of Kaiserslautern HLL Compiler HLL Compiler System Design [à la S. Guccione] © 2002, [email protected] 46 http://kressarray.de Embedded System Design Xputer Lab University of Kaiserslautern FPGA core HLL Compiler CPU Memory core core HLL Memory core soft CPU FPGA Compiler [à la S. Guccione] © 2002, [email protected] FPGA 47 http://kressarray.de >> Problems to be solved Xputer Lab University of Kaiserslautern • Configware Market • FPGA Market • Embedded Systems (Co-Design) • Hardwired IP Cores on Board • Run-Time Reconfiguration (RTR) • Rapid Prototyping & ASIC Emulation • Evolvable Hardware (EH) • Academic Expertise • ASICs dead • Soft CPU • HLLs • Problems to be solved © 2002, [email protected] 48 http://kressarray.de Xputer Lab Why Can’t Reconfig. Software Survive? University of Kaiserslautern • Resource constraints/sizes are exposed: – to programmer – in low-level representation (netlist) • Design revolves around device size – Algorithmic structure – Exploited parallelism © 2002, [email protected] 49 http://kressarray.de The future GIGA FPGA Xputer Lab University of Kaiserslautern • Map DPA plus environment onto FPGA – – – – no domain-specific silicon needed turbo turn-around upgrades via internet on customer’s site data-stream-based: no v.N bottleneck • Configware to repeat Software success story ? • Problems to be solved – Current FPGA architectures are not scalable – Configuration Code is not relocatable – no real Compilers available commercially • wrong educational background is dominant © 2002, [email protected] 50 http://kressarray.de Retargetting Xputer Lab University of Kaiserslautern • FPGA vendors are working at Retargetting Tools © 2002, [email protected] 51 http://kressarray.de Annihilation Xputer Lab University of Kaiserslautern - + © 2002, [email protected] 52 + http://kressarray.de Xputer Lab University of Kaiserslautern structural CS education ..... Configware / Software Co-Design? Hardware / Software Co-Design? procedural hardware person © 2002, [email protected] 53 software person http://kressarray.de Xputer Xputer LabLab University Kaiserslautern University of of Kaiserslautern CS Education …. …However, is basedcurrent on the Submarine Model This model disables ... Algorithm procedural high level Programming Language Brain usage: procedural-only Assembly Language Hardware invisible: under the surface Hardware © 2002, [email protected] © 2001, [email protected] 54 http://kressarray.de http://www.fpl.uni-kl.de Hardware and Software as Alternatives Xputer Xputer LabLab University Kaiserslautern University of of Kaiserslautern procedural structural Algorithm partitioning Brain Usage: both Hemispheres Hardware, Configware Software Software only & Hardw/Configw Software only Hardw/Configw © 2002, [email protected] © 2001, [email protected] 55 http://kressarray.de http://KressArray.de Dominance of the Submarine Model ... Xputer Xputer LabLab University Kaiserslautern University of of Kaiserslautern (procedural) structurally disabled Hardware ... indicates, that our CS education system produces zillions of mentally disabled Persons It‘s time to attack the software faculty dictatorship. Get involved! © 2002, [email protected] © 2001, [email protected] … completely disabled to cope with solutions other than software only 56 http://kressarray.de http://KressArray.de >>> thank you Xputer Xputer LabLab University Kaiserslautern University of of Kaiserslautern thank you for your patience © 2002, [email protected] © 2001, [email protected] 57 http://kressarray.de http://KressArray.de >>> Coarse Grain Xputer Lab University of Kaiserslautern - END © 2002, [email protected] 58 http://kressarray.de Notes – July 8, 2002 Xputer Lab University of Kaiserslautern • • • • Area efficieny: add ASIC figures (Edelin) Area efficieny: figures on new FPGAs (9 metal layers) Survey on algorithms from parallel computing (Edelin) Power efficieny on reconfigurable platforms: R&D needed © 2002, [email protected] 59 http://kressarray.de Academic Expertise Xputer Lab University of Kaiserslautern EDA & Configware © 2002, [email protected] 60 http://kressarray.de Xputer Lab Configware heading for mainstream University of Kaiserslautern • Configware market taking off for mainstream • FPGA-based designs more complex, even SoC • No design productivity and quality without good configware libraries (soft IP cores) from various application areas. • Growing no. of independent configware houses (soft IP core vendors) and design services • AllianceCORE & Reference Design Alliance • Currently the top FPGA vendors are the key innovators and meet most configware demand. © 2002, [email protected] 61 http://kressarray.de Xputer Lab Configware (soft IP Products) University of Kaiserslautern • For libraries, creation and reuse of configware • To search for IPs see: List of all available IP • The AllianceCORE program is a cooperation between Xilinx and third-party core developers • The Xilinx Reference Design Alliance Program • The Xilinx University Program • LogiCORE soft IP with LogiCORE PCI Interface. • Consultants © 2002, [email protected] 62 http://kressarray.de OS for FPGAs Xputer Lab University of Kaiserslautern • separate EDA software market, comparable to the compiler / OS market in computers, • Cadence, Mentor, Synopsys just jumped in. • < 5% Xilinx / Altera income from EDA SW • Changing EDA Tools Market • Major configware EDA vendors – – – – – © 2002, [email protected] 63 Altera Cadence Mentor Graphics Synopsys Xilinx http://kressarray.de Xputer Lab EDA Software for Xilinx University of Kaiserslautern •Full design flow from Cadence, Mentor, & Synopsys •Xilinx Software AllianceEDA Program: –Alliance Series Development System. –Foundation Series Development Systems. –Xilinx Foundation Series ISE (Integrated Synthesis Environment) –free WebPOWERED SW w. WebFitter & WebPACK-ISE –StateCAD XE and HDL Bencher –Foundation Base Express –Foundation ISE Base Express © 2002, [email protected] 64 http://kressarray.de Xputer Lab Foundation ISE Base Express University of Kaiserslautern • ModelSim Xilinx Edition (ModelSim XE) • Forge Compiler • Modular Design • Chipscope ILA • The Xilinx System Generator • XPower © 2002, [email protected] • JBits SDK • The Xilinx XtremeDSP Initiative • MathWorks / Xilinx Alliance • System Generator • Wind River / Xilinx alliance 65 http://kressarray.de Altera EDA Xputer Lab University of Kaiserslautern • Altera was founded in June 1983 • EDA: synthesis, place & route, and, verification • Quartus II: APEX, Excalibur, Mercury, FLEX 6000 families • MAX+PLUS II: FLEX, ACEX & MAX families • Flow with Quartus II: Mentor Graphics, Synopsys, Synplicity deliver a design design software to support Altera SOPC solutions. • Mentor: only EDA vendor w. complete design environment f. APEX II incl. IP, design capture, simulation, synthesis, and h/s coverification • Configware: Altera offers over a hundred IP cores • Third party IP core design services and consultants © 2002, [email protected] 66 http://kressarray.de Cadence Xputer Lab University of Kaiserslautern • FPGA Designer: top-down FPGA design system, • high-level mapping, architecture-specific optimization, • Verilog,VHDL, schematic-level design entry. • Verilog, VHDL to Synergy (logic synthesis) and FPGA Designer • FPGAs simulated by themselves using Cadence's VerilogXL or Leapfrog VHDL simulators and • simulated w. rest of the system design w. Logic Workbench board/system verification env‘ment. • Libraries for the leading FPGA manufacturers. © 2002, [email protected] 67 http://kressarray.de Mentor Graphics Xputer Lab University of Kaiserslautern • System Design and Verification. • PCB design and analysis: • IC Design and Verification • shifts ASIC design flow to FPGAs (Altera, Xilinx) – – – – by FPGA Advantage with IP support by ModuleWare, Xilinx CORE Generator Altera MegaWizard integration, © 2002, [email protected] 68 http://kressarray.de Synopsys Xputer Lab University of Kaiserslautern • FPGA Compiler II • Version of ASIC Design Compiler Ultra • Block Level Incremental Synthesis (BLIS) • ASIC <-> FPGA migration • Actel, Altera, Atmel, Cypress, Lattice, Lucent, Quicklogic, Triscend, Xilinx © 2002, [email protected] 69 http://kressarray.de Xilinx Xputer Lab University of Kaiserslautern • fabless FPGA semi vendor, San Jose, Ca, founded 1984 • key patents on FPGAs (expiring in a few years) • Fortune 2001: No. 14 Best Company to work for in (intel: no. 42, hp no. 64, TI no. 65). • DARPA grant (Nov‘99) to develop Jbits API tools for internet reconfigurable / upgradable logic (w. VT) • Less brilliant early/mid 90ies (president Curt Wozniak): 1995 market share from 84% down to 62% [Dataquest] • As designs get larger, Xilinx losed its advantage (bugfixes did not require to burn new chips) • meanwhile, weeks of expensive debug time needed © 2002, [email protected] 70 http://kressarray.de Xilinx Flexware Xputer Lab University of Kaiserslautern • Virtex, Virtex-II, first w. 1 mio system gates. – Virtex-E series > 3 mio system gates. • Virtex-EM on a copper process & addit. on chip memory f. network switch appl. • The Virtex XCV3200E > 3 million gates, 0.15-micron technology, • Spartan, Spartan-XL, Spartan-II – for low-cost, high volume applications as ASIC replacements – Multiple I/O standards, on-chip block RAM, digital delay lock loops – eliminate phase lock loops, FIFOs, I/O xlators , system bus drivers • XC4000XV, XC4000XL/XLA, CPLD: low-cost families – rapid development, longer system life, robust field upgradability – support In-System Programming (ISP), in-board debugging, – test during manufacturing, field upgrades, full JTAG compliant interface • CoolRunner: low power, high speed/density, standby mode. • Military & Aerospace: QPRO high-reliability QML certified • Configuration Storage Devices © 2002, [email protected] 71 http://kressarray.de Altera Flexware Xputer Lab University of Kaiserslautern • Newer families: APEX 20KE, APEX 20KC, APEX II, MAX 7000B, ACEX 1K, Excalibur, Mercury families. – Apex EP20K1500E (0.18-µ), up to 2.4 mio system gates, – APEX II (all-copper 0.13-µ) f. data path applications, supports many I/O standards. 1-Gbps True-LVDS performance – wQ2001, an ARM-based Excalibur device • Altera mainstream: MAX 7000A, 3000A; FLEX 6000, 10KA, 10KE; APEX 20K families. • Mature and other : Classic, MAX 7000, 7000S, 9000; FLEX 8000, 10K families. © 2002, [email protected] 72 http://kressarray.de Academic Expertise Xputer Lab University of Kaiserslautern Academic Expertise © 2002, [email protected] 73 http://kressarray.de Xputer Lab BRASS (1) University of Kaiserslautern • UC Berkeley, the BRASS group: Prof. Dr. John Wawrzynek • The Pleiades Project, Prof. Jan Rabaey, ultra-low power highperformance multimedia computing through reconfiguration of heterogeneous system modules, reducing energy by overhead elimination, programmability at just right granularity, parallellism, pipelining, dynamic voltage scaling. • Garp integrates processor and FPGA; dev. in parallel w. compiler - software compile techniques (VLIW SW pipelining): simple pipelining schema f. broad class of loops. • SCORE, a stream-based computation model - a unifying computational model. Fast Mapping for Datapaths: by a treeparsing compiler tool for datapath module mapping 74 http://kressarray.de © 2002, [email protected] BRASS (2) Xputer Lab University of Kaiserslautern • HSRA. new FPGA (& related tools) supports pipelining, w. retiming capable CLB architecture, implemented in a 0.4um DRAM process supporting 250MHz operation • OOCG. Object Oriented Circuit-Generators in Java • MESCAL (GSRC), the goal is: to provide a programmer's model and software development environment for efficient implementation of an interesting set of applications onto a family of fully-programmable architectures / microarchitectures. © 2002, [email protected] 75 http://kressarray.de Berkeley claiming (1) Xputer Lab University of Kaiserslautern • SCORE, a stream-based computation model: the BRASS group claims having solved the problem of primary impediment to wide-spread reconfigurable computing, by a unifying computational model. • Remark: clean stream-based model introduced ~1980: Systolic Array • 1995: Rainer Kress. Introduces reconfigurable stream-based model • Fast Mapping for Datapaths (SCORE): BRASS claims having introduced 1998 the first tree-parsing compiler tool for datapath module mapping ." Further, it is the first work to integrate simultaneous placement with module mapping in a way that preserves linear time complexity." © 2002, [email protected] 76 http://kressarray.de Berkeley claiming (2) Xputer Lab University of Kaiserslautern • Remark: The DPSS (Data Path Synthesis System) using tree covering simultanous datapath placement and routing has been published in 1995 by Rainer Kress • „Chip-in-a-Day Bee Project. Prof. Dr. Bob Broderson‘s „radical rethink of the ASIC design flow aimed at shortening design time, relying on stream-based DPU arrays.“ [published in 2000] • Remark: the KressArray, a scalable rDPU array [1995] is data-stream-based © 2002, [email protected] 77 http://kressarray.de Xputer Lab .... Stream Processors - MSP-3 University of Kaiserslautern • 3rd Workshop on Media and Stream Processors (MSP-3) • http://www.pdcl.eng.wayne.edu/msp01 • in conj. w. 34th Int‘l Symp. on Microarchitecture (MICRO-34) • http://www.microarch.org/micro34 • Austin, Texas, December 1-2, 2001 • Topics of interest include, but are not limited to: – Hardware/Compiler techniques for improving memory performance of media and stream-based processing – Application-specific hardware architectures for graphics, video, audio, communications, and other media and streaming applications – System-on-a-chip architectures for media & stream processors – Hardware/Software Co-Design of media and stream processors – and others .... © 2002, [email protected] 78 http://kressarray.de Xputer Lab Berkeley: „Chip-in-a-Day“ Bee Project University of Kaiserslautern • Chip-in-a-Day Project. Prof. Dr. Bob Broderson, BWRD: targeting a radical rethink of the ASIC design flow aimed at shortening design time. Relying on stream-based DPU arrays (not rDPU and related EDA tools. Davis: „ „... 50x decrease in power requ. over typical TI C64X design.“ • New design flow to break up the highly iterative EDA process, allowing designers to spend more time defining the device and far less time implementing it in silicon. „... developers to start by creating data flow graphs rather than C code,„ • It is stream-based computing by DPU array (hardwired DPA) • For hardwired and reconfigurable DPU array and rDPU array © 2002, [email protected] 79 http://kressarray.de Xputer Lab Stanford thru BYU University of Kaiserslautern • Stanford: Prof. Flynn went emeritus, Oskar Menzer moved to Bell Labs • no activities seen other than YAFA (yet another FPGA application) • UCLA: Prof. Jason Cong, expert on FPGA architectures and R& P algorithms. 9 projects, mult. sponsors under California MICRO Program • Prof. Majid Sarrafzadeh directs the SPS project: "versatile IPs„, a new routing architecture, architecture-aware CAD, IP-aware SPS compiler • USC: Prof. Viktor Prasanna (EE dept.) works 20% on reconfigurable computing: MAARC project, DRIVE project and Efficient SelfReconfiguration. - Prof. Dubois: RPM Project, FPGA-based emulation o scalable multiprocessors. • DEFACTO proj.: compilation - architecture-independent at all levels • MIT. MATRIX web pages removed `99. „RAW project“: a conglomerate • VT. Prof. Athanas: Jbits API f. internet RTR logic ($2.7 mio DARPA). w Prof. Brad Hutchings, BYU on programming approaches for RTR System • BYU. Prof. Brad Hutchings works on the JHDL (JAVA Hardware 80 of JHDL sources http://kressarray.de © Description 2002, [email protected] Language) and compilation into FPGAs. Toronto thru Karlsruhe Xputer Lab University of Kaiserslautern • U. Toronto. Prof. J.Rose, expert in FPGA architectures and R & P alg. • The group has dev. Transmogrifier C, a C compiler creating netlist for Xilinx XC4000 and Altera's Flex 8000 and Flex 10000 series FPGAs. • Founder of Right Track CAD Corporation acquired by Altera in 1999 • Los Alamos National Laboratory, Los Alamos, New Mexico (Jeff Arnold) – Project Streams-C: programming FPGAs from C sources. • Katholic University of Leuven, and IMEC: Prof. Rudy Lauwereins, methods for MPEG-4 like multimedia applications on dynamically reconfigurable platforms, & on reconf. instruction set processors. • University of Karlsruhe. Prof. Dr.-Ing. Juergen Becker: hardware/software co-design, reconfigurable architectures & rel. synthesis for future mobile communication systems & synthesis w. • distributed internet-based CAD methods, partitioning co-compilers © 2002, [email protected] 81 http://kressarray.de Academic Expertise Xputer Lab University of Kaiserslautern © 2002, [email protected] 82 http://kressarray.de Academic Expertise Xputer Lab ASICs dead? University of Kaiserslautern © 2002, [email protected] 83 http://kressarray.de Xputer Lab (When) Will FPGAs Kill ASICs? [Jonathan Rose] University of Kaiserslautern ASICs Are Already Dead My Position [Jonathan Rose] They Just Don’t Know It Yet! © 2002, [email protected] 84 http://kressarray.de Xputer Lab Why? [Jonathan Rose] University of Kaiserslautern 1. You have to fabricate an ASIC Very hard, getting harder 2. An FPGA is pre-fabricated A standard part immense economic advantages © 2002, [email protected] 85 http://kressarray.de Xputer Lab University of Kaiserslautern • • • • • • • Making ASICs is Damn Difficult [Jonathan Rose] Testing Yield Cross Talk Noise Leakage Clock Tree Design Horrible very deep submicron effects we don’t even know about yet © 2002, [email protected] 86 http://kressarray.de Xputer Lab Did I Mention Inventory? [Jonathan Rose] University of Kaiserslautern • ASIC users must predict # parts – 2 or 3 months in advance! • Never guess the Right Amount – Make Too Many – You Pay holding costs – Make Too Few – Competitor gets the Sale © 2002, [email protected] 87 [Jonathan Rose] http://kressarray.de [Jonathan Rose] FPGAs Give You Xputer Lab University of Kaiserslautern • Instant Fabrication – Get to Market Fast – Fix ‘em quick • Zero NRE Charges – Low Risk – Low Cost at good volume © 2002, [email protected] 88 http://kressarray.de Xputer Lab FPGAs: “Too Pricey & Too Slow ?” University of Kaiserslautern [Jonathan Rose] • 9 Times Out of 10 – You make can the thing fast by breaking it into multiple parallel slower pieces • Custom IC Designer Can Make Logic – 20x Faster, – 20x Smaller than Programmable © 2002, [email protected] 89 http://kressarray.de Xputer Lab What’s Wrong with This Picture? University of Kaiserslautern What About PLD Cores on ASICs ? Embedded FPGA Fabric [Jonathan Rose] 1. Still Have to Make the Chip 2. Need Two Sets of Software to Build It – The ASIC Flow – The PLD Flow 3. Have No Idea What to Connect the PLD Pins to – Chances Are, You Are Going to Get It Wrong! 90 http://kressarray.de © 2002, [email protected] Xputer Lab What’s Right with This Picture! University of Kaiserslautern Embedded CPU Serial Link, Analog, “etc.” [Jonathan Rose] 1. Pre-Fabricated 2. One CAD Tool Flow! 3. Can Connect Anything to Anything PLDs are built for general connectivity © 2002, [email protected] 91 http://kressarray.de >> HLLs Xputer Lab University of Kaiserslautern © 2002, [email protected] 92 http://kressarray.de HLLs for Hardware Design vs. System Design vs. RTR System Design Xputer Lab University of Kaiserslautern HLL Compiler System Design HLL [à la S. Guccione] © 2002, [email protected] Compiler RTR System Design 93 http://kressarray.de HLLs for Hardware Design vs. System Design vs. RTR System Design Xputer Lab University of Kaiserslautern HLL Compiler HLL Compiler System Design HLL [à la S. Guccione] © 2002, [email protected] Compiler RTR System Design 94 http://kressarray.de CPU and memory on Chip Xputer Lab University of Kaiserslautern HLL Compiler FPGA core RTR System Design HLL Compiler CPU Memory core core [à la S. Guccione] © 2002, [email protected] 95 http://kressarray.de Jbit Environment Xputer Lab University of Kaiserslautern RTP Core Library [à la S. Guccione] JRoute API JBits API User Code BoardScope Debugger XHWIF TCP/IP Device Simulator © 2002, [email protected] 96 http://kressarray.de HLLs for Hardware Design vs. System Design vs. RTR System Design Xputer Lab University of Kaiserslautern HLL Compiler HLL Compiler System Design [à la S. Guccione] © 2002, [email protected] 97 http://kressarray.de Embedded System Design Xputer Lab University of Kaiserslautern FPGA core HLL Compiler CPU Memory core core HLL Memory core soft CPU FPGA Compiler [à la S. Guccione] © 2002, [email protected] FPGA 98 http://kressarray.de >> RTR Xputer Lab University of Kaiserslautern © 2002, [email protected] 99 http://kressarray.de RTR Xputer Lab University of Kaiserslautern • divides application into a series of sequentially executed stages, each implemented as a separate execution module. • Partial RTR partitions these stages into finer-grain sub-modules to be swapped in as needed. • Without RTR, all configurable platforms just ASIC emulators. • needs a new kind of application development environments. • directly support development and debugging of RTR applications • essential for the advancement of reconfigurable computing • will also heavily influence the future system organization • Xilinx, VT, BYU work on run-time kernels, run-time support, RTR debugging tools and other associated tools. • smaller, faster circuits, simplified hardware interfacing, fewer IOBs; smaller, cheaper packages, simplified software interfaces. © 2002, [email protected] 100 http://kressarray.de Run-time Mapping Xputer Lab University of Kaiserslautern • run-time reconfigurable are: Xilinx VIRTEX FPGA family • RAs being part of Chameleon CS2000 series systems • Using such devices changes many of the basic assumptions in the HW/SW co-design process: • host/RL interaction is dynamic, needs a tiny OS like eBIOS, also to organize RL reconfiguration under host control • typical goal is minimization of reconfiguration latency (especially important in communication processors), to hide configuration loading latency, and, • Scheduling to find ’best’ schedule for eBIOS calls (C~side). © 2002, [email protected] 101 http://kressarray.de Xputer Lab Processors in PLDs: Excalibur University of Kaiserslautern Dual-Port RAM Single-Port RAM ARM 922T Core •High-Speed Processors Integrated with PLDs General Purpose PLD [Jonathan Rose] © 2002, [email protected] 102 Today! Available http://kressarray.de Xputer Lab Nios Architecture (Altera) University of Kaiserslautern © 2002, [email protected] 103 http://kressarray.de
© Copyright 2026 Paperzz