# Virtex-II Platform FPGA User Guide





The Xilinx logo shown above is a registered trademark of Xilinx, Inc.



The shadow X shown above is a trademark of Xilinx, Inc.

"Xilinx" and the Xilinx logo are registered trademarks of Xilinx, Inc. Any rights not expressly granted herein are reserved.

CoolRunner, RocketChips, Rocket IP, Spartan, StateBENCH, StateCAD, Virtex, XACT, XC2064, XC3090, XC4005, XC5210 are registered Trademarks of Xilinx, Inc.

ACE Controller, ACE Flash, A.K.A. Speed, Alliance Series, AllianceCORE, Bencher, ChipScope, Configurable Logic Cell, CORE Generator, CoreLINX, Dual Block, EZTag, Fast CLK, Fast CONNECT, Fast FLASH, FastMap, Fast Zero Power, Foundation, Gigabit Speeds...and Beyond!, HardWire, HDL Bencher, IRL, J Drive, JBits, LCA, LogiBLOX, Logic Cell, LogiCORE, LogicProfessor, MicroBlaze, MicroVia, MultiLINX, Nano-Blaze, PicoBlaze, PLUSASM, PowerGuide, PowerMaze, QPro, Real-PCI, Rocket I/O, SelectI/O, SelectRAM, SelectRAM+, Silicon Xpresso, Smartguide, Smart-IP, SmartSearch, SMARTswitch, System ACE, Testbench In A Minute, TrueMap, UIM, VectorMaze, VersaBlock, VersaRing, Virtex-II PRO, Wave Table, WebFITTER, WebPACK, WebPOWERED, XABEL, XACT-Floorplanner, XACT-Performance, XACTstep Advanced, XACTstep Foundry, XAM, XAPP, X-BLOX +, XC designated products, XChecker, XDM, XEPLD, Xilinx Foundation Series, Xilinx XDTV, Xinfo, XSI, XtremeDSP and ZERO+ are trademarks of Xilinx, Inc.

The Programmable Logic Company is a service mark of Xilinx, Inc.

All other trademarks are the property of their respective owners.

Xilinx does not assume any liability arising out of the application or use of any product described or shown herein; nor does it convey any license under its patents, copyrights, or maskwork rights or any rights of others. Xilinx reserves the right to make changes, at any time, in order to improve reliability, function or design and to supply the best product possible. Xilinx will not assume responsibility for the use of any circuitry described herein other than circuitry entirely embodied in its products. Xilinx devices and products are protected under one or more U.S. and International Patents. Xilinx does not represent that its devices or products are free from patent infringement or from any other third party right. Xilinx assumes no obligation to correct any errors contained herein or to advise any user of this text of any correction if such be made. Xilinx will not assume any liability for the accuracy or correctness of any engineering or software support or assistance provided to a user.

Xilinx products are not intended for use in life support appliances, devices, or systems. Use of a Xilinx product in such applications without the written consent of the appropriate Xilinx officer is prohibited.

Copyright 2001-2002 Xilinx, Inc. All Rights Reserved.

## Virtex-II Platform FPGA User Guide

#### UG002 (v1.5) 2 December 2002

## **Revision History**

The following table summarizes changes made to each version of this document.

| Date     | Version | Revision                                                                         |  |  |  |
|----------|---------|----------------------------------------------------------------------------------|--|--|--|
| 12/06/00 | 1.0     | Printed version with data sheet modules 1-3 in Part I) Initial Release.          |  |  |  |
| 04/02/01 | 1.1     | Electronic version only) Misc updates throughout.                                |  |  |  |
| 10/12/01 | 1.2     | (Electronic version only) Misc technical edits throughout.                       |  |  |  |
| 12/03/01 | 1.3     | (Printed version with data sheet modules 1-3 in Part I) Misc updates throughout. |  |  |  |
| 11/01/02 | 1.4     | (Electronic version only) Misc updates throughout.                               |  |  |  |
| 12/02/02 | 1.5     | (Electronic version only) Misc updates throughout.                               |  |  |  |

# Contents

## **About This Guide**

| Additional Resources      | 7 |
|---------------------------|---|
| Typographical Conventions | 8 |

## Introduction to the Virtex-II FPGA Family

| Virtex-II Platform                                                   | . 9 |
|----------------------------------------------------------------------|-----|
| Virtex-II Target Applications                                        | . 9 |
| Interconnect Engine for Fast, Wide Busses in Networking Applications | 10  |
| Complete Solution For Rapid Time-to-Production                       | 10  |

## **Chapter 1: Timing Models**

| <b>Summary</b> 1                     | 1 |
|--------------------------------------|---|
| CLB / Slice Timing Model 1           | 2 |
| Block SelectRAM Timing Model 2       | 1 |
| Embedded Multiplier Timing Model 2   | 4 |
| IOB Timing Model 2                   | 7 |
| Pin-to-Pin Timing Model              | 7 |
| Digital Clock Manager Timing Model 4 | 1 |

## **Chapter 2: Design Considerations**

| Summary                                            |
|----------------------------------------------------|
| Introduction 47                                    |
| Using Global Clock Networks 48                     |
| Using Digital Clock Managers (DCMs) 67             |
| Using Block SelectRAM <sup>™</sup> Memory          |
| Using Distributed SelectRAM Memory 113             |
| Using Look-Up Tables as Shift Registers (SRLs) 123 |
| Designing Large Multiplexers 133                   |
| Implementing Sum of Products (SOP) Logic 143       |
| Using Embedded Multipliers 150                     |
| Using Single-Ended SelectI/O-Ultra Resources 157   |
| Using Digitally Controlled Impedance (DCI) 188     |
| Using Double-Data-Rate (DDR) I/O 203               |
| Using LVDS I/O 217                                 |
| Using LVPECL I/O 222                               |
| Using Bitstream Encryption 226                     |
| Using the CORE Generator System 230                |

## **Chapter 3: Configuration**

| Summary                              | 245 |
|--------------------------------------|-----|
| Introduction                         | 245 |
| Configuration Solutions              | 253 |
| Master Serial Programming Mode       | 260 |
| Slave Serial Programming Mode        | 261 |
| Master SelectMAP Programming Mode    | 263 |
| Slave SelectMAP Programming Mode     | 265 |
| JTAG/ Boundary Scan Programming Mode | 269 |
| Configuration Details                | 287 |
| Readback                             | 296 |
|                                      |     |

## Chapter 4: PCB Design Considerations

| Summary                              | 301 |
|--------------------------------------|-----|
| Pinout Information                   | 302 |
| Pinout Diagrams                      | 311 |
| Package Specifications               | 351 |
| Flip-Chip Packages                   | 362 |
| Thermal Data                         | 363 |
| Printed Circuit Board Considerations | 365 |
| Board Routability Guidelines         | 370 |
| Power Consumption                    | 392 |
| IBIS Models                          | 400 |
| BSDL and Boundary Scan Models        | 405 |

# Appendix A: BitGen and PROMGen Switches and Options

| Using BitGen  | 407 |
|---------------|-----|
| Using PROMGen | 413 |

## Appendix B: XC18V00 Series PROMs

| PROM Package Specifications       | 419 |
|-----------------------------------|-----|
| Features                          | 1   |
| Description                       | 1   |
| Pinout and Pin Description        | 2   |
| Pinout Diagrams                   | 4   |
| Xilinx FPGAs and Compatible PROMs | 5   |
| Capacity                          | 6   |
| In-System Programming             | 6   |
| External Programming              | 6   |
| Reliability and Endurance         | 6   |

| Design Security                                                                                    |
|----------------------------------------------------------------------------------------------------|
| IEEE 1149.1 Boundary-Scan (JTAG)7                                                                  |
| Instruction Register7                                                                              |
| XC18V00 TAP Characteristics 8                                                                      |
| TAP AC Parameters   9                                                                              |
| Connecting Configuration PROMs 9                                                                   |
| Master Serial Mode Summary 10                                                                      |
| Reset Activation                                                                                   |
| Standby Mode 13                                                                                    |
| 5V Tolerant I/Os                                                                                   |
| Customer Control Bits 13                                                                           |
| Absolute Maximum Ratings 14                                                                        |
| <b>Recommended Operating Conditions</b> 14                                                         |
| Quality and Reliability Characteristics14                                                          |
| DC Characteristics Over Operating Conditions                                                       |
| AC Characteristics Over Operating Conditions for XC18V04 and XC18V02 16                            |
| AC Characteristics Over Operating Conditions for XC18V01, XC18V512,                                |
| and XC18V256 17                                                                                    |
| AC Characteristics Over Operating Conditions When Cascading for XC18V04<br>and XC18V02             |
| AC Characteristics Over Operating Conditions When Cascading for XC18V01,<br>XC18V512, and XC18V256 |
| Ordering Information                                                                               |
| Valid Ordering Combinations                                                                        |
| Marking Information                                                                                |
| Revision History                                                                                   |

# Appendix C: Glossary

Index



# About This Guide

This document describes the function and operation of Virtex-II devices and also includes information on FPGA configuration techniques and PCB design considerations. For Virtex-II device specifications, refer to the <u>Virtex-II Data Sheet (DS031)</u>.

This guide provides details on the following topics:

- Chapter 1: Timing Models
- Chapter 2: Design Considerations
- Chapter 3: Configuration
- Chapter 4: PCB Design Considerations
- Appendix A: BitGen and PROMGen Switches and Options
- Appendix B: XC18V00 Series PROMs
- Appendix C: Glossary

## **Additional Resources**

The following table lists URLs for resources available on the web. For additional information, go to <u>http://www.xilinx.com</u>.

| Resource             | Description/URL                                                                                                                                                                                                                                                           |  |  |
|----------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|--|
| Handbook             | This site contains the latest <i>Virtex-II User Guide</i> and <i>Virtex-II Data Sheet</i> :                                                                                                                                                                               |  |  |
|                      | http://www.xilinx.com/products/virtex/handbook/                                                                                                                                                                                                                           |  |  |
| Application<br>Notes | This site contains device-specific design techniques and approaches:<br>http://www.xilinx.com/apps/appsweb.htm                                                                                                                                                            |  |  |
| Data Book            | <i>The Programmable Logic Data Book</i> describes device-specific information<br>on Xilinx device characteristics, including readback, boundary scan,<br>configuration, length count, and debugging:<br><u>http://www.xilinx.com/xlnx/xweb/xil_publications_index.jsp</u> |  |  |
| Xcell Journals       | This site contains quarterly journals for Xilinx programmable logic users:<br>http://www.xilinx.com/xcell/xcell.htm                                                                                                                                                       |  |  |
| Tech Tips            | See this site for the latest news, design tips, and patch information on the Xilinx design environment:<br>http://www.xilinx.com/support/techsup/journals/index.htm                                                                                                       |  |  |
| Answers<br>Database  | This database provides a current listing of solution records for Xilinx software tools. Search this database using the search function at: <a href="http://www.xilinx.com/support/searchtd.htm">http://www.xilinx.com/support/searchtd.htm</a>                            |  |  |

## **Typographical Conventions**

The following typographical conventions are used in this manual:

- **Red text** indicates a cross-reference to information within this document. Click red text to open the specified cross-reference.
- <u>Blue-underlined text</u> indicates a link to a Web page. Click blue-underlined text to browse the specified Web site.
- Courier font indicates prompts or program outputs displayed by the system. speed grade: 5
- **Courier bold** indicates literal commands that you enter in a syntactical statement. However, braces "{}" in Courier bold are not literal and square brackets "[]" in Courier bold are literal only in the case of bus specifications, such as bus [7:0].

```
rpt_del_net=
```

Courier bold also indicates menu commands: File  $\rightarrow$  Open

- *Italic font* denotes the following items:
  - Variables that are substituted with user-defined values edif2ngd design\_name
  - References to other documents. See the *Libraries Guide* for more information.
  - Emphasis in text If a wire is drawn so that it overlaps the pin of a symbol, the two nets are *not* connected.
- Square brackets "[]" indicate an optional entry or parameter. However, in bus specifications, such as bus [7:0], they are required.
   edif2ngd [option\_name] design\_name
- Braces "{ }" enclose a list of items from which you must choose one or more, and a vertical bar " | " separates items in a list of choices:
   lowpwr = {on | off}
- A vertical ellipsis indicates repetitive material that has been omitted.

```
IOB #1: Name = QOUT'
IOB #2: Name = CLKIN'
.
.
```

• A horizontal ellipsis "..." indicates that an item can be repeated one or more times. allow block *block\_name loc1 loc2 ... locn;* 

# 

# Introduction to the Virtex-II FPGA Family

## **Virtex-II Platform**

The Virtex-II Platform FPGA solution is the result of the largest silicon and software R&D effort in the history of programmable logic, with the goal of revolutionizing the design of complex single-chip sub-systems in terms of engineering productivity, silicon efficiency, and system flexibility.

The Virtex-II product family provides IP-Immersion<sup>™</sup> technology which incorporates an abundance of on-chip memory options and advanced routing resources for supporting complex designs that use IP (intellectual property), such as on-chip hard-macro building blocks and a rapidly growing library of soft-IP blocks. For the first time in the programmable logic industry, innovative Virtex-II features enable system designers to:

- Eliminate external termination resistors with on-chip precision-controlled output impedance
- Manage 16 pre-engineered low-skew clock domains, with on-chip frequency and phase control
- Protect chip designs with bit-stream encryption

These unique capabilities increase engineering productivity and time-to-production by supply pre-engineered solutions for signal integrity and RF noise challenges, as well as providing a secure means to deliver designs rapidly to production.

The Virtex-II Platform FPGA family is a complete programmable solution that allows digital system designers to rapidly implement a single-chip solution with density up to 10 million system gates, in weeks rather than months or years. The inherent flexibility of Xilinx FPGA devices allows unlimited design changes throughout the development and production phases of the system, with important benefits in improved productivity, reduced design risk, and higher system flexibility. This further accelerates the industry -- from custom ASICs to FPGAs -- in fields such as optical networking systems, gigabit routers, wireless cellular base stations, modem arrays, and professional video broadcast systems.

## **Virtex-II Target Applications**

The Virtex-II solution is developed specifically to enable rapid development of two of the most technically challenging digital system applications: data communications and digital signal processing (DSP) systems. High logic integration, fast and complex routing of wide busses, and extensive pipeline and FIFO memory requirements characterize these systems.

The Virtex-II family incorporates high logic capacity, up to 10 million system gates, a new Active Interconnect<sup>™</sup> architecture optimized for predictable routing delays, an advanced memory array architecture with up to 4.5Mbits of on-chip memory, and built-in support for high-speed I/O standards at up to 1108 user pins.

Applications incorporating DSP functionality, such as echo cancellation, forward errorcorrection, and image compression/decompression, benefit from the abundance of embedded high-speed 18-bit x 18-bit multiplier blocks within the Virtex-II solution. The unique features of the revolutionary Virtex-II architecture make it ideal for optical networking products, storage area networks (SANs), Voice-over-Internet-Protocol (VoIP), video broadcasting, medical imaging, wireless base-stations, and Internet infrastructure products, as well as many other products.

# Interconnect Engine for Fast, Wide Busses in Networking Applications

The Virtex-II architecture incorporates a number of novel features specifically to support wide data widths in complex networking and transmission systems. Modern complex systems operate with multiple clock domains, with large IP-based subsystems operating independently. Large, wide FIFOs and buffer memories are needed for handling fast and wide inter-subsystem data transfer. These wide busses are required both internally for intra-chip communications and externally for switched fabric communications.

For example, wide 32-bit and larger data busses can drive multiple Ultra Low-Voltage Differential Signal (ULVDS) high-speed interface standards for data transfer across a backplane or for point-to-point communications, or be used for implementing high-speed multi-cast bus standards.

These requirements challenge and exceed the capabilities of current programmable logic devices, which lack the gate capacity, memory and routing resources, performance, and architecture flexibility to fully support these designs. The Virtex-II solution is the first platform FPGA specifically targeted to improve the "ease of speed" in the development and production of these complex systems.

## **Complete Solution For Rapid Time-to-Production**

The Virtex-II solution combines the most flexible FPGA architecture, advanced process technology, powerful software synthesis technology, and robust IP library, to provide the most complete system integration solution today. In addition, the Virtex-II solution provides powerful features, such as Xilinx Digitally Controlled Impedance (DCI) technology, digital clock manager to help designers further reduce overall system cost and design development cycle, making Virtex-II the ideal solution for tomorrow's high-performance system designs.



# Chapter 1

# **Timing Models**

## Summary

The following topics are covered in this chapter:

- CLB / Slice Timing Model
- Block SelectRAM Timing Model
- Embedded Multiplier Timing Model
- IOB Timing Model
- Pin-to-Pin Timing Model
- Digital Clock Manager Timing Model

#### Introduction

Due to the large size and complexity of Virtex-II FPGAs, understanding the timing associated with the various paths and functional elements has become a difficult and important problem. Although it is not necessary to understand the various timing parameters in order to implement most designs using Xilinx, Inc. software, a thorough timing model can assist advanced users in analyzing critical paths, or planning speed-sensitive designs.

The Timing Model chapter is broken up into five sections consisting of three basic components:

- Functional Element Diagram basic architectural schematic illustrating pins and connections.
- Timing Parameters <u>Virtex-II Data Sheet (DS031)</u> timing parameter definitions.
- Timing Diagram illustrates functional element timing parameters relative to each other.

This chapter was written with the Xilinx Timing Analyzer software (TRCE) in mind. All pin names, parameter names, and paths are consistent with Post Route Timing and Pre-Route Static Timing reports. Use the models in this chapter in conjunction with both the Timing Analyzer software and the section on switching characteristics in the *Virtex-II Data Sheet*. Most of the timing parameters found in the section on switching characteristics are described in this chapter.

## **CLB / Slice Timing Model**

#### Introduction

This section describes all timing parameters reported in the <u>Virtex-II Data Sheet (DS031</u>) that are associated with slices and Configurable Logic Blocks (CLBs). It consists of three parts corresponding to their respective (switching characteristics) sections in the data sheet:

- General Slice Timing Model and Parameters (CLB Switching Characteristics)
- Slice Distributed RAM Timing Model and Parameters (CLB Distributed RAM Switching Characteristics)
- Slice SRL Timing Model and Parameters (CLB SRL Switching Characteristics)

### General Slice Timing Model and Parameters

Figure 1-1 illustrates the details of a Virtex-II slice.

Note: Some elements of the Virtex-II slice have been omitted for clarity. Only the elements relevant to the timing paths described in this section are shown.





## **Timing Parameters**

| Parameter                                                                                | Function                                       | Control<br>Signal | Description                                                                                                                                                                   |
|------------------------------------------------------------------------------------------|------------------------------------------------|-------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Combinatorial                                                                            | Delays                                         |                   | ·                                                                                                                                                                             |
| T <sub>ILO</sub>                                                                         | F/G inputs to<br>X/Y outputs                   |                   | Propagation delay from the F/G inputs of<br>the slice, through the look-up tables<br>(LUTs), to the X/Y outputs of the slice.                                                 |
| T <sub>IF5</sub>                                                                         | F/G inputs to F5<br>output                     |                   | Propagation delay from the F/G inputs of the slice, through the LUTs and MUXF5 to the F5 output of the slice.                                                                 |
| T <sub>IF5X</sub>                                                                        | F/G inputs to X output                         |                   | Propagation delay from the F/G inputs of the slice, through the LUTs and MUXF5 to the X output of the slice.                                                                  |
| T <sub>IFXY</sub>                                                                        | FXINA/FXINB<br>inputs to Y<br>output           |                   | Propagation delay from the<br>FXINA/FXINB inputs, through MUXFX<br>to the Y output of the slice.                                                                              |
| T <sub>IFNCTL</sub>                                                                      | Transparent<br>Latch input to<br>XQ/YQ outputs |                   | Incremental delay through a transparent latch to XQ/YQ outputs.                                                                                                               |
| Sequential Del                                                                           | ays                                            |                   |                                                                                                                                                                               |
| Т <sub>СКО</sub>                                                                         | FF Clock (CLK)<br>to XQ/YQ<br>outputs          |                   | Time after the clock that data is stable at the XQ/YQ outputs of the slice sequential elements (configured as a flip-flop).                                                   |
| T <sub>CKLO</sub>                                                                        | Latch Clock<br>(CLK) to<br>XQ/YQ outputs       |                   | Time after the clock that data is stable at the XQ/YQ outputs of the slice sequential elements (configured as a latch).                                                       |
| Setup and Hold                                                                           | for Slice Sequent                              | tial Elements     | 5                                                                                                                                                                             |
| $T_{xxCK}$ = Setup time (before clock edge)<br>$T_{CKxx}$ = Hold time (after clock edge) |                                                |                   | The following descriptions are for setup times only.                                                                                                                          |
| T <sub>DICK</sub> /T <sub>CKDI</sub>                                                     | BX/BY inputs                                   |                   | Time before Clock (CLK) that data from<br>the BX or BY inputs of the slice must be<br>stable at the D-input of the slice sequential<br>elements (configured as a flip-flop).  |
| T <sub>DYCK</sub> /T <sub>CKDY</sub>                                                     | DY input                                       |                   | Time before Clock (CLK) that data from<br>the DY input of the slice must be stable at<br>the D-input of the slice sequential<br>elements (configured as a flip-flop).         |
| T <sub>DXCK</sub> /T <sub>CKDX</sub>                                                     | DX input                                       |                   | Time before Clock (CLK) that data from<br>the DX input of the slice must be stable at<br>the D-input of the slice sequential<br>elements (configured as a flip-flop).         |
| T <sub>CECK</sub> /T <sub>CKCE</sub>                                                     | CE input                                       |                   | Time before Clock (CLK) that the CE<br>(Clock Enable) input of the slice must be<br>stable at the CE-input of the slice sequen-<br>tial elements (configured as a flip-flop). |

| Parameter                          | Function     | Control<br>Signal | Description                                                                                                                                                                                                             |
|------------------------------------|--------------|-------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| T <sub>RCK</sub> /T <sub>CKR</sub> | SR/BY inputs |                   | Time before CLK that the SR (Set/Reset)<br>and the BY (Rev) inputs of the slice must<br>be stable at the SR/Rev-inputs of the slice<br>sequential elements (configured as a flip-<br>flop). Synchronous set/reset only. |
| Clock CLK                          | ·            |                   |                                                                                                                                                                                                                         |
| T <sub>CH</sub>                    |              |                   | Minimum Pulse Width, High.                                                                                                                                                                                              |
| T <sub>CL</sub>                    |              |                   | Minimum Pulse Width, Low.                                                                                                                                                                                               |
| Set/Reset                          |              |                   | ·                                                                                                                                                                                                                       |
| T <sub>RPW</sub>                   |              |                   | Minimum Pulse Width for the SR<br>(Set/Reset) and BY (Rev) pins.                                                                                                                                                        |
| T <sub>RQ</sub>                    |              |                   | Propagation delay for an asynchronous<br>Set/Reset of the slice sequential elements.<br>From SR/BY inputs to XQ/YQ outputs.                                                                                             |
| F <sub>TOG</sub>                   |              |                   | Toggle Frequency - Maximum Frequency<br>that a CLB flip-flop can be clocked:<br>$1/(T_{CH}+T_{CL})$                                                                                                                     |

Figure 1-2 illustrates general timing characteristics of a Virtex-II slice.



Figure 1-2: General Slice Timing Diagram

- At time T<sub>CECK</sub> before Clock Event 1, the Clock-Enable signal becomes valid-high at the CE input of the slice register.
- At time T<sub>DYCK</sub> before Clock Event 1, data from the DY input becomes valid-high at the D input of the slice register and is reflected on the YQ pin at time T<sub>CKO</sub> after Clock Event 1\*.
- At time T<sub>RCK</sub> before Clock Event 3, the SR signal (configured as synchronous reset in this case) becomes valid-high, resetting the slice register, and this is reflected on the YQ pin at time T<sub>CKO</sub> after Clock Event 3.

\* NOTE: In most cases software uses the DX/DY inputs to route data to the slice registers when at all possible. This is the fastest path to the slice registers and saves other slice routing resources.

### Slice Distributed RAM Timing Model and Parameters

Figure 1-3 illustrates the details of distributed RAM implemented in a Virtex-II slice.

Note: Some elements of the Virtex-II slice have been omitted for clarity. Only the elements relevant to the timing paths described in this section are shown.



Figure 1-3: Slice Distributed RAM Diagram

## **Timing Parameters**

| Parameter                                       | Function                                             | Control<br>Signal | Description                                                                                                                                            |  |  |
|-------------------------------------------------|------------------------------------------------------|-------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------|--|--|
| Sequential D                                    | Pelays for Slice LUT C                               | onfigured as      | RAM (Distributed RAM)                                                                                                                                  |  |  |
| T <sub>SHCKO16</sub>                            | CLK to X/Y outputs<br>(WE active) in 16x1<br>mode    |                   | Time after the Clock (CLK) of a WRITE operation that the data written to the distributed RAM (in 16x1 mode) is stable on the X/Y outputs of the slice. |  |  |
| T <sub>SHCKO32</sub>                            | CLK to X/Y outputs<br>(WE active) in 32x1<br>mode    |                   | Time after the Clock (CLK) of a WRITE operation that the data written to the distributed RAM (in 32x1 mode) is stable on the X/Y outputs of the slice. |  |  |
| T <sub>SHCKOF5</sub>                            | CLK to F5 output<br>(WE active)                      |                   | Time after the Clock (CLK) of a WRITE operation that the data written to the distributed RAM is stable on the F5 output of the slice.                  |  |  |
| Setup and H                                     | old for Slice LUT Con                                | figured as R      | AM (Distributed RAM)                                                                                                                                   |  |  |
| T <sub>xS</sub> = Setu<br>T <sub>xH</sub> = Hol | p time (before clock edą<br>d time (after clock edge | ge)<br>)          | The following descriptions are for setup times only.                                                                                                   |  |  |
| T <sub>DS</sub> /T <sub>DH</sub>                | BX/BY Data inputs<br>(DI)                            |                   | Time before the clock that data must be<br>stable at the DI input of the slice LUT<br>(configured as RAM), via the slice<br>BX/BY inputs.              |  |  |
| $T_{AS}/T_{AH}$                                 | F/G Address inputs                                   |                   | Time before the clock that address<br>signals must be stable at the F/G inputs<br>of the slice LUT (configured as RAM).                                |  |  |
| T <sub>WES</sub> /T <sub>WEH</sub>              | WE input (SR)                                        |                   | Time before the clock that the Write<br>Enable signal must be stable at the WE<br>input of the slice LUT (configured as<br>RAM).                       |  |  |
| Clock CLK                                       |                                                      |                   |                                                                                                                                                        |  |  |
| T <sub>WPH</sub>                                |                                                      |                   | Minimum Pulse Width, High (for a Distributed RAM clock).                                                                                               |  |  |
| T <sub>WPL</sub>                                |                                                      |                   | Minimum Pulse Width, Low (for a Distributed RAM clock).                                                                                                |  |  |
| T <sub>WC</sub>                                 |                                                      |                   | Minimum clock period to meet address write cycle time.                                                                                                 |  |  |



Figure 1-4 illustrates the timing characteristics of a 16-bit distributed RAM implemented in a Virtex-II slice (LUT configured as RAM).

Figure 1-4: Slice Distributed RAM Timing Diagram

#### Clock Event 1: WRITE Operation

During a WRITE operation, the contents of the memory at the address on the ADDR inputs is changed. The data written to this memory location is reflected on the X/Y outputs synchronously.

- At time T<sub>WES</sub> before Clock Event 1, the Write Enable signal (WE) becomes valid-high, enabling the RAM for the following WRITE operation.
- At time T<sub>AS</sub> before Clock Event 1, the address (2) becomes valid at the F/G inputs of the RAM.
- At time T<sub>DS</sub> before Clock Event 1, the DATA becomes valid (1) at the DI input of the RAM and is reflected on the X/Y output at time T<sub>SHCKO16</sub> after Clock Event 1.

#### **Clock Event 2: READ Operation**

All READ operations are asynchronous in distributed RAM. As long as write-enable (WE) is Low, the address bus can be asserted at any time, and the contents of the RAM at that address are reflected on the X/Y outputs after a delay of length  $T_{ILO}$  (propagation delay through a LUT). Note that the Address (F) is asserted *after* Clock Event 2, and that the contents of the RAM at that location are reflected on the output after a delay of length  $T_{ILO}$ .

## Slice SRL Timing Model and Parameters

Figure 1-5 illustrates shift register implementation in a Virtex-II slice.

Note: Some elements of the Virtex-II slice have been omitted for clarity. Only the elements relevant to the timing paths described in this section are shown.



UG002\_C3\_021\_113000

Figure 1-5: Slice SLR Diagram

1

### **Timing Parameters**

| Parameter                                                                              | Function                  | Control<br>Signal | Description                                                                                                                               |  |  |
|----------------------------------------------------------------------------------------|---------------------------|-------------------|-------------------------------------------------------------------------------------------------------------------------------------------|--|--|
| Sequential I                                                                           | Delays for Slice LUT      | Configur          | ed as SRL (Select Shift Register)                                                                                                         |  |  |
| T <sub>REG</sub>                                                                       | CLK to<br>X/Y outputs     |                   | Time after the Clock (CLK) of a WRITE operation that the data written to the SRL is stable on the X/Y outputs of the slice.               |  |  |
| T <sub>CKSH</sub>                                                                      | CLK to Shiftout           |                   | Time after the Clock (CLK) of a WRITE operation that the data written to the SRL is stable on the Shiftout or XB/YB outputs of the slice. |  |  |
| T <sub>REGF5</sub>                                                                     | CLK to F5 output          |                   | Time after the Clock (CLK) of a WRITE<br>operation that the data written to the SRL is<br>stable on the F5 output of the slice.           |  |  |
| Setup/Hold for Slice LUT Configured as SRL (Select Shift Register)                     |                           |                   |                                                                                                                                           |  |  |
| $T_{xxS}$ = Setup time (before clock edge)<br>$T_{xxH}$ = Hold time (after clock edge) |                           | edge)<br>dge)     | The following descriptions are for setup times only.                                                                                      |  |  |
| T <sub>SRLDS</sub> /<br>T <sub>SRLDH</sub>                                             | BX/BY Data<br>inputs (DI) |                   | Time before the clock that data must be stable<br>at the DI input of the slice LUT (configured as<br>SRL), via the slice BX/BY inputs.    |  |  |
| T <sub>WSS</sub> /T <sub>WSH</sub>                                                     | CE input (WE)             |                   | Time before the clock that the Write Enable<br>signal must be stable at the WE input of the<br>slice LUT (configured as SRL).             |  |  |
| Clock CLK                                                                              |                           |                   |                                                                                                                                           |  |  |
| T <sub>SRPH</sub>                                                                      |                           |                   | Minimum Pulse Width, High (for an SRL clock).                                                                                             |  |  |
| T <sub>SRPL</sub>                                                                      |                           |                   | Minimum Pulse Width, Low (for an SRL clock).                                                                                              |  |  |

Figure 1-6 illustrates the timing characteristics of a 16-bit shift register implemented in a Virtex-II slice (LUT configured as SRL).



Figure 1-6: Slice SLR Timing Diagram

#### Clock Event 1: Shift\_In

During a WRITE (Shift\_In) operation, the single-bit content of the register at the address on the ADDR inputs is changed, as data is shifted through the SRL. The data written to this register is reflected on the X/Y outputs synchronously, if the address is unchanged during the clock event. If the ADDR inputs are changed during a clock event, the value of the data at the addressable output (D) is invalid.

- At time T<sub>WSS</sub> before Clock Event 1, the Write Enable signal (SR) becomes valid-high, enabling the SRL for the WRITE operation that follows.
- At time T<sub>SRLDS</sub> before Clock Event 1 the data becomes valid (0) at the DI input of the SRL and is reflected on the X/Y output after a delay of length T<sub>REG</sub> after Clock Event 1\*.

\* Note: Since the address 0 is specified at Clock Event 1, the data on the DI input is reflected at the D output, because it is written to Register 0.

#### Clock Event 2: Shift\_In

• At time T<sub>SRLDS</sub> before Clock Event 2, the data becomes valid (1) at the DI input of the SRL and is reflected on the X/Y output after a delay of length T<sub>REG</sub> after Clock Event 2\*.

\* Note: Since the address 0 is still specified at Clock Event 2, the data on the DI input is reflected at the D output, because it is written to Register 0.

#### Clock Event 3: Shift\_In / Addressable (Asynchronous) READ

All READ operations are asynchronous. If the address is changed (between clock events), the contents of the register at that address are reflected at the addressable output (X/Y outputs) after a delay of length  $T_{ILO}$  (propagation delay through a LUT).

- At time T<sub>SRLDS</sub> before Clock Event 3 the Data becomes valid (1) at the DI input of the SRL, and is reflected on the X/Y output T<sub>REG</sub> time after Clock Event 3.
- Notice that the address is changed (from 0 to 2) some time after Clock Event 3. The value stored in Register 2 at this time is a 0 (in this example, this was the first data shifted in), and it is reflected on the X/Y output after a delay of length T<sub>ILO</sub>.

#### Clock Event 16: MSB (Most Significant Bit) Changes

• At time T<sub>REGXB</sub> after Clock Event 16, the first bit shifted into the SRL becomes valid (logical 0 in this case) on the XB output of the slice via the MC15 output of the LUT (SRL).

## **Block SelectRAM Timing Model**

### Introduction

This section describes the timing parameters associated with the block SelectRAM (illustrated in Figure 1-7) in Virtex-II FPGA devices. This section is intended to be used with the section on switching characteristics in the <u>Virtex-II Data Sheet (DS031)</u> and the Timing Analyzer (TRCE) report from Xilinx software. For specific timing parameter values, refer to the switching characteristics section in the *Virtex-II Data Sheet*.



DS031\_10\_100300

Figure 1-7: Block SelectRAM Block Diagram

## **Timing Parameters**

| Parameter                                   | Function                                   | Control<br>Signal | Description                                                                                                         |  |  |
|---------------------------------------------|--------------------------------------------|-------------------|---------------------------------------------------------------------------------------------------------------------|--|--|
| Setup and Hold <b>F</b>                     | Relative to Clock                          | (CLK)             |                                                                                                                     |  |  |
| $T_{BxCK} = Setup t$<br>$T_{BCKx} = Hold t$ | time (before clock of ime (after clock ed) | edge)<br>ge)      | The following descriptions are for setup times only.                                                                |  |  |
| T <sub>BACK</sub> /T <sub>BCKA</sub>        | Address inputs                             | ADDR              | Time before the clock that address signals<br>must be stable at the ADDR inputs of the<br>block RAM.                |  |  |
| T <sub>BDCK</sub> /T <sub>BCKD</sub>        | Data inputs                                | DI                | Time before the clock that data must be stable at the DI inputs of the block RAM.                                   |  |  |
| T <sub>BECK</sub> /T <sub>BCKE</sub>        | Enable                                     | EN                | Time before the clock that the enable signal must be stable at the EN input of the block RAM.                       |  |  |
| T <sub>BRCK</sub> /T <sub>BCKR</sub>        | Synchronous<br>Set/Reset                   | SSR               | Time before the clock that the synchronous<br>set/reset signal must be stable at the SSR<br>input of the block RAM. |  |  |
| T <sub>BWCK</sub> /T <sub>BCKW</sub>        | Write Enable                               | WE                | Time before the clock that the write enable signal must be stable at the WE input of the block RAM.                 |  |  |
| Clock to Out                                |                                            |                   |                                                                                                                     |  |  |
| T <sub>BCKO</sub>                           | Clock to<br>Output                         | CLK to<br>DO      | Time after the clock that the output data is stable at the DO outputs of the block RAM.                             |  |  |
| Clock                                       |                                            |                   |                                                                                                                     |  |  |
| T <sub>BPWH</sub>                           | Clock                                      | CLK               | Minimum pulse width, high.                                                                                          |  |  |
| T <sub>BPWL</sub>                           | Clock                                      | CLK               | Minimum pulse width, low.                                                                                           |  |  |

The timing diagram in Figure 1-8 describes a single-port block RAM in Write-First mode. The timing for Read-First and No-Change modes are similar (see chapter 2, block RAM section.)



Figure 1-8: Block SelectRAM Timing Diagram

At time 0, the block RAM is disabled; EN (enable) is low.

#### Clock Event 1

#### **READ** Operation:

During a read operation, the contents of the memory at the address on the ADDR inputs are unchanged.

- T<sub>BACK</sub> before Clock Event 1, address 00 becomes valid at the ADDR inputs of the block RAM.
- At time T<sub>BECK</sub> before Clock Event 1, Enable goes High at the EN input of the block RAM, enabling the memory for the READ operation that follows.
- At time T<sub>BCKO</sub> after Clock Event 1, the contents of the memory at address 00 become stable at the DO pins of the block RAM.

#### Clock Event 2

#### WRITE Operation:

During a write operation, the content of the memory at the location specified by the address on the ADDR inputs is replaced by the value on the DI pins and is immediately reflected on the output latches (in WRITE-FIRST mode); EN (enable) is high.

- At time T<sub>BACK</sub> before Clock Event 2, address 0F becomes valid at the ADDR inputs of the block RAM.
- At time T<sub>BDCK</sub> before Clock Event 2, data CCCC becomes valid at the DI inputs of the block RAM.
- At time T<sub>BWCK</sub> before Clock Event 2, Write Enable becomes valid at the WE following the block RAM.
- At time T<sub>BCKO</sub> after Clock Event 2, data CCCC becomes valid at the DO outputs of the block RAM.

#### Clock Event 4

#### SSR (Synchronous Set/Reset) Operation

During an SSR operation, initialization parameter value SRVAL is loaded into the output latches of the block SelectRAM. The SSR operation does NOT change the contents of the memory and is independent of the ADDR and DI inputs.

- At time T<sub>BRCK</sub> before Clock Event 4, the synchronous set/reset signal becomes valid (High) at the SSR input of the block RAM.
- At time T<sub>BCKO</sub> after Clock Event 4, the SRVAL 0101 becomes valid at the DO outputs of the block RAM.

#### **Clock Event 5**

#### **Disable Operation:**

De-asserting the enable signal EN disables any write, read or SSR operation. The disable operation does NOT change the contents of the memory or the values of the output latches.

- At time T<sub>BECK</sub> before Clock Event 5, the enable signal becomes valid (Low) at the EN input of the block RAM.
- After Clock Event 5, the data on the DO outputs of the block RAM is unchanged.

#### **Timing Model**

Figure 1-9 illustrates the delay paths associated with the implementation of block SelectRAM. This example takes the simplest paths on and off chip (these paths can vary greatly depending on the design). This timing model demonstrates how and where the block SelectRAM timing parameters are used.





$$\begin{split} \textbf{NET} &= Varying interconnect delays\\ \textbf{T}_{IOPI} &= Pad \text{ to I-output of IOB delay}\\ \textbf{T}_{IOOP} &= O\text{-input of IOB to pad delay}\\ \textbf{T}_{GI0O} &= BUFGMUX \text{ delay} \end{split}$$

## **Embedded Multiplier Timing Model**

### Introduction

This section explains all timing parameters associated with the use of embedded 18-bit x 18-bit multipliers in Virtex-II FPGAs (see Figure 1-10). The propagation delays through the embedded multiplier differ based on the size of the multiplier function implemented. The longest delay through the multiplier is to the highest order bit output (P35). Therefore, if an 18-bit x 18-bit signed multiplier is implemented, the worst-case delay for this function is the longest delay associated with the embedded multiplier block. If smaller (LSB) multipliers are used, shorter delays can be realized.

This section is intended to be used in conjunction with the section on switching characteristics in the <u>Virtex-II Data Sheet (DS031)</u> and the Timing Analyzer (TRCE) report from Xilinx software. For specific timing parameter values, refer to the *Virtex-II Data Sheet*.



Figure 1-10: Embedded 18-bit x 18-bit Multiplier Block

### **Timing Parameters**

#### Propagation Delays (All Worst-Case)

Table 1-1 lists the different values for the  $T_{MULT}$  timing parameter reported by the Timing Analyzer software. These values correspond to the propagation delay through the multiplier to a specific output pin of the multiplier block.

Table 1-1: Multiplier Switching Characteristics

| Description                     | Symbol            |
|---------------------------------|-------------------|
| Propagation Delay to Output Pin |                   |
| Input to Pin35                  | T <sub>MULT</sub> |
| Input to Pin34                  | T <sub>MULT</sub> |
| Input to Pin33                  | T <sub>MULT</sub> |
| Input to Pin32                  | T <sub>MULT</sub> |
| Input to Pin31                  | T <sub>MULT</sub> |
| Input to Pin30                  | T <sub>MULT</sub> |
| Input to Pin29                  | T <sub>MULT</sub> |
| Input to Pin28                  | T <sub>MULT</sub> |
| Input to Pin27                  | T <sub>MULT</sub> |
| Input to Pin26                  | T <sub>MULT</sub> |
| Input to Pin25                  | T <sub>MULT</sub> |
| Input to Pin24                  | T <sub>MULT</sub> |
| Input to Pin23                  | T <sub>MULT</sub> |

| Description    | Symbol            |
|----------------|-------------------|
| Input to Pin22 | T <sub>MULT</sub> |
| Input to Pin21 | T <sub>MULT</sub> |
| Input to Pin20 | T <sub>MULT</sub> |
| Input to Pin19 | T <sub>MULT</sub> |
| Input to Pin18 | T <sub>MULT</sub> |
| Input to Pin17 | T <sub>MULT</sub> |
| Input to Pin16 | T <sub>MULT</sub> |
| Input to Pin15 | T <sub>MULT</sub> |
| Input to Pin14 | T <sub>MULT</sub> |
| Input to Pin13 | T <sub>MULT</sub> |
| Input to Pin12 | T <sub>MULT</sub> |
| Input to Pin11 | T <sub>MULT</sub> |
| Input to Pin10 | T <sub>MULT</sub> |
| Input to Pin9  | T <sub>MULT</sub> |
| Input to Pin8  | T <sub>MULT</sub> |
| Input to Pin7  | T <sub>MULT</sub> |
| Input to Pin6  | T <sub>MULT</sub> |
| Input to Pin5  | T <sub>MULT</sub> |
| Input to Pin4  | T <sub>MULT</sub> |
| Input to Pin3  | T <sub>MULT</sub> |
| Input to Pin2  | T <sub>MULT</sub> |
| Input to Pin1  | T <sub>MULT</sub> |
| Input to Pin0  | T <sub>MULT</sub> |

Table 1-1: Multiplier Switching Characteristics (Continued)

The shortest delay is to pin 0 and the longest delay to pin 35. Notice that the delay-to-pin ratio is essentially linear (see Figure 1-11). This implies that smaller multiply functions are faster than larger ones. This is true as long as the LSB inputs are used.



Figure 1-11: Pin-to-Delay Ratio Curve

Figure 1-12 illustrates the result (outputs) of a 4-bit x 4-bit unsigned multiply implemented in an embedded multiplier block.



Figure 1-12: Embedded Multiplier Block Timing Diagram

At time 0 the two 4-bit numbers to be multiplied become valid at the A[0..3], B[0..3] inputs to the embedded multiplier. The result appears on the output pins P[0..7] in a staggered fashion. First, P0 becomes valid at time  $T_{MULT}$ (P0), followed by each subsequent output pin, until P7 becomes valid at time  $T_{MULT}$  (P7). In this case, the delay for this multiply function should correspond to that of Pin 7. In other words, the result is not valid until all output pins become valid.

## **IOB Timing Model**

### Introduction

This section describes all timing parameters associated with the Virtex-II IOB. The section consists of three parts:

- IOB Input Timing Model and Parameters
- IOB Output Timing Model and Parameters
- IOB 3-State Timing Model and Parameters

This section is intended to be used in conjunction with the section on switching characteristics in the <u>Virtex-II Data Sheet (DS031)</u> and the Timing Analyzer (TRCE) report from Xilinx software. For specific timing parameter values, refer to the *Virtex-II Data Sheet*.

#### A Note on I/O Standard Adjustments:

The "IOB Input and Output Switching Characteristics Standard Adjustments" tables in the switching characteristics section of the *Virtex-II Data Sheet* are delay adders (+/-) to be added to all timing parameter values associated with the IOB and the Global Clock (see "Pin-to-Pin Timing Model" on page 37), if an I/O standard other than LVTTL is used.

All values specified in the *Virtex-II Data Sheet* for the parameters covered in this section are specified for LVTTL. If another I/O standard is used, these delays change. However, there are several exceptions. The following parameters associated with the pad going to high-impedance (3-State buffer OFF) should NOT be adjusted:

- T<sub>IOTHZ</sub>
- T<sub>IOTLPHZ</sub>
- T<sub>GTS</sub>
- T<sub>IOCKHZ</sub>
- T<sub>IOSRHZ</sub>

1

## IOB Input Timing Model and Parameters

Figure 1-13 illustrates IOB inputs.



UG002\_C3\_004\_101300

Figure 1-13: Virtex-II IOB Input Diagram

## **Timing Parameters**

| Parameter                                                  | Function                                    | Control<br>Signal | Description                                                                                                                       |  |
|------------------------------------------------------------|---------------------------------------------|-------------------|-----------------------------------------------------------------------------------------------------------------------------------|--|
| <b>Propagation Delays</b>                                  |                                             |                   |                                                                                                                                   |  |
| T <sub>IOPI</sub>                                          |                                             |                   | Propagation delay from the pad to I output of the IOB with no delay adder.                                                        |  |
| T <sub>IOPID</sub>                                         |                                             |                   | Propagation delay from the pad to I output of the IOB with the delay adder.                                                       |  |
| T <sub>IOPLI</sub>                                         |                                             |                   | Propagation delay from the pad to IQ output of the IOB via transparent latch with no delay adder.                                 |  |
| T <sub>IOPLID</sub>                                        |                                             |                   | Propagation delay from the pad to IQ output of the IOB via transparent latch with the delay adder.                                |  |
| Setup and Hold With Respect to Clock at IOB Input Register |                                             |                   |                                                                                                                                   |  |
| $T_{xxCK}$ = Setup time<br>$T_{xxCKxx}$ = Hold time        | (before clock edge)<br>e (after clock edge) |                   | The following descriptions are for setup times only.                                                                              |  |
| T <sub>IOPICK</sub> /T <sub>IOICKP</sub>                   | ID input with<br>NO delay                   |                   | Time before the clock that the input signal from the pad must be stable at the ID input of the IOB Input Register, with no delay. |  |

# 

| Parameter                                  | Function                         | Control<br>Signal | Description                                                                                                                       |
|--------------------------------------------|----------------------------------|-------------------|-----------------------------------------------------------------------------------------------------------------------------------|
| T <sub>IOPICKD</sub> /T <sub>IOICKPD</sub> | ID input with delay              |                   | Time before the clock that the input signal from the pad must<br>be stable at the ID input of the IOB Input Register, with delay. |
| T <sub>IOICECK</sub> /T <sub>IOCKICE</sub> | ICE input                        |                   | Time before the clock that the Clock Enable signal must be stable at the ICE input of the IOB Input Register.                     |
| T <sub>IOSRCKI</sub>                       | SR input (IFF,<br>synchronous)   |                   | Time before the clock that the Set/Reset signal must be stable at the SR input of the IOB Input Register.                         |
| Clock to Out                               | •                                |                   |                                                                                                                                   |
| T <sub>IOCKIQ</sub>                        | Clock (CLK) to<br>(IQ) output    |                   | Time after the clock that the output data is stable at the IQ output of the IOB Input Register.                                   |
| Set/Reset Delays                           |                                  |                   |                                                                                                                                   |
| T <sub>IOSRIQ</sub>                        | SR Input to IQ<br>(asynchronous) |                   | Time after the Set/Reset signal of the IOB is toggled that the output of the IOB input register (IQ) reflects the signal.         |
| T <sub>GSRQ</sub>                          | GSR to output IQ                 |                   | Time after the Global Set/Reset is toggled that the output of the IOB input register (IQ) reflects the set or reset.              |

#### Figure 1-14 illustrates IOB input register timing.



Figure 1-14: IOB Input Register Timing Diagram

#### **Clock Events**

- At time T<sub>IOICECK</sub> before Clock Event 1, the input clock enable signal becomes valid-high at the ICE input of the input register, enabling the input register for incoming data.
- At time T<sub>IOPICK</sub> before Clock Event 1, the input signal becomes valid-high at the I input of the input register and is reflected on the IQ output of the input register at time T<sub>IOCKIQ</sub> after Clock Event 1.
- At time T<sub>IOSRCKI</sub> before Clock Event 4 the SR signal (configured as synchronous reset in this case) becomes valid-high resetting the input register and reflected at the IQ output of the IOB at time T<sub>IOCKIQ</sub> after Clock Event 4.





Figure 1-15: IOB DDR Input Register Timing Diagram

#### **Clock Events**

- At time T<sub>IOICECK</sub> before Clock Event 1 the input clock enable signal becomes validhigh at the ICE input of both of the DDR input registers, enabling them for incoming data. Since the ICE and I signals are common to both DDR registers, care must be taken to toggle these signals between the rising edges of ICLK1 and ICLK2 as well as meeting the register setup-time relative to both clocks.
- At time T<sub>IOPICK</sub> before Clock Event 1 (rising edge of ICLK1) the input signal becomes valid-high at the I input of both registers and is reflected on the IQ1 output of input-register 1 at time T<sub>IOCKIQ</sub> after Clock Event 1.
- At time T<sub>IOPICK</sub> before Clock Event 2 (rising edge of ICLK2) the input signal becomes valid-low at the I input of both registers and is reflected on the IQ2 output of input-register 2 at time T<sub>IOCKIQ</sub> after Clock Event 2 (no change in this case).
- At time T<sub>IOSRCKI</sub> before Clock Event 9 the SR signal (configured as synchronous reset in this case) becomes valid-high resetting input-register 1 (IQ1) at time T<sub>IOCKIQ</sub> after Clock Event 9, and input-register 2 (IQ2) at time T<sub>IOCKIQ</sub> after Clock Event 10.

## IOB Output Timing Model and Parameters

Figure 1-16 illustrates IOB outputs.



Figure 1-16: Virtex-II IOB Output Diagram

### **Timing Parameters**

| Parameter                                              | Function                                 | Control<br>Signal | Description                                                                                                    |  |
|--------------------------------------------------------|------------------------------------------|-------------------|----------------------------------------------------------------------------------------------------------------|--|
| Propagation Delays                                     |                                          |                   |                                                                                                                |  |
| T <sub>IOOP</sub>                                      |                                          |                   | Propagation delay from the O input of the IOB to the pad.                                                      |  |
| T <sub>IOOLP</sub>                                     |                                          |                   | Propagation delay from the O input of the IOB to the pad via transparent latch.                                |  |
| Setup and Hold With I                                  | Respect to Clock at IOB                  | Output I          | Register                                                                                                       |  |
| $T_{xxCK}$ = Setup time (I<br>$T_{xxCKxx}$ = Hold time | pefore clock edge)<br>(after clock edge) |                   | The following descriptions are for setup times only.                                                           |  |
| T <sub>IOOCK</sub> /T <sub>IOCKO</sub>                 | O input                                  |                   | Time before the clock that data must be stable at the O input of the IOB Output Register.                      |  |
| T <sub>IOOCECK</sub> /T <sub>IOCKOCE</sub>             | OCE input                                |                   | Time before the clock that the Clock Enable signal must be stable at the OCE input of the IOB Output Register. |  |
| T <sub>IOSRCKO</sub> /T <sub>IOCKOSR</sub>             | SR input (OFF)                           |                   | Time before the clock that the Set/Reset signal must be stable at the SR input of the IOB Output Register.     |  |
| Clock to Out                                           |                                          |                   |                                                                                                                |  |
| T <sub>IOCKP</sub>                                     | Clock (CLK) to pad                       |                   | Time after the clock that the output data is stable at the pad.                                                |  |
| Set/Reset Delays                                       |                                          |                   |                                                                                                                |  |
| T <sub>IOSRP</sub>                                     | SR Input to pad<br>(asynchronous)        |                   | Time after the Set/Reset input of the IOB is toggled that the pad reflects the set or reset.                   |  |
| T <sub>IOGSRQ</sub>                                    | GSR to pad                               |                   | Time after the Global Set/Reset is toggled that the pad reflects the set or reset.                             |  |





Figure 1-17: IOB Output Register Timing Diagram

#### **Clock Events**

- At time T<sub>IOOCECK</sub> before Clock Event 1, the output clock enable signal becomes validhigh at the OCE input of the output register, enabling the output register for incoming data.
- At time T<sub>IOOCK</sub> before Clock Event 1, the output signal becomes valid-high at the O input of the output register and is reflected on the pad at time T<sub>IOCKP</sub> after Clock Event 1.
- At time T<sub>IOSRCKO</sub> before Clock Event 4, the SR signal (configured as synchronous reset in this case) becomes valid-high, resetting the output register and reflected on the pad at time T<sub>IOCKP</sub> after Clock Event 4.





Figure 1-18: IOB DDR Output Register Timing Diagram

#### **Clock Events**

- At time T<sub>IOOCECK</sub> before Clock Event 1, the output clock enable signal becomes validhigh at the OCE input of both of the DDR output registers, enabling them for incoming data. Since the OCE signal is common to both DDR registers, care must be taken to toggle this signal between the rising edges of OTCLK1 and OTCLK2 as well as meeting the register setup-time relative to both clocks.
- At time T<sub>IOOCK</sub> before Clock Event 1 (rising edge of OTCLK1), the output signal O1 becomes valid-high at the O1 input of output register 1 and is reflected on the pad at time T<sub>IOCKP</sub> after Clock Event 1.
- At time T<sub>IOOCK</sub> before Clock Event 2 (rising edge of OTCLK2), the output signal O2 becomes valid-high at the O2 input of output register 2 and is reflected on the pad at time T<sub>IOCKP</sub> after Clock Event 2 (no change on the pad in this case).
- At time T<sub>IOSRCKO</sub> before Clock Event 9, the SR signal (configured as synchronous reset in this case) becomes valid-high, resetting output-register 1 (reflected on the pad at time T<sub>IOCKP</sub> after Clock Event 9) (no change in this case) and output-register 2 (reflected on the pad at time T<sub>IOCKP</sub> after Clock Event 10) (no change in this case).

## IOB 3-State Timing Model and Parameters

Figure 1-19 illustrates IOB 3-state timing



Figure 1-19: Virtex-II IOB 3-State Diagram

### **Timing Parameters**

| Parameter                                                                                  | Function | Control<br>Signal                                    | Description                                                                                                                   |  |  |
|--------------------------------------------------------------------------------------------|----------|------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------|--|--|
| Propagation Delays                                                                         |          |                                                      |                                                                                                                               |  |  |
| T <sub>IOTHZ</sub>                                                                         |          |                                                      | Time after T input of the IOB is toggled that the pad goes to high-impedance.                                                 |  |  |
| T <sub>IOTON</sub>                                                                         |          |                                                      | Time after the T input of the IOB is toggled that the pad goes from high-impedance to valid data.                             |  |  |
| T <sub>IOTLPHZ</sub>                                                                       |          |                                                      | Time after the T input of the IOB via transparent latch is toggled that the pad goes to high-impedance.                       |  |  |
| T <sub>IOTLPON</sub>                                                                       |          |                                                      | Time after the T input of the IOB via transparent latch<br>is toggled that the pad goes from high-impedance to<br>valid data. |  |  |
| T <sub>GTS</sub>                                                                           |          |                                                      | Time after the Global 3-state signal is asserted that the pad goes to high-impedance.                                         |  |  |
| Setup and Hold With Respect to Clock at IOB 3-State Register                               |          |                                                      |                                                                                                                               |  |  |
| $T_{xxCK}$ = Setup time (before clock edge)<br>$T_{xxCKxx}$ = Hold time (after clock edge) |          | The following descriptions are for setup times only. |                                                                                                                               |  |  |
| T <sub>IOTCK</sub> /T <sub>IOCKT</sub>                                                     | T input  |                                                      | Time before the clock that the signal must be stable<br>at the T input of the IOB 3-state Register.                           |  |  |

# 

| Parameter                                  | Function                                     | Control<br>Signal | Description                                                                                                           |  |  |
|--------------------------------------------|----------------------------------------------|-------------------|-----------------------------------------------------------------------------------------------------------------------|--|--|
| T <sub>IOTCECK</sub> /T <sub>IOCKTCE</sub> | TCE input                                    |                   | Time before the clock that the clock enable signal<br>must be stable at the TCE input of the IOB 3-state<br>Register. |  |  |
| T <sub>IOSRCKT</sub> /T <sub>IOCKTSR</sub> | SR input (TFF)                               |                   | Time before the clock that the set/reset signal.                                                                      |  |  |
| Clock to Out                               |                                              |                   |                                                                                                                       |  |  |
| T <sub>IOCKHZ</sub>                        | Clock (CLK) to pad High-Z                    |                   | Time after clock that the pad goes to high-<br>impedance.                                                             |  |  |
| T <sub>IOCKON</sub>                        | Clock (CLK) to valid data<br>on pad          |                   | Time after clock that the pad goes from high-<br>impedance to valid data.                                             |  |  |
| Set/Reset Delays                           |                                              |                   |                                                                                                                       |  |  |
| T <sub>IOSRHZ</sub>                        | SR Input to pad High-Z<br>(asynchronous)     |                   | Time after the SR signal is toggled that the pad goes to high-impedance.                                              |  |  |
| T <sub>IOSRON</sub>                        | SR Input to valid data on pad (asynchronous) |                   | Time after the SR signal is toggled that the pad goes from high-impedance to valid data.                              |  |  |

Figure 1-20 illustrates IOB 3-state register timing.



Figure 1-20: IOB 3-State Register Timing Diagram

#### **Clock Events**

- At time T<sub>IOTCECK</sub> before Clock Event 1, the 3-state clock enable signal becomes validhigh at the TCE input of the 3-state register, enabling the 3-state register for incoming data.
- At time T<sub>IOTCK</sub> before Clock Event 1 the 3-state signal becomes valid-high at the T input of the 3-state register, returning the pad to high-impedance at time T<sub>IOCKHZ</sub> after Clock Event 1.
- At time T<sub>IOSRCKT</sub> before Clock Event 2, the SR signal (configured as synchronous reset in this case) becomes valid-high, resetting the 3-state register and returning the pad to valid data at time T<sub>IOSRON</sub> after Clock Event 2.





Figure 1-21: IOB DDR 3-State Register Timing Diagram

#### **Clock Events**

- At time T<sub>IOTCECK</sub> before Clock Event 1, the 3-state clock enable signal becomes validhigh at the TCE input of both of the DDR 3-state registers, enabling them for incoming data. Since the TCE signal is common to both DDR registers, care must be taken to toggle this signal between the rising edges of OTCLK1 and OTCLK2 as well as meeting the register setup-time relative to both clocks.
- At time T<sub>IOTCK</sub> before Clock Event 2 (rising edge of OTCLK2), the 3-state signal T2 becomes valid-high at the T2 input of 3-state register 2, switching the pad to high-impedance at time T<sub>IOCKHZ</sub> after Clock Event 2.
- At time T<sub>IOTCK</sub> before Clock Event 3 (rising edge of OTCLK1), the 3-state signal T1 becomes valid-high at the T1 input of 3-state register 1, keeping the pad at high-impedance for another half clock cycle (half the period of OTCLK1 or 2).
- At time T<sub>IOTCK</sub> before Clock Event 4 (rising edge of OTCLK2), the 3-state signal T2 becomes valid-low at the T2 input of 3-state register 2, switching the pad to valid data at time T<sub>IOCKON</sub> after Clock Event 4. This is repeated for 3-state signal T1 at the following clock event (5) maintaining valid data on the pad until Clock Event 8.
- At time T<sub>IOTCK</sub> before Clock Event 8 (rising edge of OTCLK2), the 3-state signal T2 becomes valid-high at the T2 input of 3-state register 2, switching the pad to high-impedance at time T<sub>IOCKHZ</sub> after Clock Event 8.
- At time T<sub>IOSRCKT</sub> before Clock Event 9 (rising edge of OTCLK1), the SR signal (configured as synchronous reset in this case) becomes valid-high at the SR input of 3-state Register 1, returning the pad to valid data at time T<sub>IOSRON</sub> after Clock Event 9.
# **Pin-to-Pin Timing Model**

# Introduction

This section explains the delays and timing parameters associated with the use of the Global Clock network and the DCM. These delays are true pin-to-pin delays relative to the Global Clock pin and an output or input pin with or without the DCM.

This section consists of two parts:

- Global Clock Input to Output
- Global Clock Setup and Hold

The former describes the delay from the Global Clock pin (with and without the DCM) to an output pin via an Output flip-flop. The latter describes the set-up time for an Input flipflop from an input pin relative to the Global Clock pin (with and without the DCM).

The values reported in the switching characteristics section of the <u>Virtex-II Data Sheet</u>. (DS031) are for LVTTL I/O standards. For different I/O standards, adjust these values with those shown in the "IOB Switching Characteristics Standard Adjustments" tables.

This section is intended to be used in conjunction with the section on switching characteristics in the *Virtex-II Data Sheet* and the Timing Analyzer (TRCE) report from Xilinx software. For specific timing parameter values, refer to the *Virtex-II Data Sheet*.

# Global Clock Input to Output

Figure 1-22 illustrates the paths associated with the timing parameters defined in this section. Note that they differ only in their use of the DCM.



UG002\_C3\_013\_101300

Figure 1-22: Global Clock Input to Output Model

www.xilinx.com 1-800-255-7778

# **Timing Parameters**

| Parameter             | Description                                                                                                                        |
|-----------------------|------------------------------------------------------------------------------------------------------------------------------------|
| T <sub>ICKOFDLL</sub> | Time after the Global Clock (pin), using the DCM, that the output data from an IOB Output flip-flop is stable at the output pin.   |
| T <sub>ICKOF</sub>    | Time after the Global Clock (pin), without the DCM, that the output data from an IOB Output flip-flop is stable at the output pin. |

The waveforms depicted in Figure 1-23 demonstrate the relation of the Global Clock pin, the output data, and the use of the timing parameters.



Figure 1-23: Global Clock Input to Output Timing Diagram

# Global Clock Setup and Hold

Figure 1-24 illustrates the paths associated with the timing parameters defined in this section. Note, they differ only in their use of the DCM.



UG002\_C3\_014\_101300

Figure 1-24: Global Clock Setup and Hold Model

# **Timing Parameters**

Setup and Hold for Input Registers Relative to the Global Clock (pin):

- T<sub>PSDLL</sub> / T<sub>PHDLL</sub> Time before the Global Clock (pin), with DCM, that the input signal must be stable at the D-input of the IOB input register.
- T<sub>PSFD</sub> / T<sub>PHFD</sub> Time before the Global Clock (pin), without DCM, that the input signal must be stable at the D-input of the IOB input register.

Note:  $T_{PSFD}$  = Setup time (before clock edge) and  $T_{PHFD}$  = Hold time (after clock edge). The previous descriptions are for setup times only.

The waveforms depicted in Figure 1-25 demonstrate the relation of the Global Clock pin, the input data, and the use of the timing parameters.



Figure 1-25: Global Clock Setup and Hold Timing Diagram

1

# **Digital Clock Manager Timing Model**

This section describes the timing parameters associated with the Digital Clock Manager (DCM), which are reported in the <u>Virtex-II Data Sheet (DS031)</u>. Note that these parameters are not used by the Timing Analyzer software in the production of timing reports; they are all measured values and are fully characterized in silicon. For specific timing parameter values, refer to the *Virtex-II Data Sheet*. This section discusses the following:

- **Operating Frequency Ranges:** The minimum and maximum frequencies supported by the DCM for all clock inputs and outputs.
- **Input Clock Tolerances:** Input clock period (pulse widths), jitter, and drift requirements for proper function of the DCM for all clock inputs.
- **Output Clock Precision:** Output clock period jitter, phase offsets, and duty cycle for all clock outputs of the DCM (worst case).
- **Miscellaneous Timing Parameters:** DCM lock times, Tap delay and shifting range.

For a detailed description of input clock tolerance, jitter, and phase offset see the waveforms at the end of this section.

# **Operating Frequency Ranges**

Figure 1-26 illustrates the DCM functional block and corresponding timing parameters for all clock inputs and outputs.



Figure 1-26: DCM Functional Block: Operating Frequency Ranges

# **Timing Parameters**

| Parameter                      | Description                                                                                                                            |
|--------------------------------|----------------------------------------------------------------------------------------------------------------------------------------|
| Low Frequency Mode             |                                                                                                                                        |
| CLKOUT_FREQ_1X_LF              | The minimum and maximum frequency for the CLK0, CLK90, CLK180, CLK270 outputs of the DCM in low-frequency mode.                        |
| CLKOUT_FREQ_2X_LF              | The minimum and maximum frequency for the CLK2X and CLK2X180 outputs of the DCM in low-frequency mode.                                 |
| CLKOUT_FREQ_DV_LF              | The minimum and maximum frequency for the CLKDV output of the DCM in low-frequency mode.                                               |
| CLKOUT_FREQ_FX_LF              | The minimum and maximum frequency for the CLKFX and CLKFX180 outputs of the DCM in low-frequency mode.                                 |
| CLKIN_FREQ_DLL_LF <sup>1</sup> | The minimum and maximum frequency for the CLKIN input to the DCM in low-frequency mode when using the delay-locked loop (DLL) outputs. |
| CLKIN_FREQ_FX_LF <sup>2</sup>  | The minimum and maximum frequency for the CLKIN input to the DCM in low-frequency mode when using the FX outputs.                      |
| PSCLK_FREQ_LF                  | The minimum and maximum frequency for the PSCLK input to the DCM in low-frequency mode.                                                |
| High Frequency Mode            |                                                                                                                                        |
| CLKOUT_FREQ_1X_HF              | The minimum and maximum frequency for the CLK0, CLK180 outputs of the DCM in high-frequency mode.                                      |
| CLKOUT_FREQ_DV_HF              | The minimum and maximum frequency for the CLKDV output of the DCM in high-frequency mode.                                              |
| CLKOUT_FREQ_FX_HF              | The minimum and maximum frequency for the CLKFX and CLKFX180 outputs of the DCM in high-frequency mode.                                |
| CLKIN_FREQ_DLL_HF              | The minimum and maximum frequency for the CLKIN input<br>to the DCM in high-frequency mode when using the DLL<br>outputs.              |
| CLKIN_FREQ_FX_HF               | The minimum and maximum frequency for the CLKIN input<br>to the DCM in high-frequency mode when using the FX<br>outputs.               |
| PSCLK_FREQ_HF                  | The minimum and maximum frequency for the PSCLK input to the DCM in high-frequency mode.                                               |

Notes:

- 1. Delay-locked loop (DLL) outputs include: CLK0, CLK90, CLK180, CLK270, CLK2X, CLK2X180, and CLKDV.
- 2. FX outputs include: CLKFX and CLKFX180

1

# Input Clock Tolerances

# **Timing Parameters**

| Parameter                | Description                                                                                                                                            |
|--------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------|
| PSCLK_PULSE <sup>1</sup> | The minimum pulse width (HIGH and LOW) that the PSCLK input to the DCM can have over a range of frequencies.                                           |
| CLKIN_PULSE              | The minimum pulse width (HIGH and LOW) that the CLKIN input to the DCM can have over a range of frequencies. Also applies to PSCLK.                    |
| CLKFB_DELAY_VAR_EXT      | The maximum allowed variation in delay (across<br>environmental changes) of the feedback clock path<br>when routed externally for board-level de-skew. |
| Low Frequency Mode       |                                                                                                                                                        |
| CLKIN_CYC_JITT_DLL_LF    | The maximum cycle-to-cycle jitter the CLKIN input to<br>the DCM can have when using the DLL outputs in<br>low-frequency mode.                          |
| CLKIN_CYC_JITT_FX_LF     | The maximum cycle-to-cycle jitter the CLKIN input to<br>the DCM can have when using the FX outputs in low-<br>frequency mode.                          |
| CLKIN_PER_JITT_DLL_LF    | The maximum period jitter the CLKIN input to the DCM can have when using the DLL outputs in low-frequency mode.                                        |
| CLKIN_PER_JITT_FX_LF     | The maximum period jitter the CLKIN input to the DCM can have when using the FX outputs in low-frequency mode.                                         |
| High Frequency Mode      |                                                                                                                                                        |
| CLKIN_CYC_JITT_DLL_HF    | The maximum cycle-to-cycle jitter the CLKIN input to<br>the DCM can have when using the DLL outputs in<br>high-frequency mode.                         |
| CLKIN_CYC_JITT_FX_HF     | The maximum cycle-to-cycle jitter the CLKIN input to<br>the DCM can have when using the FX outputs in high-<br>frequency mode.                         |
| CLKIN_PER_JITT_DLL_HF    | The maximum period jitter the CLKIN input to the DCM can have when using the DLL outputs in high-frequency mode.                                       |
| CLKIN_PER_JITT_FX_HF     | The maximum period jitter the CLKIN input to the DCM can have when using the FX outputs in high-frequency mode.                                        |

#### Notes:

 The frequencies applicable to CLKIN\_PULSE range from 1 to >400 MHz. These frequencies also apply to PSCLK\_PULSE. Since PSCLK can be less than 1 MHz, the pulse width under this condition is specified for PSCLK only.

# **Output Clock Precision**

# **Timing Parameters**

| Parameter             | Description                                                                                           |
|-----------------------|-------------------------------------------------------------------------------------------------------|
| CLKOUT_PER_JITT_0     | The maximum period jitter of the CLK0 output clock from the DCM (worst case).                         |
| CLKOUT_PER_JITT_90    | The maximum period jitter of the CLK90 output clock from the DCM (worst case).                        |
| CLKOUT_PER_JITT_180   | The maximum period jitter of the CLK180 output clock from the DCM (worst case).                       |
| CLKOUT_PER_JITT_270   | The maximum period jitter of the CLK270 output clock from the DCM (worst case).                       |
| CLKOUT_PER_JITT_2X    | The maximum period jitter of the CLK2X and CLK2X180 output clocks from the DCM (worst case).          |
| CLKOUT_PER_JITT_DV1   | The maximum period jitter of the CLKDV (integer division) output clock from the DCM (worst case).     |
| CLKOUT_PER_JITT_DV2   | The maximum period jitter of the CLKDV (non-integer division) output clock from the DCM (worst case). |
| CLKOUT_PER_JITT_FX    | The maximum period jitter of the FX output clocks from the DCM (worst case).                          |
| CLKIN_CLKFB_PHASE     | Maximum phase offset between the CLKIN and CLKFB inputs to the DCM.                                   |
| CLKOUT_PHASE          | Maximum phase offset between any DCM clock outputs.                                                   |
| CLKOUT_DUTY_CYCLE_DLL | The duty-cycle precision for all DLL outputs.                                                         |
| CLKOUT_DUTY_CYCLE_FX  | The duty-cycle precision for the FX outputs.                                                          |

# Miscellaneous DCM Timing Parameters

# Table 1-2: Miscellaneous DCM Timing Parameters

| Parameter           | Description                                                                                 |
|---------------------|---------------------------------------------------------------------------------------------|
| LOCK_DLL            | Time required for DCM to lock over a range of clock frequencies when using the DLL outputs. |
| LOCK_FX             | Time required for DCM to lock when using the FX outputs.                                    |
| LOCK_DLL_FINE_SHIFT | Additional lock time when performing fine phase shifting.                                   |
| FINE_SHIFT_RANGE    | Absolute range for fine phase shifting.                                                     |
| DCM_TAP             | Resolution of delay line.                                                                   |

The waveforms in Figure 1-27 demonstrate the relationship between clock tolerance, jitter, and phase.





Figure 1-27: DCM Jitter, Phase, and Tolerance Timing Waveforms

Output jitter is period jitter measured on the DLL output clocks, excluding input clock jitter.

Phase offset between CLKIN and CLKFB is the worst-case fixed time difference between rising edges of CLKIN and CLKFB, excluding output jitter and input clock jitter.

Phase offset between clock outputs on the DLL is the worst-case fixed time difference between rising edges of any two DLL outputs, excluding output jitter and input clock jitter.

Maximum phase difference between CLKIN an CLKFB is the sum of output jitter and phase offset between CLKIN and CLKFB, or the greatest difference between CLKIN and CLKFB rising edges due to DLL alone (excluding input clock jitter).

Maximum phase difference between clock outputs on the DLL is the sum of output jitter and phase offset between any DLL clock outputs, or the greatest difference between any two DLL output rising edges due to DLL alone (excluding input clock jitter).



# Chapter 2

# **Design Considerations**

# Summary

This chapter covers the following topics:

- Using Global Clock Networks
- Using Digital Clock Managers (DCMs)
- Using Block SelectRAM<sup>TM</sup> Memory
- Using Distributed SelectRAM Memory
- Using Look-Up Tables as Shift Registers (SRLs)
- Designing Large Multiplexers
- Implementing Sum of Products (SOP) Logic
- Using Embedded Multipliers
- Using Single-Ended SelectI/O-Ultra Resources
- Using Digitally Controlled Impedance (DCI)
- Using Double-Data-Rate (DDR) I/O
- Using LVDS I/O
- Using LVPECL I/O
- Using Bitstream Encryption
- Using the CORE Generator System

# Introduction

This chapter describes how to take advantage of the many special features of the Virtex-II architecture to achieve maximum density and performance. In many cases, the functions described can be automatically generated using the Xilinx CORE Generator<sup>™</sup> tool. This is noted throughout the chapter, specifically in the following sections:

- Using Block SelectRAM<sup>™</sup> Memory
- Using Distributed SelectRAM Memory
- Using Look-Up Tables as Shift Registers (SRLs)
- Designing Large Multiplexers
- Using Embedded Multipliers

# **Using Global Clock Networks**

# Introduction

Virtex-II devices support very high frequency designs and thus require low-skew advanced clock distribution. With device density up to 10 million system gates, numerous global clocks are necessary in most designs. Therefore, to provide a uniform and portable solution (soft-IP), all Virtex-II devices from XC2V40 to XC2V8000 have 16 global clock buffers and support 16 global clock domains. Up to eight of these clocks can be used in any quadrant of the device by the synchronous logic elements (that is, registers, 18Kb block RAM, pipeline multipliers) and the IOBs. The software tools place and route these global clocks automatically.

If the design uses between 8 and 16 clocks, it must be partitioned into quadrants, with up to 8 clocks per quadrant. If more than 16 clocks are required, the backbone (24 horizontal and vertical long lines routing resources) can be used as additional clock network.

In addition to clock distribution, the 16 clock buffers are also "glitch-free" synchronous 2:1 multiplexers. These multiplexers are capable of switching between two asynchronous (or synchronous) clocks at any time. No particular phase relations between the two clocks are needed. The clock multiplexers can also be configured as a global clock buffer with a clock enable. The clock can be stopped High or Low at the clock buffer output.

# **Clock Distribution Resources**

The various resources available to manage and distribute the clocks include:

- 16 clock pads that can be used as regular user I/Os if not used as clock inputs. The 16 clock pads can be configured for any I/O standard, including differential standards (for example, LVDS, LVPECL, and so forth).
- 16 "IBUFG" elements that represent the clock inputs in a VHDL or Verilog design.
- 8 "IBUFGDS" elements (that is, attributes LVPECL\_33, LVDS\_25, LVDS\_33, LDT\_25, or ULVDS\_25) that represent the differential clock input pairs in a VHDL or Verilog design. Each IBUFGDS replaces two IBUFG elements.
- 4 to 12 Digital Clock Managers (DCMs), depending on the device size, to de-skew and generate the clocks. For more information on DCMs, see "Using Digital Clock Managers (DCMs)" on page 67.
- 16 "BUFGMUX" elements that can consist of up to 16 global clock buffers (BUFG), global clock buffers with a clock enable (BUFGCE), or global clock multiplexers (BUFGMUX).

Figure 2-1 illustrates the placement of these clock resources in Virtex-II devices (the XC2V250 through the XC2V2000) that have eight DCMs.



Figure 2-1: Clock Resources in Virtex-II Devices

The simple scheme to distribute an external clock in the device is to implement a clock pad with an IBUFG input buffer connected to a BUFG global buffer, as shown in Figure 2-2 and Figure 2-3. The primary (GCLKP) and secondary (GCLKS) clock pads have no relationship with the P-side and N-side of differential clock inputs. In banks 0 and 1, the GCLKP corresponds to the N-side, and the GCLKS corresponds to the P-side of a differential clock input. In banks 4 and 5, this correspondence is reversed.



Figure 2-2: Simple Clock Distribution (Bank 0 and 1 Scheme)

2



Figure 2-3: Simple Clock Distribution (Bank 4 and 5 Scheme)

Major synthesis tools automatically infer the IBUFG and BUFG when the corresponding input signal is used as a clock in the VHDL or Verilog code.

A high frequency or adapted (frequency, phase, and so forth) clock distribution with low skew is implemented by using a DCM between the output of the IBUFG and the input of the BUFG, as shown in Figure 2-4. "Using Digital Clock Managers (DCMs)" on page 67 provides details about DCMs and their use.



Figure 2-4: Clock Distribution with DCM

Clock distribution from internal sources is also possible with a BUFG only or with a DCM, as shown in Figure 2-5.



Figure 2-5: Internal Logic Driving Clock Distribution

# **Global Clock Inputs**

The clock buffer inputs are fed either by one of the 16 clock pads (refer to the <u>Virtex-II Data</u> <u>Sheet (DS031)</u>), by the outputs of the DCM, or by local interconnect. Each clock buffer can be a synchronous "glitch-free" 2:1 multiplexer with two clock inputs and one select input. Internal logic (or alternatively a regular IOB) can feed the clock inputs. Any internal or external signal can drive the select input or clock enable input.

The possible inputs driving a global clock buffer or multiplexer are summarized in Table 2-1.

| Table 2-1: | Inputs Driving | <b>Global Clock</b> | Buffers or DCMs |
|------------|----------------|---------------------|-----------------|
|------------|----------------|---------------------|-----------------|

|                                         |                                           | Destination                                  |                                         |                         |                                      |  |
|-----------------------------------------|-------------------------------------------|----------------------------------------------|-----------------------------------------|-------------------------|--------------------------------------|--|
| Source                                  | BUFG(I) or<br>BUFGCE(I)                   | BUFGCE<br>(CE)                               | BUFGMUX<br>(I0 or I1)                   | BUFGMUX<br>(S)          | DCM<br>(CLKIN)                       |  |
| External Clock via IBUFG(O)             | Dedicated in same quadrant <sup>1</sup>   | NA                                           | Dedicated in same quadrant <sup>1</sup> | NA                      | Same edge                            |  |
| DCM Clock Outputs                       | Same edge (top<br>or bottom) <sup>2</sup> | NA Same edge (top<br>or bottom) <sup>2</sup> |                                         | NA                      | General<br>interconnect <sup>3</sup> |  |
| Internal Logic                          | General<br>interconnect                   | General interconnect                         | General<br>interconnect                 | General<br>interconnect | General<br>interconnect <sup>3</sup> |  |
| User I/O Pad via IBUF(O)<br>(not IBUFG) | General interconnect                      | General<br>interconnect                      | General<br>interconnect                 | General<br>interconnect | General<br>interconnect <sup>3</sup> |  |
| BUFG(O)                                 | NA                                        | NA                                           | NA                                      | NA                      | Global clock<br>net                  |  |
| BUFGMUX(O)                              | NA                                        | NA                                           | General<br>interconnect                 | NA                      | Global clock<br>net                  |  |

#### Notes:

1. Not all IBUFGs in the quadrant have a dedicated connection to a specific BUFG. Others would require general interconnect to be hooked up.

2. Same edge (top or bottom) enables use of dedicated routing resources.

3. Pad to DCM input skew is not compensated.

All BUFG (BUFGCE, BUFGMUX) outputs are available at the quadrant boundaries.

The output of the global clock buffer can be routed to non-clock pins.

# Primary and Secondary Global Multiplexers

Each global clock buffer is a self-synchronizing circuit called a clock multiplexer.

The 16 global clock buffers or multiplexers are divided as follows:

- Eight primary clock multiplexers
- Eight secondary clock multiplexers

No hardware difference exists between a primary and a secondary clock multiplexer. However, some restrictions apply to primary/secondary multiplexers, because they share input connections, as well as access to a quadrant.

Each Virtex-II device is divided into four quadrants: North-West, South-West, North-East, and South-East. Each quadrant has two primary and two secondary clock multiplexers. The clock multiplexers are indexed 0 to 7, with one primary and one secondary for each index, alternating on the top and on the bottom (i.e., clock multiplexer "0P" at the bottom is facing clock multiplexer "0S" at the top).

In each device, the eight top/bottom clock multiplexers are divided into four primary and four secondary, indexed 0 to 7, as shown in Figure 2-6.



Figure 2-6: Primary and Secondary Clock Multiplexer Locations

## Primary/Secondary: Rule 1

Considering two "facing" clock multiplexers (BUFG#P and BUFG#S), one or the other of these clock outputs can enter any quadrant of the chip to drive a clock within that quadrant, as shown in Figure 2-7. Note that the clock multiplexers "xP" and "xS" compete for quadrant access. For example, BUFG0P output cannot be used in the same quadrant as BUFG0S.



Figure 2-7: Facing BUFG#P and BUFG#S Connections

## Primary/Secondary: Rule 2

In a BUFGCE or BUFGMUX configuration, shared inputs have to be considered. Any two adjacent clock multiplexers share two inputs, as shown in Figure 2-8. The clock multiplexer "1P" and "0S" have common I0/I1 and I1/I0 inputs.



UG002\_C2\_089\_113000

#### Figure 2-8: Clock Multiplexer Pair Sharing Clock Multiplexer Inputs

Table 2-2 lists the clock multiplexer pairs in any Virtex-II device. The primary multiplexer inputs I1/I0 are common with the corresponding secondary multiplexer inputs I0/I1 (i.e., Primary I1 input is common with secondary I0 input, and primary I0 input is common with secondary I1 input).

| Table 2-2: | Top Clock | Multiplexer | Pairs |
|------------|-----------|-------------|-------|
|            |           |             |       |

| Primary I1/I0   | 1P | 3P | 5P | 7P |
|-----------------|----|----|----|----|
| Secondary I0/I1 | 0S | 2S | 4S | 6S |

#### Table 2-3: Bottom Clock Multiplexer Pairs

| Primary I1/I0   | 0P | 2P | 4P | 6P |
|-----------------|----|----|----|----|
| Secondary I0/I1 | 1S | 35 | 5S | 7S |

## Primary/Secondary Usage

For up to eight global clocks, it is safe to use the eight primary global multiplexers (1P, 3P, 5P, 7P on the top and 0P, 2P, 4P, 6P on the bottom). Because of the shared inputs, a maximum of eight independent global clock multiplexers can be used in a design, as shown in Figure 2-9.



Figure 2-9: Eight Global Clocks Design

# DCM Clocks

The four clock pins (IBUFG) in a quadrant can feed all DCMs in the same edge of the device. The clock-to-out and setup times are identical for all DCMs. Up to four clock outputs per DCM can be used to drive any clock multiplexer on the same edge (top or bottom), as shown in Figure 2-10.

### **BUFG Exclusivity**

Each DCM has a restriction on the number of BUFGs it can drive on its (top or bottom) edge. Pairs of buffers with shared dedicated routing resources exist such that only one buffer from each dedicated pair can be driven by a single DCM. The exclusive pairs for each edge are: 1:5, 2:6, 3:7, and 4:8.



Figure 2-10: DCM Clocks

# **Clock Output**

The clock distribution is based on eight clock trees per quadrant. Each clock multiplexer output is driving one global clock net. The Virtex-II device has eight dedicated low-skew clock nets. The device is divided into four quadrants (NW, NE, SW and SE) with eight global clocks available per quadrant.

Eight clock buffers are in the middle of the top edge and eight are in the middle of the bottom edge. Any of these 16 clock buffer outputs can be used in any quadrant, up to a maximum of eight clocks per quadrant, as illustrated in Figure 2-11, provided there is not a primary vs. secondary conflict.



Figure 2-11: Clock Buffer Outputs per Quadrant

Designs with more than eight clocks must be floorplanned manually or automatically, distributing the clocks in each quadrant. As an example, a design with 16 clocks can be floorplanned as shown in Figure 2-12.



Figure 2-12: 16-Clock Floorplan

| The clock r | ets and clock buffers in this example are associated as shown in Table 2-4. |
|-------------|-----------------------------------------------------------------------------|
| Table 2-4:  | Clock Net Association With Clock Buffers                                    |

| Clock Net (top edge)    | CLK_A | CLK_B | CLK_C | CLK_D | CLK_E | CLK_F | CLK_G | CLK_H |
|-------------------------|-------|-------|-------|-------|-------|-------|-------|-------|
| BUFG                    | 7P    | 6S    | 5P    | 4S    | 3P    | 2S    | 1P    | 0S    |
| Clock Net (bottom edge) | CLK_I | CLK_J | CLK_K | CLK_L | CLK_M | CLK_N | CLK_O | CLK_P |
| BUFG                    | 7S    | 6P    | 5S    | 4P    | 35    | 2P    | 1S    | 0P    |
| Quadrant NW             | CLK_A | CLK_B | CLK_C | -     | -     | CLK_F | -     | CLK_P |
| Quadrant SW             | CLK_A | -     | CLK_C | CLK_L | CLK_M | CLK_N | CLK_G | CLK_H |
| Quadrant NE             | CLK_I | CLK_B | CLK_K | CLK_D | CLK_E | CLK_N | CLK_O | CLK_H |
| Quadrant SE             | CLK_A | CLK_J | -     | -     | -     | -     | -     | CLK_P |

CLK\_A is used in three quadrants, and the other clocks are used in one or two quadrants, regardless of the position of the clock buffers (multiplexers), as long as they are not competing to access the same quadrant. (That is, CLK\_A (BUFG7P) cannot be used in the same quadrant with CLK\_I (BUFG7S). Refer to "Primary/Secondary: Rule 1" on page 52.) In other words, two buffers with the same index (0 to 7) cannot be used in the same quadrant. Each register, block RAM, registered multiplier, or DDR register (IOB) can be connected to any of the eight clock nets available in a particular quadrant.

Note that if a global clock (primary buffer) is used in four quadrants, the corresponding secondary buffer is not available.

# **Power Consumption**

Clock trees have been designed for low skew and low-power operation. Any unused branch is disconnected, as shown in Figure 2-13.



Figure 2-13: Low-Power Clock Network

Also available to reduce overall power consumption are the BUFGCE feature, for dynamically driving a clock tree only when the corresponding module is used, and the BUFGMUX feature, for switching from a high-frequency clock to a low-frequency clock. The frequency synthesizer capability of the DCM can generate the low (or high) frequency clock from a single source clock, as illustrated in Figure 2-14. (See "Using Digital Clock Managers (DCMs)" on page 67).



Figure 2-14: Dynamic Power Reduction Scheme

# Library Primitives and Submodules

The primitives in Table 2-5 are available with the input, output, and control pins listed.

#### Table 2-5: Clock Primitives

| Primitive | Input  | Output | Control |
|-----------|--------|--------|---------|
| IBUFG     | Ι      | 0      | -       |
| IBUFGDS   | I, IB  | 0      | -       |
| BUFG      | Ι      | 0      | _       |
| BUFGMUX   | I0, I1 | 0      | S       |
| BUFGMUX_1 | I0, I1 | 0      | S       |

Refer to "Using Single-Ended SelectI/O-Ultra Resources" on page 157 for a list of the attributes available for IBUFG and Refer to "Using LVDS I/O" on page 217 for a list of the attributes available for IBUFGDS.

The submodules in Table 2-6 are available with the input, output, and control pins listed.

| Table 2-6: CI | ock Su | bmodules |
|---------------|--------|----------|
|---------------|--------|----------|

| Submodule | Input | Output | Control |
|-----------|-------|--------|---------|
| BUFGCE    | Ι     | 0      | CE      |
| BUFGCE_1  | Ι     | 0      | CE      |

# **Primitive Functions**

#### IBUFG

IBUFG is an input clock buffer with one clock input and one clock output.

### IBUFGDS

IBUFGDS is a differential input clock buffer with two clock inputs (positive and negative polarity) and one clock output.

### BUFG

All Virtex-II devices have 16 global clock buffers (each of which can be used as BUFG, BUFGMUX, or BUFGCE).

BUFG is a global clock buffer with one clock input and one clock output, driving a low-skew clock distribution network. The output follows the input, as shown in Figure 2-15.



Figure 2-15: BUFG Waveforms

### BUFGMUX and BUFGMUX\_1

BUFGMUX (see Figure 2-16) can switch between two unrelated, even asynchronous clocks. Basically, a Low on S selects the I0 input, a High on S selects the I1 input. Switching from one clock to the other is done in such a way that the output High and Low time is never shorter than the shortest High or Low time of either input clock. As long as the presently selected clock is High, any level change of S has no effect for BUFGMUX. For BUFGMUX\_1, as long as the presently selected clock is low, any level change of S has no effect.

BUFGMUX is the preferred circuit for rising edge clocks, while BUFGMUX\_1 is preferred for falling edge clocks.



Figure 2-16: Virtex-II BUFGMUX or BUFGMUX 1 Function

#### Operation of the BUFGMUX Circuit

If the presently selected clock is Low while S changes, or if it goes Low after S has changed, the output is kept Low until the other ("to-be-selected") clock has made a transition from High to Low. At that instant, the new clock starts driving the output.

The two clock inputs can be asynchronous with regard to each other, and the S input can change at any time, except for a short setup time prior to the rising edge of the presently selected clock; that is, prior to the rising edge of the BUFGMUX output O. Violating this setup time requirement can result in an undefined runt pulse output.

Figure 2-17 shows a switchover from CLK0 to CLK1.



Figure 2-17: BUFGMUX Waveform Diagram

- The current clock is CLK0.
- S is activated High.
- If CLK0 is currently High, the multiplexer waits for CLK0 to go Low.
- Once CLK0 is Low, the multiplexer output stays Low until CLK1 transitions High to Low.
- When CLK1 transitions from High to Low, the output switches to CLK1.
- No glitches or short pulses can appear on the output.

### Operation of the BUFGMUX\_1 Circuit

If the presently selected clock is High while S changes, or if it goes High after S has changed, the output is kept High until the other ("to-be-selected") clock has made a transition from Low to High. At that instant, the new clock starts driving the output.

The two clock inputs can be asynchronous with regard to each other, and the S input can change at any time, except for a short setup time prior to the falling edge of the presently selected clock; that is, prior to the falling edge of the BUFGMUX output O. Violating this setup time requirement can result in an undefined runt pulse output.

Figure 2-18 shows a switchover from CLK0 to CLK1.



Figure 2-18: BUFGMUX\_1 Waveform Diagram

- The current clock is CLK0.
- S is activated High.
- If CLK0 is currently Low, the multiplexer waits for CLK0 to go High.
- Once CLK0 is High, the multiplexer output stays High until CLK1 transitions Low to High.
- When CLK1 transitions from Low to High, the output switches to CLK1.
- No glitches or short pulses can appear on the output.

# Submodules

# BUFGCE and BUFGCE\_1

BUFGCE and BUFGCE\_1 are submodules based on BUFGMUX and BUFGMUX\_1, respectively. BUFGCE and BUFGCE\_1 are global clock buffers incorporating a smart enable function that avoids output glitches or runt pulses. The select signal must meet the setup time for the clock.

BUFGCE is the preferred circuit for clocking on the rising edge, while BUFGCE\_1 is preferred when clocking on the falling edge.

### Operation of the BUFGCE Circuit

If the CE input (see Figure 2-19) is active (High) prior to the incoming rising clock edge, this Low-to-High-to-Low clock pulse passes through the clock buffer. Any level change of CE during the incoming clock High time has no effect.



Figure 2-19: Virtex-II BUFGCE or BUFGCE\_1 Function

If the CE input is inactive (Low) prior to the incoming rising clock edge, the following clock pulse does not pass through the clock buffer, and the output stays Low. Any level change of CE during the incoming clock High time has no effect. CE must not change during a short setup window just prior to the rising clock edge on the BUFGCE\_1 input I. Violating this setup time requirement can result in an undefined runt pulse output.

This means the output stays Low when the clock is disabled, but it completes the clock-High pulse when the clock is being disabled, as shown in Figure 2-20.



Figure 2-20: BUFGCE Waveforms

# Operation of the BUFGCE\_1 circuit

If the CE input is active (High) prior to the incoming falling clock edge, this High-to-Lowto-High clock pulse passes through the clock buffer. Any level change of CE during the incoming clock Low time has no effect.

If the CE input is inactive (Low) prior to the incoming falling clock edge, the following clock pulse does not pass through the clock buffer, and the output stays High. Any level change of CE during the incoming clock Low time has no effect. CE must not change during a short setup window just prior to the falling clock edge on the BUFGCE input I. Violating this setup time requirement can result in an undefined runt pulse output.

This means the output stays High when the clock is disabled, but it completes the clock-Low pulse when the clock is being disabled, as shown in Figure 2-21.



Figure 2-21: BUFGCE\_1 Waveforms

When BUFGCE (or BUFGCE\_1) is used with DCM outputs, a second BUFG can be used for clock feedback. Buffer sharing the inputs with BUFGCE is the preferred solution.

# Summary

Table 2-7 shows the maximum resources available per Virtex-II device.

Table 2-7: Resources per Virtex-II Device (from XC2V40 to XC2V8000)

| Resource                     | Maximum Number |  |
|------------------------------|----------------|--|
| Single-ended IBUFG (pads)    | 16             |  |
| Differential IBUFGDS (pairs) | 8              |  |
| BUFG (Global Clock Buffer)   | 16             |  |
| BUFGCE (or BUFGCE_1)         | 8              |  |
| BUFGMUX (or BUFGMUX_1)       | 8              |  |

# **Characteristics**

UG002 (v1.5) 2 December 2002

Virtex-II Platform FPGA User Guide

The following are characteristics of global clocks in Virtex-II devices:

- Low-skew clock distribution.
- Synchronous "glitch-free" multiplexer that avoids runt pulses. Switching between two asynchronous clock sources is usually considered unsafe, but it is safe with the Virtex-II global clock multiplexer.

- Any level change on S must meet a setup time requirement with respect to the signal on the output O (rising edge for BUFGMUX, falling edge for BUFGMUX\_1). Any level change on CE must meet a setup time requirement with respect to the signal on the Input I (rising edge for BUFGCE, falling edge for BUFGCE\_1).
- Two BUFGMUX (or BUFGMUX\_1) resources can be cascaded to create a 3 to 1 clock multiplexer.

# Location Constraints

BUFGMUX and BUFGMUX\_1 (primitives) and IBUFG (IBUFGDS) instances can have LOC properties attached to them to constrain placement. The LOC properties use the following form to constrain a clock net:

NET "clock\_name" LOC="BUFGMUX#P/S";

Each clock pad (or IBUFG) has a direct connection with a specific global clock multiplexer (input I0). A placement that does not conform to this rule causes the software to send a warning.

If the clock pad (or IBUFG) has LOC properties attached, the DCM allows place and route software maximum flexibility, as compared to a direct connection to the global clock buffer (BUFG).

# Secondary Clock Network

If more clocks are required, the 24 horizontal and vertical long lines in Virtex-II devices can be used to route additional clock nets. Skew is minimized by the place and route software, if the USELOWSKEWLINES constraint is attached to the net.

# VHDL and Verilog Instantiation

VHDL and Verilog instantiation templates are available as examples (see "VHDL and Verilog Templates" on page 62) for all primitives and submodules.

In VHDL, each template has a component declaration section and an architecture section. Each part of the template should be inserted within the VHDL design file. The port map of the architecture section should include the design signal names.

# VHDL and Verilog Templates

The following are templates for primitives:

- BUFGMUX\_INST
- BUFGMUX\_1\_INST

The following are templates for submodules:

- BUFGCE\_SUBM
- BUFGCE\_1\_SUBM

As examples, the BUFGMUX\_INST.vhd, BUFGMUX\_1\_INST.vhd, BUFGCE\_SUBM.vhd, and BUFGCE\_1\_SUBM.vhd VHDL templates are shown. In addition, the BUFGMUX\_INST.v, BUFGMUX\_1\_INST.v, BUFGCE\_1\_SUBM.v, and BUFGCE\_SUBM.v Verilog templates are shown.

### **VHDL** Template

- -- Module: BUFGMUX\_INST
- -- Description: VHDL instantiation template
- -- Global Clock Multiplexer (Switch Low)
- -- Device: Virtex-II Family
- -----
- -- Component Declarations:

-----

```
component BUFGMUX
 port (
     : in std_logic;
 ΙO
      I1 : in std logic;
         : in std logic;
      S
         : out std logic
      0
 );
end component;
- -
-- Architecture section:
- -
-- Global Clock Buffer Instantiation
U BUFGMUX: BUFGMUX
 port map (
       => , -- insert clock input used when select (S) is Low
 ΤO
 Ι1
       => , -- insert clock input used when select (S) is High
 S
      => , -- insert Mux-Select input
 0
       => -- insert clock output
 );
- -
_____
-- Module: BUFGMUX 1 INST
-- Description: VHDL instantiation template
-- Global Clock Multiplexer (Switch High)
_ _
-- Device: Virtex-II Family
_____
-- Component Declarations:
component BUFGMUX 1
 port (
 ТО
     : in std logic;
      I1 : in std_logic;
      S : in std logic;
      0 : out std_logic
 );
end component;
-- Architecture section:
-- Global Clock Buffer Instantiation
U_BUFGMUX_1: BUFGMUX_1
 port map (
    => , -- insert clock input used when select (S) is Low
 ΙO
 I1
      => , -- insert clock input used when select (S) is High
 S
      => , -- insert Mux-Select input
       => -- insert clock output
 0
 );
_ _
_ _ _ _ _ _ _ _
      -- Module: BUFGCE_SUBM
-- Description: VHDL instantiation template
-- Global Clock Buffer with Clock Enable:
-- Input Clock Buffer to BUFGMUX - Clock disabled = Low
-- Device: Virtex-II Family
_____
library IEEE;
use IEEE.std_logic_1164.all;
- -
-- pragma translate_off
library UNISIM;
use UNISIM.VCOMPONENTS.ALL;
-- pragma translate on
```

- -

```
entity BUFGCE SUBM is
  port (
       I: in std logic;
       CE: in std_logic;
       0: out std logic
      );
end BUFGCE_SUBM;
- -
architecture BUFGCE_SUBM_arch of BUFGCE_SUBM is
-- Component Declarations:
component BUFGMUX
  port (
      : in std logic;
  ΙO
       I1 : in std_logic;
          : in std_logic;
       S
       0
          : out std logic
 );
end component;
_ _
-- signal declarations
signal GND : std_logic;
signal CE_B : std_logic;
_ _
begin
GND <= '0';
_ _
CE_B <= not CE;
- -
-- Global Clock Buffer Instantiation
U_BUFGMUX: BUFGMUX
 port map (
 ΙO
       => I,
       => GND,
 Ι1
       => CE_B,
 S
        => O
 0
 );
- -
end BUFGCE SUBM arch;
-----
                               _____
-- Module: BUFGCE_1_SUBM
-- Description: VHDL instantiation template
-- Global Clock Buffer with Clock Enable:
-- Input Clock Buffer to BUFGMUX_1 - Clock disabled = High
-- Device: Virtex-II Family
_ _ _ _ _ _ _ _ _ _ _ _
                                                    -----
library IEEE;
use IEEE.std logic 1164.all;
- -
-- pragma translate off
library UNISIM;
use UNISIM.VCOMPONENTS.ALL;
-- pragma translate on
- -
entity BUFGCE_1_SUBM is
 port (
       I: in std logic;
       CE: in std_logic;
       0: out std_logic
       );
end BUFGCE_1_SUBM;
```

```
- -
architecture BUFGCE_1_SUBM_arch of BUFGCE_1_SUBM is
- -
-- Component Declarations:
component BUFGMUX 1
 port (
 ΙO
     : in std logic;
       I1 : in std_logic;
       S : in std logic;
        0 : out std logic
 );
end component;
-- signal declarations
signal VCC : std logic;
signal CE_B : std_logic;
begin
VCC <= '1';
- -
CE B <= not CE;
- -
-- Global Clock Buffer Instantiation
U BUFGMUX 1: BUFGMUX 1
 port map (
     => I,
 ΙO
 Ι1
        => VCC,
 S
       => CE_B,
 0
        => 0
 );
_ _
end BUFGCE_1_SUBM_arch;
```

### Verilog Template

```
//-----
        BUFGMUX INST
// Module:
// Description: Verilog Instantiation Template
// Global Clock Multiplexer (Switch Low)
11
11
// Device: Virtex-II Family
//-----
11
//BUFGMUX Instantiation
BUFGMUX U BUFGMUX
         (.IO(), // insert clock input used when select(S) is Low
          .I1(), // insert clock input used when select(S) is High
          .S(), // insert Mux-Select input
          .0()
               // insert clock output
         );
                      -----
//-----
// Module:
          BUFGMUX_1_INST
// Description: Verilog Instantiation Template
// Global Clock Multiplexer (Switch High)
11
11
// Device: Virtex-II Family
//-----
11
//BUFGMUX 1 Instantiation
BUFGMUX_1 U_BUFGMUX_1
```

www.xilinx.com 1-800-255-7778

# 

```
(.IO(), // insert clock input used when select(S) is Low
           .I1(), \ // insert clock input used when select(S) is High
                 // insert Mux-Select input
            .s(),
            .0()
                   // insert clock output
           );
//-----
                                  // Module:
             BUFGCE SUBM
// Description: Verilog Submodule
// Global Clock Buffer with Clock Enable:
// Input Clock Buffer to BUFGMUX - Clock disabled = Low
11
// Device: Virtex-II Family
//-----
module BUFGCE_SUBM (I,
                CE,
                0);
      I,
input
      CE;
output 0;
wire GND;
assign GND = 1'b0;
BUFGMUX U_BUFGMUX
           (.IO(I),
           .I1(GND),
           .S(~CE),
           .0(0)
          );
11
endmodule
//-----
// Module: BUFGCE_1_SUBM
// Description: Verilog Submodule
// Global Clock Buffer with Clock Enable:
// Input Clock Buffer to BUFGMUX_1 - Clock disabled = High
11
// Device: Virtex-II Family
//-----
                             -----
module BUFGCE_1_SUBM (I,
                  CE,
                  0);
input
      I,
      CE;
output 0;
wire VCC;
assign VCC = 1'b1;
BUFGMUX 1 U BUFGMUX 1
           (.IO(I),
           .I1(VCC),
           .S(~CE),
           .0(0)
          );
11
```

endmodule

# **Using Digital Clock Managers (DCMs)**

# Overview

Virtex-II devices have 4 to 12 DCMs, and each DCM provides a wide range of powerful clock management features:

• **Clock De-skew**: The DCM contains a delay-locked loop(DLL) that can completely eliminate clock distribution delays, hence deskewing the DCM's output clocks with respect to the input clock. The DLL contains delay elements(individual small buffers) and control logic. The incoming clock drives a chain of delay elements, thus the output of every delay element represents a version of the incoming clock delayed at a different point.

The control logic contains a phase detector and a delay line selector. The phase detector compares the incoming clock signal(CLKIN) against a feedback input (CLKFB) and steers the delay line selector, essentially adding delay to the output of DCM until the CLKIN and CLKFB coincide.

• **Frequency Synthesis**: Separate outputs provide a doubled frequency (CLK2X and CLK2X180). Another output (CLKDV) provides a frequency that is a specified fraction of the input frequency (÷1.5, ÷2, ÷2.5, and so forth, up to ÷15 and ÷16.)

Two other outputs (CLKFX and CLKFX180) provide an output frequency that is derived from the input clock by simultaneous frequency division and multiplication. The user can specify any integer multiplier (M) and divisor (D) within the range specified in the DCM Timing Parameters section of the <u>Virtex-II Data Sheet (DS031)</u>. An internal calculator figures out the appropriate tap selection, so that the output edge coincides with the input clock whenever that is mathematically possible. For example, M=9 and D=5, multiply the frequency by 1.8, and the output rising edge is coincident with the input rising edge every 5 input periods = every 9 output periods.

• **Phase Shifting**: Three outputs drive the same frequency as CLCK0 but are delayed by 1/4, 1/2, and 3/4 of a clock period. An additional control optionally shifts all nine clock outputs by a fixed fraction of the clock period (defined during configuration, and described in multiples of the clock period divided by 256).

The user can also dynamically and repetitively move the phase forwards or backwards by one unit of the clock period divided by 256. Note that any such phase shift is always invoked as a specific fraction of the clock period, but is always implemented by moving delay taps with a resolution of DCM\_TAP (see the DCM Timing Parameters section in the *Virtex-II Data Sheet*).

- **General Control Signals**: The input is an asynchronous reset; when High, it resets the entire DCM (all clock outputs, LOCKED, and STATUS signals are brought LOW). The LOCKED output is High when all enabled DCM circuits have locked. The active High STATUS outputs indicate the following:
  - Phase Shift Overflow (STATUS[0])
  - CLKIN Stopped (STATUS[1])
  - CLKFX Stopped (STATUS[2])

When using DCMs it is important to observe the V<sub>CCAUX</sub> recommended operating noise specification in the <u>Virtex-II Data Sheet (DS031</u>). Power distribution design details are available in Xilinx Application Note XAPP623.

# Clock De-Skew

The Virtex-II Digital Clock Manager (DCM) offers a fully digital, dedicated on-chip de-skew circuit providing zero propagation delay, low clock skew between output clock signals distributed throughout the device, and advanced clock domain control. These features can be used to implement several circuits that improve and simplify system level design.

Any four of the nine outputs of the DCM can be used to drive a global clock network. All DCM outputs can drive general interconnect at the same time; for example, DCM output can be used to generate board-level clocks. The well-buffered global clock distribution network minimizes clock skew caused by loading differences. By monitoring a sample of the output clock (CLK0 or CLK2X), the de-skew circuit compensates for the delay on the routing network, effectively eliminating the delay from the external input port to the individual clock loads within the device.

Figure 2-22 shows all of the inputs and outputs relevant to the DCM de-skew feature.



Figure 2-22: Clock De-Skew Outputs

The de-skew feature can also act as a clock mirror. By driving the CLK0 or CLK2X output off-chip and then back in again, the de-skew feature can be used to de-skew a board-level clock serving multiple devices.

By taking advantage of the de-skew circuit to remove on-chip clock delay, the designer can greatly simplify and improve system level design involving high-fanout, high-performance clocks.

# Operation

A de-skew circuit in its simplest form consists of variable delay line and control logic. The delay line produces a delayed version of the input clock (CLKIN). The clock distribution network routes the clock to all internal registers and to the clock feedback CLKFB pin. The control logic samples the input clock, as well as the feedback clock, and adjusts the delay line.

For optimum performance, the Virtex-II DCM uses a discrete digital delay line, which is a series of buffer elements each with an intrinsic delay of less than DCM\_TAP (see AC characteristics in the <u>Virtex-II Data Sheet (DS031)</u>).

A de-skew circuit works by inserting delay between the input clock and the feedback clock until the two rising edges align, putting the two clocks 360 degrees out of phase, which means they are in phase. When the edges from the input clock line up with the edges from the feedback clock, the DCM achieves "lock." The two clocks have no discernible difference. Thus, the DCM output clock compensates for the delay in the clock distribution network, effectively removing the delay between the source clock and its loads.

# Input Clock Requirements

The clock input of the DCM can be driven either by an IBUFG, an IBUF, or a BUFGMUX. An LVDS clock can also be used as input.

The output clock signal of a DCM, essentially a delayed version of the input clock signal, reflects any instability on the input clock in the output waveform. A DCM cannot improve the input jitter. The DCM input clock requirements are specified in the <u>Virtex-II Data Sheet</u> (DS031).

Once locked, the DCM can tolerate input clock period variations of up to the value specified by CLKIN\_PER\_JITT\_DLL\_HF (at high frequencies) or CLKIN\_PER\_JITT\_DLL\_LF (at low frequencies). Larger frequency changes can cause the DCM to lose lock, which is indicated by the LOCKED output going low. The user must then reset the DCM. The cycle-to-cycle input jitter must be kept to less than CLKIN\_CYC\_JITT\_DLL\_LF in the low frequencies and CLKIN\_CYC\_JITT\_DLL\_HF for the high frequencies.

# Input Clock Changes

Changing the period of the input clock beyond the maximum input period jitter specification requires a manual reset of the DCM. Failure to reset the DCM produces an unreliable lock signal and output clock.

It is possible to temporarily stop the input clock with little impact to the de-skew circuit, as long as CLKFX or CLKFX180 is not used.

If the input clock is stopped and CLKFX or CLKFX180 is used, the CLKFX or CLKFX180 outputs might stop toggling, and STATUS[2] ("CLKFX Stopped") is asserted. The DCM must be reset to recover from this event.

CLKFX or CLKFX180 stops within D cycles of CLKIN or when CLKFX is concurrent to CLKIN. STATUS[2] is asserted within 1 to D CLKIN + 1 CLKFX cycles of CLKFX or when CLKFX180 output stops. (D is the CLKFX\_DIVIDE value.)

In any other cases, the clock should not be stopped for more than 100 ms to minimize the effect of device cooling; otherwise, the tap delays might change. The clock should be stopped during a Low or a High phase, and must be restored with the same input clock period/frequency. During this time, LOCKED stays High and remains High when the clock is restored. Thus, a High on LOCKED does not necessarily mean that a valid clock is available.

When the input clock is being stopped (CLKIN remains High or Low for more than 1 clock cycle), one to eight more output clock cycles are still generated as the delay line is flushed. When the output clock stops, the CLKIN stopped (STATUS(1)) signal is asserted. When the clock is restarted, the output clock cycles are not generated for one to eight clocks while the delay line is filled. Similarly, the STATUS(1) signal is deasserted once the output clock is generated. The most common case is two or three clocks. In a similar manner, a phase shift of the input clock is possible. For example, the input clock can be stopped for 12 ns to achieve a 12 ns phase shift at the output. The phase shift propagates one to eight clocks to the output after the original shift, with no disruption to the DCM control. The STATUS(1) is forced Low whenever LOCKED is Low.

# **Output Clocks**

Some restrictions apply regarding the connectivity of the output pins. The DCM clock outputs can each drive an OBUF, a global clock buffer BUFGMUX, or they can route directly to the clock input of a synchronous element. To use the dedicated routing, the DCM clock outputs should drive BUFGMUXs that are on the same edge (top or bottom) of the device. If the DCM and BUFGMUX are not on the same edge, local routing is used and the DCM might not deskew properly.

Do not use the DCM output clock signals until after activation of the LOCKED signal. Prior to the activation of the LOCKED signal, the DCM output clocks are not valid and can exhibit glitches, spikes, or other spurious movement.

# External Feedback

To ensure consistent locking, if a DCM is configured with external feedback, applying a reset after configuration is strongly recommended.

For an optimum locking process, a DCM configured with external feedback requires both CLKIN and CLK0 to be present and stable when the DCM begins to lock. During configuration with external feedback, it is not possible to provide CLKFB in the beginning of the locking process. At the end of configuration, the DCM begins to lock once the device enters the startup sequence. Because a global 3-state signal (GTS) is still asserted during this time, the output I/O pins remain in a 3-state condition, effectively putting the CLKFB signal into a 3-state condition.

When CLKFB eventually appears (after the GTS is deasserted), the DCM proceeds with the lock. However, it might not lock at the optimal point and can introduce slightly more jitter (as well as greater clock cycle latency) through the DCM.

In addition, if CLKFB is coupling with another signal when it is put into a 3-state condition (a PCB signal integrity issue), the DCM might sense this invalid clock as CLKFB and use it to proceed with a lock. This second possibility can cause the DCM to not lock properly once the GTS deasserts and the true CLKFB signal is present.

Use of reset after configuration guarantees that the locking process starts with a valid CLKIN and CLKFB signal and ensures consistent locking.

#### Notes:

- 1. The default value is -g LCK\_cycle:NoWait and -g GTS\_cycle:5. When these settings are used, the startup sequence does not wait for the DCM to lock.
- 2. If the startup sequence is altered (for example, using the BitGen option), the LCK\_cycle (wait for DCM to lock) must not be placed before the GTS\_cycle (de-assert GTS); otherwise, the DCM never locks and configuration does not complete.
- 3. The workaround requires that **GTS\_cycle** be set before **GWE\_cycle** in the BitGen options (this is the default). This ensures that the DCM is reset after the I/O pins are activated.

For example code, see Answer Record 14425 on **<u>support.xilinx.com</u>**.

### Characteristics of the De-Skew Circuit

- Can eliminate clock distribution delay by effectively adding one clock period delay. Clocks are de-skewed to within CLKOUT\_PHASE, specified in the *Virtex-II Data Sheet*.
- Can be used to eliminate on-chip as well as off-chip clock delay.
- Has no restrictions on the delay in the feedback clock path.
- Requires a continuously running input clock.
- Adapts to a wide range of frequencies. However, once locked to a frequency, cannot tolerate large variations of the input frequency.
- De-skew circuit is part of the DCM, which also includes phase adjustment, frequency synthesis, and spread spectrum techniques that are described in this document.
- Does not eliminate jitter. The de-skew circuit output jitter is the sum of input jitter and some jitter value that the de-skew circuit might add.
- The completion of configuration can be delayed until after DCM locks to guarantee the system clock is established prior to initiating the device.

# Port Signals

## Source Clock Input - CLKIN

The CLKIN pin provides the user source clock (the clock signal on which the de-skew circuit operates) to the DCM. The CLKIN frequency must fall in the ranges specified in the <u>Virtex-II Data Sheet (DS031)</u>. The clock input signal can be provided by one of the following:

#### IBUF — Input buffer

IBUFG — Global clock input buffer on the same edge of the device (top or bottom)

BUFGMUX — Internal global clock buffer

**Note:** When IBUF is used as the clock input, the PAD to DCM input skew is not compensated. Refer to Table 2-1 for more information.

### Feedback Clock Input — CLKFB

A reference or feedback signal is required to delay-compensate the output. Connect only the CLK0 or CLK2X DCM outputs to the feedback clock input (CLKFB) pin to provide the necessary feedback to the DCM. The feedback clock input signal can be driven by an internal global clock buffer (BUFGMUX), one of the global clock input buffers (IBUFG) on the same edge of the device (top or bottom), or IBUF (the input buffer.) Note that when IBUF is used, the PAD to DCM input skew is not compensated.

If an IBUFG sources the CLKFB pin of a DCM, the following special rules apply:

- 1. The CLK0 or CLK2X of the DCM must directly drive an OBUF or a BUFG-to-DDR configuration to forward the clock.
- 2. External to the FPGA, the same forwarded clock signal must be connected to the IBUFG (GCLK pin) that drives the CLKFB of the DCM.

## Reset Input - RST

When the reset pin is activated, the LOCKED signal deactivates. The M and D values at configuration are maintained after the reset. The RST pin, active High, must be connected to a dynamic signal or tied to ground. For all designs that use the feedback feature of the DCM, applying a reset signal after configuration is strongly recommended for both production and ES devices in order to ensure consistent locking. As the DCM delay taps reset to zero, glitches can occur on the DCM clock output pins. After the DCM resets the clock, outputs have a DC behavior. Activation of the RST pin can also severely affect the duty cycle of the clock output pins. Furthermore, the DCM output clocks no longer deskew with respect to one another and are eventually stopped Low if Status signals are deactivated (brought to Low).

For these reasons, use the reset pin when reconfiguring the device or changing the input frequency, or after reconfiguration for DCMs with external feedback. The reset input signal is asynchronous and should be held High for at least three clock cycles. The time it takes for the DCM to achieve lock is specified as LOCK\_DLL (for DLL output) and LOCK\_FX (for DFS output). The DCM locks faster at higher frequencies. See the LOCK\_DLL timing parameter in the *Virtex-II Data Sheet*.

### Locked Output - LOCKED

In order to achieve lock, the DCM might need to sample several thousand clock cycles. After the DCM achieves lock, the LOCKED signal goes High. The DCM timing parameters section of the *Virtex-II Data Sheet* provides estimates for locking times.

To guarantee that the system clock is established prior to the device "waking up," the DCM can delay the completion of the device configuration process until after the DCM locks. The STARTUP\_WAIT attribute activates this feature.

Until the LOCKED signal activates, the DCM output clocks are not valid and can exhibit glitches, spikes, or other spurious movement. In particular, the CLK2X output appears as a 1x clock with a 25/75 duty cycle.

## Status - STATUS

The STATUS output is an 8-bit output, of which STATUS[1] reveals the loss of the input clock, CLKIN to the DCM.

### Attributes

The following attributes provide access to some of the Virtex-II series de-skew features, (for example, clock division and duty cycle correction).

#### Frequency Mode

The de-skew feature of the DCM is achieved with a delay-locked loop (DLL). This attribute specifies either the high or low-frequency mode of the DLL. The default is low-frequency mode. In high-frequency mode, the only outputs available from the DLL are the CLK0, CLK180, CLKDV, and LOCKED. (CLK90, CLK270, CLK2X, and CLK2X180 are not available in high-frequency mode.) The frequency ranges for both frequency modes are specified in the <u>Virtex-II Data Sheet (DS031)</u>. To set the DLL to high-frequency mode, attach the DLL\_FREQUENCY\_MODE=HIGH attribute in the source code or schematic.

#### Feedback Input

This attribute specifies the feedback input to the DCM (CLK0, or CLK2x). CLK0 is the default feedback. When both the CLK0 and the CLK2x outputs are used internally or externally to the device, the feedback input can be either the CLK0 or CLK2x. In order to set the feedback to CLK2X, attach the CLK\_FEEDBACK=2X attribute in the source code or schematic.

#### **Duty Cycle Correction**

The 1x clock outputs, CLK0, CLK90, CLK180, and CLK270, use the duty cycle corrected default such that they exhibit a 50/50 duty cycle. The DUTY\_CYCLE\_CORRECTION attribute (by default TRUE) controls this feature.

Setting DUTY\_CYCLE\_CORRECTION=FALSE dectivates duty cycle correction for the DCM outputs listed above. It is strongly recommended to always set the DUTY\_CYCLE\_CORRECTION attribute to TRUE (default value). Note that setting this attribute to FALSE does not necessarily produce output clocks with the same duty cycle as the source clock.

### Startup Delay

The default value of the STARTUP\_WAIT attribute is FALSE. When STARTUP\_WAIT is set to TRUE, and the LCK\_cycle BitGen option is used, then the configuration startup sequence waits in the specified cycle until the DCM locks. For details, see Chapter 3: Configuration and Appendix A: BitGen and PROMGen Switches and Options.

### Legacy Support

The Virtex/Virtex-E library primitives/sub modules are supported in Virtex-II for legacy purposes. The following are supported primitives/submodules:

- CLKDLL
- CLKDLLE
- CLKDLLHF
- BUFGDLL

### Library Primitive

Only a single library primitive is available for the DLL, a part of the DCM. It is labeled the 'DCM' primitive.
2

Submodules



UG002\_C2\_061\_112102

Figure 2-23: BUFG\_CLK0\_SUBM



UG002\_C2\_062\_112102

Figure 2-24: BUFG\_CLK2X\_SUBM



UG002\_C2\_063\_112102

Figure 2-25: BUFG\_CLK0\_FB\_SUBM

www.xilinx.com 1-800-255-7778



Figure 2-26: BUFG\_CLK2X\_FB\_SUBM



Figure 2-27: BUFG\_CLKDV\_SUBM

# **Frequency Synthesis**

The DCM provides several flexible methods for generating new clock frequencies. Each method has a different operating frequency range and different AC characteristics. The CLK2X and CLK2X180 outputs double the clock frequency. The CLKDV output provides divided output clocks with division options of 1.5, 2, 2.5, 3, 3.5, 4, 4.5, 5, 5.5, 6, 6.5, 7, 7.5, 8, 9, 10, 11, 12, 13, 14, 15, and 16.

The DCM also offers a fully digital, dedicated Frequency Synthesizer output (CLKFX) and its opposite phase (CLKFX180). The output frequency can be any function of the input clock frequency described by  $M \div D$ , where M is the multipler (numerator) and D is the divisor (denominator).

The two counter-phase frequency synthesized outputs can drive global clock routing networks within the device. The well-buffered global clock distribution network minimizes clock skew due to differences in distance or loading. See Figure 2-28.

# Operation

The DCM clock output CLKFX is any M/D product of the clock input to the DCM. Specifications for M and D, as well as input and output frequency ranges for the frequency synthesizer, are provided in the <u>Virtex-II Data Sheet (DS031</u>). The frequency synthesizer output is phase aligned to the clock output, CLK0, only if feedback is provided to the CLKFB input of the DCM.



*Figure 2-28:* **Frequency Synthesis Outputs** 

The internal operation of the frequency synthesizer is complex and beyond the scope of this document. The frequency synthesizer multiplies the incoming frequencies by the precalculated quotient M/D and generates the correct output frequencies as long as it is within the range specified in the <u>Virtex-II Data Sheet (DS031)</u>.

For example, assume input frequency = 50 MHz, M = 25, and D = 8 (note that M and D values have no common factors and hence cannot be reduced). The output frequency is correctly 156.25 MHz, although 25 x 50 MHz = 1.25 GHz and 50 MHz / 8 = 6.25 MHz, and both of these values are far outside the range of the input frequency.

# **Frequency Synthesizer Characteristics**

- The frequency synthesizer provides an output frequency equal to the input frequency multiplied by M and divided by D.
- The outputs CLKFX and CLKFX180 always have a 50/50 duty-cycle.
- Smaller M and D values achieve faster lock times. The user should divide M and D by the largest common factor.
- The outputs are phase aligned with CLK0 when CLKFB is connected.

## Port Signals

## Source Clock Input - CLKIN

The CLKIN pin provides the user source clock to the DCM. The CLKIN frequency must fall in the ranges specified in the *Virtex-II Data Sheet*. The clock input signal can be provided by one of the following:

- IBUF Input buffer
- IBUFG Global clock input buffer
- BUFGMUX Internal global clock buffer

**Note:** When IBUF is used as the clock input, the PAD to DCM input skew is not compensated. Refer to Table 2-1 for more information.

## 2x Clock Output — CLK2X

The CLK2X output provides a frequency-doubled clock with an automatic 50/50 duty-cycle correction. This output is not available in high-frequency mode.

Until the DCM has achieved lock, the CLK2X output appears as a 1x version of the input clock with a 25/75 duty cycle. This behavior allows the DCM to lock on the correct edge with respect to source clock.

### Clock Divide Output - CLKDV

The clock divide output pin CLKDV provides a lower frequency version of the source clock. The CLKDV\_DIVIDE property controls CLKDV such that the source clock is divided by N where N is either 1.5, 2, 2.5, 3, 3.5, 4, 4.5, 5, 5.5, 6, 6.5, 7, 7.5, 8, 9, 10, 11, 12, 13, 14, 15, or 16.

This feature provides automatic duty cycle correction such that the CLKDV output pin has a 50/50 duty cycle always in low-frequency mode, as well as for all all integer values of the division factor N in high-frequency mode.

## Frequency Synthesized Clock Output - CLKFX

The CLKFX output provides a frequency-synthesized clock (M/D \* CLKIN) with a 50/50 duty cycle. For the CLKFX output to be phase-aligned with CLKIN, the clock feedback (CLK0) must be provided at the CLKFB input. With M and D adjusted such that they have no common factor, the alignment occurs only once every D input clock cycles.

### Frequency Synthesized Clock Output 180° Phase Shifted - CLKFX180

The CLKFX180 output is a  $180^{\circ}$  phase shifted version of the CLKFX clock output, also with a 50/50 duty cycle.

### Locked Output - LOCKED

The LOCKED signal is activated after the DCM has achieved the parameter values set by the user parameters. To guarantee that the system clock is established prior to the device "waking up," the DCM can delay the completion of the device configuration process until after the DCM locks. The STARTUP\_WAIT attribute activates this feature. Until the LOCKED signal activates, the DCM output clocks are not valid and can exhibit glitches, spikes, or other spurious signals.

### Reset Input - RST

When the reset pin is activated, the LOCKED signal deactivates. The M and D values at configuration are maintained after the reset. The RST pin, active High, must be connected to a dynamic signal or tied to ground. For all designs that use the feedback feature of the DCM, applying a reset signal after configuration is strongly recommended for both production and ES devices in order to ensure consistent locking. As the DCM delay taps reset to zero, glitches can occur on the DCM clock output pins. After the DCM resets the clock, outputs have a DC behavior. Activation of the RST pin can also severely affect the duty cycle of the clock output pins. Furthermore, the DCM output clocks no longer deskew with respect to one another and are eventually stopped Low if Status signals are deactivated (brought to Low).

For these reasons, use the reset pin when reconfiguring the device or changing the input frequency, or after reconfiguration for DCMs with external feedback. The reset input signal is asynchronous and should be held High for at least three clock cycles. The time it takes for the DCM to achieve lock is specified as LOCK\_DLL (for DLL output) and LOCK\_FX (for DFS output). The DCM locks faster at higher frequencies. See the LOCK\_DLL timing parameter in the *Virtex-II Data Sheet*.

### Status - STATUS

The STATUS output is an 8-bit output:

- STATUS[1] indicates the loss of the input clock, CLKIN, only when CLKFB is connected.
- STATUS[2] indicates loss of CLKFX and CLKFX180 even though LOCKED might still be High. Note that the "CLKFX stopped" status functions only when CLKIN is present.

### Attributes

The following attributes provide access to some of the Virtex-II series frequency synthesis features, (for example, clock multiplication, clock division).

### **Clock Divide**

The CLKDV\_DIVIDE attribute specifies how the signal on the CLKDV pin is frequency divided with respect to the CLK0 pin. The values allowed for this attribute are 1.5, 2, 2.5, 3, 3.5, 4, 4.5, 5, 5.5, 6, 6.5, 7, 7.5, 8, 9, 10, 11, 12, 13, 14, 15, or 16; the default value is 2.

### Frequency Mode for Frequency Synthesis

This attribute specifies either the high or low-frequency mode of the frequency synthesizer. The default is low-frequency mode. The frequency ranges for both frequency modes are specified in the <u>Virtex-II Data Sheet (DS031)</u>.

To set the frequency synthesizer to high-frequency mode, attach the DFS\_FREQUENCY\_MODE=HIGH attribute in the source code or schematic.

### Multiply/Divide Attribute

The M and D values can be set using the CLKFX\_MULTIPLY and the CLKFX\_DIVIDE attributes. The default settings are M = 4 and D = 1.

### Startup Delay

The default value of the STARTUP\_WAIT attribute is FALSE. When STARTUP\_WAIT is set to TRUE, and the LCK\_cycle BitGen option is used, then the configuration startup sequence waits in the specified cycle until the DCM locks. For details, see Chapter 3: Configuration and Appendix A: BitGen and PROMGen Switches and Options.

### CLKIN\_PERIOD

CLKIN\_PERIOD specifies the period of the clock used to drive the CLKIN pin of the DCM. It must be specified for optimal frequency synthesis operation when using CLKFX or CLKFX180 outputs. It is not needed for other DCM clock outputs.

## Submodules





UG002\_C2\_075\_110800

Figure 2-30: BUFG\_DFS\_FB\_SUBM

# Phase Shifting

The DCM can also provide coarse and fine-grained phase shifting. The CLK0, CLK90, CLK180, and CLK270 outputs are each phase shifted by ¼ of the input clock period relative to each other, providing coarse phase control. Note that CLK90 and CLK270 are not available in high-frequency mode.

## Operation

Figure 2-31 shows a block diagram of the DCM and all of the outputs affected by the circuitry of the phase shift feature.



Figure 2-31: Phase Shift Outputs

Fine-phase adjustment affects all nine DCM output clocks. When activated, the phase shift between the rising edges of CLKIN and CLKFB is a specified fraction of the input clock period.

In variable mode, the PHASE\_SHIFT value can also be dynamically incremented or decremented as determined by PSINCDEC synchronously to PSCLK, when the PSEN input is active. Figure 2-32 illustrates the effects of fine-phase shifting.



Figure 2-32: Phase Shift Effects

Two separate components of the phase shift range must be understood:

- PHASE\_SHIFT attribute range
- FINE\_SHIFT\_RANGE DCM timing parameter range

The PHASE\_SHIFT attribute is the numerator in the following equation:

Phase Shift (ns) = (PHASE\_SHIFT/256) \* PERIOD<sub>CLKIN</sub>

The full range of this attribute is always -255 to +255, but its practical range varies with CLKIN frequency, as constrained by the FINE\_SHIFT\_RANGE component, which represents the total delay achievable by the phase shift delay line. Total delay is a function of the number of delay taps used in the circuit. Across process, voltage, and temperature, this absolute range is guaranteed to be as specified in the DCM Timing Parameters section of the <u>Virtex-II Data Sheet (DS031</u>).

Absolute range (fixed mode) = ± FINE\_SHIFT\_RANGE

Absolute range (variable mode) = ± FINE\_SHIFT\_RANGE/2

The reason for the difference between fixed and variable modes is as follows. For variable mode to allow symmetric, dynamic sweeps from -255/256 to +255/256, the DCM sets the "zero phase skew" point as the middle of the delay line, thus dividing the total delay line range in half. In fixed mode, since the PHASE\_SHIFT value never changes after configuration, the entire delay line is available for insertion into either the CLKIN or CLKFB path (to create either positive or negative skew).

Taking both of these components into consideration, the following are some usage examples:

- If PERIOD<sub>CLKIN</sub> = two times FINE\_SHIFT\_RANGE, then PHASE\_SHIFT in fixed mode is limited to ± 128, and in variable mode it is limited to ± 64.
- If PERIOD<sub>CLKIN</sub> = FINE\_SHIFT\_RANGE, then PHASE\_SHIFT in fixed mode is limited to  $\pm 255$ , and in variable mode it is limited to  $\pm 128$ .
- If PERIOD<sub>CLKIN</sub> ≤ half of the FINE\_SHIFT\_RANGE, then PHASE\_SHIFT is limited to ± 255 in either mode.

In variable mode, the phase factor can be changed by activating PSEN for one period of PSCLK. Increments or decrements to the phase factor can be made by setting the PSINCDEC pin to a High or Low, respectively. When the de-skew circuit has completed an increment or decrement operation, the signal PSDONE goes High for a single PSCLK cycle. This indicates to the user that the next change may be made.

The user interface and the physical implementation are different. The user interface describes the phase shift as a fraction of the clock period (N/256). The physical implementation adds the appropriate number of buffer stages (each DCM\_TAP) to the clock delay. The DCM\_TAP granularity limits the phase resolution at higher clock frequencies.

# Phase Shift Characteristics

- Offers fine-phase adjustment with a resolution of ±1/256 of the clock period (or ± one DCM\_TAP, whichever is greater) by configuration and also dynamically under user control.
- The phase shift settings affect all nine DCM outputs.
- V<sub>CC</sub> and temperature do not affect the phase shift.

# Fixed-Mode Phase Shifting

The use of negative, fixed-mode phase shifting with a Virtex-II DCM requires a minor modification to the clock muxing. Specifically, CLKIN must be used to drive CLKFB while the DCM is in reset (when the RST signal is held High or during the startup sequence after configuration). Figure 2-33 and the example Verilog and VHDL code below illustrate the implementation of this simple modification. This modification is NOT required for any positive phase shifting or for any variable-mode phase shifting.

Note that this requirement applies to any DCM using fixed-mode phase shifting, regardless of whether the RST of the DCM is connected to a user signal. RST must be held High for at least three clock cycles.

## Solution 1: Clock-Muxing Modification

Figure 2-33 illustrates the clock-muxing modification.



Figure 2-33: Clock Muxing for Negative PHASE\_SHIFT Value

**Note:** PAR might report a warning that the IBUFG (CLKIN)-to-BUFGMUX connection is not an optimal connection and so does not use the fast connection between the two components. This warning can be safely ignored; in this case, a dedicated connection is not necessary for the IBUFG-BUFGMUX connection.

### Verilog Example:

```
//DCM instantiation
DCM U_DCM ( .CLKIN(clkin), .CLKFB(clk0_new), .RST(rst), .CLK0(clk0),
.LOCKED(locked) );
//BUFGMUX instantiation
BUFGMUX U_BUFGMUX ( .O(clk0_new), .IO(clk0), .I1(clkin), .S(rst) );
```

### **VHDL Example:**

```
-- DCM instantiation --
U_DCM: DCM port map ( CLKIN => clkin,
CLKFB => clk0_new,
DSSEN => '0',
PSCLK => '0',
PSINCDEC => '0',
RST => rst,
CLK0 => clk0,
LOCKED => locked);
-- BUFGMUX instantiation --
U_BUFGMUX: BUFGMUX( I0 => clk0
I1 => clkin,
0 => clk0_new);
```

### Solution 2: Alternative Workaround Using Positive Phase Shift

This problem can also be resolved by using an equivalent positive PHASE\_SHIFT value. Given the current data sheet specification of FINE\_SHIFT\_RANGE = 10 ns, this restriction begins to have an effect for input frequencies below 100 MHz (input periods larger than 10 ns).

Given the equation:

Phase shift = (PS/256) \* PeriodCLKIN

For a 10 ns input clock period:

Variable mode: -128 <= PS <= 128

Fixed mode: 0 <= PS <= 255 (full range)

For a 20 ns input clock period:

Variable mode: -64 <= PS <= 64

Fixed mode: 0 <= PS <= 128

For a 40 ns input clock period:

Variable mode: -32 <= PS <= 32

Fixed mode: 0 <= PS <= 64

In either fixed or variable mode, the range can be extended by choosing CLK90, CLK180, or CLK270, rather than CLK0, choosing CLK2X180 rather than CLK2X, or choosing CLKFX180 rather than CLKFX. Even at 25 MHz (40 ns period), the fixed mode coupled with these CLK\* phases allows shifting throughout the entire input clock period range. See Figure 2-34.

Furthermore, the phase-shifting (DPS) function in the DCM requires the CLKFB for delay adjustment.

Because CLKFB must be from CLK0 or CLK2X, the DLL output is used. Hence the minimum CLKIN frequency for the DPS function is 24 MHz.

### Solution 3: Alternative Workaround Using Variable Phase Shift

Configure the DCM with variable phase shifting, as follows:

- 1. Set the CLK\_OUT\_PHASE\_SHIFT attribute to VARIABLE.
- 2. Set the PHASE\_SHIFT attribute to the required value (a negative value is acceptable).
- 3. Connect PSCLK to a clock signal.
- 4. Connect PSEN to Ground if incrementing or decrementing the phase shift is not intended.
- 5. Connect PSINCDEC to a signal, GND, or VCC.



Figure 2-34: Fixed Phase Shift Example

## Port Signals

1x Clock Outputs — CLK[0|90|180|270]

The 1x clock output pin CLK0 represents a delay-compensated version of the source clock (CLKIN) signal. In low-frequency mode, the DCM provides three phase-shifted versions of the CLK0 signal (CLK90, CLK180, and CLK270), whereas in high-frequency mode, only the 180 phase-shifted version is provided. All four (including CLK0) of the phase shifted outputs can be used simultaneously in low-frequency mode. The relationship between phase shift and the corresponding period shift appears in Table 2-8. The timing diagrams in Figure 2-35 illustrate the DLL clock output characteristics.

Table 2-8: Relationship of Phase-Shifted Output Clock to Period Shift

| Phase (degrees) | % Period Shift |
|-----------------|----------------|
| 0               | 0%             |
| 90              | 25%            |
| 180             | 50%            |
| 270             | 75%            |

By default, the DCM provides a 50/50 duty cycle correction on all 1x clock outputs. The DUTY\_CYCLE\_CORRECTION attribute (TRUE by default), controls this feature. Attach the DUTY\_CYCLE\_CORRECTION=FALSE property to the DCM symbol in order to

deactivate the DCM duty cycle correction. With duty cycle correction deactivated, the output clocks have the same duty cycle as the source clock.

The DCM clock outputs can drive an OBUF, a BUFGMUX, or they can route directly to the clock input of a synchronous element.



Figure 2-35: DLL Output Characteristics

## Source Clock Input - CLKIN

The CLKIN pin provides the user source clock to the DCM. The CLKIN frequency must fall in the ranges specified in the <u>Virtex-II Data Sheet (DS031)</u>. The clock input signal can be provided by one of the following:

- IBUF Input buffer
- IBUFG Global clock input buffer
- BUFGMUX Internal global clock buffer

**Note:** When IBUF is used as the clock input, the PAD to DCM input skew is not compensated. Refer to Table 2-1 for more information.

## Feedback Clock Input — CLKFB

A DCM requires a reference or feedback signal to provide delay-compensated output. Connect only the CLK0 or CLK2X DCM outputs to the feedback clock input (CLKFB) pin to provide the necessary feedback to the DCM. The feedback clock input signal can be driven by an internal global clock buffer (BUFGMUX), one of the global clock input buffers (IBUFG) on the same edge of the device (top or bottom), or IBUF (the input buffer.)

If an IBUFG sources the CLKFB pin, the following special rules apply:

- 1. The CLK0 or CLK2X of the DCM must directly drive an OBUF or a BUFG-to-DDR configuration to forward the clock.
- 2. External to the FPGA, the same forwarded clock signal must be connected to the IBUFG (GCLK pin) that drives the CLKFB of the DCM.

### Phase Shift Clock - PSCLK

The PSCLK input can be sourced by the CLKIN signal to the DCM, or it can be a lower or higher frequency signal provided from any clock source (external or internal). The frequency range of PSCLK is defined by PSCLK\_FREQ\_LF/HF (see the *Virtex-II Data Sheet*). This input has to be tied to ground when the CLKOUT\_PHASE\_SHIFT attribute is set to NONE or FIXED.

### Phase Shift Increment/Decrement - PSINCDEC

The PSINCDEC signal is synchronous to PSCLK and is used to increment or decrement the phase shift factor. In order to increment or decrement the phase shift by 1/256 of clock period, the PSINCDEC signal must be High for increment or Low for decrement. This input has to be tied to ground when the CLKOUT\_PHASE\_SHIFT attribute is set to NONE or FIXED.

### Phase Shift Enable - PSEN

To initiate a variable phase-shift operation, the PSEN input must be activated for one period of PSCLK. The phase change becomes effective after up to 100 CLKIN pulse cycles plus three PSCLK cycles, and is indicated by a High pulse on PSDONE. During the phase transition there are no sporadic changes or glitches on any output. PSEN must be tied to ground when the CLKOUT\_PHASE\_SHIFT attribute is set to NONE or FIXED.

### Reset Input - RST

When the reset pin is activated, the LOCKED signal deactivates. The M and D values at configuration are maintained after the reset. The RST pin, active High, must be connected to a dynamic signal or tied to ground. For all designs that use the feedback feature of the DCM, applying a reset signal after configuration is strongly recommended for both production and ES devices in order to ensure consistent locking. As the DCM delay taps reset to zero, glitches can occur on the DCM clock output pins. After the DCM resets the clock, outputs have a DC behavior. Activation of the RST pin can also severely affect the duty cycle of the clock output pins. Furthermore, the DCM output clocks no longer deskew with respect to one another and are eventually stopped Low if Status signals are deactivated (brought to Low).

For these reasons, use the reset pin when reconfiguring the device or changing the input frequency, or after reconfiguration for DCMs with external feedback. The reset input signal is asynchronous and should be held High for at least three clock cycles. The time it takes for the DCM to achieve lock is specified as LOCK\_DLL (for DLL output) and LOCK\_FX (for DFS output). The DCM locks faster at higher frequencies. See the LOCK\_DLL timing parameter in the *Virtex-II Data Sheet*.

### Locked Output - LOCKED

The LOCKED signal activates after the DCM has achieved lock. To guarantee that the system clock is established prior to the device "waking up," the DCM can delay the completion of the device configuration process until after the DCM locks. The STARTUP\_WAIT attribute activates this feature. Until the LOCKED signal activates, the DCM output clocks are not valid and can exhibit glitches, spikes, or other spurious movement. For details, refer to Chapter 3: Configuration.

### Phase Shift DONE - PSDONE

The PSDONE signal is synchronous to PSCLK and it indicates, by pulsing High for one period of PSCLK, that the requested phase shift was achieved. This signal also indicates to the user that a new change to the phase shift numerator can be made. This output signal is not valid if the phase shift feature is not being used or is in fixed mode.

### Status - STATUS

STATUS[0] indicates the overflow of the phase shift numerator (when DCM is phase shifted beyond +255 or -255) and that the absolute delay range of the phase shift delay line is exceeded.

### Attributes

The following attributes provide access to the Virtex-II fine-phase adjustment capability.

### **Clock Out Phase Shift**

The CLKOUT\_PHASE\_SHIFT attribute controls the use of the PHASE\_SHIFT value. It can be set to NONE, FIXED, or VARIABLE. By default, this attribute is set to NONE, indicating that the phase shift feature is not being used. When this attribute is set to NONE, the PHASE\_SHIFT value has no effect on the DCM outputs. If the CLKOUT\_PHASE\_SHIFT attribute is set to FIXED or NONE, then the PSEN, PSINCDEC, and the PSCLK inputs must be tied to ground. The effects of the CLKOUT\_PHASE\_SHIFT attribute are shown in Figure 2-32.

### PHASE\_SHIFT

This attribute specifies the phase shift numerator as any value from -255 to 255. This attribute can be used with both fixed and variable phase shift mode. If used with variable mode, the attribute sets the starting phase shift.

## Submodules



Figure 2-36: BUFG\_PHASE\_CLK0\_SUBM



Figure 2-37: BUFG\_PHASE\_CLKFX\_FB\_SUBM



UG002\_C2\_072\_101201







www.xilinx.com 1-800-255-7778

# Summary of All DCM Attributes

A handful of DCM attributes govern the functionality of DCM features. Below is a collection of all DCM attributes. Table 2-9 summarizes all attributes applicable to DCMs.

### Table 2-9: DCM Attributes

| DCM Attribute Name | Description                                                                                                                                                                                                                                                                                                                             | Value                                                                         | Default Value      |  |  |  |
|--------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------------------------------------------------------------------------------|--------------------|--|--|--|
| CLKDV_DIVIDE       | This attribute controls<br>CLKDV such that the source<br>clock is divided by N.<br>This feature provides<br>automatic duty cycle<br>correction such that the<br>CLKDV output pin has a<br>50/50 duty cycle always in<br>low-frequency mode, as<br>well as for all integer values<br>of the division factor N in<br>high-frequency mode. | Real:<br>1.5 <= N+0.5 <=8,<br>8< N+1 < 16                                     | 2.0                |  |  |  |
| CLKFX_DIVIDE       |                                                                                                                                                                                                                                                                                                                                         | Integer: 1 to 32                                                              | 1                  |  |  |  |
| CLKFX_MULTIPLY     |                                                                                                                                                                                                                                                                                                                                         | Integer: 2 to 32                                                              | 4                  |  |  |  |
| CLKIN_PERIOD       | This specifies the source<br>clock period to help DCM<br>adjust for optimum<br>CLKFX/CLKFX180<br>outputs.                                                                                                                                                                                                                               | Real in ns                                                                    | 0.0                |  |  |  |
| CLKIN_DIVIDE_BY_2  | This attribute allows for the<br>input clock frequency to be<br>divided in half when such a<br>reduction is necessary to<br>meet the DCM input clock<br>frequency requirements.<br>This is characterized to<br>work up to 1.15 GHz at<br>room temperature.                                                                              | Boolean: FALSE or TRUE                                                        | FALSE              |  |  |  |
| CLKOUT_PHASE_SHIFT | This controls the use of the PHASE_SHIFT value.                                                                                                                                                                                                                                                                                         | String: "NONE" or "FIXED" or<br>"VARIABLE"                                    | NONE               |  |  |  |
| CLK_FEEDBACK       | This attribute specifies the feedback input to the DCM (CLK0,or CLK2X).                                                                                                                                                                                                                                                                 | String: "1X" or "2X"                                                          | 1X                 |  |  |  |
| DESKEW_ADJUST      | This affects the amount of<br>delay in the feedback path,<br>and should be used for<br>source-synchronous<br>interfaces. [See the Clock<br>De-Skew section.]                                                                                                                                                                            | String:<br>"SYSTEM_SYNCHRONOUS"<br>or<br>"SOURCE_SYNCHRONOUS"<br>or "0 to 15" | SYSTEM_SYNCHRONOUS |  |  |  |
| DFS_FREQUENCY_MODE | This specifies the frequency mode of the frequency synthesizer.                                                                                                                                                                                                                                                                         | String: "LOW" or "HIGH"                                                       | LOW                |  |  |  |
| DLL_FREQUENCY_MODE | This specifies the DLL's<br>frequency mode. CLK90,<br>CLK270, CLK2X, and<br>CLK2X180 are not available<br>in high-frequency mode.                                                                                                                                                                                                       | String: "LOW" or "HIGH"                                                       | LOW                |  |  |  |

### Table 2-9: DCM Attributes

| DCM Attribute Name    | Description                                                                                                                                                                               | Value                  | Default Value |
|-----------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------------------|---------------|
| DUTY_CYCLE_CORRECTION | This controls the DCM of 1X<br>outputs (CLK0, CLK90,<br>CLK180, and CLK270), such<br>that they exhibit a 50/50<br>duty cycle. Leave this<br>attribute set at the default<br>value (TRUE). | Boolean: TRUE or FALSE | TRUE          |
| DSS_MODE              | Unsupported feature. Leave the value at default.                                                                                                                                          | String                 | NONE          |
| FACTORY_JF            | This attribute is fixed unless recommended otherwise.                                                                                                                                     | Bit_vector             | XC080         |
| PHASE_SHIFT           | This specifies the phase shift numerator.                                                                                                                                                 | Integer: -255 to 255   | 0             |
| STARTUP_WAIT          | When this attribute is set to<br>TRUE, the configuration<br>startup sequence waits in<br>the specified cycle until the<br>DCM locks.                                                      | Boolean: FALSE or TRUE | FALSE         |

For more information on applying these attributes in UCF, VHDL, or Verilog code, refer to the *Constraints Guide* at: <u>http://toolbox.xilinx.com/docsan/xilinx4/manuals.htm</u>.

# VHDL and Verilog Instantiation

VHDL and Verilog instantiation templates are available as examples (see "VHDL and Verilog Templates" on page 89) for all submodules.

In VHDL, each template has a component declaration section and an architecture section. Each part of the template should be inserted within the VHDL design file. The port map of the architecture section should include the design signal names.

### VHDL and Verilog Templates

The following submodules described in this section are available:

- BUFG\_CLK0\_SUBM
- BUFG\_CLK2X\_SUBM
- BUFG\_CLK0\_FB\_SUBM
- BUFG\_CLK2X\_FB\_SUBM
- BUFG\_CLKDV\_SUBM
- BUFG\_DFS\_SUBM
- BUFG\_DFS\_FB\_SUBM
- BUFG\_PHASE\_CLKFX\_FB\_SUBM
- BUFG\_PHASE\_CLK0\_SUBM
- BUFG\_PHASE\_CLK2X\_SUBM
- BUFG\_PHASE\_CLKDV\_SUBM

The corresponding submodules must be synthesized with the design. The BUFG\_CLK0\_SUBM submodule is provided in VHDL and Verilog as an example.

```
VHDL Template
   -- Module: BUFG_CLK0_SUBM
   -- Description: VHDL submodule
   -- DCM with CLK0 deskew
   -- Device: Virtex-II Family
   _____
   library IEEE;
   use IEEE.std_logic_1164.all;
   - -
   -- pragma translate off
   library UNISIM;
   use UNISIM.VCOMPONENTS.ALL;
   -- pragma translate on
   - -
   entity BUFG_CLK0_SUBM is
     port (
          CLK IN : in std logic;
          RST : in std_logic;
CLK1X : out std_logic;
          LOCK : out std_logic
         );
   end BUFG CLK0 SUBM;
   - -
   architecture BUFG CLK0 SUBM arch of BUFG CLK0 SUBM is
   -- Components Declarations:
   component BUFG
     port (
          I : in std logic;
          0 : out std logic
         );
   end component;
   component DCM
   -- pragma translate_off
      generic (
               DLL FREQUENCY MODE : string := "LOW";
               DUTY CYCLE CORRECTION : boolean := TRUE;
               STARTUP_WAIT : boolean := FALSE
              );
   -- pragma translate_on
      port ( CLKIN : in std_logic;
                      : in std logic;
             CLKFB
             DSSEN : in std_logic;
             PSINCDEC : in std_logic;
             PSEN : in std_logic;
             PSCLK
                     : in std_logic;
             RST
                     : in std logic;
             CLK0
                     : out std logic;
                    : out std_logic;
             CLK90
             CLK180 : out std_logic;
             CLK270 : out std_logic;
             CLK2X
                      : out std_logic;
             CLK2X180 : out std_logic;
             CLKDV
                     : out std logic;
                      : out std_logic;
             CLKFX
             CLKFX180 : out std_logic;
             LOCKED : out std_logic;
             PSDONE : out std_logic;
             STATUS : out std_logic_vector(7 downto 0)
            );
   end component;
   -- Attributes
```

```
attribute DLL_FREQUENCY_MODE : string;
```

www.xilinx.com

1-800-255-7778

```
attribute DUTY_CYCLE_CORRECTION : string;
attribute STARTUP WAIT : string;
attribute DLL FREQUENCY MODE of U DCM: label is "LOW";
attribute DUTY_CYCLE_CORRECTION of U_DCM: label is "TRUE";
attribute STARTUP_WAIT of U_DCM: label is "FALSE";
-- Signal Declarations:
signal GND : std logic;
signal CLK0_W: std_logic;
signal CLK1X W: std logic;
begin
GND <= '0';
CLK1X <= CLK1X W;
-- DCM Instantiation
U DCM: DCM
 port map (
            CLKIN =>
                       CLK IN,
            CLKFB =>
                       CLK1X W,
            DSSEN =>
                        GND,
            PSINCDEC => GND,
            PSEN =>
                        GND,
            PSCLK =>
                        GND,
            RST =>
                       RST,
            CLK0 =>
                    CLK0 W,
            LOCKED => LOCK
    );
-- BUFG Instantiation
U BUFG: BUFG
  port map (
      I => CLK0_W,
      O => CLK1X_W
          );
end BUFG_CLK0_SUBM_arch;
```

### Verilog Template

```
// Module:
           BUFG CLK0 SUBM
// Description: Verilog Submodule
// DCM with CLK0 deskew
11
// Device: Virtex-II Family
//-----
module BUFG_CKL0_SUBM (
                   CLK IN,
                   RST,
                   CLK1X,
                  LOCK
                  );
   input CLK_IN;
   input RST;
   output CLK1X;
   output LOCK;
   wire CLK0_W;
   wire GND;
```

```
assign GND = 1'b0;
```

```
//BUFG Instantiation
11
BUFG U BUFG
            (.I(CLK0 W),
             .O(CLK1X)
            );
// Attributes for functional simulation//
// synopsys translate off
       defparam U_DCM.DLL_FREQUENCY_MODE = "LOW";
       defparam U DCM.DUTY CYCLE CORRECTION = "TRUE";
       defparam U_DCM.STARTUP_WAIT = "FALSE";
// synopsys translate_on
// Instantiate the DCM primitive//
 DCM U_DCM (
                   .CLKFB(CLK1X),
                   .CLKIN(CLK_IN),
                   .DSSEN(GND),
                   .PSCLK(GND),
      .PSEN(GND),
                   .PSINCDEC(GND),
                   .RST(RST),
                   .CLK0(CLK0_W),
      .LOCKED(LOCK)
                  );
// synthesis attribute declarations
  /* synopsys attribute
 DLL_FREQUENCY_MODE "LOW"
 DUTY_CYCLE_CORRECTION "TRUE"
 STARTUP_WAIT "FALSE"
  */
endmodule
```

2

# **DCM Waveforms**

The DCM waveforms shown below are the results of functional simulation using Model Technology's ModelSim EE/Plus 5.3a\_p1 simulator. Note that the time scale for these simulations were set to 1ns/1ps. It is important to set the unused inputs of the DCM to logic 0 and to set the attribute values to the correct data types. For example, the PHASE\_SHIFT, CLKFX\_DIVIDE, and CLKFX\_MULTIPLY attributes are integers and should be set to values as shown.

defparam U\_DCM.DFS\_FREQUENCY\_MODE = "LOW"; defparam U\_DCM.CLKFX\_DIVIDE = 1; (this value's range is specified under Frequency Synthesis in the Virtex-II Data Sheet) defparam U\_DCM.CLKFX\_MULTIPLY = 4; (this value's range is specified under Frequency Synthesis in the Virtex-II Data Sheet) defparam U\_DCM.CLKOUT\_PHASE\_SHIFT = "FIXED"; defparam U\_DCM.PHASE\_SHIFT = 150; (Any value from 1 to 255) defparam U\_DCM.STARTUP\_WAIT = "FALSE";

The input clock, 'clk\_in' (CLKIN input of DCM) in all these waveforms is 50 MHz. The DCM\_DLL waveforms in Figure 2-40 shows four DCM outputs, namely, clk1x (CLK0 output of DCM), clk2x (CLK2X output of DCM), clk90 (CLK90 output of DCM), and clk180 (CLK180 output of DCM).



ug002\_c2\_095\_113000

Figure 2-40: DCM\_DLL Waveforms

The DCM\_DFS Waveforms in Figure 2-41 shows four DCM outputs namely, clk1x (CLK0 output of DCM), clk2x (CLK2X output of DCM), clkfx (CLKFX output of DCM), and clkfx180 (CLKFX180 output of DCM). In this case the attributes, CLKFX\_DIVIDE = 1, and the CLKFX\_MULTIPLY = 3.



### Figure 2-41: DCM\_DFS Waveforms

The DCM\_DPS waveforms in Figure 2-42 shows four DCM outputs, namely, clk1x (CLK0 output of DCM), clk2x (CLK2X output of DCM), clk90 (CLK90 output of DCM), and clk180 (CLK180 output of DCM). In this case, the attribute PHASE\_SHIFT = 150 which translates to a phase shift of  $(150 \times 20 \text{ ns})/256 = 11.719 \text{ ns}$ , where 20 ns is the clock period.





# XILINX<sup>®</sup>

2

The DCM\_DPS\_DFS waveforms in Figure 2-43 shows four DCM outputs namely, clk1x (CLK0 output of DCM), clk90 (CLK90 output of DCM), clkfx (CLKFX output of DCM), and clkfx180 (CLKFX180 output of DCM). In this case, the attributes, CLKFX\_DIVIDE = 1, and the CLKFX\_MULTIPLY = 4. The attribute, PHASE\_SHIFT = 150 which translates to a phase shift of  $(150 \times 20 \text{ ns})/256 = 11.719 \text{ ns}$ , where 20 ns is the clock period.



Figure 2-43: DCM\_DPS\_DFS Waveforms

# Using Block SelectRAM<sup>™</sup> Memory

# Introduction

In addition to distributed SelectRAM memory, Virtex-II devices feature a large number of 18 Kb block SelectRAM memories. The block SelectRAM memory is a True Dual-Port RAM, offering fast, discrete, and large blocks of memory in the device. The memory is organized in columns, and the total amount of block SelectRAM memory depends on the size of the Virtex-II device. The 18 Kb blocks are cascadable to enable a deeper and wider memory implementation, with a minimal timing penalty incurred through specialized routing resources.

Embedded dual- or single-port RAM modules, ROM modules, synchronous and asynchronous FIFOs, and data width converters are easily implemented using the Xilinx CORE Generator "Block Memory" modules. Asynchronous FIFOs can also be generated using the CORE Generator Asynchronous FIFO module. Starting with IP Update #3, the designer can also generate synchronous FIFOs using Block Memory.

# Synchronous Dual-Port and Single-Port RAM

## Data Flow

The 18Kb block SelectRAM dual-port memory consists of an 18-Kb storage area and two completely independent access ports, A and B. The structure is fully symmetrical, and both ports are interchangeable.

Data can be written to either port and can be read from the same or the other port. Each port is synchronous, with its own clock, clock enable, and write enable. Note that the read operation is also synchronous and requires a clock edge.



Figure 2-44: Dual-Port Data Flows

As described below, there are three options for the behavior of the data output during a write operation on its port. There is no dedicated monitor to arbitrate the result of identical addresses on both ports. It is up to the user to time the two clocks appropriately. However, conflicting simultaneous writes to the same location never cause any physical damage.

## **Operating Modes**

To maximize utilization of the True Dual-Port memory at each clock edge, the block SelectRAM memory supports three different write modes for each port. The "read during write" mode offers the flexibility of using the data output bus during a write operation on the same port. Output behavior is determined by the configuration. This choice increases the efficiency of block SelectRAM memory at each clock cycle and allows designs that use maximum bandwidth.

## **Read Operation**

The read operation uses one clock edge. The read address is registered on the read port, and the stored data is loaded into the output latches after the RAM access interval passes.

## Write Operations

A write operation is a single clock-edge operation. The write address is registered on the write port, and the data input is stored in memory.

Three different modes are used to determine data available on the output latches after a write clock edge.

### WRITE\_FIRST or Transparent Mode (Default)

In WRITE\_FIRST mode, the input data is simultaneously written into memory and stored in the data output (transparent write), as shown in Figure 2-45.



UG002 (v1.5) 2 December 2002 Virtex-II Platform FPGA User Guide

### READ\_FIRST or Read-Before-Write Mode

In READ\_FIRST mode, data previously stored at the write address appears on the output latches, while the input data is being stored in memory (read before write). See Figure 2-46.



Figure 2-46: **READ\_FIRST Mode Waveforms** 

### NO\_CHANGE Mode

In NO\_CHANGE mode, the output latches remain unchanged during a write operation. As shown in Figure 2-47, data output is still the last read data and is unaffected by a write operation on the same port.

Mode selection is set by configuration. One of these three modes is set individually for each port by an attribute. The default mode is WRITE\_FIRST.



Figure 2-47: NO\_CHANGE Mode Waveforms

# **Conflict Resolution**

Virtex-II block SelectRAM memory is a True Dual-Port RAM that allows both ports to simultaneously access the same memory cell. When one port writes to a given memory cell, the other port must not address that memory cell (for a write or a read) within the clock-to-clock setup window. Figure 2-48 describes this asynchronous operation.



Figure 2-48: **READ-WRITE Conditions** 

If port A and port B are configured with different widths, only the overlapping bits are invalid when conflicts occur.

## Asynchronous Clocks

The first CLK\_A clock edge violates the clock-to-clock setup parameter, because it occurs too soon after the last CLK\_B clock edge. The write operation on port B is valid, and the read operation on port A is invalid.

At the second rising edge of the CLK\_B pin, the write operation is valid. The memory location (bb) contains 4444. The second rising edge of CLK\_A reads the new data at the same location (bb), which now contains 4444.

The clock-to-clock setup timing parameter is specified together with other block SelectRAM switching characteristics in the <u>Virtex-II Data Sheet (DS031)</u>.

### Synchronous Clocks

When both clocks are synchronous or identical, the result of simultaneous accesses from both ports to the same memory cell is best described in words:

- If both ports read simultaneously from the same memory cell: Both Data\_out ports will have the same data.
- If both ports write simultaneously into the same memory cell: The data stored in that cell becomes invalid (unless both ports write identical data).
- If one port writes and the other one reads from the same memory cell: The write operation succeeds, and the write port's Data\_out behaves as determined by the read output mode (write\_first, read\_first, or no\_change).

If the write port is in read\_first mode, the read port's Data\_out represents the previous content of the memory cell. If the write port is in write\_first mode or in no\_change mode, the read port's Data\_out becomes invalid. Obviously, the read port's mode setting does not affect this operation.

# Characteristics

- A write operation requires only one clock edge.
- A read operation requires only one clock edge.
- All inputs are registered with the port clock and have a setup-to-clock timing specification.
- All outputs have a read-through function or one of three read-during-write functions, depending on the state of the WE pin. The outputs relative to the port clock are available after the clock-to-out timing interval.
- Block SelectRAM cells are true synchronous RAM memories and do not have a combinatorial path from the address to the output.
- The ports are completely independent of each other (that is, clocking, control, address, read/write functions, initialization, and data width) without arbitration.
- Output ports are latched with a self-timed circuit, guaranteeing glitch-free reads. The state of the output port does not change until the port executes another read or write operation.
- Data input and output signals are always busses; that is, in a 1-bit width configuration, the data input signal is DI[0] and the data output signal is DO[0].

# Library Primitives

The input and output data busses are represented by two busses for 9-bit width (8+1), 18-bit width (16+2), and 36-bit width (32+4) configurations. The ninth bit associated with each byte can store parity or error correction bits. No specific function is performed on this bit.

The separate bus for parity bits facilitates some designs. However, other designs safely use a 9-bit, 18-bit, or 36-bit bus by merging the regular data bus with the parity bus. Read/write and storage operations are identical for all bits, including the parity bits.

Figure 2-49 shows the generic dual-port block RAM primitive. DIA, DIPA, ADDRA, DOA, DOPA, and the corresponding signals on port B are busses.



Figure 2-49: Dual-Port Block RAM Primitive

| Primitive      | Port A Width | Port B Width |  |  |  |
|----------------|--------------|--------------|--|--|--|
| RAMB16_S1_S1   |              | 1            |  |  |  |
| RAMB16_S1_S2   |              | 2            |  |  |  |
| RAMB16_S1_S4   | 1            | 4            |  |  |  |
| RAMB16_S1_S9   | - 1          | (8+1)        |  |  |  |
| RAMB16_S1_S18  |              | (16+2)       |  |  |  |
| RAMB16_S1_S36  | _            | (32+4)       |  |  |  |
| RAMB16_S2_S2   |              | 2            |  |  |  |
| RAMB16_S2_S4   | _            | 4            |  |  |  |
| RAMB16_S2_S9   | 2            | (8+1)        |  |  |  |
| RAMB16_S2_S18  | _            | (16+2)       |  |  |  |
| RAMB16_S2_S36  |              | (32+4)       |  |  |  |
| RAMB16_S4_S4   |              | 4            |  |  |  |
| RAMB16_S4_S9   |              | (8+1)        |  |  |  |
| RAMB16_S4_S18  | - 4          | (16+2)       |  |  |  |
| RAMB16_S4_S36  | _            | (32+4)       |  |  |  |
| RAMB16_S9_S9   |              | (8+1)        |  |  |  |
| RAMB16_S9_S18  | (8+1)        | (16+2)       |  |  |  |
| RAMB16_S9_S36  |              | (32+4)       |  |  |  |
| RAMB16_S18_S18 | (16+2)       | (16+2)       |  |  |  |
| RAMB16_S18_S36 | (10+2)       | (32+4)       |  |  |  |
| RAMB16_S36_S36 | (32+4)       | (32+4)       |  |  |  |

Table 2-10 lists the available dual-port primitives for synthesis and simulation.Table 2-10:Dual-Port Block RAM Primitives

Figure 2-50 shows the generic single-port block RAM primitive. DI, DIP, ADDR, DO, and DOP are busses.



Figure 2-50: Single-Port Block RAM Primitive

Table 2-11 lists all of the available single-port primitives for synthesis and simulation.Table 2-11:Single-Port Block RAM Primitives

| Primitive  | Port Width |
|------------|------------|
| RAMB16_S1  | 1          |
| RAMB16_S2  | 2          |
| RAMB16_S4  | 4          |
| RAMB16_S9  | (8+1)      |
| RAMB16_S18 | (16+2)     |
| RAMB16_S36 | (32+4)     |

# VHDL and Verilog Instantiation

VHDL and Verilog instantiation templates are available as examples (see "VHDL and Verilog Templates" on page 106).

In VHDL, each template has a component declaration section and an architecture section. Each part of the template should be inserted within the VHDL design file. The port map of the architecture section should include the design signal names.

The SelectRAM\_Ax templates (with x = 1, 2, 4, 9, 18, or 36) are single-port modules and instantiate the corresponding RAMB16\_Sx module.

SelectRAM\_Ax\_By templates (with x = 1, 2, 4, 9, 18, or 36 and y = 1, 2, 4, 9, 18, or 36) are dual-port modules and instantiate the corresponding RAMB16\_Sx\_Sy module.

# Port Signals

Each block SelectRAM port operates independently of the other while accessing the same set of 18K-bit memory cells.

### Clock - CLK[AIB]

Each port is fully synchronous with independent clock pins. All port input pins have setup time referenced to the port CLK pin. The data bus has a clock-to-out time referenced to the CLK pin. Clock polarity is configurable (rising edge by default).

### Enable - EN[AIB]

The enable pin affects the read, write, and set/reset functionality of the port. Ports with an inactive enable pin keep the output pins in the previous state and do not write data to the memory cells. Enable polarity is configurable (active High by default).

## Write Enable - WE[AIB]

Both EN and WE are active when the contents of the data input bus is written to memory at the address pointed to by the address bus. The output latches are loaded or not loaded according to the write configuration (WRITE\_FIRST, READ\_FIRST, NO\_CHANGE). When inactive, a read operation occurs, and the contents of the memory cells referenced by the address bus reflect on the data-out bus, regardless of the write mode attribute. Write enable polarity is configurable (active High by default).

## Set/Reset - SSR[A|B]

The SSR pin forces the data output latches to contain the value "SRVAL" (see "Attributes" on page 104). The data output latches are synchronously asserted to 0 or 1, including the parity bit. In a 36-bit width configuration, each port has an independent SRVAL[A | B] attribute of 36 bits. This operation does not affect RAM memory cells and does not disturb write operations on the other port. Like the read and write operation, the set/reset function is active only when the enable pin of the port is active. Set/reset polarity is configurable (active High by default).

# Address Bus - ADDR[A|B]<#:0>

The address bus selects the memory cells for read or write. The width of the port determines the required address bus width, as shown in Table 2-12.

| Port Data Width | Depth  | ADDR Bus | DI Bus / DO Bus | DIP Bus / DOP Bus |
|-----------------|--------|----------|-----------------|-------------------|
| 1               | 16,384 | <13:0>   | <0>             | NA                |
| 2               | 8,192  | <12:0>   | <1:0>           | NA                |
| 4               | 4,096  | <11:0>   | <3:0>           | NA                |
| 9               | 2,048  | <10:0>   | <7:0>           | <0>               |
| 18              | 1,024  | <9:0>    | <15:0>          | <1:0>             |
| 36              | 512    | <8:0>    | <31:0>          | <3:0>             |

Table 2-12: Port Aspect Ratio

# Data-In Busses - DI[AIB]<#:0> & DIP[AIB]<#:0>

Data-in busses provide the new data value to be written into RAM. The regular data-in bus (DI) and the parity data-in bus (when available) have a total width equal to the port width. For example the 36-bit port data width is represented by DI<31:0> and DIP<3:0>, as shown in Table 2-12.

# Data-Out Busses - DO[AIB]<#:0> & DOP[AIB]<#:0>

Data-out busses reflect the contents of memory cells referenced by the address bus at the last active clock edge during a read operation. During a write operation (WRITE\_FIRST or READ\_FIRST configuration), the data-out busses reflect either the data-in busses or the stored value before write. During a write operation in NO\_CHANGE mode, data-out busses are not affected. The regular data-out bus (DO) and the parity data-out bus (DOP) (when available) have a total width equal to the port width, as shown in Table 2-12.

## **Inverting Control Pins**

For each port, the four control pins (CLK, EN, WE, and SSR) each have an individual inversion option. Any control signal can be configured as active High or Low, and the clock can be active on a rising or falling edge (active High on rising edge by default) without requiring other logic resources.

## **Unused Inputs**

Non-connected Data and/or address inputs should be connected to logic "1".

## GSR

The global set/reset (GSR) signal of a Virtex-II device is an asynchronous global signal that is active at the end of device configuration. The GSR can also restore the initial Virtex-II state at any time. The GSR signal initializes the output latches to the INIT, or to the INIT\_A and INIT\_B value (see "Attributes" on page 104). A GSR signal has no impact on internal memory contents. Because it is a global signal, the GSR has no input pin at the functional level (block SelectRAM primitive).

# Address Mapping

Each port accesses the same set of 18,432 memory cells using an addressing scheme dependent on the width of the port. The physical RAM locations addressed for a particular width are determined using the following formula (of interest only when the two ports use different aspect ratios):

END = ((ADDR + 1) \* Width) -1 START= ADDR \* Width Table 2-13 shows low-order address mapping for each port width.

| Table 2-13: | Port Address | Mapping |
|-------------|--------------|---------|
|-------------|--------------|---------|

| Port<br>Width | P<br>Loc | arity<br>ations |    | Data Locations |    |    |    |    |    |    |    |    |    |    |    |    |    |    |    |    |    |    |    |    |     |    |    |     |       |
|---------------|----------|-----------------|----|----------------|----|----|----|----|----|----|----|----|----|----|----|----|----|----|----|----|----|----|----|----|-----|----|----|-----|-------|
| 1             |          |                 | 31 | 30             | 29 | 28 | 27 | 26 | 25 | 24 | 23 | 22 | 21 | 20 | 19 | 18 | 17 | 16 | 15 | 14 | 13 | 12 | 11 | 10 | 98  | 76 | 54 | 3 2 | 2 1 0 |
| 2             | ו [      | N.A.            | 1  | 5              | 1  | 4  | 1  | .3 | 1  | 2  | 1  | 1  | 1  | .0 | ç  | 9  | :  | 8  | 2  | 7  | (  | 5  | Ę  | 5  | 4   | 3  | 2  | 1   | 0     |
| 4             |          |                 |    | 1              | 7  |    |    | (  | 5  |    |    | į  | 5  |    |    | 4  | 1  |    |    | 3  | 3  |    |    | 2  | 1 0 |    |    | 0   |       |
| 8+1           | 3 2      | 2 1 0           |    | 3 2 1 0        |    |    |    |    |    |    |    |    |    |    |    |    |    |    |    |    |    |    |    |    |     |    |    |     |       |
| 16 + 2        | 1        | 0               |    | 1 0            |    |    |    |    |    |    |    |    |    |    |    |    |    |    |    |    |    |    |    |    |     |    |    |     |       |
| 32 + 4        |          | 0               |    | 0              |    |    |    |    |    |    |    |    |    |    |    |    |    |    |    |    |    |    |    |    |     |    |    |     |       |

# **Attributes**

## Content Initialization - INIT\_xx

INIT\_xx attributes define the initial memory contents. By default block SelectRAM memory is initialized with all zeros during the device configuration sequence. The 64 initialization attributes from INIT\_00 through INIT\_3F represent the regular memory contents. Each INIT\_xx is a 64-digit hex-encoded bit vector. The memory contents can be partially initialized and are automatically completed with zeros.

The following formula is used for determining the bit positions for each INIT\_xx attribute. Given yy = conversion hex-encoded to decimal (xx), INIT\_xx corresponds to the memory cells as follows:

- from [(yy + 1) \* 256] -1
- to (yy) \* 256

For example, for the attribute INIT\_1F, the conversion is as follows:

- yy = conversion hex-encoded to decimal X"1F" = 31
- from [(31+1) \* 256] -1 = 8191
- to 31 \* 256 = 7936

More examples are given in Table 2-14.

### Table 2-14: Block SelectRAM Initialization Attributes

| Attributo | Memo  | ry Cell |
|-----------|-------|---------|
| Allibule  | from  | to      |
| INIT_00   | 255   | 0       |
| INIT_01   | 511   | 256     |
| INIT_02   | 767   | 512     |
|           |       |         |
| INIT_0E   | 3839  | 3584    |
| INIT_0F   | 4095  | 3840    |
| INIT_10   | 4351  | 4096    |
|           |       |         |
| INIT_1F   | 8191  | 7936    |
| INIT_20   | 8447  | 8192    |
|           |       |         |
| INIT_2F   | 12287 | 12032   |
| INIT_30   | 12543 | 12288   |
|           |       |         |
| INIT_3F   | 16383 | 16128   |

# Content Initialization - INITP\_xx

INITP\_xx attributes define the initial contents of the memory cells corresponding to DIP/DOP busses (parity bits). By default these memory cells are also initialized to all zeros. The eight initialization attributes from INITP\_00 through INITP\_07 represent the memory contents of parity bits. Each INITP\_xx is a 64-digit hex-encoded bit vector and behaves like a regular INIT\_xx attribute. The same formula can be used to calculate the bit positions initialized by a particular INITP\_xx attribute.

# Output Latches Initialization - INIT (INIT\_A & INIT\_B)

The INIT (single-port) or INIT\_A and INIT\_B (dual-port) attributes define the output latches values after configuration. The width of the INIT (INIT\_A & INIT\_B) attribute is the port width, as shown in Table 2-15. These attributes are hex-encoded bit vectors and the default value is 0.

# Output Latches Synchronous Set/Reset - SRVAL (SRVAL\_A & SRVAL\_B)

The SRVAL (single-port) or SRVAL\_A and SRVAL\_B (dual-port) attributes define output latch values when the SSR input is asserted. The width of the SRVAL (SRVAL\_A and SRVAL\_B) attribute is the port width, as shown in Table 2-15. These attributes are hexencoded bit vectors and the default value is 0.

| Port Data Width | DOP Bus | DO Bus | INIT / SRVAL  |
|-----------------|---------|--------|---------------|
| 1               | NA      | <0>    | 1             |
| 2               | NA      | <1:0>  | 2             |
| 4               | NA      | <3:0>  | 4             |
| 9               | <0>     | <7:0>  | (1+8) = 9     |
| 18              | <1:0>   | <15:0> | (2+16) = 18   |
| 36              | <3:0>   | <31:0> | (4 + 32) = 36 |

Table 2-15: Port Width Values

# Initialization in VHDL or Verilog Codes

Block SelectRAM memory structures can be initialized in VHDL or Verilog code for both synthesis and simulation. For synthesis, the attributes are attached to the block SelectRAM instantiation and are copied in the EDIF output file to be compiled by Xilinx Alliance Series<sup>TM</sup> tools. The VHDL code simulation uses a generic parameter to pass the attributes. The Verilog code simulation uses a defparam parameter to pass the attributes.

The XC2V\_RAMB\_1\_PORT block SelectRAM instantiation code examples (in VHDL and Verilog) illustrate these techniques (see "VHDL and Verilog Templates" on page 106).

# **Location Constraints**

Block SelectRAM instances can have LOC properties attached to them to constrain placement. Block SelectRAM placement locations differ from the convention used for naming CLB locations, allowing LOC properties to transfer easily from array to array.

The LOC properties use the following form:

```
LOC = RAMB16_X#Y#
```

The RAMB16\_X0Y0 is the bottom-left block SelectRAM location on the device.

# Applications

# **Creating Larger RAM Structures**

Block SelectRAM columns have specialized routing to allow cascading blocks with minimal routing delays. Wider or deeper RAM structures are achieved with a smaller timing penalty than is encountered when using normal routing resources.

The CORE Generator program offers the designer a painless way to generate wider and deeper memory structures using multiple block SelectRAM instances. This program outputs VHDL or Verilog instantiation templates and simulation models, along with an EDIF file for inclusion in a design.

# Multiple RAM Organizations

The flexibility of block SelectRAM memories allows designs with various types of RAM in addition to regular configurations. Application notes at <u>www.xilinx.com</u> describe some of these designs, with VHDL and Verilog reference designs included.

Virtex-II block SelectRAM can be used as follows:

- Two independent single-port RAM resources
- One 72-bit single-port RAM resource
- One triple-port (1 Read/Write and 2 Read ports) RAM resource

Xilinx application notes with VHDL and Verilog reference designs also describe other implementations using block SelectRAM memory, such as:

- xapp258 "FIFOs Using Virtex-II Block RAM"
- xapp260 "Using Virtex-II Block RAM for High-Performance Read/Write CAMs"

# VHDL and Verilog Templates

VHDL and Verilog templates are available for all single-port and dual-port primitives. The A and B numbers indicate the width of the ports.

The following are single-port templates:

- SelectRAM\_A1
- SelectRAM\_A2
- SelectRAM\_A4
- SelectRAM\_A9
- SelectRAM\_A18
- SelectRAM\_A36

The following are dual-port templates:

- SelectRAM\_A1\_B1
- SelectRAM\_A1\_B2
- SelectRAM\_A1\_B4
- SelectRAM\_A1\_B9
- SelectRAM\_A1\_B18
- SelectRAM\_A1\_B36
- SelectRAM\_A2\_B2
- SelectRAM\_A2\_B4
- SelectRAM\_A2\_B9
- SelectRAM\_A2\_B18
- SelectRAM\_A2\_B36
- SelectRAM\_A4\_B4

- SelectRAM\_A4\_B9
- SelectRAM\_A4\_B18
- SelectRAM\_A4\_B36
- SelectRAM\_A9\_B9
- SelectRAM\_A9\_B18
- SelectRAM\_A9\_B36
- SelectRAM\_A18\_B18
- SelectRAM\_A18\_B36
- SelectRAM\_A36\_B36

# **VHDL** Template

As an example, the XC2V\_RAMB\_1\_PORT.vhd file uses the SelectRAM\_A36 template:

```
-- Module: XC2V_RAMB_1_PORT
-- Description: 18Kb Block SelectRAM example
-- Single Port 512 x 36 bits
-- Use template "SelectRAM_A36.vhd"
- -
-- Device: Virtex-II Family
_____
library IEEE;
use IEEE.std logic 1164.all;
- -
-- Syntax for Synopsys FPGA Express
-- pragma translate off
library UNISIM;
use UNISIM.VCOMPONENTS.ALL;
-- pragma translate on
entity XC2V RAMB 1 PORT is
  port (
   DATA_IN : in std_logic_vector (35 downto 0);
       ADDRESS : in std logic vector (8 downto 0);
       ENABLE: in std logic;
       WRITE_EN : in std_logic;
SET_RESET : in std_logic;
       CLK : in std logic;
       DATA OUT : out std logic vector (35 downto 0)
      );
end XC2V RAMB 1 PORT;
- -
architecture XC2V_RAMB_1_PORT_arch of XC2V_RAMB_1_PORT is
- -
-- Components Declarations:
component BUFG
 port (
 I: in std logic;
 0: out std logic
 );
end component;
-- Syntax for Synopsys FPGA Express
component RAMB16 S36
-- pragma translate off
 generic (
-- "Read during Write" attribute for functional simulation
WRITE_MODE : string := "READ_FIRST" ; -- WRITE_FIRST(default) /
READ_FIRST/ NO_CHANGE
```

```
-- Output value after configuration
 INIT : bit vector(35 downto 0) := X"000000000";
-- Output value if SSR active
 SRVAL : bit vector(35 downto 0) := X"012345678";
-- Plus bits initial content
    INITP 00 : bit vector(255 downto 0) :=
INITP 01 : bit vector(255 downto 0) :=
INITP 02 : bit vector(255 downto 0) :=
INITP 03 : bit vector(255 downto 0) :=
INITP_04 : bit_vector(255 downto 0) :=
INITP 05 : bit vector(255 downto 0) :=
INITP 06 : bit vector(255 downto 0) :=
INITP_07 : bit_vector(255 downto 0) :=
-- Regular bits initial content
    INIT 00 : bit vector(255 downto 0) :=
INIT_01 : bit_vector(255 downto 0) :=
INIT_02 : bit_vector(255 downto 0) :=
... (cut)
    INIT_3E : bit_vector(255 downto 0) :=
INIT_3F : bit_vector(255 downto 0) :=
);
-- pragma translate_on
 port (
    DI
         : in std_logic_vector (31 downto 0);
    DIP
         : in std_logic_vector (3 downto 0);
    ADDR : in std_logic_vector (8 downto 0);
    ΕN
         : in STD LOGIC;
    WE
        : in STD_LOGIC;
    SSR
        : in STD_LOGIC;
    CLK
       : in STD LOGIC;
        : out std_logic_vector (31 downto 0);
    DO
    DOP
         : out std_logic_vector (3 downto 0)
);
end component;
- -
-- Attribute Declarations:
attribute WRITE_MODE : string;
attribute INIT: string;
attribute SRVAL: string;
attribute INITP 00: string;
attribute INITP 01: string;
attribute INITP 02: string;
attribute INITP 03: string;
attribute INITP 04: string;
attribute INITP_05: string;
attribute INITP 06: string;
```

attribute INITP 07: string;
```
attribute INIT_00: string;
attribute INIT 01: string;
attribute INIT 02: string;
... (cut)
attribute INIT_3E: string;
attribute INIT 3F: string;
- -
-- Attribute "Read during Write mode" = WRITE FIRST(default) /
READ FIRST/ NO CHANGE
attribute WRITE MODE of U RAMB16 S36: label is "READ FIRST";
attribute INIT of U RAMB16 S36: label is "000000000";
attribute SRVAL of U RAMB16 S36: label is "012345678";
-- RAMB16 memory initialization for Alliance
-- Default value is "0" / Partial initialization strings are padded
-- with zeros to the left
attribute INITP 00 of U RAMB16 S36: label is
attribute INITP 01 of U RAMB16 S36: label is
attribute INITP 02 of U RAMB16 S36: label is
attribute INITP 03 of U_RAMB16_S36: label is
attribute INITP 04 of U RAMB16 S36: label is
attribute INITP 05 of U RAMB16 S36: label is
attribute INITP_06 of U_RAMB16_S36: label is
attribute INITP 07 of U RAMB16 S36: label is
_ _
attribute INIT 00 of U RAMB16 S36: label is
attribute INIT_01 of U_RAMB16_S36: label is
attribute INIT_02 of U_RAMB16_S36: label is
... (cut)
attribute INIT_3E of U_RAMB16_S36: label is
attribute INIT 3F of U RAMB16 S36: label is
- -
- -
-- Signal Declarations:
- -
-- signal VCC : std logic;
-- signal GND : std_logic;
signal CLK BUFG: std logic;
signal INV SET RESET : std logic;
- -
begin
-- VCC <= '1';
-- GND <= '0';
-- Instantiate the clock Buffer
U BUFG: BUFG
 port map (
 I => CLK,
 O => CLK BUFG
```

```
);
_ _
-- Use of the free inverter on SSR pin
INV_SET_RESET <= NOT SET_RESET;</pre>
-- Block SelectRAM Instantiation
U RAMB16 S36: RAMB16 S36
 port map (
             => DATA IN (31 downto 0), -- insert 32 bits data-in bus
       DI
(<31 downto 0>)
           => DATA IN (35 downto 32), -- insert 4 bits parity data-
       DIP
in bus (or <35 downto 32>)
     ADDR => ADDRESS (8 downto 0), -- insert 9 bits address bus
       EN
             => ENABLE, -- insert enable signal
             => WRITE EN, -- insert write enable signal
       WE
       SSR
             => INV_SET_RESET, -- insert set/reset signal
       CLK
            => CLK_BUFG, -- insert clock signal
           => DATA_OUT (31 downto 0), -- insert 32 bits data-out bus
      DO
(<31 \text{ downto } 0>)
      DOP
            => DATA_OUT (35 downto 32) -- insert 4 bits parity data-
out bus (or <35 downto 32>)
);
_ _
end XC2V_RAMB_1_PORT_arch;
                          _____
```

#### Verilog Template

// Module: XC2V RAMB 1 PORT // Description: 18Kb Block SelectRAM-II example // Single Port 512 x 36 bits // Use template "SelectRAM A36.v" 11 // Device: Virtex-II Family //-----module XC2V\_RAMB\_1\_PORT (CLK, SET\_RESET, ENABLE, WRITE\_EN, ADDRESS, DATA\_IN, DATA\_OUT); input CLK, SET\_RESET, ENABLE, WRITE\_EN; input [35:0] DATA\_IN; input [8:0] ADDRESS; output [35:0] DATA\_OUT; wire CLK\_BUFG, INV\_SET\_RESET; //Use of the free inverter on SSR pin assign INV\_SET\_RESET = ~SET\_RESET; // initialize block ram for simulation // synopsys translate\_off defparam //"Read during Write" attribute for functional simulation U RAMB16 S36.WRITE MODE = "READ FIRST", //WRITE FIRST(default)/ READ FIRST/ NO CHANGE //Output value after configuration U RAMB16 S36.INIT = 36'h00000000, //Output value if SSR active U\_RAMB16\_S36.SRVAL = 36'h012345678,

# XILINX<sup>®</sup>

```
//Plus bits initial content
U RAMB16 S36.INITP 00 =
U RAMB16 S36.INITP 01 =
U RAMB16 S36.INITP 02 =
U RAMB16 S36.INITP 03 =
U RAMB16 S36.INITP 04 =
U RAMB16 S36.INITP 05 =
U RAMB16 S36.INITP 06 =
U RAMB16 S36.INITP 07 =
//Regular bits initial content
U RAMB16 S36.INIT 00 =
U RAMB16 S36.INIT 01 =
U RAMB16 S36.INIT 02 =
...<cut>
U RAMB16 S36.INIT 3E =
U RAMB16 S36.INIT 3F =
// synopsys translate on
//Instantiate the clock Buffer
BUFG U BUFG ( .I (CLK), .O (CLK BUFG));
//Block SelectRAM Instantiation
RAMB16 S36 U RAMB16 S36 ( .DI(DATA IN[31:0]),
   .DIP(DATA_IN-PARITY[35:32]),
   .ADDR (ADDRESS) ,
   .EN(ENABLE),
   .WE(WRITE EN),
   .SSR(INV_SET_RESET),
   .CLK(CLK BUFG),
   .DO(DATA_OUT[31:0]),
   .DOP(DATA OUT-PARITY[35:32]));
// synthesis attribute declarations
/* synopsys attribute
WRITE_MODE "READ_FIRST"
INIT "00000000"
SRVAL "012345678"
INITP 00
INITP 01
INITP 02
INITP 03
```

# 

INITP\_04

INIT\_00

INIT\_3E

endmodule

# **Using Distributed SelectRAM Memory**

# Introduction

In addition to 18Kb SelectRAM blocks, Virtex-II devices feature distributed SelectRAM modules. Each function generator or LUT of a CLB resource can implement a 16 x 1-bit synchronous RAM resource. Distributed SelectRAM memory writes synchronously and reads asynchronously. However, a synchronous read can be implemented using the register that is available in the same slice. This 16 x 1-bit RAM is cascadable for a deeper and/or wider memory implementation, with a minimal timing penalty incurred through specialized logic resources.

Distributed SelectRAM modules up to a size of 128 x 1 are available as primitives. Two 16 x 1 RAM resources can be combined to form a dual-port 16 x 1 RAM with one dedicated read/write port and a second read-only port. One port writes into both 16 x1 RAMs simultaneously, but the second port reads independently.

This section provides generic VHDL and Verilog reference code examples implementing *n*-bit-wide single-port and dual-port distributed SelectRAM memory.

Distributed SelectRAM memory enables many high-speed applications that require relatively small embedded RAM blocks, such as FIFOs, which are close to the logic that uses them.

Virtex-II Distributed SelectRAM memories can be generated using the CORE Generator Distributed Memory module (V2.0 or later). The user can also generate Distributed RAMbased Asynchronous and Synchronous FIFOs using the CORE Generator.

### Single-Port and Dual-Port RAM

#### Data Flow

Distributed SelectRAM memory supports the following:

- Single-port RAM with synchronous write and asynchronous read
- Dual-port RAM with one synchronous write and two asynchronous read ports

As illustrated in the Figure 2-51, the dual port has one read/write port and an independent read port.



ug002\_c2\_001\_061400

Figure 2-51: Single-Port and Dual-Port Distributed SelectRAM

Any read/write operation can occur simultaneously with and independently of a read operation on the other port.

#### Write Operations

The write operation is a single clock-edge operation, with a write enable that is active High by default. When the write enable is Low, no data is written into the RAM. When the write enable is High, the clock edge latches the write address and writes the data on D into the RAM.

#### **Read Operation**

The read operation is a combinatorial operation. The address port (single or dual port) is asynchronous with an access time equivalent to the logic delay.

#### **Read During Write**

When new data is synchronously written, the output reflects the data in the memory cell addressed (transparent mode). The timing diagram in Figure 2-52 illustrates a write operation, with the previous data read on the output port, before the clock edge and then the new data.



*Figure 2-52:* Write Timing Diagram

### Characteristics

- A write operation requires only one clock edge.
- A read operation requires only the logic access time.
- Outputs are asynchronous and dependent only on the logic delay.
- Data and address inputs are latched with the write clock and have a setup-to-clock timing specification. There is no hold time requirement.
- For dual-port RAM, one address is the write and read address, the other address is an independent read address.

2

# **Library Primitives**

Seven library primitives from 16 x 1-bit to 128 x 1-bit are available. Four primitives are single-port RAM and three primitives are True Dual-Port RAM, as shown in Table 2-16.

| Primitive | RAM Size | Туре        | Address Inputs             |
|-----------|----------|-------------|----------------------------|
| RAM16X1S  | 16 bits  | single-port | A3, A2, A1, A0             |
| RAM32X1S  | 32 bits  | single-port | A4, A3, A2, A1, A0         |
| RAM64X1S  | 64 bits  | single-port | A5, A4, A3, A2, A1, A0     |
| RAM128X1S | 128 bits | single-port | A6, A5, A4, A3, A2, A1, A0 |
| RAM16X1D  | 16 bits  | dual-port   | A3, A2, A1, A0             |
| RAM32X1D  | 32 bits  | dual-port   | A4, A3, A2, A1, A0         |
| RAM64X1D  | 64 bits  | dual-port   | A5, A4, A3, A2, A1, A0     |

 Table 2-16:
 Single-Port and Dual-Port Distributed SelectRAM

The input and output data are 1-bit wide. However, several distributed SelectRAM memories can be used to implement wide memory blocks.

Figure 2-53 shows generic single-port and dual-port distributed SelectRAM primitives. The A and DPRA signals are address busses.





As shown in Table 2-17, wider library primitives are available for 2-bit, 4-bit, and 8-bit RAM.

| Table 2-17: | Wider | Library | <b>Primitives</b> |
|-------------|-------|---------|-------------------|
|-------------|-------|---------|-------------------|

| Primitive | RAM Size   | Data Inputs    | Address Inputs         | Data Outputs   |
|-----------|------------|----------------|------------------------|----------------|
| RAM16x2S  | 16 x 2-bit | D1, D0         | A3, A2, A1, A0         | O1, O0         |
| RAM32X2S  | 32 x 2-bit | D1, D0         | A4, A3, A2, A1, A0     | O1, O0         |
| RAM64X2S  | 64 x 2-bit | D1, D0         | A5, A4, A3, A2, A1, A0 | O1, O0         |
| RAM16X4S  | 16 x 4-bit | D3, D2, D1, D0 | A3, A2, A1, A0         | O3, O2, O1, O0 |
| RAM32X4S  | 32 x 4-bit | D3, D2, D1, D0 | A4,A3, A2, A1, A0      | O3, O2, O1, O0 |
| RAM16X8S  | 16 x 8-bit | D <7:0>        | A3, A2, A1, A0         | O <7:0>        |
| RAM32X8S  | 32 x 8-bit | D <7:0>        | A4,A3, A2, A1, A0      | O <7:0>        |

# VHDL and Verilog Instantiation

VHDL and Verilog instantiations templates are available as examples (see "VHDL and Verilog Templates" on page 120).

In VHDL, each template has a component declaration section and an architecture section. Each part of the template should be inserted within the VHDL design file. The port map of the architecture section should include the design signal names.

The SelectRAM\_xS templates (with x = 16, 32, 64, or 128) are single-port modules and instantiate the corresponding RAMxX1S primitive.

SelectRAM\_xD templates (with x = 16, 32, or 64) are dual-port modules and instantiate the corresponding RAMxX1D primitive.

## **Ports Signals**

Each distributed SelectRAM port operates independently of the other while reading the same set of memory cells.

#### Clock - WCLK

The clock is used for the synchronous write. The data and the address input pins have setup time referenced to the WCLK pin.

#### Enable - WE

The enable pin affects the write functionality of the port. An inactive Write Enable prevents any writing to memory cells. An active Write Enable causes the clock edge to write the data input signal to the memory location pointed to by the address inputs.

### Address - A0, A1, A2, A3 (A4, A5, A6)

The address inputs select the memory cells for read or write. The width of the port determines the required address inputs. Note that the address inputs are not a bus in VHDL or Verilog instantiations.

#### Data In - D

The data input provides the new data value to be written into the RAM.

### Data Out - O, SPO, and DPO

The data out O (Single-Port or SPO) and DPO (Dual-Port) reflects the contents of the memory cells referenced by the address inputs. Following an active write clock edge, the data out (O or SPO) reflects the newly written data.

#### **Inverting Control Pins**

The two control pins (WCLK and WE) each have an individual inversion option. Any control signal, including the clock, can be active at 0 (negative edge for the clock) or at 1 (positive edge for the clock) without requiring other logic resources.

#### GSR

The global set/reset (GSR) signal does not affect distributed SelectRAM modules.

### Attributes

#### **Content Initialization - INIT**

With the INIT attributes, users can define the initial memory contents after configuration. By default distributed SelectRAM memory is initialized with all zeros during the device configuration sequence. The initialization attribute INIT represents the specified memory contents. Each INIT is a hex-encoded bit vector. Table 2-18 shows the length of the INIT attribute for each primitive.

| Primitive | Template       | INIT Attribute Length |
|-----------|----------------|-----------------------|
| RAM16X1S  | SelectRAM_16S  | 4 digits              |
| RAM32X1S  | SelectRAM_32S  | 8 digits              |
| RAM64X1S  | SelectRAM_64S  | 16 digits             |
| RAM128X1S | SelectRAM_128S | 32 digits             |
| RAM16X1D  | SelectRAM_16S  | 4 digits              |
| RAM32X1D  | SelectRAM_32S  | 8 digits              |
| RAM64X1D  | SelectRAM_64S  | 16 digits             |

Table 2-18: INIT Attributes Length

# Initialization in VHDL or Verilog Codes

Distributed SelectRAM memory structures can be initialized in VHDL or Verilog code for both synthesis and simulation. For synthesis, the attributes are attached to the distributed SelectRAM instantiation and are copied in the EDIF output file to be compiled by Xilinx Alliance Series<sup>TM</sup> tools. The VHDL code simulation uses a generic parameter to pass the attributes. The Verilog code simulation uses a defparam parameter to pass the attributes.

The distributed SelectRAM instantiation templates (in VHDL and Verilog) illustrate these techniques (see "VHDL and Verilog Templates" on page 120).

# **Location Constraints**

The CLB has four slices S0, S1, S2 and S3. As an example, in the bottom left CLB, the slices have the coordinates shown below: S

| Slice S3 | Slice S2 | Slice S1 | Slice S0 |
|----------|----------|----------|----------|
| X1Y1     | X1Y0     | X0Y1     | X0Y0     |

Distributed SelectRAM instances can have LOC properties attached to them to constrain placement. The RAM16X1S primitive fits in any LUT of slices S0 or S1.

For example, the instance U\_RAM16 is placed in slice X0Y0 with the following LOC properties:

INST "U\_RAM16" LOC = "SLICE\_X0Y0";

The RAM16X1D primitive occupies half of two slices, as shown in Figure 2-54. The first slice (output SPO) implements the read/write port with the same address A[3:0] for read

and write. The second slice implements the second read port with the address DPRA[3:0] and is written simultaneously with the first slice to the address A[3:0].



Figure 2-54: RAM16X1D Placement

In the same CLB module, the dual-port RAM16X1D either occupies half of slices S0 (X0Y0) and S2 (X1Y0), or half of slices S1 (X0Y1) and S3 (X1Y1).

If a dual-port 16 x 2-bit module is built, the two RAM16X1D primitives occupy two slices, as long as they share the same clock and write enable, as illustrated in Figure 2-55.



Figure 2-55: Two RAM16X1D Placement

A RAM32X1S primitive fits in one slice, as shown in Figure 2-56.



Figure 2-56: RAM32X1S Placement

Following the same rules, a RAM32X1D primitive fits in two slices, with one slice implementing the read/write port and the second slice implementing the second read port.

The RAM64X1S primitive occupies two slices and the RAM64X1D primitive occupies four slices (one CLB element), with two slices implementing the read/write port and two other slices implementing the second read port. The RAM64X1S read path is built on the MUXF5 and MUXF6 multiplexers.

The RAM128X1S primitive occupies four slices, equivalent to one CLB element.

Distributed SelectRAM placement locations use the slice location naming convention, allowing LOC properties to transfer easily from array to array.

# **Applications**

#### **Creating Larger RAM Structures**

The memory compiler program generates wider and/or deeper memory structures using distributed SelectRAM instances. Along with an EDIF file for inclusion in a design, this program produces VHDL and Verilog instantiation templates and simulation models.

 Table 2-19 shows the generic VHDL and Verilog distributed SelectRAM examples

 provided to implement *n*-bit-wide memories.

 Table 2-19:
 VHDL and Verilog Submodules

| Submodules           | Primitive | Size                      | Туре        |
|----------------------|-----------|---------------------------|-------------|
| XC2V_RAM16XN_S_SUBM  | RAM16X1S  | 16 words x <i>n</i> -bit  | single-port |
| XC2V_RAM32XN_S_SUBM  | RAM32X1S  | 32 words x <i>n</i> -bit  | single-port |
| XC2V_RAM64XN_S_SUBM  | RAM64X1S  | 64 words x <i>n</i> -bit  | single-port |
| XC2V_RAM128XN_S_SUBM | RAM128X1S | 128 words x <i>n</i> -bit | single-port |
| XC2V_RAM16XN_D_SUBM  | RAM16X1D  | 16 words x <i>n</i> -bit  | dual-port   |
| XC2V_RAM32XN_D_SUBM  | RAM32X1D  | 32 words x <i>n</i> -bit  | dual-port   |
| XC2V_RAM64XN_D_SUBM  | RAM64X1D  | 64 words x <i>n</i> -bit  | dual-port   |

By using the read/write port for the write address and the second read port for the read address, a FIFO that can read and write simultaneously is easily generated. Simultaneous access doubles the effective throughput of the memory.

# VHDL and Verilog Templates

VHDL and Verilog templates are available for all single-port and dual-port primitives. The number in each template indicates the number of bits (for example, SelectRAM\_16S is the template for the 16 x 1-bit RAM); S indicates single-port, and D indicates dual-port.

In VHDL, each template has a component declaration section and an architecture section. Each part of the template should be inserted within the VHDL design file. The port map of the architecture section should include the design signal names.

The following are single-port templates:

- SelectRAM\_16S
- SelectRAM\_32S
- SelectRAM\_64S
- SelectRAM\_128S

The following are dual-port templates:

- SelectRAM\_16D
- SelectRAM\_32D
- SelectRAM\_64D

Templates for the SelectRAM\_16S module are provided in VHDL and Verilog code as examples.

# 

#### **VHDL** Template

```
- -
-- Module: SelectRAM_16S
- -
-- Description: VHDL instantiation template
            Distributed SelectRAM
- -
- -
            Single Port 16 x 1
- -
            can be used also for RAM16X1S_1
- -
-- Device: Virtex-II Family
- -
_____
_ _
-- Components Declarations:
- -
component RAM16X1S
-- pragma translate off
  generic (
-- RAM initialization ("0" by default) for functional simulation:
       INIT : bit_vector := X"0000"
      );
-- pragma translate on
 port (
      D
          : in std_logic;
       WE : in std logic;
       WCLK : in std logic;
       A0 : in std logic;
      A1
         : in std logic;
      A2 : in std logic;
       A3 : in std logic;
          : out std logic
       0
      );
end component;
- -
_____
- -
-- Architecture section:
- -
-- Attributes for RAM initialization ("0" by default):
attribute INIT: string;
attribute INIT of U_RAM16X1S: label is "0000";
-- Distributed SelectRAM Instantiation
U_RAM16X1S: RAM16X1S
 port map (
  D
      => , -- insert input signal
  WE
        => , -- insert Write Enable signal
  WCLK => , -- insert Write Clock signal
  AO
        => , -- insert Address 0 signal
  A1
        => , -- insert Address 1 signal
  A2
        => , -- insert Address 2 signal
  A3
        => , -- insert Address 3 signal
  0
        => -- insert output signal
 );
     _____
```

Verilog Template

```
11
// Module: SelectRAM 16S
11
// Description: Verilog instantiation template
             Distributed SelectRAM
11
//
             Single Port 16 x 1
11
             can be used also for RAM16X1S_1
11
// Device: Virtex-II Family
11
          -----
//---
11
11
// Syntax for Synopsys FPGA Express
// synopsys translate off
 defparam
      //RAM initialization ("0" by default) for functional simulation:
 U RAM16X1S.INIT = 16'h0000;
// synopsys translate on
//Distributed SelectRAM Instantiation
RAM16X1S U_RAM16X1S ( .D(), // insert input signal
                        // insert Write Enable signal
               .WE(),
                        // insert Write Clock signal
               .WCLK(),
               .AO(), // insert Address 0 signal
                         // insert Address 1 signal
               .A1(),
                         // insert Address 2 signal
               .A2(),
               .A3(),
                         // insert Address 3 signal
               .0()
                         // insert output signal
                );
// synthesis attribute declarations
 /* synopsys attribute
 INIT "0000"
```

\*/

# Using Look-Up Tables as Shift Registers (SRLs)

## Introduction

Virtex-II can configure any look-up table (LUT) as a 16-bit shift register without using the flip-flops available in each slice. Shift-in operations are synchronous with the clock, and output length is dynamically selectable. A separate dedicated output allows the cascading of any number of 16-bit shift registers to create whatever size shift register is needed. Each CLB resource can be configured using the 8 LUTs as a 128-bit shift register.

This section provides generic VHDL and Verilog submodules and reference code examples for implementing from 16-bit up to 128-bit shift registers. These submodules are built from 16-bit shift-register primitives and from dedicated MUXF5, MUXF6, MUXF7, and MUXF8 multiplexers.

These shift registers enable the development of efficient designs for applications that require delay or latency compensation. Shift registers are also useful in synchronous FIFO and content-addressable memory (CAM) designs. To quickly generate a Virtex-II shift register without using flip-flops (i.e., using the SRL16 element(s)), use the CORE Generator RAM-based Shift Register module.

# Shift Register Operations

#### Data Flow

Each shift register (SRL16 primitive) supports:

- Synchronous shift-in
- Asynchronous 1-bit output when the address is changed dynamically
- Synchronous shift-out when the address is fixed

In addition, cascadable shift registers (SRLC16) support synchronous shift-out output of the last (16th) bit. This output has a dedicated connection to the input of the next SRLC16 inside the CLB resource. Two primitives are illustrated in Figure 2-57.



UG002\_C2\_007\_061400



www.xilinx.com 1-800-255-7778 2

#### Shift Operation

The shift operation is a single clock-edge operation, with an active High clock enable feature. When enable is High, the input (D) is loaded into the first bit of the shift register, and each bit is shifted to the next highest bit position. In a cascadable shift register configuration (such as SRLC16), the last bit is shifted out on the Q15 output.

The bit selected by the 4-bit address appears on the Q output.

#### **Dynamic Read Operation**

The Q output is determined by the 4-bit address. Each time a new address is applied to the 4-input address pins, the new bit position value is available on the Q output after the time delay to access the LUT. This operation is asynchronous and independent of the clock and clock enable signals.

Figure 2-58 illustrates the shift and dynamic read operations.



UG002\_C2\_011\_061300

Figure 2-58: Shift- and Dynamic-Length Timing Diagrams

#### Static Read Operation

If the 4-bit address is fixed, the Q output always uses the same bit position. This mode implements any shift register length up 1 to 16 bits in one LUT. Shift register length is (N+1) where N is the input address.

The Q output changes synchronously with each shift operation. The previous bit is shifted to the next position and appears on the Q output.

2

### **Characteristics**

- A shift operation requires one clock edge.
- Dynamic-length read operations are asynchronous (Q output).
- Static-length read operations are synchronous (Q output).
- The data input has a setup-to-clock timing specification.
- In a cascadable configuration, the Q15 output always contains the last bit value.
- The Q15 output changes synchronously after each shift operation.

# Library Primitives and Submodules

Eight library primitives are available that offer optional clock enable (CE), inverted clock (CLK) and cascadable output (Q15) combinations.

Table 2-20 lists all of the available primitives for synthesis and simulation.

Control Output **Primitive Address Inputs** Length SRL16 CLK 16 bits A3, A2, A1, A0 Q SRL16E 16 bits CLK, CE A3,A2,A1,A0 Q CLK SRL16\_1 16 bits A3,A2,A1,A0 Q CLK, CE SRL16E 1 16 bits A3, A2, A1, A0 Q CLK SRLC16 16 bits A3,A2,A1,A0 Q,Q15 SRLC16E 16 bits CLK, CE A3, A2, A1, A0 Q, Q15 CLK SRLC16\_1 16 bits A3, A2, A1, A0 Q, Q15 CLK, CE SRLC16E\_1 16 bits Q, Q15 A3, A2, A1, A0

Table 2-20: Shift Register Primitives

In addition to the 16-bit primitives, three submodules that implement 32-bit, 64-bit, and 128-bit cascadable shift registers are provided in VHDL and Verilog code. Table 2-21 lists available submodules.

Table 2-21: Shift Register Submodules

| Submodule     | Length   | Control | Address Inputs          | Output  |
|---------------|----------|---------|-------------------------|---------|
| SRLC32E_SUBM  | 32 bits  | CLK, CE | A4,A3,A2,A1,A0          | Q, Q31  |
| SRLC64E_SUBM  | 64 bits  | CLK, CE | A5, A4, A3,A2,A1,A0     | Q, Q63  |
| SRLC128E_SUBM | 128 bits | CLK, CE | A6, A5, A4, A3,A2,A1,A0 | Q, Q127 |

The submodules are based on SRLC16E primitives, which are associated with dedicated multiplexers (MUXF5, MUXF6, and so forth). This implementation allows a fast static- and dynamic-length mode, even for very large shift registers.

Figure 2-59 represents the cascadable shift registers (32-bit and 64-bit) implemented by the submodules in Table 2-21.

Add. \_\_\_\_\_

D





UG002\_C2\_008\_061300

#### Figure 2-59: Shift-Register Submodules (32-bit, 64-bit)

A 128-bit shift register is built on the same scheme and uses MUXF7 (address input A6).

All clock enable (CE) and clock (CLK) inputs are connected to one global clock enable and one clock signal per submodule. If a global static- or dynamic-length mode is not required, the SRLC16E primitive can be cascaded without multiplexers.

www.xilinx.com

1-800-255-7778

# Initialization in VHDL and Verilog Code

A shift register can be initialized in VHDL or Verilog code for both synthesis and simulation. For synthesis, the attribute is attached to the 16-bit shift register instantiation and is copied in the EDIF output file to be compiled by Xilinx Alliance Series tools. The VHDL code simulation uses a generic parameter to pass the attributes. The Verilog code simulation uses a defparam parameter to pass the attributes.

The V2\_SRL16E shift register instantiation code examples (in VHDL and Verilog) illustrate these techniques (see "VHDL and Verilog Templates" on page 131). V2\_SRL16E.vhd and .v files are not a part of the documentation.

# **Port Signals**

#### Clock - CLK

Either the rising edge or the falling edge of the clock is used for the synchronous shift-in. The data and clock enable input pins have set-up times referenced to the chosen edge of CLK.

#### Data In - D

The data input provides new data (one bit) to be shifted into the shift register.

### Clock Enable - CE (optional)

The clock enable pin affects shift functionality. An inactive clock enable pin does not shift data into the shift register and does not write new data. Activating the clock enable allows the data in (D) to be written to the first location and all data to be shifted by one location. When available, new data appears on output pins (Q) and the cascadable output pin (Q15).

### Address - A0, A1, A2, A3

Address inputs select the bit (range 0 to 15) to be read. The n<sup>th</sup> bit is available on the output pin (Q). Address inputs have no effect on the cascadable output pin (Q15), which is always the last bit of the shift register (bit 15).

#### Data Out - Q

The data output Q provides the data value (1 bit) selected by the address inputs.

#### Data Out - Q15 (optional)

The data output Q15 provides the last bit value of the 16-bit shift register. New data becomes available after each shift-in operation.

#### **Inverting Control Pins**

The two control pins (CLK, CE) have an individual inversion option. The default is the rising clock edge and active High clock enable.

#### GSR

The global set/reset (GSR) signal has no impact on shift registers.

### Attributes

#### **Content Initialization - INIT**

The INIT attribute defines the initial shift register contents. The INIT attribute is a hexencoded bit vector with four digits (0000). The left-most hexadecimal digit is the most significant bit. By default the shift register is initialized with all zeros during the device configuration sequence, but any other configuration value can be specified.

# Location Constraints

Each CLB resource has four slices: S0, S1, S2, and S3. As an example, in the bottom left CLB resource, each slice has the coordinates shown in Table 2-22.

Table 2-22: Slice Coordinates in the Bottom-Left CLB Resource

| Slice S3 | Slice S2 | Slice S1 | Slice S0 |
|----------|----------|----------|----------|
| X1Y1     | X1Y0     | X0Y1     | X0Y0     |

To constrain placement, shift register instances can have LOC properties attached to them. Each 16-bit shift register fits in one LUT.

A 32-bit shift register in static or dynamic address mode fits in one slice (two LUTs and one MUXF5). This shift register can be placed in any slice.

A 64-bit shift register in static or dynamic address mode fits in two slices. These slices are either S0 and S1, or S2 and S3. Figure 2-60 illustrates the position of the four slices in a CLB resource.

The dedicated CLB shift chain runs from the top slice to the bottom slice. The data input pin must either be in slice S1 or in S3. The address selected as the output pin (Q) is the MUXF6 output.

A 128-bit shift register in static or dynamic address mode fits in a four-slice CLB resource. The data input pin has to be in slice S3. The address selected as the output pin (Q) is the MUXF7 output.



Figure 2-60: Shift Register Placement

# Fully Synchronous Shift Registers

All shift-register primitives and submodules do not use the register(s) available in the same slice(s). To implement a fully synchronous read and write shift register, output pin Q must be connected to a flip-flop. Both the shift register and the flip-flop share the same clock, as shown in Figure 2-61.



Figure 2-61: Fully Synchronous Shift Register

This configuration provides a better timing solution and simplifies the design. Because the flip-flop must be considered to be the last register in the shift-register chain, the static or dynamic address should point to the desired length minus one. If needed, the cascadable output can also be registered in a flip-flop.

# Static-Length Shift Registers

The cascadable16-bit shift register implements any static length mode shift register without the dedicated multiplexers (MUXF5, MUXF6,...). Figure 2-62 illustrates a 40-bit shift register. Only the last SRLC16E primitive needs to have its address inputs tied to "0111". Alternatively, shift register length can be limited to 39 bits (address tied to "0110") and a flip-flop can be used as the last register. (In an SRLC16E primitive, the shift register length is the address input + 1.)





www.xilinx.com

1-800-255-7778

## VHDL and Verilog Instantiation

VHDL and Verilog instantiation templates are available for all primitives and submodules.

In VHDL, each template has a component declaration section and an architecture section. Each part of the template should be inserted within the VHDL design file. The port map of the architecture section should include the design signal names.

The ShiftRegister\_C\_x (with x = 16, 32, 64, 128, or 256) templates are cascadable modules and instantiate the corresponding SRLCxE primitive (16) or submodule (32, 64, 128, or 256).

The ShiftRegister\_16 template can be used to instantiate an SRL16 primitive.

#### VHDL and Verilog Templates

In template names, the number indicates the number of bits (for example, SHIFT\_REGISTER\_16 is the template for the 16-bit shift register) and the "C" extension means the template is cascadable.

The following are templates for primitives:

- SHIFT\_REGISTER\_16
- SHIFT\_REGISTER\_16\_C

The following are templates for submodules:

- SHIFT\_REGISTER\_32\_C (submodule: SRLC32E\_SUBM)
- SHIFT\_REGISTER\_64\_C (submodule: SRLC64E\_SUBM)
- SHIFT\_REGISTER\_128\_C (submodule: SRLC128E\_SUBM)

The corresponding submodules have to be synthesized with the design.

Templates for the SHIFT\_REGISTER\_16\_C module are provided in VHDL and Verilog code as an example.

VHDL Template:

```
-- Module: SHIFT REGISTER C 16
-- Description: VHDL instantiation template
-- CASCADABLE 16-bit shift register with enable (SRLC16E)
-- Device: Virtex-II Family
-----
                          _____
-- Components Declarations:
- -
component SRLC16E
-- pragma translate_off
 generic (
-- Shift Register initialization ("0" by default) for functional
simulation:
       INIT : bit vector := X"0000"
 );
-- pragma translate_on
 port (
       D : in std logic;
       CE : in std logic;
       CLK : in std logic;
       A0 : in std logic;
       A1 : in std logic;
       A2 : in std logic;
       A3 : in std logic;
       Q
           : out std logic;
       Q15 : out std logic
 );
end component;
```

```
-- Architecture Section:
- -
-- Attributes for Shift Register initialization ("0" by default):
attribute INIT: string;
attribute INIT of U_SRLC16E: label is "0000";
- -
-- ShiftRegister Instantiation
U SRLC16E: SRLC16E
 port map (
        => , -- insert input signal
 D
        => , -- insert Clock Enable signal (optional)
 CE
        => , -- insert Clock signal
 CLK
        => , -- insert Address 0 signal
 A0
        => , -- insert Address 1 signal
 Α1
 A2
        => , -- insert Address 2 signal
        => , -- insert Address 3 signal
 A3
        => , -- insert output signal
 Q
 Q15
        => -- insert cascadable output signal
 );
```

#### Verilog Template:

```
// Module: SHIFT REGISTER 16
// Description: Verilog instantiation template
// Cascadable 16-bit Shift Register with Clock Enable (SRLC16E)
// Device: Virtex-II Family
//-----
// Syntax for Synopsys FPGA Express
// synopsys translate_off
 defparam
//Shift Register initialization ("0" by default) for functional
simulation:
 U SRLC16E.INIT = 16'h0000;
// synopsys translate on
//SelectShiftRegister-II Instantiation
  SRLC16E U SRLC16E ( .D(),
                       .AO(),
                       .A1(),
                       .A2(),
                      .A3(),
                       .CLK(),
                       .CE(),
                       .Q(),
                      .Q15()
         );
// synthesis attribute declarations
 /* synopsys attribute
 INIT "0000"
 */
```

# **Designing Large Multiplexers**

# Introduction

Virtex-II slices contain dedicated two-input multiplexers (one MUXF5 and one MUXFX per slice). These multiplexers combine the 4-input LUT outputs or the outputs of other multiplexers. Using the multiplexers MUXF5, MUXF6, MUXF7 and MUXF8 allows to combine 2, 4, 8 and 16 LUTs. Specific routing resources are associated with these 2-input multiplexers to guarantee a fast implementation of any combinatorial function built upon LUTs and MUXFX.

The combination of the LUTs and the MUXFX offers an unique solution to the design of wide-input functions. This section illustrates the implementation of large multiplexers up to 32:1. Any Virtex-II slice can implement a 4:1 multiplexer, any CLB can implement a 16:1 multiplexer, and 2 CLBs can implement a 32:1 multiplexer. Such multiplexers are just one example of wide-input combinatorial function taking advantage of the MUXFX feature. Many other logic functions can be mapped in the LUT and MUXFX features.

This section provides generic VHDL and Verilog reference code implementing multiplexers. These submodules are built from LUTs and the dedicated MUXF5, MUXF6, MUXF7, and MUXF8 multiplexers. To automatically generate large multiplexers using these dedicated elements, use the CORE Generator Bit Multiplexer and Bus Multiplexer modules.

For applications like comparators, encoder-decoders or "case" statement in VHDL or Verilog, these resources offer an optimal solution.

# Virtex-II CLB Resources

#### Slice Multiplexers

Each Virtex-II slice has a MUXF5 to combine the outputs of the 2 LUTs and an extra MUXFX. Figure 2-63 illustrates a combinatorial function with up to 9 inputs in one slice.



Figure 2-63: LUTs and MUXF5 in a Slice

Each Virtex-II CLB contains 4 slices. The second MUXFX implements a MUXF6, MUXF7 or MUXF8 according to the position of the slice in the CLB. These MUXFX are designed to allow LUTs combination up to 16 LUTs in two adjacent CLBs.

Figure 2-64 shows the relative position of the slices in the CLB.



UG002\_C2\_017\_081600

*Figure 2-64:* **Slice Positions in a CLB** 

Slices S0 and S2 have a MUXF6, designed to combine the outputs of two MUXF5 resources. Figure 2-65 illustrates a combinatorial function up to 18 inputs in the slices S0 and S1, or in the slices S2 and S3.







The slice S1 has a MUXF7, designed to combine the outputs of two MUXF6. Figure 2-66 illustrates a combinatorial function up to 35 inputs in a Virtex-II CLB.

UG002\_C2\_019\_081600

Figure 2-66: LUTs and (MUXF5, MUXF6, and MUXF7) in One CLB

www.xilinx.com 1-800-255-7778 The slice S3 of each CLB has a MUXF8. combinatorial functions of up to 68 inputs fit in two CLBs as shown in Figure 2-67. The outputs of two MUXF7 are combined through dedicated routing resources between two adjacent CLBs in a column.



UG002\_C2\_020\_081600



2

# Wide-Input Multiplexers

Each LUT can implement a 2:1 multiplexer. In each slice, the MUXF5 and two LUTs can implement a 4:1 multiplexer. As shown in Figure 2-68, the MUXF6 and two slices can implement a 8:1 multiplexer. The MUFXF7 and the four slices of any CLB can implement a 16:1 and the MUXF8 and two CLBs can implement a 32:1 multiplexer.



*Figure 2-68:* 8:1 and 16:1 Multiplexers

# Characteristics

- Implementation in one level of logic (LUT) and dedicated MUXFX
- Full combinatorial path

# Library Primitives and Submodules

Four library primitives are available that offer access to the dedicated MUXFX in each slice. In the example shown in Table 2-23, MUXF7 is available only in slice S1.

| Primitive | Slice          | Control | Input  | Output |
|-----------|----------------|---------|--------|--------|
| MUXF5     | S0, S1, S2, S3 | S       | I0, I1 | О      |
| MUXF6     | S0, S2         | S       | I0, I1 | 0      |
| MUXF7     | S1             | S       | I0, I1 | О      |
| MUXF8     | S3             | S       | I0, I1 | О      |

Table 2-23: MUXFX Resources

In addition to the primitives, five submodules that implement multiplexers from 2:1 to 32:1 are provided in VHDL and Verilog code. Synthesis tools can automatically infer the above primitives (MUXF5, MUXF6, MUXF7, and MUXF8); however, the submodules described in this section used instantiation of the new MUXFX to guarantee an optimized result. Table 2-24 lists available submodules:

Table 2-24: Available Submodules

| Submodule     | Multiplexer | Control       | Input        | Output |
|---------------|-------------|---------------|--------------|--------|
| MUX_2_1_SUBM  | 2:1         | SELECT_I      | DATA_I[1:0]  | DATA_O |
| MUX_4_1_SUBM  | 4:1         | SELECT_I[1:0] | DATA_I[3:0]  | DATA_O |
| MUX_8_1_SUBM  | 8:1         | SELECT_I[2:0] | DATA_I[8:0]  | DATA_O |
| MUX_16_1_SUBM | 16:1        | SELECT_I[3:0] | DATA_I[15:0] | DATA_O |
| MUX_32_1_SUBM | 32:1        | SELECT_I[4:0] | DATA_I[31:0] | DATA_O |

# **Port Signals**

### Data In - DATA\_I

The data input provides the data to be selected by the SELECT\_I signal(s).

#### Control In - SELECT\_I

The select input signal or bus determines the DATA\_I signal to be connected to the output DATA\_O. For example, the MUX\_4\_1\_SUBM multiplexer has a 2-bit SELECT\_I bus and a 4-bit DATA\_I bus. Table 2-25 shows the DATA\_I selected for each SELECT\_I value.

#### Table 2-25: Selected Inputs

| SELECT_I[1:0] | DATA_O    |
|---------------|-----------|
| 0 0           | DATA_I[0] |
| 0 1           | DATA_I[1] |
| 10            | DATA_I[2] |
| 11            | DATA_I[3] |

### Data Out - DATA\_O

The data output O provides the data value (1 bit) selected by the control inputs.

## Applications

Multiplexers are used in various applications. These are often inferred by synthesis tools when a "case" statement is used (see the example below). Comparators, encoder-decoders and wide-input combinatorial functions are optimized when they are based on one level of LUTs and dedicated MUXFX resources of the Virtex-II CLBs.

# VHDL and Verilog Instantiation

The primitives (MUXF5, MUXF6, and so forth) can be instantiated in VHDL or Verilog code, to design wide-input functions.

The submodules (MUX\_2\_1\_SUBM, MUX\_4\_1\_SUBM, and so forth) can be instantiated in VHDL or Verilog code to implement multiplexers. However the corresponding submodule must be added to the design directory as hierarchical submodule. For example, if a module is using the MUX\_16\_1\_SUBM, the MUX\_16\_1\_SUBM.vhd file (VHDL code) or MUX\_16\_1\_SUBM.v file (Verilog code) must be compiled with the design source code. The submodule code can also be "cut and pasted" into the designer source code.

#### VHDL and Verilog Submodules

VHDL and Verilog submodules are available to implement multiplexers up to 32:1. They illustrate how to design with the MUXFX resources. When synthesis infers the corresponding MUXFX resource(s), the VHDL or Verilog code is behavioral code ("case" statement). Otherwise, the equivalent "case" statement is provided in comments and the correct MUXFX are instantiated. However, most synthesis tools support the inference of all of the MUXFX. The following examples can be used as guidelines for designing other wide-input functions.

The following submodules are available:

- MUX\_2\_1\_SUBM (behavioral code)
- MUX\_4\_1\_SUBM
- MUX\_8\_1\_SUBM
- MUX\_16\_1\_SUBM
- MUX\_32\_1\_SUBM

The corresponding submodules have to be synthesized with the design

The submodule MUX\_16\_1\_SUBM in VHDL and Verilog are provided as example.

#### **VHDL** Template

| Module: MUX_16_1_SUBM<br>Description: Multiplexer 16:1<br><br>Device: Virtex-II Family                                                                                               |
|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| <pre>library IEEE;<br/>use IEEE.std_logic_1164.all;</pre>                                                                                                                            |
| Syntax for Synopsys FPGA Express<br>pragma translate_off<br>library UNISIM;<br>use UNISIM.VCOMPONENTS.ALL;<br>pragma translate_on                                                    |
| <pre>entity MUX_16_1_SUBM is    port (         DATA_I: in std_logic_vector (15 downto 0);         SELECT_I: in std_logic_vector (3 downto 0);         DATA_0: out std_logic );</pre> |

```
end MUX_16_1_SUBM;
architecture MUX 16 1 SUBM arch of MUX 16 1 SUBM is
-- Component Declarations:
component MUXF7
   port (
   IO: in std logic;
   I1: in std_logic;
   S: in std logic;
   0: out std logic
   );
end component;
-- Signal Declarations:
signal DATA MSB : std logic;
signal DATA_LSB : std_logic;
begin
- -
-- If synthesis tools support MUXF7 :
--SELECT_PROCESS: process (SELECT_I, DATA I)
--begin
--case SELECT I is
-- when "0000" => DATA O <= DATA I (0);
-- when "0001" => DATA O <= DATA I (1);
-- when "0010" => DATA_O <= DATA_I (2);
-- when "0011" => DATA_O <= DATA_I (3);
-- when "0100" => DATA_O <= DATA_I (4);
-- when "0101" => DATA_O <= DATA_I (5);
-- when "0110" => DATA_O <= DATA_I (6);
-- when "0111" => DATA_O <= DATA_I (7);
-- when "1000" => DATA_O <= DATA_I (8);
-- when "1001" => DATA O <= DATA I (9);
-- when "1010" => DATA_O <= DATA_I (10);
-- when "1011" => DATA_O <= DATA_I (11);
-- when "1100" => DATA_O <= DATA_I (12);
-- when "1101" => DATA_O <= DATA_I (13);
-- when "1110" => DATA_O <= DATA_I (14);
-- when "1111" => DATA_O <= DATA_I (15);
-- when others => DATA O <= 'X';
--end case;
--end process SELECT_PROCESS;
- -
-- If synthesis tools DO NOT support MUXF7 :
SELECT_PROCESS_LSB: process (SELECT_I, DATA_I)
begin
 case SELECT_I (2 downto 0) is
   when "000" => DATA_LSB <= DATA_I (0);
   when "001" => DATA LSB <= DATA I (1);
   when "010" => DATA_LSB <= DATA_I (2);
   when "011" => DATA LSB <= DATA I (3);
   when "100" => DATA LSB <= DATA I (4);
   when "101" => DATA_LSB <= DATA_I (5);
   when "110" => DATA LSB <= DATA I (6);
   when "111" => DATA LSB <= DATA I (7);
   when others => DATA LSB <= 'X';
 end case;
end process SELECT PROCESS LSB;
SELECT_PROCESS_MSB: process (SELECT_I, DATA_I)
begin
 case SELECT I (2 downto 0) is
```

```
2
```

```
when "000" => DATA_MSB <= DATA_I (8);
   when "001" => DATA MSB <= DATA I (9);
   when "010" => DATA_MSB <= DATA_I (10);
   when "011" => DATA MSB <= DATA I (11);
   when "100" => DATA MSB <= DATA I (12);
   when "101" => DATA MSB <= DATA I (13);
   when "110" => DATA MSB <= DATA I (14);
   when "111" => DATA_MSB <= DATA_I (15);
   when others => DATA MSB <= 'X';
 end case;
end process SELECT_PROCESS_MSB;
- -
-- MUXF7 instantiation
U MUXF7: MUXF7
    port map (
    IO => DATA LSB,
    I1 => DATA MSB,
    S => SELECT_I (3),
    O => DATA O
    );
_ _
end MUX_16_1_SUBM_arch;
```

#### Verilog Template

```
// Module: MUX 16 1 SUBM
11
// Description: Multiplexer 16:1
// Device: Virtex-II Family
//-----
11
module MUX 16 1 SUBM (DATA I, SELECT I, DATA O);
input [15:0]DATA I;
input [3:0]SELECT_I;
output DATA O;
wire [2:0] SELECT;
reg DATA_LSB;
reg DATA_MSB;
assign SELECT[2:0] = SELECT_I[2:0];
/*
//If synthesis tools supports MUXF7 :
always @ (DATA_I or SELECT_I)
   case (SELECT I)
      4'b0000 : DATA O <= DATA I[0];
 4'b0001 : DATA_O <= DATA_I[1];
 4'b0010 : DATA_O <= DATA_I[2];
 4'b0011 : DATA_O <= DATA_I[3];
       4'b0100 : DATA O <= DATA I[4];
 4'b0101 : DATA_O <= DATA_I[5];
 4'b0110 : DATA_O <= DATA_I[6];
 4'b0111 : DATA_O <= DATA_I[7];
       4'b1000 : DATA_O <= DATA_I[8];
 4'b1001 : DATA_O <= DATA_I[9];
 4'b1010 : DATA O <= DATA I[10];
 4'b1011 : DATA_O <= DATA_I[11];
```

```
4'b1100 : DATA_O <= DATA_I[12];
 4'b1101 : DATA_O <= DATA_I[13];
 4'b1110 : DATA O <= DATA I[14];
 4'b1111 : DATA_O <= DATA_I[15];
 default : DATA_0 <= 1'bx;</pre>
    endcase
*/
always @ (SELECT or DATA I)
    case (SELECT)
        3'b000 : DATA_LSB <= DATA_I[0];
 3'b001 : DATA LSB <= DATA I[1];
 3'b010 : DATA_LSB <= DATA_I[2];
 3'b011 : DATA LSB <= DATA I[3];
        3'b100 : DATA_LSB <= DATA_I[4];
 3'b101 : DATA_LSB <= DATA_I[5];
 3'b110 : DATA_LSB <= DATA_I[6];
 3'b111 : DATA LSB <= DATA I[7];
 default : DATA_LSB <= 1'bx;</pre>
    endcase
always @ (SELECT or DATA_I)
    case (SELECT)
        3'b000 : DATA_MSB <= DATA_I[8];
 3'b001 : DATA_MSB <= DATA_I[9];
 3'b010 : DATA_MSB <= DATA_I[10];
 3'b011 : DATA_MSB <= DATA_I[11];
        3'b100 : DATA_MSB <= DATA_I[12];
 3'b101 : DATA_MSB <= DATA_I[13];
 3'b110 : DATA_MSB <= DATA_I[14];
 3'b111 : DATA_MSB <= DATA_I[15];
 default : DATA_MSB <= 1'bx;</pre>
    endcase
// MUXF7 instantiation
MUXF7 U MUXF7
               (.IO(DATA_LSB),
     .I1(DATA_MSB),
     .S(SELECT_I[3]),
     .O(DATA_O)
     );
endmodule
11
*/
```

# Implementing Sum of Products (SOP) Logic

## Introduction

Virtex-II slices contain a dedicated two-input multiplexer (MUXCY) and a two-input OR gate (ORCY) to perform operations involving wide AND and OR gates. These combine the four-input LUT outputs. These gates can be cascaded in a chain to provide the wide AND functionality across slices. The output from the cascaded AND gates can then be combined with the dedicated ORCY to produce the Sum of Products (SOP).

# Virtex-II CLB Resources

Each Virtex-II slice has a MUXCY, which uses the output from the LUTs as a SELECT signal. Depending on the width of data desired, several slices can be used to provide the SOP output. Figure 2-69 illustrates the logic involved in designing a 16-input AND gate. It utilizes the 4-input LUT to provide the necessary SELECT signal for the MUXCY. Only when all of the input signals are High, can the V<sub>CC</sub> at the bottom reach the output. This use of carry logic helps to perform AND functions at high speed and saves logic resources.



Figure 2-69: Implementing a 16-bit Wide AND Gate Using MUXCY & ORCY

The output from the chain of AND gates is passed as one of the inputs of the dedicated OR gate, ORCY. To calculate the SOP, these AND chains can be cascaded vertically across several CLBs, depending on the width of the input data. Figure 2-70 illustrates how the AND outputs are then passed in through the ORCY gates in a horizontal cascade, the sum of which is the Sum of Products.



Figure 2-70: 64-bit Input SOP Design

# **VHDL** Parameters

#### AND\_WIDTH Parameter

The width of each AND gate used in the cascade.

#### **PROD\_TERM** Parameter

The number of AND gates used along each vertical cascade.

#### AND\_IN Parameter

Data input to the AND gates. The total width of data is calculated from the product of AND\_WIDTH and PROD\_TERM

#### SOP\_OUT Parameter

The Sum of Products (SOP) output data from the cascade chain.

### **Applications**

These logic gates can be used in various applications involving very wide AND gates and Sum of Products (SOP) functions.
# VHDL and Verilog Instantiation

To implement wide-input AND functions, MUXCY and ORCY primitives can be instantiated in VHDL or Verilog code. The submodule code provided can be used to implement wide-input AND gates for any width of input data.

### VHDL and Verilog Submodules

VHDL and Verilog submodules are available to implement the cascade chain of wideinput AND gates and OR gates to calculate the Sum of Products (SOP). The VHDL module provided uses a generic case, where the width of data and the product terms can be specified in the case. The Verilog module provides a 64-bit input example, using four wide AND chains, each of which handle 16 bits of data.

### **VHDL** Templates

```
-- Module : AND CHAIN
-- Description : 16 input AND gate
- -
-- Device : Virtex-II Family
-----
library IEEE;
use IEEE.std_logic_1164.all;
--library UNISIM;
--use UNISIM.VCOMPONENTS.ALL;
entity AND CHAIN is
generic (
   input_width : integer); --must be a 4x value
port (
   data in : in std logic vector( input width-1 downto 0);
   carry in : in std logic;
   out_andor_chain : out std_logic);
end AND_CHAIN;
architecture AND_CHAIN_arch of AND_CHAIN is
component ORCY
  port( i : std_logic;
       ci : in std_logic;
       o : out std_logic);
end component;
component AND LOGIC
  port( sel_data : in std_logic_vector(3 downto 0);
       data_cin : in std_logic;
       data out : out std logic);
end component;
signal VCC, GND : std logic;
signal cout : std logic vector(input width/4 downto 0);
signal out_and_chain : std_logic;
begin
VCC <= '1';
GND <= '0';
--initialisation of first input for MUXCY
cout(0) <= VCC;</pre>
and chain x : for i in (input width/4) - 1 downto 0 generate
```

www.xilinx.com 1-800-255-7778

```
AND_LOGIC_inst : AND_LOGIC
       port map (
          sel data => data in((4 * i + 3) downto (4 * i)),
          data_cin => cout(i),
          data_out => cout(i + 1));
end generate;
out_and_chain <= cout(input_width/4);</pre>
orcy inst : ORCY
 port map( i => out_and_chain,
       ci => carry_in,
       o => out_andor_chain);
end AND CHAIN arch;
_____
-- Module AND LOGIC
-- Description : 4-input AND gate
- -
-- Device : Virtex-II Family
-----
                           library IEEE;
use IEEE.std_logic_1164.all;
--library UNISIM;
--use UNISIM.VCOMPONENTS.ALL;
entity AND_LOGIC is
 port (
     sel_data : in std_logic_vector(3 downto 0); -- data for select
signal for MUXCY from LUT
     data_cin : in std_logic; -- result from previous stage
     data_out : out std_logic);
end AND_LOGIC;
architecture AND_LOGIC_arch of AND_LOGIC is
component MUXCY
port(
    DI : in std_logic;
    CI : in std_logic;
    s : in std_logic;
    o : out std_logic);
end component;
signal GND : std logic;
signal sel:std_logic;
begin
GND <= '0';
sel <= sel data(0) and sel data(1) and sel data(2) and sel data(3);</pre>
--Wide AND gate using MUXCY
MUX : MUXCY
    port map (
       DI => GND,
       CI => data_cin,
       s => sel,
       o => data_out);
```

end AND\_LOGIC\_arch;

```
_____
-- Module : SOP SUBM
-- Description : Implementing SOP using MUXCY and ORCY
- -
-- Device : Virtex-II Family
-----
                                             _ _ _ _ _ _ _ _ _ _
library ieee;
use ieee.std logic 1164.all;
--library UNISIM;
--use UNISIM.VCOMPONENTS.ALL;
entity SOP SUBM is
 generic(
    and width : integer :=16 ;
    prod_term : integer := 4 );
 port (
    and_in : in std_logic_vector(and_width * prod_term - 1 downto 0);
    sop out : out std logic);
end SOP_SUBM;
architecture SOP_SUBM_arch of SOP_SUBM is
component AND CHAIN
generic (
   input_width : integer); --must be a 4x value
port (
   data_in : in std_logic_vector( input_width-1 downto 0);
   carry_in : in std_logic;
   out_andor_chain : out std_logic);
end component;
signal VCC, GND : std logic;
signal carry : std_logic_vector(prod_term downto 0);
begin
VCC <= '1';
GND <= '0';
carry(0) <= GND;</pre>
andor_inst : for i in 0 to (prod_term - 1) generate
     and_chainx : AND_CHAIN
            generic map(
               input_width => and_width)
            port map(
            data_in => and_in((and_width * i + (and_width -1)) downto
(and_width * i)),
               carry in => carry(i),
               out_andor_chain => carry(i + 1));
end generate;
sop out <= carry(prod term);</pre>
end SOP SUBM arch;
```

### Verilog Templates

```
// Module : AND_CHAIN
// Description : 16 input AND gate
11
// Device : Virtex-II Family
_____
                               module AND_CHAIN(data_in, carry_in, out_andor_chain);
input [15:0] data_in;
input carry_in;
output out_andor_chain;
wire VCC = 1'b1;
wire out_and_chain;
wire dat_out1, data_out2, data_out3;
AND LOGIC OR u4(.sel_data(data_in[15:12]), .data_cin(data_out3),
.carry_in(carry_in), .data_out(out_andor_chain));
AND LOGIC u3(.sel data(data in[11:8]), .data cin(data out2),
.data out(data out3));
AND LOGIC u2(.sel data(data in[7:4]), .data cin(data out1),
.data_out(data_out2));
AND_LOGIC u1(.sel_data(data_in[3:0]), .data_cin(VCC),
.data out(data out1));
endmodule
//-----
// Module AND LOGIC
// Description : 4-input AND gate
11
// Device : Virtex-II Family
_____
                               // Module : init and
11
module AND_LOGIC(sel_data, data_cin, data_out);
input[3:0] sel_data;
input data cin;
output data_out;
wire GND = 1'b0;
wire VCC = 1'b1;
wire and out;
assign and_out = sel_data[3] & sel_data[2] & sel_data[1] & sel_data[0];
MUXCY muxcy_inst (.DI(GND), .CI(data_cin), .S(and_out), .O(data_out));
endmodule
// Module AND LOGIC + ORCY
module AND_LOGIC_OR(sel_data, data_cin, carry_in, data_out);
input[3:0] sel_data;
input data_cin;
input carry_in;
output data out;
wire data mux out;
wire GND = 1'b0;
wire VCC = 1'b1;
wire and out;
assign and out = sel data[3] & sel data[2] & sel data[1] & sel data[0];
MUXCY muxcy_inst (.DI(GND), .CI(data_cin), .S(and_out),
.O(data mux out)) /* synthesis RLOC="x0y0" */;
ORCY u5(.I(carry_in), .CI(data_mux_out), .O(data_out)) /* synthesis
RLOC="x0y0" */;
endmodule
```

```
//----
// Module : SOP_SUBM
// Description : Implementing SOP using MUXCY and ORCY
11
// Device : Virtex-II Family
//-----
                                     module SOP_SUBM(and_in, sop_out);
input [63:0] and_in;
output sop out;
wire out_andor_chain1, out_andor_chain2, out_andor_chain3;
wire GND = 1'b0;
AND_CHAIN u4(.data_in(and_in[63:48]), .carry_in(out_andor_chain3),
.out_andor_chain(sop_out));
AND_CHAIN u3(.data_in(and_in[47:32]), .carry_in(out_andor_chain2),
.out andor chain(out andor chain3));
AND_CHAIN u2(.data_in(and_in[31:16]), .carry_in(out_andor_chain1),
.out_andor_chain(out_andor_chain2));
AND_CHAIN u1(.data_in(and_in[15:0]), .carry_in(GND),
.out_andor_chain(out_andor_chain1));
endmodule
```

# **Using Embedded Multipliers**

# Introduction

Virtex-II devices feature a large number of embedded 18-bit X 18-bit two's-complement embedded multipliers. The embedded multipliers offer fast, efficient means to create 18-bit signed by 18-bit signed multiplication products. The multiplier blocks share routing resources with the Block SelectRAM memory, allowing for increased efficiency for many applications. Cascading of multipliers can be implemented with additional logic resources in local Virtex-II slices.

Applications such as signed-signed, signed-unsigned, and unsigned-unsigned multiplication, logical, arithmetic, and barrel shifters, two's-complement and magnitude return are easily implemented.

Using the CORE Generator, the designer can quickly generate multipliers that make use of the embedded 18-bit x 18-bit two's-complement multipliers (V2.0 or later) of the Multiplier core for Virtex-II devices.

# Two's-Complement Signed Multiplier

## Data Flow

Each embedded multiplier block (MULT18X18 primitive) supports two independent dynamic data input ports: 18-bit signed or 17-bit unsigned. The MULT18X18 primitive is illustrated in Figure 2-71.

In addition, efficient cascading of multipliers up to 35-bit X 35-bit signed can be accomplished by using 4 embedded multipliers, one 36-bit adder, and one 53-bit adder. See Figure 2-72.



UG002\_C2\_025\_082100

Figure 2-71: Embedded Multiplier

# Library Primitives and Submodules

One library primitive (MULT18X18) is available. Table 2-26 lists the attributes of this primitive.

### Table 2-26: Embedded Multiplier Primitive

| Primitive | A width | B width | P width | Signed/Unsigned         |
|-----------|---------|---------|---------|-------------------------|
| MULT18X18 | 18      | 18      | 36      | Signed (2's complement) |

2

In addition to the primitive, 15 submodules that implement various widths of signed and unsigned multipliers and two's-complement return functions are provided in VHDL and Verilog code. Multipliers using cascaded MULT18X18 primitives are included with registers between stages causing three cycles of latency. Multipliers that make use of the embedded Virtex-II 18-bit by 18-bit two's complement multipliers can be easily generated using V2.0 of the CORE Generator Multiplier module. Table 2-27 lists cascaded multiplier submodules.

 Table 2-27:
 Embedded Multiplier Submodules - Cascaded MULT18X18

| Submodule   | A Width | B Width | P Width | Signed/Unsigned |
|-------------|---------|---------|---------|-----------------|
| MULT35X35_S | 35      | 35      | 70      | Signed          |
| MULT34X34_U | 34      | 34      | 68      | Unsigned        |

Figure 2-72 represents the cascaded scheme used to implement a 35-bit by 35-bit signed multiplier utilizing four embedded multipliers and two adders.



Figure 2-72: MULT35X35\_S Submodule

The fixed adder is 53 bits wide (17 LSBs are always 0 on one input).

The 34-bit by 34-bit unsigned submodule is constructed in a similar manner with the most significant bit on each operand being tied to logic low.

Table 2-27 lists multipliers and two's-complement return functions that utilize oneMULT18X18 primitive and are not registered.

| Submodule      | A width | B width | P width  | Signed/Unsigned      |
|----------------|---------|---------|----------|----------------------|
| MULT17X17_U    | 17      | 17      | 34       | Unsigned             |
| MULT8X8_S      | 8       | 8       | 16       | Signed               |
| MULT8X8_U      | 8       | 8       | 16       | Unsigned             |
| MULT4X4_S      | 4       | 4       | 8        | Signed               |
| MULT4X4_U      | 4       | 4       | 8        | Unsigned             |
| MULT_6X6S_5X5U | 6<br>5  | 6<br>5  | 12<br>10 | Signed<br>Unsigned   |
| MULT_5X5S_6X6U | 5<br>6  | 5<br>6  | 10<br>12 | Signed<br>Unsigned   |
| MULT_5X5U_5X5U | 5<br>5  | 5<br>5  | 10<br>10 | Unsigned<br>Unsigned |
| MULT_4X4S_7X7U | 4<br>7  | 4<br>7  | 8<br>14  | Signed<br>Unsigned   |
| MULT_4X4S_3X3S | 4<br>3  | 4<br>3  | 8<br>6   | Signed<br>Signed     |
| TWOS_CMP18     | 18      | -       | 18       | -                    |
| TWOS_CMP9      | 9       | -       | 9        | -                    |
| MAGNTD_18      | 18      | -       | 17       | -                    |

Table 2-28: Embedded Multiplier Submodules - Single MULT18X18

Multipliers of form MULT\_aXaS\_bXbU use one embedded multiplier to implement two multipliers with separate outputs. The submodules listed above use optimized pin assignments to achieve shortest possible through-delay.

Figure 2-73 and Figure 2-74 represent 4-bit by 4-bit signed multiplier and 4-bit by 4-bit unsigned multiplier implementations, respectively.



UG002\_C2\_022\_032901

Figure 2-73: MULT4X4\_S Submodule



*Figure 2-74:* **MULT4X4\_U Submodule** 

Submodule MAGNTD\_18 performs a magnitude return (i.e., absolute value) of a two'scomplement number. An incoming negative number returns with a positive number, while an incoming positive number remains unchanged. Submodules TWOS\_CMP18 and TWOS\_CMP9 perform a two's-complement return function. The incoming number in two's-complement form (either signed or unsigned) is complemented when the DO\_COMP pin is asserted High. Additional slice logic can be used with these submodules to efficiently convert sign-magnitude to two's-complement or vice-versa. Figure 2-75 shows the connections to a MULT18X18 to create the submodule TWOS\_CMP9.



Figure 2-75: TWOS\_CMP9 Submodule

## Two Multipliers in a Single Primitive

Two multipliers can be implemented in a single primitive. For simplified illustration purposes, an assumption of two squares being implemented in the same MULT18X18 primitive is used. The following equation shows the form of the multiplication.

#### **Two Multipliers per Primitive:**

 $(X * 2^{n} + Y)(X * 2^{n} + Y) = (X^{2} * 2^{2n}) + (Y^{2}) + (XY * 2^{n+1})$ 

 $(X * 2^n)$  is the input X appearing on the MSBs while Y appears on the LSBs to form the value  $(X * 2^n + Y)$ . Two multipliers can coexist in one MULT18X18 primitive, if the conditions in the following inequalities are met when neither X nor Y are 0.

### Inequality Conditions for Two Multipliers per Primitive:

 $(X^2 * 2^{2n})_{min} > (XY * 2^{n+1})_{max}, (XY * 2^{n+1})_{min} > (Y^2)_{max}$ For values 0 on X or Y, the equation becomes:

| X <sup>2</sup> ^ 2 <sup>2</sup> 11 | {Y=0}      |
|------------------------------------|------------|
| Y <sup>2</sup>                     | {X=0}      |
| 0                                  | {X=0, Y=0} |

Figure 2-76 represents the MULT\_6X6S\_5X5U submodule.





Table 2-29 shows values for X and Y where these conditions are met.

*Table 2-29:* **Two Multipliers per MULT18X18 Allowable Sizes** 

| X * X       |               | Y           | * Y           |
|-------------|---------------|-------------|---------------|
| Signed Size | Unsigned Size | Signed Size | Unsigned Size |
| 7 X 7       | 6 X 6         | -           | 4 X 4         |
| 6 X 6       | 5 X 5         | -           | 5 X 5         |
| 5 X 5       | 4 X 4         | 3 X 3       | 6 X 6         |
| 4 X 4       | 3 X 3         | 3 X 3       | 7 X 7         |
| 3 X 3       | 2 X 2         | 4 X 4       | 8 X 8         |

# VHDL and Verilog Instantiation

VHDL and Verilog instantiation templates are available as examples of primitives and submodules (see "VHDL and Verilog Templates" on page 155).

In VHDL, each template has a component declaration section and an architecture section. Each part of the template should be inserted within the VHDL design file. The port map of the architecture section should include the design signals names.

# **Port Signals**

### Data In - A

The data input provides new data (up to 18 bits) to be used as one of the multiplication operands.

### Data In - B

The data input provides new data (up to 18 bits) to be used as one of the multiplication operands.

## Data Out - P

The data output bus P provides the data value (up to 36 bits) of two's-complement multiplication for operands A and B.

# **Location Constraints**

Each embedded multiplier has location coordinates of the form XrowYcolumn. To constrain placement, multiplier instances can have LOC properties attached to

MULT18X18 embedded multiplier instances can have LOC properties attached to them to constrain placement. MULT18X18 placement locations differ from the convention used for naming CLB locations, allowing LOC properties to transfer easily from array to array.

The LOC properties use the following form:

LOC = MULT18X18\_X#Y#

For example, MULT18X18\_X0Y0 is the bottom-left MULT18X18 location on the device.

# VHDL and Verilog Templates

VHDL and Verilog templates are available for the primitive and submodules.

The following is a template for the primitive:

• SIGNED\_MULT\_18X18 (primitive: MULT18X18)

The following are templates for submodules:

- SIGNED\_MULT\_35X35 (submodule: MULT35X35\_S)
- UNSIGNED\_MULT\_34X34 (submodule: MULT34X34\_U)
- UNSIGNED\_MULT\_17X17 (submodule: MULT17X17\_U)
- SIGNED\_MULT\_8X8 (submodule: MULT8X8\_S)
- UNSIGNED\_MULT\_8X8 (submodule: MULT8X8\_U)
- SIGNED\_MULT\_4X4 (submodule: MULT4X4\_S)
- UNSIGNED\_MULT\_4X4 (submodule: MULT4X4\_U)
- DUAL\_MULT\_6X6S\_5X5U (submodule: MULT\_6X6S\_5X5U)
- DUAL\_MULT\_5X5S\_6X6U (submodule: MULT\_5X5S\_6X6U)
- DUAL\_MULT\_5X5U\_5X5U (submodule: MULT\_5X5U\_5X5U)
- DUAL\_MULT\_4X4S\_7X7U (submodule: MULT\_4X4S\_7X7U)
- DUAL\_MULT\_4X4S\_3X3S (submodule: MULT\_4X4S\_3X3S)
- TWOS\_COMPLEMENTER\_18BIT (submodule: TWOS\_CMP18)
- TWOS\_COMPLEMENTER\_9BIT (submodule: TWOS\_CMP9)
- MAGNITUDE\_18BIT (submodule: MAGNTD\_18)

The corresponding submodules have to be synthesized with the design.

Templates for the SIGNED\_MULT\_18X18 module are provided in VHDL and Verilog code as an example.

## VHDL Template:

```
-- Module: SIGNED_MULT_18X18
-- Description: VHDL instantiation template
-- 18-bit X 18-bit embedded signed multiplier (asynchronous)
- -
-- Device: Virtex-II Family
_____
-- Components Declarations
component MULT18X18
 port(
      A : in std_logic_vector (17 downto 0);
      B : in std_logic_vector (17 downto 0);
      P : out std_logic_vector (35 downto 0)
 );
end component;
- -
-- Architecture Section
_ _
U MULT18X18 : MULT18X18
 port map (
   A => , -- insert input signal #1
   B => , -- insert input signal #2
   P => -- insert output signal
 );
```

## Verilog Template:

# **Using Single-Ended Selectl/O-Ultra Resources**

# Summary

The Virtex-II FPGA series includes a highly configurable, high-performance single-ended SelectI/O-Ultra resource that supports a wide variety of I/O standards. The SelectI/O-Ultra resource includes a robust set of features, including programmable control of output drive strength, slew rate, and input delay and hold time. Taking advantage of the flexibility of SelectI/O-Ultra features and the design considerations described in this document can improve and simplify system-level design.

# Introduction

As FPGAs continue to grow in size and capacity, the larger and more complex systems designed for them demand an increased variety of I/O standards. Furthermore, as system clock speeds continue to increase, the need for high-performance I/O becomes more important. Chip-to-chip delays have an increasingly substantial impact on overall system speed. The task of achieving the desired system performance is becoming more difficult with the proliferation of low-voltage I/O standards. SelectI/O-Ultra resolves this potential problem by providing a highly configurable, high-performance alternative to I/O resources used in more conventional programmable devices.

Virtex-II SelectI/O-Ultra blocks can support up to 19 single-ended I/O standards. Supporting such a variety of I/O standards allows support for a wide variety of applications.

Each Input/Output Block (IOB) includes six registers, two each from the input, output, and 3-state signals within the IOB. These registers are optionally configured as either a D-type flip-flop or as a level-sensitive latch. The purpose of having six registers is to allow designers to design double-data-rate (DDR) logic in the I/O blocks. Each pair of the flip-flop (FF) has different clocks so that the flip-flops can be driven by two clocks with a 180-degree phase shift to achieve DDR. All I/O flip-flops still share the same reset/preset line.

The input buffer has an optional delay element used to guarantee a zero hold time requirement for input signals registered within the IOB.

Virtex-II SelectI/O-Ultra features also provide dedicated resources for input reference voltage ( $V_{REF}$ ) and input output source voltage ( $V_{CCO}$ ), along with a convenient banking system that simplifies board design. Virtex-II inputs and outputs are powered from  $V_{CCO}$ . Differential amplifier inputs, such as GTL and SSTL, are powered from  $V_{REF}$ .

# **Fundamentals**

Modern bus applications, pioneered by the largest and most influential components in the digital electronics industry, are commonly introduced with a new I/O standard tailored specifically to the needs of that application. The bus I/O standards provide specifications to other vendors who create products designed to interface with these applications. Each standard often has its own specifications for current, voltage, I/O buffering, and termination techniques.

The ability to provide the flexibility and time-to-market advantages of programmable logic is increasingly dependent on the capability of the programmable logic device to support an ever increasing variety of I/O standards.

SelectI/O-Ultra resources feature highly configurable input and output buffers that provide support for a wide variety of I/O standards. An input buffer can be configured as either a simple buffer or as a differential amplifier input. An output buffer can be configured as either a Push-Pull output or as an Open Drain output.

Table 2-30 illustrates all of the supported single-ended I/O standards in Virtex-II devices.Each buffer type can support a variety of current and voltage requirements.

| I/O<br>Standard | Input Reference<br>Voltage (V <sub>REF</sub> ) | Input Source<br>Voltage (V <sub>CCO</sub> ) | Output Source<br>Voltage (V <sub>CCO</sub> ) | Board Termination<br>Voltage (V <sub>TT</sub> ) |
|-----------------|------------------------------------------------|---------------------------------------------|----------------------------------------------|-------------------------------------------------|
| LVTTL           | N/A                                            | 3.3                                         | 3.3                                          | N/A                                             |
| LVCMOS15        | N/A                                            | 1.5                                         | 1.5                                          | N/A                                             |
| LVCMOS18        | N/A                                            | 1.8                                         | 1.8                                          | N/A                                             |
| LVCMOS25        | N/A                                            | 2.5                                         | 2.5                                          | N/A                                             |
| LVCMOS33        | N/A                                            | 3.3                                         | 3.3                                          | N/A                                             |
| PCI33_3         | N/A                                            | 3.3                                         | 3.3                                          | N/A                                             |
| PCI66_3         | N/A                                            | 3.3                                         | 3.3                                          | N/A                                             |
| PCIX            | N/A                                            | 3.3                                         | 3.3                                          | N/A                                             |
| GTL             | 0.80                                           | N/A                                         | N/A                                          | 1.2                                             |
| GTL+            | 1.0                                            | N/A                                         | N/A                                          | 1.5                                             |
| HSTL_I          | 0.75                                           | N/A                                         | 1.5                                          | 0.75                                            |
| HSTL_II         | 0.75                                           | N/A                                         | 1.5                                          | 0.75                                            |
| HSTL_III        | 0.9                                            | N/A                                         | 1.5                                          | 1.5                                             |
| HSTL_IV         | 0.9                                            | N/A                                         | 1.5                                          | 1.5                                             |
| HSTL_I          | 0.9                                            | N/A                                         | 1.8                                          | 0.9                                             |
| HSTL_II         | 0.9                                            | N/A                                         | 1.8                                          | 0.9                                             |
| HSTL_III        | 1.1                                            | N/A                                         | 1.8                                          | 1.8                                             |
| HSTL_IV         | 1.1                                            | N/A                                         | 1.8                                          | 1.8                                             |
| SSTL3_I         | 1.5                                            | N/A                                         | 3.3                                          | 1.5                                             |
| SSTL3_II        | 1.5                                            | N/A                                         | 3.3                                          | 1.5                                             |
| SSTL2_I         | 1.25                                           | N/A                                         | 2.5                                          | 1.25                                            |
| SSTL2_II        | 1.25                                           | N/A                                         | 2.5                                          | 1.25                                            |
| AGP-2X          | 1.32                                           | N/A                                         | 3.3                                          | N/A                                             |

Table 2-30: Supported Single-Ended I/O Standards

# Overview of Supported I/O Standards

This section provides a brief overview of I/O standards supported by all Virtex-II devices.

While most I/O standards specify a range of allowed voltages, this document records typical voltage values only. Detailed information on each specification can be found on the Electronic Industry Alliance JEDEC website at: <u>http://www.jedec.org</u>

# LVTTL - Low-Voltage TTL

The low-voltage TTL, or LVTTL, standard is a general purpose EIA/JESDSA standard for 3.3 V applications that use an LVTTL input buffer and a Push-Pull output buffer. This standard requires a 3.3 V input and output source voltage ( $V_{CCO}$ ), but does not require the use of a reference voltage ( $V_{REF}$ ) or a termination voltage ( $V_{TT}$ ).

## LVCMOS33 - 3.3 Volt Low-Voltage CMOS

This standard is an extension of the LVCMOS standard (JESD 8.-5). It is used in general purpose 3.3 V applications. The standard requires a 3.3 V input/output source voltage ( $V_{CCO}$ ), but does not require the use of a reference voltage ( $V_{REF}$ ) or termination voltage ( $V_{TT}$ ).

## LVCMOS25 - 2.5 Volt Low-Voltage CMOS

This standard is an extension of the LVCMOS standard (JESD 8.-5). It is used in general purpose 2.5 volts or lower applications. This standard requires a 2.5 V input /output source voltage ( $V_{CCO}$ ), but does not require the use of a reference voltage ( $V_{REF}$ ) or a board termination voltage ( $V_{TT}$ ).

## LVCMOS18 - 1.8 Volt Low-Voltage CMOS

This standard is an extension of the LVCMOS standard. It is used in general purpose 1.8 V applications. The use of a reference voltage ( $V_{REF}$ ) or board termination voltage ( $V_{TT}$ ) is not required.

## LVCMOS15 - 1.5 Volt Low-Voltage CMOS

This standard is an extension of the LVCMOS standard. It is used in general purpose 1.5 V applications. The use of a reference voltage ( $V_{REF}$ ) or a board termination voltage ( $V_{TT}$ ) is not required.

### PCI - Peripheral Component Interface

The PCI standard specifies support for 33 MHz, 66 MHz and 133 MHz PCI bus applications. It uses a LVTTL input buffer and a Push-Pull output buffer. This standard does not require the use of a reference voltage ( $V_{REF}$ ) or a board termination voltage ( $V_{TT}$ ), however, it does require 3.3 V input output source voltage ( $V_{CCO}$ ).

## GTL -Gunning Transceiver Logic Terminated

The GTL standard is a high-speed bus standard (JESD8.3) invented by Xerox. Xilinx has implemented the terminated variation for this standard. This standard requires a differential amplifier input buffer and a open Drain output buffer.

### GTL+ - Gunning Transceiver Logic Plus

The Gunning Transceiver Logic Plus, or GTL+ standard is a high-speed bus standard (JESD8.3) first used by the Pentium Pro Processor.

## HSTL - High-speed Transceiver Logic

The high-speed Transceiver Logic, or HSTL standard is a general purpose high-speed, 1.5V bus standard sponsored by IBM (EIA/JESD8-6). This standard has four variations or classes. Virtex-II SelectI/O-Ultra supports all four Classes. This standard requires a Differential Amplifier input buffer and a Push-pull output buffer.

## SSTL3 - Stub Series Terminated Logic for 3.3V

The Stub Series Terminated Logic for 3.3V, or SSTL3 standard is a general purpose 3.3V memory bus standard also sponsored by Hitachi and IBM (JESD8-8). This standard has two classes, I and II. Virtex-II SelectI/O-Ultra supports both classes for the SSTL3 standard. This standard requires a Differential Amplifier input buffer and a Push-Pull output buffer.

## SSTL2 - Stub Series Terminated Logic for 2.5V

The Stub Series Terminated Logic for 2.5V, or SSTL2 standard is a general purpose 2.5V memory bus standard also sponsored by Hitachi and IBM (JESD8-8). This standard has two classes, I and II. Virtex-II SelectI/O-Ultra supports both classes for the SSTL2 standard. This standard requires a Differential Amplifier input buffer and a Push-Pull output buffer

## AGP-2X - Advanced Graphics Port

The Intel AGP standard is a 3.3V Advanced Graphics Port-2X bus standard used with the Pentium II processor for graphic applications. This standard requires a Push-Pull output buffer and a Differential Amplifier input buffer.

# Library Symbols

The Xilinx library includes an extensive list of symbols designed to provide support for the variety of SelectI/O-Ultra features. Most of these symbols represent variations of the five generic SelectI/O-Ultra symbols.

- IBUF (input buffer)
- IBUFG (clock input buffer)
- OBUF (output buffer)
- OBUFT (3-state output buffer)
- IOBUF (input/output buffer)

### IBUF

Signals used as inputs to a Virtex-II device must source an input buffer (IBUF) via an external input port. The generic Virtex-II IBUF symbol is shown in Figure 2-77. The extension to the base name defines which I/O standard the IBUF uses. The assumed standard is LVTTL when the generic IBUF has no specified extension.



Figure 2-77: Input Buffer (IBUF) Symbols

Table 2-31 details variations of the IBUF symbol for single-ended Virtex-II I/O standards:

Table 2-31: Variations of the IBUF Symbol

| IBUF          | IBUF_HSTL_III |
|---------------|---------------|
| IBUF_LVCMOS15 | IBUF_HSTL_IV  |
| IBUF_LVCMOS18 | IBUF_SSTL2_I  |
| IBUF_LVCMOS25 | IBUF_SSTL2_II |
| IBUF_LVCMOS33 | IBUF_SSTL3_I  |
| IBUF_APG      | IBUF_SSTL3_II |
| IBUF_GTL      | IBUF_PCI33_3  |
| IBUF_GTLP     | IBUF_PCI66_3  |
| IBUF_HSTL_I   | IBUF_PCIX     |
| IBUF_HSTL_II  | IBUF_AGP      |

When the IBUF symbol supports an I/O standard that requires a differential amplifier input, the IBUF is automatically configured as a differential amplifier input buffer. The low-voltage I/O standards with a differential amplifier input require an external reference voltage input  $V_{REF}$ .

The voltage reference signal is "banked" within the Virtex-II device on a half-edge basis, such that for all packages there are eight independent  $V_{REF}$  banks internally. For a representation of the Virtex-II I/O banks, see Figure 2-79. Within each bank approximately one of every six I/O pins is automatically configured as a  $V_{REF}$  input. After placing a differential amplifier input signal within a given  $V_{REF}$  bank, the same external source must drive all I/O pins configured as a  $V_{REF}$  input.

IBUF placement restrictions require that any differential amplifier input signals within a bank be of the same standard. How to specify a specific location for the IBUF via the LOC property is described below. Table 2-32 summarizes compatibility requirements of Virtex-II input standards.

An optional delay element in the input data path is associated with each IBUF. When the IBUF drives a flip-flop within the IOB, the delay element is activated by default to ensure a zero hold-time requirement at the device input pin. The IOBDELAY = NONE property overrides this default, thus reducing the input set-up time, but risking a hold-time requirement.

When the IBUF does not drive a flip-flop within the IOB, the delay element is deactivated by default to provide a shorter input set-up time. To delay the input signal, activate the delay element with the IOBDELAY = BOTH property.



*Figure 2-78:* Virtex-II I/O Banks: Top View for Flip-Chip Packages (FF & BF)



ug002\_c2\_014\_112900



Table 2-32: Xilinx Input Standard Compatibility Requirements

| Rule 1 | Standards with the same $\rm V_{CCO}$ , and $\rm V_{REF}$ can be placed within the same bank.                                              |
|--------|--------------------------------------------------------------------------------------------------------------------------------------------|
| Rule 2 | Standards that don't require a $\rm V_{REF}$ can be placed within the same bank with the standards that have the same $\rm V_{CCO}$ values |

Each bank has its own  $V_{CCO}$  and  $V_{REF}$  voltage. Details on compatible input standards for each  $V_{CCO}$  /  $V_{REF}$  voltage combination are available in the <u>Virtex-II Data Sheet (DS031)</u>.

### OBUF

An OBUF must drive outputs through an external output port. Figure 2-80 shows the generic output buffer (OBUF) symbol.



Figure 2-80: Virtex-II Output Buffer (OBUF) Symbol

The extension to the base name defines which I/O standard the OBUF uses. With no extension specified for the generic OBUF symbol, the assumed standard is slew rate limited LVTTL with 12mA drive strength.

The LVTTL and LVCMOS OBUFs can additionally support one of two slew rate modes to minimize bus transients. By default, the slew rate for each output buffer is reduced to minimize power bus transients, when switching non-critical signals.

LVTTL and LVCMOS output buffers have selectable drive strengths. The format for these OBUF symbol names is as follows:

OBUF\_<slew\_rate>\_<drive\_strength>

<slew\_rate> is either F (fast) or S (slow) and <drive\_strength> is specified in milliamperes. For LVTTL, LVCMOS25, and LVCMOS33, the supported drive strengths are 2, 4, 6, 8, 12, 16, and 24. For LVCMOS15, and LVCMOS18, the supported drive strengths are 2, 4, 6, 8, 12, and 16.

Table 2-33 details variations of the OBUF symbol.

| Table 2-33: 🛛 🛛 | ariations | of the | OBUF | Symbol |
|-----------------|-----------|--------|------|--------|
|-----------------|-----------|--------|------|--------|

| OBUF              | OBUF_LVCMOS18_S_2  | OBUF_LVCMOS33_S_4  |
|-------------------|--------------------|--------------------|
| OBUF_S_2          | OBUF_LVCMOS18_S_4  | OBUF_LVCMOS33_S_6  |
| OBUF_S_4          | OBUF_LVCMOS18_S_6  | OBUF_LVCMOS33_S_8  |
| OBUF_S_6          | OBUF_LVCMOS18_S_8  | OBUF_LVCMOS33_S_12 |
| OBUF_S_8          | OBUF_LVCMOS18_S_12 | OBUF_LVCMOS33_S_16 |
| OBUF_S_12         | OBUF_LVCMOS18_S_16 | OBUF_LVCMOS33_S_24 |
| OBUF_S_16         | OBUF_LVCMOS18_F_2  | OBUF_LVCMOS33_F_2  |
| OBUF_S_24         | OBUF_LVCMOS18_F_4  | OBUF_LVCMOS33_F_4  |
| OBUF_F_2          | OBUF_LVCMOS18_F_6  | OBUF_LVCMOS33_F_6  |
| OBUF_F_4          | OBUF_LVCMOS18_F_8  | OBUF_LVCMOS33_F_8  |
| OBUF_F_6          | OBUF_LVCMOS18_F_12 | OBUF_LVCMOS33_F_12 |
| OBUF_F_8          | OBUF_LVCMOS18_F_16 | OBUF_LVCMOS33_F_16 |
| OBUF_F_12         | OBUF_LVCMOS25      | OBUF_LVCMOS33_F_24 |
| OBUF_F_16         | OBUF_LVCMOS25_S_2  | OBUF_PCI33_3       |
| OBUF_F_24         | OBUF_LVCMOS25_S_4  | OBUF_PCI66-3       |
| OBUF_LVCMOS15     | OBUF_LVCMOS25_S_6  | OBUF_PCIX          |
| OBUF_LVCMOS15_S_2 | OBUF_LVCMOS25_S_8  | OBUF_GTL           |
| OBUF_LVCMOS15_S_4 | OBUF_LVCMOS25_S_12 | OBUF_GTLP          |
| OBUF_LVCMOS15_S_6 | OBUF_LVCMOS25_S_16 | OBUF_HSTL_I        |

|                    |                    | -/            |
|--------------------|--------------------|---------------|
| OBUF_LVCMOS15_S_8  | OBUF_LVCMOS25_S_24 | OBUF_HSTL_II  |
| OBUF_LVCMOS15_S_12 | OBUF_LVCMOS25_F_2  | OBUF_HSTL_III |
| OBUF_LVCMOS15_S_16 | OBUF_LVCMOS25_F_4  | OBUF_HSTL_IV  |
| OBUF_LVCMOS15_F_2  | OBUF_LVCMOS25_F_6  | OBUF_SSTL3_I  |
| OBUF_LVCMOS15_F_4  | OBUF_LVCMOS25_F_8  | OBUF_SSTL3_II |
| OBUF_LVCMOS15_F_6  | OBUF_LVCMOS25_F_12 | OBUF_SSTL2_I  |
| OBUF_LVCMOS15_F_8  | OBUF_LVCMOS25_F_16 | OBUF_SSTL2_II |
| OBUF_LVCMOS15_F_12 | OBUF_LVCMOS25_F_24 | OBUF_AGP      |
| OBUF_LVCMOS15_F_16 | OBUF_LVCMOS33      |               |
| OBUF_LVCMOS18      | OBUF_LVCMOS33_S_2  |               |
|                    |                    |               |

 Table 2-33:
 Variations of the OBUF Symbol (Continued)

OBUF placement restrictions require that within a given  $V_{CCO}$  bank each OBUF share the same output source drive voltage. Input buffers with the same  $V_{CCO}$  and output buffers that do not require  $V_{CCO}$  can be placed within any  $V_{CCO}$  bank. Table 2-34 summarizes Virtex-II output compatibility requirements. The LOC property can specify a location for the OBUF.

Table 2-34: Output Standards Compatibility Requirements

| Rule 1 | Only outputs with standards which share compatible $\mathrm{V}_{\mathrm{CCO}}$ can be used within the same bank. |
|--------|------------------------------------------------------------------------------------------------------------------|
| Rule 2 | There are no placement restrictions for outputs with standards that do not require a $\mathrm{V}_{\mathrm{CCO}}$ |

Each bank has its own  $V_{CCO}$  voltage. Details on compatible output standards for each  $V_{CCO}$  voltage combination are available in the <u>Virtex-II Data Sheet (DS031)</u>.

### OBUFT

The generic 3-state output buffer OBUFT, shown in Figure 2-81, typically implements 3-state outputs or bidirectional I/O.



Figure 2-81: 3-State Output Buffer Symbol (OBUFT)

The extension to the base name defines which I/O standard OBUFT uses. With no extension specified for the generic OBUFT symbol, the assumed standard is slew rate limited LVTTL with 12mA drive strength.

The LVTTL and LVCMOS OBUFTs additionally can support one of two slew rate modes to minimize bus transients. By default, the slew rate for each output buffer is reduced to minimize power bus transients, when switching non-critical signals.

LVTTL and LVCMOS 3-state buffers have selectable drive strengths. The format for these OBUFT symbol names is as follows:

OBUFT\_<slew\_rate>\_<drive\_strength>

<slew\_rate> is either F(fast) or S(slow) and <drive\_strength> is specified in milliamperes. For LVTTL, LVCMOS25, and LVCMOS33, the supported drive strengths are 2, 4, 6, 8, 12, 16, and 24. For LVCMOS15 and LVCMOS18, the supported drive strengths are 2, 4, 6, 8, 12, and 16.

Table 2-35 details variations of the OBUFT symbol.

|                     | • • = • • • • • • • • • • |                     |
|---------------------|---------------------------|---------------------|
| OBUFT               | OBUFT_LVCMOS18_S_2        | OBUFT_LVCMOS33_S_4  |
| OBUFT_S_2           | OBUFT_LVCMOS18_S_4        | OBUFT_LVCMOS33_S_6  |
| OBUFT_S_4           | OBUFT_LVCMOS18_S_6        | OBUFT_LVCMOS33_S_8  |
| OBUFT_S_6           | OBUFT_LVCMOS18_S_8        | OBUFT_LVCMOS33_S_12 |
| OBUFT_S_8           | OBUFT_LVCMOS18_S_12       | OBUFT_LVCMOS33_S_16 |
| OBUFT_S_12          | OBUFT_LVCMOS18_S_16       | OBUFT_LVCMOS33_S_24 |
| OBUFT_S_16          | OBUFT_LVCMOS18_F_2        | OBUFT_LVCMOS33_F_2  |
| OBUFT_S_24          | OBUFT_LVCMOS18_F_4        | OBUFT_LVCMOS33_F_4  |
| OBUFT_F_2           | OBUFT_LVCMOS18_F_6        | OBUFT_LVCMOS33_F_6  |
| OBUFT_F_4           | OBUFT_LVCMOS18_F_8        | OBUFT_LVCMOS33_F_8  |
| OBUFT_F_6           | OBUFT_LVCMOS18F_12        | OBUFT_LVCMOS33_F_12 |
| OBUFT_F_8           | OBUFT_LVCMOS18_F_16       | OBUFT_LVCMOS33_F_16 |
| OBUFT_F_12          | OBUFT_LVCMOS25            | OBUFT_LVCMOS33_F_24 |
| OBUFT_F_16          | OBUFT_LVCMOS25_S_2        | OBUFT_PCI33_3       |
| OBUFT_F_24          | OBUFT_LVCMOS25_S_4        | OBUFT_PCI66-3       |
| OBUFT_LVCMOS15      | OBUFT_LVCMOS25_S_6        | OBUFT_PCIX          |
| OBUFT_LVCMOS15_S_2  | OBUFT_LVCMOS25_S_8        | OBUFT_GTL           |
| OBUFT_LVCMOS15_S_4  | OBUFT_LVCMOS25_S_12       | OBUFT_GTLP          |
| OBUFT_LVCMOS15_S_6  | OBUFT_LVCMOS25_S_16       | OBUFT_HSTL_I        |
| OBUFT_LVCMOS15_S_8  | OBUFT_LVCMOS25_S_24       | OBUFT_HSTL_II       |
| OBUFT_LVCMOS15_S_12 | OBUFT_LVCMOS25_F_2        | OBUFT_HSTL_III      |
| OBUFT_LVCMOS15_S_16 | OBUFT_LVCMOS25_F_4        | OBUFT_HSTL_IV       |
| OBUFT_LVCMOS15_F_2  | OBUFT_LVCMOS25_F_6        | OBUFT_SSTL3_I       |
| OBUFT_LVCMOS15_F_4  | OBUFT_LVCMOS25_F_8        | OBUFT_SSTL3_II      |
| OBUFT_LVCMOS15_F_6  | OBUFT_LVCMOS25_F_12       | OBUFT_SSTL2_I       |
| OBUFT_LVCMOS15_F_8  | OBUFT_LVCMOS25_F_16       | OBUFT_SSTL2_II      |
| OBUFT_LVCMOS15_F_12 | OBUFT_LVCMOS25_F_24       | OBUFT_AGP           |
| OBUFT_LVCMOS15_F_16 | OBUFT_LVCMOS33            |                     |
| OBUFT_LVCMOS18      | OBUFT_LVCMOS33_S_2        |                     |

Table 2-35: Variations of the OBUFT Symbol

OBUFT placement restrictions require that within a given V<sub>CCO</sub> bank each OBUFT share the same output source drive voltage. Input buffers with the same V<sub>CCO</sub> and output buffers that do not require V<sub>CCO</sub> can be placed within any V<sub>CCO</sub> bank. The LOC property can specify a location for the OBUFT.

3-state output buffers and bidirectional buffers can have either a weak pull-up resistor, a weak pull-down resistor, or a weak "keeper" circuit. Control this feature by adding the appropriate symbol to the output net of the OBUFT (PULLUP, PULLDOWN, or KEEPER).

The weak "keeper" circuit requires the input buffer within the IOB to sample the I/O signal. Thus, OBUFTs programmed for an I/O standard that requires a  $V_{REF}$  have

automatic placement of a  $V_{REF}$  in the bank with an OBUFT configured with a weak "keeper" typically implement a bidirectional I/O. In this case, the IBUF (and the corresponding  $V_{REF}$ ) are placed explicitly.

### IOBUF

Use the IOBUF symbol for bidirectional signals that require both an input buffer and a 3-state output buffer with an active High 3-state pin. Figure 2-82 shows the generic input/output IOBUF buffer.



2

Figure 2-82: Input/Output Buffer Symbol (IOBUF)

The extension to the base name defines which I/O standard the IOBUF uses. With no extension specified for the generic IOBUF symbol, the assumed standard is LVTTL input buffer and slew rate limited LVTTL with 12mA drive strength for the output buffer.

The LVTTL and LVCMOS IOBUFs can additionally support one of two slew rate modes to minimize bus transients. By default, the slew rate for each output buffer is reduced to minimize power bus transients, when switching non-critical signals.

LVTTL and LVCMOS output buffers have selectable drive strengths. The format for these IOBUF symbol names is as follows:

#### IOBUF\_<slew\_rate>\_<drive\_strength>

<slew\_rate> is either F (fast) or S (slow) and <drive\_strength> is specified in milliamperes. For LVTTL, LVCMOS25 and LVCMOS33, the supported drive strengths are 2, 4, 6, 8, 12, 16, and 24. For LVCMOS15, and LVCMOS18, the supported drive strengths are 2, 4, 6, 8, 12, and 16. Table 2-36 details variations of the IOBUF symbol.

| IOBUF      | IOBUF_LVCMOS18      | IOBUF_LVCMOS33      |
|------------|---------------------|---------------------|
| IOBUF_S_2  | IOBUF_LVCMOS18_S_2  | IOBUF_LVCMOS33_S_2  |
| IOBUF_S_4  | IOBUF_LVCMOS18_S_4  | IOBUF_LVCMOS33_S_4  |
| IOBUF_S_6  | IOBUF_LVCMOS18_S_6  | IOBUF_LVCMOS33_S_6  |
| IOBUF_S_8  | IOBUF_LVCMOS18_S_8  | IOBUF_LVCMOS33_S_8  |
| IOBUF_S_12 | IOBUF_LVCMOS18_S_12 | IOBUF_LVCMOS33_S_12 |
| IOBUF_S_16 | IOBUF_LVCMOS18_S_16 | IOBUF_LVCMOS33_S_16 |
| IOBUF_S_24 | IOBUF_LVCMOS18_F_2  | IOBUF_LVCMOS33_S_24 |
| IOBUF_F_2  | IOBUF_LVCMOS18_F_4  | IOBUF_LVCMOS33_F_2  |
| IOBUF_F_4  | IOBUF_LVCMOS18_F_6  | IOBUF_LVCMOS33_F_4  |
| IOBUF_F_6  | IOBUF_LVCMOS18_F_8  | IOBUF_LVCMOS33_F_6  |

Table 2-36: Variations of the IOBUF Symbol

|                     | <i>,</i> ,          |                     |
|---------------------|---------------------|---------------------|
| IOBUF_F_8           | IOBUF_LVCMOS18F_12  | IOBUF_LVCMOS33_F_8  |
| IOBUF_F_12          | IOBUF_LVCMOS18_F_16 | IOBUF_LVCMOS33_F_12 |
| IOBUF_F_16          | IOBUF_LVCMOS25      | IOBUF_LVCMOS33_F_16 |
| IOBUF_F_24          | IOBUF_LVCMOS25_S_2  | IOBUF_LVCMOS33_F_24 |
| IOBUF_LVCMOS15      | IOBUF_LVCMOS25_S_4  | IOBUF_PCI33_3       |
| IOBUF_LVCMOS15_S_2  | IOBUF_LVCMOS25_S_6  | IOBUF_PCI66-3       |
| IOBUF_LVCMOS15_S_4  | IOBUF_LVCMOS25_S_8  | IOBUF_PCIX          |
| IOBUF_LVCMOS15_S_6  | IOBUF_LVCMOS25_S_12 | IOBUF_GTL           |
| IOBUF_LVCMOS15_S_8  | IOBUF_LVCMOS25_S_16 | IOBUF_GTLP          |
| IOBUF_LVCMOS15_S_12 | IOBUF_LVCMOS25_S_24 | IOBUF_HSTL_II       |
| IOBUF_LVCMOS15_S_16 | IOBUF_LVCMOS25_F_2  | IOBUF_HSTL_IV       |
| IOBUF_LVCMOS15_F_2  | IOBUF_LVCMOS25_F_4  | IOBUF_AGP           |
| IOBUF_LVCMOS15_F_4  | IOBUF_LVCMOS25_F_6  |                     |
| IOBUF_LVCMOS15_F_6  | IOBUF_LVCMOS25_F_8  |                     |
| IOBUF_LVCMOS15_F_8  | IOBUF_LVCMOS25_F_12 |                     |
| IOBUF_LVCMOS15_F_12 | IOBUF_LVCMOS25_F_16 |                     |
| IOBUF_LVCMOS15_F_16 | IOBUF_LVCMOS25_F_24 |                     |
|                     |                     |                     |

Table 2-36: Variations of the IOBUF Symbol (Continued)

When the IOBUF symbol supports an I/O standard that requires a differential amplifier input, IOBUF is automatically configured as a differential amplifier input buffer. Low-voltage I/O standards with a differential amplifier input require an external reference voltage input  $V_{REF}$ .

The voltage reference signal is "banked" within the Virtex-II device on a half-edge basis, such that for all packages there are eight independent  $V_{REF}$  banks internally. For a representation of the Virtex-II I/O banks, see Figure 2-79. Within each bank approximately one of every twelve I/O pins is automatically configured as a  $V_{REF}$  input. After placing a differential amplifier input signal within a given  $V_{REF}$  bank, the same external source must drive all I/O pins configured as a  $V_{REF}$  input.

IOBUF placement restrictions require any differential amplifier input signals within a bank be of the same standard.

Additional restrictions on Virtex-II SelectI/O-Ultra IOBUF placement require that within a given V<sub>CCO</sub> bank each IOBUF share the same output source drive voltage. Input buffers with the same V<sub>CCO</sub> and output buffers that do not require V<sub>CCO</sub> can be placed within any V<sub>CCO</sub> bank. The LOC property can specify a location for the IOBUF.

An optional delay element is associated with the input path in each IOBUF. When the IOBUF drives an input flip-flop within the IOB, the delay element is activated by default to ensure the zero hold-time requirement. Override this default with the IOBDELAY = NONE property.

In the case when the IOBUF does not drive an input flip-flop within the IOB, the delay element is deactivated by default to provide higher performance. To delay the input signal, activate the delay element with the IOBDELAY = BOTH property.

3-state output buffers and bidirectional buffers can have a weak pull-up resistor, a weak pull-down resistor, or a weak "keeper" circuit. Control this feature by adding the appropriate symbol to the output net of the IOBUF (PULLUP, PULLDOWN, or KEEPER).

## Selectl/O-Ultra Properties

Access to some SelectI/O-Ultra features (for example, location constraints, input delay, output drive strength, and slew rate) is available through properties associated with these features.

### **Input Delay Properties**

An optional delay element is associated with the input path in each IBUF. When the IBUF drives an input flip-flop within the IOB, the delay element activates by default to ensure the zero hold-time requirement. Override this default with the IOBDELAY = NONE property.

In the case when the IBUF does not drive an input flip-flop within the IOB, the delay element is deactivated by default to provide higher performance. To delay the input signal, activate the delay element with the IOBDELAY = BOTH property.

## **IOB Flip-Flop/Latch Properties**

The Virtex-II series I/O block (IOB) includes two optional registers on the input path, two optional registers on the output path, and two optional registers on the 3-state control pin. The design implementation software automatically takes advantage of these registers when the following option for the MAP program is specified.

### Map -pr b <filename>

Alternatively, the IOB = TRUE property can be placed on a register to force the mapper to place the register in an IOB.

The two registers for each path makes designing double-data-rate (DDR) logic much simpler. Each pair of the registers has separate clock inputs, which can be driven by either the positive edge or the negative edge of the clock. Users can use both edges of the clocks to clock data in and out from the IOB. For details on DDR, see "Using Double-Data-Rate (DDR) I/O" on page 203.

## Location Constraints

Specify the location of each SelectI/O-Ultra symbol with the location constraint LOC attached to the SelectI/O-Ultra symbol. The external port identifier indicates the value of the location constrain. The format of the port identifier depends on the package chosen for the specified design.

The LOC properties use the following form:

- LOC=A42;
- LOC=P37;

# **Output Slew Rate Property**

As mentioned above, a variety of symbol names provide the option of choosing the desired slew rate for the output buffers. In the case of the LVTTL or LVCMOS output buffers (OBUF, OBUFT, and IOBUF), slew rate control can be alternatively programmed with the SLEW = property. By the default, the slew rate for each output buffer is reduced to minimize power bus transients when switching non-critical signals. The SLEW = property has one of the two following values:

- SLEW = SLOW
- SLEW = FAST

# **Output Drive Strength Property**

The desired output drive strength can be additionally specified by choosing the appropriate library symbol. The Xilinx library also provides an alternative method for specifying this feature. For the LVTTL, and LVCMOS output buffers (OBUF, OBUFT, and

IOBUF), the desired drive strength can be specified with the DRIVE = property. This property could have one of the following values:

- DRIVE = 2
- DRIVE = 4
- DRIVE = 6
- DRIVE = 8
- DRIVE = 12
- DRIVE = 16
- DRIVE = 24

# **Design Considerations**

### Reference Voltage (V<sub>REF</sub>) Pins

Low-voltage I/O standards with a differential amplifier input buffer require an input reference voltage ( $V_{REF}$ ). Provide the  $V_{REF}$  as an external signal to the device.

The voltage reference signal is "banked" within the Virtex-II device on a half-edge basis such that for all packages there are eight independent  $V_{REF}$  banks internally. See Figure 2-79 for a representation of the Virtex-II I/O banks. Within each bank approximately one of every twelve I/O pins is automatically configured as a  $V_{REF}$  input. After placing a differential amplifier input signal within a given  $V_{REF}$  bank, the same external source must drive all I/O pins configured as a  $V_{REF}$  input.

Within each  $V_{REF}$  bank, any input buffers that require a  $V_{REF}$  signal must be of the same type. Output buffers that have the same  $V_{CCO}$  values as the input buffers can be placed within the same  $V_{REF}$  bank.

## Output Drive Source Voltage (V<sub>CCO</sub>) Pins

Many of the low-voltage I/O standards supported by SelectI/O-Ultra devices require a different output drive source voltage ( $V_{CCO}$ ). As a result each device can often have to support multiple output drive source voltages.

Output buffers within a given V<sub>CCO</sub> bank must share the same output drive source voltage. Input buffers for LVTTL, LVCMOS15, LVCMOS18, LVCMOS25, LVCMOS33, PCI33\_3, PCI66\_3, PCIX use the V<sub>CCO</sub> voltage for input V<sub>CCO</sub> voltage.

### **Transmission Line Effects**

The delay of an electrical signal along a wire is dominated by the rise and fall times when the signal travels a short distance. Transmission line delays vary with inductance and capacitance. But a well-designed board can experience delays of approximately 180ps per inch. Transmission line effects, or reflections, typically start at 1.5" for fast (1.5ns) rise and fall times. Poor (or non-existent) termination or changes in the transmission line impedance cause these reflections and can cause additional delay in longer traces. As a system speeds continue to increase, the effect of I/O delays can become a limiting factor and therefore transmission line termination becomes increasingly more important.

### **Termination Techniques**

A variety of termination techniques reduce the impact of transmission line effects.

The following are output termination techniques:

- None
- Series
- Parallel (Shunt)
- Series and Parallel (Series-Shunt)

The following are input termination techniques:

- None
- Parallel (Shunt)

These termination techniques can be applied in any combination. A generic example of each combination of termination methods appears in Figure 2-83.



*Figure 2-83:* **Overview of Standard Input and Output Termination Methods** 

# Simultaneous Switching Output (SSO) Guidelines

Ground bounce can occur with high-speed digital integrated circuits when multiple outputs change states simultaneously, causing undesired transient behavior on an output or in the internal logic. This problem is also referred to as the Simultaneous Switching Output (SSO) problem.

Ground bounce is primarily due to current changes in the combined inductance of ground pins, bond wires, and group metallization. The IC internal ground level deviates from the external system ground level for a short duration (a few nanoseconds) after multiple outputs change state simultaneously.

Ground bounce affects stable low outputs and all inputs because they interpret the incoming signal by comparing it to the internal ground. If the ground bounce amplitude exceeds the actual instantaneous noise margin, then a non-changing input can be interpreted as changing. Table 2-37 provides the guidelines for the maximum number of simultaneously switching outputs allowed per output power/ground pair to avoid the effects of ground bounce. Refer to Table 2-38 for the number of effective output power/ground pairs for each Virtex-II device and package combination.

## Note on LVDCI Drivers

All entries in the SSO table for LVDCI assume  $50\Omega$  reference resistors. To calculate the SSO limit for an LVDCI driver with different resistors, the following formula is used:

Higher Reference Resistor (RR) Value = Less Drive Strength = Higher SSO Limit

SSO Limit for New RR = (RR Value /  $50\Omega$ ) × (SSO Limit for  $50\Omega$ )

### Example:

If using the LVDCI\_18 driver with  $65\Omega$  reference resistors, the LVDCI\_18 SSO limit is taken from the table. The SSO limit for LVDCI\_18 at  $50\Omega$  is 11 per power/ground pin pair. The SSO limit for LVDCI\_18 at  $65\Omega$  is  $(65\Omega / 50\Omega) \times 11 = 14$  per power/ground pin pair.

| Standard                           | Package     |    |           |           |  |  |
|------------------------------------|-------------|----|-----------|-----------|--|--|
| Standard                           | FG,BG,FF,BF | CS | XC2V40-FG | XC2V40-CS |  |  |
| LVTTL2_slow                        | 68          | 51 | 51        | 34        |  |  |
| LVTTL4_slow                        | 41          | 31 | 31        | 21        |  |  |
| LVTTL6_slow                        | 29          | 22 | 22        | 15        |  |  |
| LVTTL8_slow                        | 22          | 17 | 17        | 11        |  |  |
| LVTTL12_slow                       | 15          | 11 | 11        | 8         |  |  |
| LVTTL16_slow                       | 11          | 8  | 8         | 6         |  |  |
| LVTTL24_slow                       | 7           | 5  | 5         | 4         |  |  |
| LVTTL2_fast                        | 40          | 30 | 30        | 20        |  |  |
| LVTTL4_fast                        | 24          | 18 | 18        | 12        |  |  |
| LVTTL6_fast                        | 17          | 13 | 13        | 9         |  |  |
| LVTTL8_fast                        | 13          | 10 | 10        | 7         |  |  |
| LVTTL12_fast                       | 10          | 8  | 8         | 5         |  |  |
| LVTTL16_fast                       | 8           | 6  | 6         | 4         |  |  |
| LVTTL24_fast                       | 5           | 4  | 4         | 3         |  |  |
| LVDCI_15 50 $\Omega$ impedance     | 10          | 8  | 8         | 5         |  |  |
| LVDCI_DV2_15 25 $\Omega$ impedance | 5           | 4  | 4         | 3         |  |  |
| LVCMOS15_2_slow                    | 51          | 38 | 38        | 26        |  |  |
| LVCMOS15_4_slow                    | 31          | 23 | 23        | 16        |  |  |
| LVCMOS15_6_slow                    | 22          | 17 | 17        | 11        |  |  |
| LVCMOS15_8_slow                    | 17          | 13 | 13        | 9         |  |  |
| LVCMOS15_12_slow                   | 11          | 8  | 8         | 6         |  |  |
| LVCMOS15_16_slow                   | 8           | 6  | 6         | 4         |  |  |
| LVCMOS15_2_fast                    | 30          | 23 | 23        | 15        |  |  |
| LVCMOS15_4_fast                    | 18          | 14 | 14        | 9         |  |  |
| LVCMOS15_6_fast                    | 13          | 10 | 10        | 7         |  |  |
| LVCMOS15_8_fast                    | 10          | 8  | 8         | 5         |  |  |
| LVCMOS15_12_fast                   | 8           | 6  | 6         | 4         |  |  |
| LVCMOS15_16_fast                   | 6           | 5  | 5         | 3         |  |  |
| LVDCI_18 50 $\Omega$ impedance     | 11          | 8  | 8         | 6         |  |  |
| LVDCI_DV2_18 25 $\Omega$ impedance | 6           | 4  | 4         | 3         |  |  |
| LVCMOS18_2_slow                    | 58          | 44 | 44        | 29        |  |  |
| LVCMOS18_4_slow                    | 35          | 26 | 26        | 18        |  |  |

*Table 2-37:* Guidelines for Max Number of Simultaneously Switching Outputs per Power/Ground Pair

| Standard                           | Package     |    |           |           |  |  |
|------------------------------------|-------------|----|-----------|-----------|--|--|
| Standard                           | FG,BG,FF,BF | CS | XC2V40-FG | XC2V40-CS |  |  |
| LVCMOS18_6_slow                    | 25          | 19 | 19        | 13        |  |  |
| LVCMOS18_8_slow                    | 19          | 14 | 14        | 10        |  |  |
| LVCMOS18_12_slow                   | 13          | 10 | 10        | 7         |  |  |
| LVCMOS18_16_slow                   | 10          | 8  | 8         | 5         |  |  |
| LVCMOS18_2_fast                    | 34          | 26 | 26        | 17        |  |  |
| LVCMOS18_4_fast                    | 20          | 15 | 15        | 10        |  |  |
| LVCMOS18_6_fast                    | 15          | 11 | 11        | 8         |  |  |
| LVCMOS18_8_fast                    | 11          | 8  | 8         | 6         |  |  |
| LVCMOS18_12_fast                   | 9           | 7  | 7         | 5         |  |  |
| LVCMOS18_16_fast                   | 7           | 5  | 5         | 4         |  |  |
| LVDCI_25 50 $\Omega$ impedance     | 13          | 10 | 10        | 7         |  |  |
| LVDCI_DV2_25 25 $\Omega$ impedance | 7           | 5  | 5         | 3         |  |  |
| LVCMOS25_2_slow                    | 68          | 51 | 51        | 34        |  |  |
| LVCMOS25_4_slow                    | 41          | 31 | 31        | 21        |  |  |
| LVCMOS25_6_slow                    | 29          | 22 | 22        | 15        |  |  |
| LVCMOS25_8_slow                    | 22          | 17 | 17        | 11        |  |  |
| LVCMOS25_12_slow                   | 15          | 11 | 11        | 8         |  |  |
| LVCMOS25_16_slow                   | 11          | 8  | 8         | 6         |  |  |
| LVCMOS25_24_slow                   | 7           | 5  | 5         | 4         |  |  |
| LVCMOS25_2_fast                    | 40          | 30 | 30        | 20        |  |  |
| LVCMOS25_4_fast                    | 24          | 18 | 18        | 12        |  |  |
| LVCMOS25_6_fast                    | 17          | 13 | 13        | 9         |  |  |
| LVCMOS25_8_fast                    | 13          | 10 | 10        | 7         |  |  |
| LVCMOS25_12_fast                   | 10          | 8  | 8         | 5         |  |  |
| LVCMOS25_16_fast                   | 8           | 6  | 6         | 4         |  |  |
| LVCMOS25_24_fast                   | 5           | 4  | 4         | 2         |  |  |
| LVDCI_33 50 Ω impedance            | 13          | 10 | 10        | 7         |  |  |
| LVDCI_DV2_33 25 Ω impedance        | 7           | 5  | 5         | 3         |  |  |
| LVCMOS33_2_slow                    | 68          | 51 | 51        | 34        |  |  |
| LVCMOS33_4_slow                    | 41          | 31 | 31        | 21        |  |  |
| LVCMOS33_6_slow                    | 29          | 22 | 22        | 15        |  |  |
| LVCMOS33_8_slow                    | 22          | 17 | 17        | 11        |  |  |

Table 2-37: Guidelines for Max Number of Simultaneously Switching Outputs per Power/Ground Pair (Continued)

| Chandard         | Package     |    |           |           |  |  |
|------------------|-------------|----|-----------|-----------|--|--|
| Standard         | FG,BG,FF,BF | CS | XC2V40-FG | XC2V40-CS |  |  |
| LVCMOS33_12_slow | 15          | 11 | 11        | 8         |  |  |
| LVCMOS33_16_slow | 11          | 8  | 8         | 6         |  |  |
| LVCMOS33_24_slow | 7           | 5  | 5         | 4         |  |  |
| LVCMOS33_2_fast  | 40          | 30 | 30        | 20        |  |  |
| LVCMOS33_4_fast  | 24          | 18 | 18        | 12        |  |  |
| LVCMOS33_6_fast  | 17          | 13 | 13        | 9         |  |  |
| LVCMOS33_8_fast  | 13          | 10 | 10        | 7         |  |  |
| LVCMOS33_12_fast | 10          | 8  | 8         | 5         |  |  |
| LVCMOS33_16_fast | 8           | 6  | 6         | 4         |  |  |
| LVCMOS33_24_fast | 5           | 4  | 4         | 2         |  |  |
| PCI33/66/X       | 8           | 6  | 6         | 4         |  |  |
| GTL              | 4           | 3  | 3         | 2         |  |  |
| GTL_DCI          | 3           | 2  | 2         | 1         |  |  |
| GTL+             | 4           | 3  | 3         | 2         |  |  |
| GTL+_DCI         | 3           | 2  | 2         | 1         |  |  |
| HSTLI            | 20          | 15 | 15        | 10        |  |  |
| HSTLI_DCI        | 20          | 15 | 15        | 10        |  |  |
| HSTLII           | 10          | 8  | 8         | 5         |  |  |
| HSTLII_DCI       | 7           | 5  | 5         | 4         |  |  |
| HSTLIII          | 8           | 6  | 6         | 4         |  |  |
| HSTLIII_DCI      | 8           | 6  | 6         | 4         |  |  |
| HSTLIV           | 4           | 3  | 3         | 2         |  |  |
| HSTLIV_DCI       | 4           | 3  | 3         | 2         |  |  |
| SSTL2I           | 15          | 11 | 11        | 8         |  |  |
| SSTL2I_DCI       | 15          | 11 | 11        | 8         |  |  |
| SSTL2II          | 10          | 8  | 8         | 5         |  |  |
| SSTL2II_DCI      | 5           | 4  | 4         | 3         |  |  |
| SSTL3I           | 12          | 9  | 9         | 6         |  |  |
| SSTL3I_DCI       | 12          | 9  | 9         | 6         |  |  |
| SSTL3II          | 8           | 6  | 6         | 4         |  |  |
| SSTL3II_DCI      | 4           | 3  | 3         | 2         |  |  |
| AGP              | 9           | 7  | 7         | 5         |  |  |

*Table 2-37:* Guidelines for Max Number of Simultaneously Switching Outputs per Power/Ground Pair (Continued)

| Paakaga               | XC2V Device |    |     |     |      |      |      |      |      |      |      |
|-----------------------|-------------|----|-----|-----|------|------|------|------|------|------|------|
| Гаскауе               | 40          | 80 | 250 | 500 | 1000 | 1500 | 2000 | 3000 | 4000 | 6000 | 8000 |
| CS144 <sup>(1)</sup>  | 1           | 1  | 1   | -   | -    | -    | -    | -    | -    | -    | -    |
| FG256 <sup>(1)</sup>  | 1           | 2  | 3   | 3   | 3    | -    | -    | -    | -    | -    | -    |
| FG456 <sup>(1)</sup>  | -           | -  | 3   | 4   | 5    | -    | -    | -    | -    | -    | -    |
| FG676 <sup>(1)</sup>  | -           | -  | -   | -   | -    | 6    | 7    | 7    | -    | -    | -    |
| BG575 <sup>(1)</sup>  | -           | -  | -   | -   | 5    | 6    | 6    | -    | -    | -    | -    |
| BG728 <sup>(1)</sup>  | -           | -  | -   | -   | -    | -    | -    | 8    | -    | -    | -    |
| FF896 <sup>(2)</sup>  | -           | -  | -   | -   | 7    | 8    | 10   | -    | -    | -    | -    |
| FF1152 <sup>(2)</sup> | -           | -  | -   | -   | -    | -    | -    | 11   | 13   | 13   | 13   |
| FF1517 <sup>(2)</sup> | -           | -  | -   | -   | -    | -    | -    | -    | 14   | 17   | 17   |
| BF957 <sup>(2)</sup>  | -           | -  | -   | -   | -    | -    | 10   | 10   | 10   | 11   | -    |

*Table 2-38:* Virtex-II Equivalent Power/Ground Pairs per Bank

Notes:

1. Wire-bond only.

2. Flip-chip only.

### Application Example

Creating a design with the SelectI/O-Ultra feature requires either assignment of the IOSTANDARD attribute in the constraint file or instantiation of the desired library symbol within the design code.

To enter the IOSTANDARD attribute in the constraint file (UCF file), the following syntax can be used:

NET <pad net name> IOSTANDARD=<the name of the standard>

For example, to enter PCIX standard, use

NET <pad net name> IOSTANDARD=PCIX;

To instantiate a library symbol in the HDL code, use the proper input or output buffer name, and follow the standard syntax of instantiation.

For example, to instantiate a GTL input buffer in VHDL, the following syntax can be used:

GTL\_buffer : IBUF\_GTL port map (I=>data\_in, O=>data\_gtl\_in);

At the board level, designers need to know the termination techniques required for each I/O standard.

This section describes some common application examples illustrating the termination techniques recommended by each of the single-ended standard supported by the SelectI/O-Ultra features.

#### Termination Example

Circuit examples involving typical termination techniques for each of the SelectI/O-Ultra standards follow. For a full range of accepted values for the DC voltage specifications for each standard, refer to the table associated with each figure.

The resistors used in each termination technique example and the transmission lines depicted represent board level components and are not meant to represent components on the device.

# GTL

A sample circuit illustrating a valid termination technique for GTL is shown in Figure 2-84.



Figure 2-84: GTL Terminated

 Table 2-39 lists DC voltage specifications.

Table 2-39: GTL Voltage Specifications

| Parameter                                        | Min  | Тур  | Мах  |
|--------------------------------------------------|------|------|------|
| V <sub>CCO</sub>                                 | -    | N/A  | -    |
| $V_{\text{REF}} = N \times V_{\text{TT}}^{(1)}$  | 0.74 | 0.8  | 0.86 |
| V <sub>TT</sub>                                  | 1.14 | 1.2  | 1.26 |
| $V_{IH} \ge V_{REF} + 0.05$                      | 0.79 | 0.85 | -    |
| $V_{IL} \le V_{REF} - 0.05$                      | -    | 0.75 | 0.81 |
| V <sub>OH</sub>                                  | -    | -    | -    |
| V <sub>OL</sub>                                  | -    | 0.2  | 0.4  |
| I <sub>OH</sub> at V <sub>OH</sub> (mA)          | -    | -    | -    |
| $I_{OL} at V_{OL} \mbox{ (mA)} at 0.4 \mbox{ V}$ | 32   | -    | -    |
| I <sub>OL</sub> at V <sub>OL</sub> (mA) at 0.2 V | -    | -    | 40   |

#### Notes:

1. N must be greater than or equal to 0.653 and less than or equal to 0.68.

# GTL +

Figure 2-85 shows a sample circuit illustrating a valid termination technique for GTL+.



*Figure 2-85:* **GTL+ Terminated** 

 Table 2-40 lists DC voltage specifications.

Table 2-40: GTL+ Voltage Specifications

| Parameter                                       | Min  | Тур  | Мах  |
|-------------------------------------------------|------|------|------|
| V <sub>CCO</sub>                                | -    | -    | -    |
| $V_{REF} = N \times V_{TT}^{(1)}$               | 0.88 | 1.0  | 1.12 |
| V <sub>TT</sub>                                 | 1.35 | 1.5  | 1.65 |
| $V_{IH} \ge V_{REF} + 0.1$                      | 0.98 | 1.1  | -    |
| $V_{IL} \le V_{REF} - 0.1$                      | -    | 0.9  | 1.02 |
| V <sub>OH</sub>                                 | -    | -    | -    |
| V <sub>OL</sub>                                 | 0.3  | 0.45 | 0.6  |
| I <sub>OH</sub> at V <sub>OH</sub> (mA)         | -    | -    | -    |
| I <sub>OL</sub> at V <sub>OL</sub> (mA) at 0.6V | 36   | -    | -    |
| I <sub>OL</sub> at V <sub>OL</sub> (mA) at 0.3V | -    | -    | 48   |

Notes:

1. N must be greater than or equal to 0.653 and less than or equal to 0.68.

## **HSTL Class I**

Figure 2-86 shows a sample circuit illustrating a valid termination technique for HSTL\_I.

### HSTL Class I



Figure 2-86: Terminated HSTL Class I

 Table 2-41 lists DC voltage specifications.

Table 2-41: HSTL Class I Voltage Specification

|                                         | • •                    |                      |                 |
|-----------------------------------------|------------------------|----------------------|-----------------|
| Parameter                               | MIN                    | ТҮР                  | MAX             |
| V <sub>CCO</sub>                        | 1.40                   | 1.50                 | 1.60            |
| V <sub>REF</sub>                        | 0.68                   | 0.75                 | 0.90            |
| V <sub>TT</sub>                         | -                      | $V_{CCO} \times 0.5$ | -               |
| V <sub>IH</sub>                         | V <sub>REF</sub> + 0.1 | -                    | -               |
| V <sub>IL</sub>                         | -                      | -                    | $V_{REF} - 0.1$ |
| V <sub>OH</sub>                         | V <sub>CCO</sub> – 0.4 | -                    | -               |
| V <sub>OL</sub>                         | -                      | -                    | 0.4             |
| $I_{\mbox{OH}}$ at $V_{\mbox{OH}}$ (mA) | -8                     | -                    | -               |
| I <sub>OL</sub> at V <sub>OL</sub> (mA) | 8                      | -                    | -               |
|                                         |                        |                      |                 |

# HSTL Class II

Figure 2-87 shows a sample circuit illustrating a valid termination technique for HSTL\_II.





Figure 2-87: Terminated HSTL Class II

Table 2-42 lists DC voltage specifications.

| Table 2-42: | HSTL | Class II | Voltage | Specification |
|-------------|------|----------|---------|---------------|
|-------------|------|----------|---------|---------------|

| Parameter                               | MIN                    | ТҮР                  | МАХ             |
|-----------------------------------------|------------------------|----------------------|-----------------|
| V <sub>CCO</sub>                        | 1.40                   | 1.50                 | 1.60            |
| V <sub>REF</sub> <sup>(1)</sup>         | -                      | 0.75                 | -               |
| V <sub>TT</sub>                         | -                      | $V_{CCO} \times 0.5$ | -               |
| V <sub>IH</sub>                         | $V_{REF} + 0.1$        | -                    | -               |
| V <sub>IL</sub>                         | -                      | -                    | $V_{REF} - 0.1$ |
| V <sub>OH</sub>                         | V <sub>CCO</sub> – 0.4 | -                    | -               |
| V <sub>OL</sub>                         | -                      | -                    | 0.4             |
| $I_{\mbox{OH}}$ at $V_{\mbox{OH}}$ (mA) | -16                    | -                    | -               |
| I <sub>OL</sub> at V <sub>OL</sub> (mA) | 16                     | -                    | -               |

#### Notes:

 Per EIA/JESD8-6, "The value of V<sub>REF</sub> is to be selected by the user to provide optimum noise margin in the use conditions specified by the user."

### HSTL Class III

Figure 2-88 shows a sample circuit illustrating a valid termination technique for HSTL\_III.

HSTL Class III



Figure 2-88: Terminated HSTL Class III

 Table 2-43 lists DC voltage specifications.

Table 2-43: HSTL Class III Voltage Specification

| Darameter                               | MIN                    | TVD              | ΜΑΧ             |
|-----------------------------------------|------------------------|------------------|-----------------|
| Faiailletei                             | IVIIIN                 | 11F              | INAA            |
| V <sub>CCO</sub>                        | 1.40                   | 1.50             | 1.60            |
| V <sub>REF</sub> <sup>(1)</sup>         | -                      | 0.90             | -               |
| V <sub>TT</sub>                         | -                      | V <sub>CCO</sub> | -               |
| V <sub>IH</sub>                         | $V_{REF} + 0.1$        | -                | -               |
| V <sub>IL</sub>                         | -                      | -                | $V_{REF} - 0.1$ |
| V <sub>OH</sub>                         | V <sub>CCO</sub> – 0.4 | -                | -               |
| V <sub>OL</sub>                         | -                      | -                | 0.4             |
| I <sub>OH</sub> at V <sub>OH</sub> (mA) | -8                     | -                | -               |
| I <sub>OL</sub> at V <sub>OL</sub> (mA) | 24                     | -                | -               |

#### Notes:

1. Per EIA/JESD8-6, "The value of  $V_{REF}$  is to be selected by the user to provide optimum noise margin in the use conditions specified by the user."

### HSTL Class IV

Figure 2-89 shows a sample circuit illustrating a valid termination technique for HSTL\_IV.

#### HSTL Class IV



Figure 2-89: Terminated HSTL Class IV

Table 2-44 lists DC voltage specifications.

#### Table 2-44: HSTL Class IV Voltage Specification

| Parameter                               | MIN                    | ТҮР              | MAX             |
|-----------------------------------------|------------------------|------------------|-----------------|
| V <sub>CCO</sub>                        | 1.40                   | 1.50             | 1.60            |
| V <sub>REF</sub>                        | -                      | 0.90             | -               |
| V <sub>TT</sub>                         | -                      | V <sub>CCO</sub> | -               |
| V <sub>IH</sub>                         | V <sub>REF</sub> + 0.1 | -                | -               |
| V <sub>IL</sub>                         | -                      | -                | $V_{REF} - 0.1$ |
| V <sub>OH</sub>                         | V <sub>CCO</sub> – 0.4 | -                | -               |
| V <sub>OL</sub>                         | -                      | -                | 0.4             |
| $I_{OH}$ at $V_{OH}$ (mA)               | -8                     | -                | -               |
| I <sub>OL</sub> at V <sub>OL</sub> (mA) | 48                     | -                | -               |

#### Notes:

1. Per EIA/JESD8-6, "The value of  $V_{REF}$  is to be selected by the user to provide optimum noise margin in the use conditions specified by the user.

# HSTL Class I (1.8V)

Figure 2-90 shows a sample circuit illustrating a valid termination technique for HSTL\_I.





Figure 2-90: Terminated HSTL Class I (1.8V)

Table 2-45 lists DC voltage specifications.

Table 2-45: HSTL Class I (1.8V) Voltage Specification

| Parameter                               | MIN                    | ТҮР                  | MAX             |  |
|-----------------------------------------|------------------------|----------------------|-----------------|--|
| V <sub>CCO</sub>                        | 1.7                    | 1.8                  | 1.9             |  |
| V <sub>REF</sub>                        | 0.8                    | 0.9                  | 1.1             |  |
| V <sub>TT</sub>                         | -                      | $V_{CCO} \times 0.5$ | -               |  |
| V <sub>IH</sub>                         | V <sub>REF</sub> + 0.1 | -                    | -               |  |
| V <sub>IL</sub>                         | -                      | -                    | $V_{REF} - 0.1$ |  |
| V <sub>OH</sub>                         | V <sub>CCO</sub> – 0.4 | -                    | -               |  |
| V <sub>OL</sub>                         | -                      | -                    | 0.4             |  |
| I <sub>OH</sub> at V <sub>OH</sub> (mA) | -8                     | -                    | -               |  |
| I <sub>OL</sub> at V <sub>OL</sub> (mA) | 8                      | -                    | -               |  |

# HSTL Class II (1.8V)

Figure 2-91 shows a sample circuit illustrating a valid termination technique for HSTL\_II.

### HSTL Class II (1.8V)



Figure 2-91: Terminated HSTL Class II (1.8V)

 Table 2-46 lists DC voltage specifications.

#### Table 2-46: HSTL Class II (1.8V) Voltage Specification

| Parameter MIN                           |                        | ТҮР                  | MAX             |  |
|-----------------------------------------|------------------------|----------------------|-----------------|--|
| V <sub>CCO</sub>                        | 1.7                    | 1.8                  | 1.9             |  |
| V <sub>REF</sub> <sup>(1)</sup>         | -                      | 0.9                  | -               |  |
| V <sub>TT</sub>                         | -                      | $V_{CCO} \times 0.5$ | -               |  |
| V <sub>IH</sub>                         | $V_{REF} + 0.1$        | -                    | -               |  |
| V <sub>IL</sub>                         | -                      | -                    | $V_{REF} - 0.1$ |  |
| V <sub>OH</sub>                         | V <sub>CCO</sub> – 0.4 | -                    | -               |  |
| V <sub>OL</sub>                         | -                      | -                    | 0.4             |  |
| I <sub>OH</sub> at V <sub>OH</sub> (mA) | -16                    | -                    | -               |  |
| I <sub>OL</sub> at V <sub>OL</sub> (mA) | 16                     | -                    | -               |  |
|                                         |                        |                      |                 |  |

#### Notes:

1. Per EIA/JESD8-6, "The value of  $V_{REF}$  is to be selected by the user to provide optimum noise margin in the use conditions specified by the user."

## HSTL Class III (1.8V)

Figure 2-92 shows a sample circuit illustrating a valid termination technique for HSTL\_III.

#### HSTL Class III (1.8V)



Figure 2-92: Terminated HSTL Class III (1.8V)

Table 2-47 lists DC voltage specifications.

Table 2-47: HSTL Class III (1.8V) Voltage Specification

| Parameter                               | MIN                    | ТҮР              | MAX             |
|-----------------------------------------|------------------------|------------------|-----------------|
| V <sub>CCO</sub>                        | 1.7                    | 1.8              | 1.9             |
| V <sub>REF</sub> <sup>(1)</sup>         | -                      | 1.1              | -               |
| V <sub>TT</sub>                         | -                      | V <sub>CCO</sub> | -               |
| V <sub>IH</sub>                         | $V_{REF} + 0.1$        | -                | -               |
| V <sub>IL</sub>                         | -                      | -                | $V_{REF} - 0.1$ |
| V <sub>OH</sub>                         | V <sub>CCO</sub> – 0.4 | -                | -               |
| V <sub>OL</sub>                         | -                      | -                | 0.4             |
| I <sub>OH</sub> at V <sub>OH</sub> (mA) | -8                     | -                | -               |
| I <sub>OL</sub> at V <sub>OL</sub> (mA) | 24                     | -                | -               |

#### Notes:

1. Per EIA/JESD8-6, "The value of  $V_{REF}$  is to be selected by the user to provide optimum noise margin in the use conditions specified by the user."

2

# HSTL Class IV (1.8V)

Figure 2-93 shows a sample circuit illustrating a valid termination technique for HSTL\_IV.





Figure 2-93: Terminated HSTL Class IV (1.8V)

Table 2-48 lists DC voltage specifications.

#### Table 2-48: HSTL Class IV (1.8V) Voltage Specification

| Parameter                               | MIN             | ТҮР              | МАХ             |
|-----------------------------------------|-----------------|------------------|-----------------|
| V <sub>CCO</sub>                        | 1.7             | 1.8              | 1.9             |
| V <sub>REF</sub>                        | -               | 1.1              | -               |
| V <sub>TT</sub>                         | -               | V <sub>CCO</sub> | -               |
| V <sub>IH</sub>                         | $V_{REF} + 0.1$ | -                | -               |
| V <sub>IL</sub>                         | -               | -                | $V_{REF} - 0.1$ |
| V <sub>OH</sub>                         | $V_{CCO} - 0.4$ | -                | -               |
| V <sub>OL</sub>                         | -               | -                | 0.4             |
| $I_{OH}$ at $V_{OH}$ (mA)               | -8              | -                | -               |
| I <sub>OL</sub> at V <sub>OL</sub> (mA) | 48              | -                | -               |

Notes:

1. Per EIA/JESD8-6, "The value of  $V_{REF}$  is to be selected by the user to provide optimum noise margin in the use conditions specified by the user.
## SSTL3\_I

Figure 2-94 shows a sample circuit illustrating a valid termination technique for SSTL3\_I.



Figure 2-94: Terminated SSTL3\_I

Table 2-49 lists DC voltage specifications.

### Table 2-49: SSTL3\_I Voltage Specifications

| Parameter                                 | Min      | Тур | Max     |
|-------------------------------------------|----------|-----|---------|
| V <sub>CCO</sub>                          | 3.0      | 3.3 | 3.6     |
| $V_{REF} = 0.45 \times V_{CCO}$           | 1.3      | 1.5 | 1.7     |
| $V_{TT} = V_{REF}$                        | 1.3      | 1.5 | 1.7     |
| $V_{IH} \ge V_{REF} + 0.2$                | 1.5      | 1.7 | 3.9 (1) |
| $V_{IL} \le V_{REF} - 0.2$                | -0.3 (2) | 1.3 | 1.5     |
| $V_{OH} \ge V_{REF} + 0.6^{(3)}$          | 1.9      | 2.1 | -       |
| $V_{OL} \le V_{REF} - 0.6$ <sup>(3)</sup> | -        | 0.9 | 1.1     |
| I <sub>OH</sub> at V <sub>OH</sub> (mA)   | -8       | -   | -       |
| I <sub>OL</sub> at V <sub>OL</sub> (mA)   | 8        | -   | _       |

### Notes:

1.  $V_{IH}$  maximum is  $V_{CCO} + 0.3$ 

- 2.  $V_{IL}$  minimum does not conform to the formula
- 3. V<sub>OH</sub> and V<sub>OL</sub> values are different for SSTL3\_I\_DCI, which uses a controlled impedance driver.

### SSTL3\_II

Figure 2-95 shows a sample circuit illustrating a valid termination technique for SSTL3\_II.



Figure 2-95: Terminated SSTL3\_II

 Table 2-50 lists DC voltage specifications.

Table 2-50: SSTL3\_II Voltage Specifications

| Parameter                                 | Min      | Тур | Max                |
|-------------------------------------------|----------|-----|--------------------|
| V <sub>CCO</sub>                          | 3.0      | 3.3 | 3.6                |
| $V_{REF} = 0.45 \times V_{CCO}$           | 1.3      | 1.5 | 1.7                |
| $V_{TT} = V_{REF}$                        | 1.3      | 1.5 | 1.7                |
| $V_{IH} \ge V_{REF} + 0.2$                | 1.5      | 1.7 | 3.9 <sup>(1)</sup> |
| $V_{IL} \le V_{REF} - 0.2$                | -0.3 (2) | 1.3 | 1.5                |
| $V_{OH} \ge V_{REF} + 0.8$ <sup>(3)</sup> | 2.1      | 2.3 | -                  |
| $V_{OL} \le V_{REF} - 0.8$ <sup>(3)</sup> | -        | 0.7 | 0.9                |
| I <sub>OH</sub> at V <sub>OH</sub> (mA)   | -16      | -   | -                  |
| I <sub>OL</sub> at V <sub>OL</sub> (mA)   | 16       | -   | -                  |

Notes:

1.  $V_{IH}$  maximum is  $V_{CCO}$  + 0.3.

- 2. V<sub>IL</sub> minimum does not conform to the formula.
- 3.  $V_{OH}$  and  $V_{OL}$  values are different for SSTL3\_II\_DCI, which uses a controlled impedance driver.

### SSTL2\_I

Figure 2-96 shows a sample circuit illustrating a valid termination technique for SSTL2\_I.



Figure 2-96: Terminated SSTL2\_I

 Table 2-51 lists DC voltage specifications.

Table 2-51: SSTL2\_I Voltage Specifications

| Parameter                                    | Min      | Тур  | Max     |
|----------------------------------------------|----------|------|---------|
| V <sub>CCO</sub>                             | 2.3      | 2.5  | 2.7     |
| $V_{\text{REF}} = 0.5 \times V_{\text{CCO}}$ | 1.15     | 1.25 | 1.35    |
| $V_{TT} = V_{REF} + N^{(1)}$                 | 1.11     | 1.25 | 1.39    |
| $V_{IH} \ge V_{REF} + 0.18$                  | 1.33     | 1.43 | 3.0 (2) |
| $V_{IL} \le V_{REF} - 0.18$                  | -0.3 (3) | 1.07 | 1.17    |
| $V_{OH} \ge V_{REF} + 0.61$ <sup>(4)</sup>   | 1.76     | 1.82 | 1.96    |
| $V_{OL} \le V_{REF} - 0.61$ <sup>(4)</sup>   | 0.54     | 0.64 | 0.74    |
| I <sub>OH</sub> at V <sub>OH</sub> (mA)      | -7.6     | -    | -       |
| I <sub>OL</sub> at V <sub>OL</sub> (mA)      | 7.6      | -    | -       |

### Notes:

- 1. N must be greater than or equal to -0.04 and less than or equal to 0.04.
- 2.  $V_{IH}$  maximum is  $V_{CCO}$  + 0.3.
- 3.  $V_{IL}$  minimum does not conform to the formula.
- 4. V<sub>OH</sub> and V<sub>OL</sub> values are different for SSTL2\_I\_DCI, which uses a controlled impedance driver.

# SSTL2\_II

Figure 2-97 shows a sample circuit illustrating a valid termination technique for SSTL2\_II.



Figure 2-97: Terminated SSTL2\_II

 Table 2-52 lists DC voltage specifications.

### Table 2-52: SSTL2\_II Voltage Specifications

| Parameter                                    | Min      | Тур  | Max     |
|----------------------------------------------|----------|------|---------|
| V <sub>CCO</sub>                             | 2.3      | 2.5  | 2.7     |
| $V_{\text{REF}} = 0.5 \times V_{\text{CCO}}$ | 1.15     | 1.25 | 1.35    |
| $V_{TT} = V_{REF} + N^{(1)}$                 | 1.11     | 1.25 | 1.39    |
| $V_{IH} \ge V_{REF} + 0.18$                  | 1.33     | 1.43 | 3.0 (2) |
| $V_{IL} \le V_{REF} - 0.18$                  | -0.3 (3) | 1.07 | 1.17    |
| $V_{OH} \ge V_{REF} + 0.8$ <sup>(4)</sup>    | 1.95     | 2.05 | -       |
| $V_{OL} \le V_{REF} - 0.8$ <sup>(4)</sup>    | -        | 0.45 | 0.55    |
| I <sub>OH</sub> at V <sub>OH</sub> (mA)      | -15.2    | -    | -       |
| I <sub>OL</sub> at V <sub>OL</sub> (mA)      | 15.2     | _    | _       |

### Notes:

- 1. N must be greater than or equal to -0.04 and less than or equal to 0.04.
- 2.  $V_{IH}$  maximum is  $V_{CCO} + 0.3$ .
- 3.  $V_{IL}$  minimum does not conform to the formula.
- $4. \quad V_{OH} \ \text{and} \ V_{OL} \ \text{values are different for SSTL2\_II\_DCI, which uses a controlled impedance driver.}$

# PCI33\_3, PCI66\_3, and PCIX

Table 2-53 lists DC voltage specifications.

| Table 2-53: PCI33 | _3, PCI66_ | _3, and PCIX | Voltage S | pecifications |
|-------------------|------------|--------------|-----------|---------------|
|-------------------|------------|--------------|-----------|---------------|

| Parameter                               | Min    | Тур  | Max             |
|-----------------------------------------|--------|------|-----------------|
| V <sub>CCO</sub>                        | 3.0    | 3.3  | 3.5             |
| V <sub>REF</sub>                        | -      | -    | -               |
| V <sub>TT</sub>                         | -      | -    | -               |
| $V_{IH} = 0.5 \times V_{CCO}$           | 1.5    | 1.65 | $V_{CCO} + 0.5$ |
| $V_{IL} = 0.3 \times V_{CCO}$           | - 0.5  | 0.99 | 1.08            |
| $V_{OH} = 0.9 \times V_{CCO}$           | 2.7    | -    | -               |
| $V_{OL} = 0.1 \times V_{CCO}$           | -      | -    | 0.36            |
| I <sub>OH</sub> at V <sub>OH</sub> (mA) | Note 1 | -    | -               |
| I <sub>OL</sub> at V <sub>OL</sub> (mA) | Note 1 | _    | -               |

### Notes:

1. Tested according to the relevant specification.

# LVTTL

Table 2-54 lists DC voltage specifications.

Table 2-54: LVTTL Voltage Specifications

| Parameter                               | Min  | Тур | Мах |
|-----------------------------------------|------|-----|-----|
| V <sub>CCO</sub>                        | 3.0  | 3.3 | 3.6 |
| V <sub>REF</sub>                        | -    | -   | -   |
| V <sub>TT</sub>                         | -    | -   | -   |
| V <sub>IH</sub>                         | 2.0  | -   | 3.6 |
| V <sub>IL</sub>                         | -0.5 | -   | 0.8 |
| V <sub>OH</sub>                         | 2.4  | -   | -   |
| V <sub>OL</sub>                         | -    | -   | 0.4 |
| I <sub>OH</sub> at V <sub>OH</sub> (mA) | -24  | -   | -   |
| I <sub>OL</sub> at V <sub>OL</sub> (mA) | 24   | -   | -   |

### Notes:

1.  $V_{OL}$  and  $V_{OH}$  for lower drive currents are sample tested.

# LVCMOS15

Table 2-55 lists DC voltage specifications.

| Parameter                               | Min  | Тур  | Мах  |
|-----------------------------------------|------|------|------|
| V <sub>CCO</sub>                        | -    | 1.5  | -    |
| V <sub>REF</sub>                        | -    | -    | -    |
| V <sub>TT</sub>                         | -    | -    | -    |
| $V_{IH} = 0.7 \times V_{CCO}$           | 1.05 | -    | 1.65 |
| $V_{IL} = 0.2 \times V_{CCO}$           | -0.5 | -    | 0.3  |
| $V_{OH} = V_{CCO} - 0.45$               | -    | 1.05 | -    |
| V <sub>OL</sub>                         | -    | -    | 0.4  |
| I <sub>OH</sub> at V <sub>OH</sub> (mA) | -16  | -    | -    |
| I <sub>OL</sub> at V <sub>OL</sub> (mA) | 16   | -    | -    |

Table 2-55: LVCMOS15 Voltage Specifications

# LVCMOS18

Table 2-56 lists DC voltage specifications.

| Table 2-56: | LVCMOS18 | Voltage | Specifications |
|-------------|----------|---------|----------------|
|-------------|----------|---------|----------------|

| Parameter                               | Min  | Тур | Max  |
|-----------------------------------------|------|-----|------|
| V <sub>CCO</sub>                        | 1.7  | 1.8 | 1.9  |
| V <sub>REF</sub>                        | -    | -   | -    |
| V <sub>TT</sub>                         | -    | -   | -    |
| $V_{IH} = 0.7 \times V_{CCO}$           | 1.19 | -   | 1.95 |
| $V_{IL} = 0.2 \times V_{CCO}$           | -0.5 | -   | 0.4  |
| $V_{OH} = V_{CCO} - 0.4$                | 1.3  | -   | -    |
| V <sub>OL</sub>                         | -    | -   | 0.4  |
| I <sub>OH</sub> at V <sub>OH</sub> (mA) | -16  | -   | -    |
| I <sub>OL</sub> at V <sub>OL</sub> (mA) | 16   | -   | -    |

# LVCMOS25

Table 2-57 lists DC voltage specifications.

| Parameter                               | Min  | Тур | Мах |
|-----------------------------------------|------|-----|-----|
| V <sub>CCO</sub>                        | 2.3  | 2.5 | 2.7 |
| V <sub>REF</sub>                        | -    | -   | -   |
| V <sub>TT</sub>                         | -    | -   | -   |
| V <sub>IH</sub>                         | 1.7  | -   | 2.7 |
| V <sub>IL</sub>                         | -0.5 | -   | 0.7 |
| V <sub>OH</sub>                         | 1.9  | -   | -   |
| V <sub>OL</sub>                         | -    | -   | 0.4 |
| I <sub>OH</sub> at V <sub>OH</sub> (mA) | -24  | -   | -   |
| I <sub>OL</sub> at V <sub>OL</sub> (mA) | 24   | -   | -   |

Table 2-57: LVCMOS25 Voltage Specifications

### LVCMOS33

 Table 2-58 lists DC voltage specifications.

Table 2-58: LVCMOS33 Voltage Specifications

| Parameter                               | Min  | Тур | Max |
|-----------------------------------------|------|-----|-----|
| V <sub>CCO</sub>                        | 3.0  | 3.3 | 3.6 |
| V <sub>REF</sub>                        | -    | -   | -   |
| V <sub>TT</sub>                         | -    | -   | -   |
| V <sub>IH</sub>                         | 2.0  | -   | 3.6 |
| V <sub>IL</sub>                         | -0.5 | -   | 0.8 |
| V <sub>OH</sub>                         | 2.6  | -   | -   |
| V <sub>OL</sub>                         | -    | -   | 0.4 |
| I <sub>OH</sub> at V <sub>OH</sub> (mA) | -24  | -   | -   |
| I <sub>OL</sub> at V <sub>OL</sub> (mA) | 24   | -   | -   |

# AGP-2X

Table 2-59 lists DC voltage specifications.

| Parameter                               | Min    | Тур  | Max  |
|-----------------------------------------|--------|------|------|
| V <sub>CCO</sub>                        | 3.0    | 3.3  | 3.6  |
| $V_{REF} = N \times V_{CCO}^{(1)}$      | 1.17   | 1.32 | 1.48 |
| V <sub>TT</sub>                         | -      | -    | -    |
| $V_{IH} \ge V_{REF} + 0.2$              | 1.37   | 1.52 | -    |
| $V_{IL} \le V_{REF} - 0.2$              | -      | 1.12 | 1.28 |
| $V_{OH} = 0.9 \times V_{CCO}$           | 2.7    | 3.0  | -    |
| $V_{OL} = 0.1 \times V_{CCO}$           | -      | 0.33 | 0.36 |
| I <sub>OH</sub> at V <sub>OH</sub> (mA) | Note 2 | -    | -    |
| I <sub>OL</sub> at V <sub>OL</sub> (mA) | Note 2 | _    | -    |

Notes:

- 1. N must be greater than or equal to 0.39 and less than or equal to 0.41.
- 2. Tested according to the relevant specification.

# 5V Tolerance in Virtex-II Devices

Virtex-II I/Os are not 5V tolerant without adding an external current-limiting resistor. Each Virtex-II I/O has a pair of clamp diodes that connects to  $V_{CCO}$  and GND, as shown in the Virtex-II *Data Sheet*.

However, for LVCMOS/LVTTL I/O standards, a workaround for this problem is to use a resistor in series to limit the current into the clamp diode. (This works only for a higher voltage that is driving Virtex-II input. It does not work for a Virtex-II output or bidirectional signal, because the resulting  $V_{OH}$  is lower than the specification of the other device.) Given that the forward-bias voltage of the clamp diode is 0.5V, and the limit that any I/O pin can be overdriven above or below the limits of GND and  $V_{CCO}$  is 10 mA, the following is a calculation example:

Assume that driving 5V from a device to the LVCMOS33 input in a Virtex-II device is wanted:

- V<sub>CCO</sub> minimum for LVCMOS33 is 3.0V.
- V<sub>OH</sub> maximum for the LVCMOS5 is 5.25V.

This means that the diode is forward-biased at a voltage of 3.0V + 0.5V = 3.5V or higher.

- The maximum voltage difference is 5.25V 3.5V = 1.75V.
- To limit the current to 10 mA, a resistor (R) of 1.75/10 mA = 175 ohms is needed.

### Notes:

- 1. The minimum V<sub>CCO</sub> value is specified in this section.
- 2. The minimum  $V_{CCO}$  was used for the worst-case calculation.
- 3. A stable, clean, and properly bypassed V<sub>CCO</sub> must be maintained at all times.
- 4. Placing the resistor closer to the driver provides better signal integrity.
- 5. IBIS simulation can be performed to verify the result.
- 6. The clamp diodes are always there (programmed, unprogrammed, during configuration), and there is never a need to add external clamp diodes in the event that the Virtex-II device is unprogrammed and has 5V signals driving the pins.

# **Using Digitally Controlled Impedance (DCI)**

# Introduction

As FPGAs get bigger and system clock speeds get faster, PCB board design and manufacturing has become more difficult. With ever faster edge rates, maintaining signal integrity becomes a critical issue. Designers must make sure that most PC board traces are terminated properly to avoid reflections or ringing.

To terminate a trace, resistors are traditionally added to make the output and/or input match the impedance of the receiver or driver to the impedance of the trace. However, due to the increase in the device I/O counts, adding resistors close to the device pins increases the board area and component count and might even be physically impossible. To address these issues and to achieve better signal integrity, Xilinx developed a new I/O technology for the Virtex-II device family, Digitally Controlled Impedance (DCI).

DCI adjusts the output impedance or input termination to accurately match the characteristic impedance of the transmission line. DCI actively adjusts the impedance of the I/O to equal an external reference resistance. This compensates for changes in I/O impedance due to process variation. It also continuously adjusts the impedance of the I/O to compensate for variations of temperature and supply voltage fluctuations.

In the case of controlled impedance drivers, DCI controls the driver impedance to match two reference resistors, or optionally, to match half the value of these reference resistors. DCI eliminates the need for external series termination resistors.

DCI provides parallel or series termination for transmitters or receivers. This eliminates the need for termination resistors on the board, reduces board routing difficulties and component count, and improves signal integrity by eliminating stub reflection. Stub reflection occurs when termination resistors are located too far from the end of the transmission line. With DCI, the termination resistors are as close as possible to the output driver or the input buffer, thus, eliminating stub reflections completely.

# Xilinx DCI

DCI uses two multi-purpose reference pins in each bank to control the impedance of the driver or the parallel termination value for all of the I/Os of that bank. The N reference pin (VRN) must be pulled up to  $V_{CCO}$  by a reference resistor, and the P reference pin (VRP) must be pulled down to ground by another reference resistor. The value of each reference resistor should be equal to the characteristic impedance of the PC board traces, or should be twice that value (configuration option).

When a DCI I/O standard is used on a particular bank, the two multi-purpose reference pins cannot be used as regular I/Os. However, if DCI I/O standards are not used in the bank, these pins are available as regular I/O pins. Check the Virtex-II pinout for detailed pin descriptions.

DCI adjusts the impedance of the I/O by selectively turning transistors in the I/Os on or off. The impedance is adjusted to match the external reference resistors. The impedance adjustment process has two phases. The first phase, which compensates for process variations, is done during the device startup sequence. The second phase, which maintains the impedance in response to temperature and supply voltage changes, begins immediately after the first phase and continues indefinitely, even while the part is operating. By default, the DONE pin does not go High until the first phase of the impedance adjustment process has completed. If users do not want to have the second phase enabled, they should use the FreezeDCI option in BitGen. If FreezeDCI is used, phase one is the only impedance adjustment.

For controlled impedance output drivers, the impedance can be adjusted either to match the reference resistors or half the resistance of the reference resistors. For on-chip termination, the termination is always adjusted to match the reference resistors. DCI can configure output drivers to be the following types:

- 1. Controlled Impedance Driver (Source Termination)
- 2. Controlled Impedance Driver with Half Impedance (Source Termination)

It can also configure inputs to have he following types of on-chip terminations:

- 1. Termination to V<sub>CCO</sub> (Single Termination)
- 2. Termination to V<sub>CCO</sub>/2 (Split Termination, Thevenin equivalent)

For bidirectional operation, both ends of the line can be DCI-terminated permanently:

- 1. Termination to V<sub>CCO</sub> (Single Termination)
- 2. Termination to V<sub>CCO</sub>/2 (Split Termination, Thevenin equivalent)

Alternatively, bidirectional point-to-point lines can use controlled-impedance drivers (with 3-state buffers) on both ends.

### Controlled Impedance Driver (Source Termination)

Some I/O standards, such as LVTTL, LVCMOS, etc., must have a drive impedance that matches the characteristic impedance of the driven line. DCI can provide a controlled impedance output drivers that eliminate reflections without an external source termination. The impedance is set by the external reference resistors, whose resistance should be equal to the trace impedance. Figure 2-98 illustrates a controlled impedance driver inside Virtex-II device. The DCI I/O standards that support Controlled Impedance Driver are: LVDCI\_15, LVDCI\_18, LVDCI\_25, and LVDCI\_33.



Figure 2-98: Controlled Impedance Driver

### Controlled Impedance Driver With Half Impedance (Source Termination)

DCI can also provide drivers with one half of the impedance of the reference resistors. The DCI I/O standards that support controlled impedance driver with half impedance are: LVDCI\_DV2\_15, LVDCI\_DV2\_18, LVDCI\_DV2\_25, and LVDCI\_DV2\_33

Figure 2-99 illustrates a controlled driver with half impedance inside a Virtex-II device.

Note that to match the drive impedance to Z0 when using a driver with half impedance, reference resistor R must be 2(Z0).



Figure 2-99: Controlled Impedance Driver With Half Impedance

www.xilinx.com 1-800-255-7778

# Termination to V<sub>CCO</sub> (Single Termination)

Some I/O standards, such as HSTL Class III, IV, etc., require an input termination to V<sub>CCO</sub>. See Figure 2-100.



Figure 2-100: Single Termination Without DCI

DCI can provide this termination to  $V_{CCO}$  using single termination. The termination resistance is set by the reference resistors. For GTL and HSTL standards, they should be controlled by 50-ohm reference resistors. The DCI I/O standards that support single termination are: GTL\_DCI, GTLP\_DCI, HSTL\_III\_DCI, and HSTL\_IV\_DCI.

Figure 2-101 illustrates single termination inside a Virtex-II device.



Figure 2-101: Single Termination Using DCI

## Termination to $V_{CCO}/2$ (Split Termination)

Some I/O standards, such as HSTL Class I, II, SSTL3\_I, etc., require an input termination voltage of  $V_{CCO}/2$ . See Figure 2-102.



2



This is equivalent to having a split termination composed of two resistors. One terminates to  $V_{CCO}$ , the other to ground. The resistor values are 2R. DCI provides termination to  $V_{CCO}/2$  using split termination. The termination resistance is set by the external reference resistors, i.e., the resistors to  $V_{CC}$  and ground are each twice the reference resistor value. If users are planning to use HSTL or SSTL standards, the reference resistors should be 50-ohms. The DCI I/O standards that support split termination are: HSTL\_I\_DCI, HSTL\_II\_DCI, SSTL2\_I\_DCI, SSTL2\_II\_DCI, SSTL3\_I\_DCI, and SSTL3\_II\_DCI.

Figure 2-103 illustrates split termination inside a Virtex-II device.



Figure 2-103: Split Termination Using DCI

### **Driver With Single Termination**

Some I/O standards, such as HSTL Class IV, require an output termination to  $V_{CCO}$ . Figure 2-104 illustrates the output termination to  $V_{CCO}$ .



Figure 2-104: Driver With Single Termination Without DCI

DCI can provide this termination to  $V_{CCO}$  using single termination. In this case, DCI only controls the impedance of the termination, but not the driver. If users are planning to use GTL or HSTL standards, the external reference resistors should be 50-ohms. The DCI I/O standards that support a driver with single termination are: GTL\_DCI, GTLP\_DCI, and HSTL\_IV\_DCI.

Figure 2-105 illustrates a driver with single termination inside a Virtex-II device



Figure 2-105: Driver With Single Termination Using DCI

### **Driver With Split Termination**

Some I/O standards, such as HSTL Class II, require an output termination to  $V_{CCO}/2$ . See Figure 2-106.



Figure 2-106: Driver With Split Terminating

DCI can provide this termination to  $V_{CCO}/2$  using split termination. It only controls the impedance of the termination, but not the driver. For HSTL or SSTL standards, the external reference resistors should be 50-ohms. The DCI I/O standards that support a Driver with split termination are: HSTL\_II\_DCI, SSTL2\_II\_DCI, and SSTL3\_II\_DCI.

Figure 2-107 illustrates a driver with split termination inside a Virtex-II device.



Figure 2-107: Driver With Split Termination Using DCI

# Software Support

This section lists the valid DCI I/O buffer library components and describes how to use DCI in the Xilinx software.

# DCI I/O Buffer Library Components

The DCI input buffer library components, including global clock buffer, are the following:

- IBUFG\_GTLP\_DCI
- IBUFG\_GTL\_DCI
- IBUFG\_HSTL\_I\_DCI
- IBUFG\_HSTL\_II\_DCI
- IBUFG\_HSTL\_III\_DCI
- IBUFG\_HSTL\_IV\_DCI
- IBUFG\_LVDCI\_15
- IBUFG\_LVDCI\_18
- IBUFG\_LVDCI\_25
- IBUFG\_LVDCI\_33
- IBUFG\_LVDCI\_DV2\_15
- IBUFG\_LVDCI\_DV2\_18
- IBUFG\_LVDCI\_DV2\_25
- IBUFG\_LVDCI\_DV2\_33
- IBUFG\_SSTL2\_I\_DCI
- IBUFG\_SSTL2\_II\_DCI
- IBUFG\_SSTL3\_I\_DCI
- IBUFG\_SSTL3\_II\_DCI
- IBUF\_GTLP\_DCI
- IBUF\_GTL\_DCI
- IBUF\_HSTL\_I\_DCI
- IBUF\_HSTL\_II\_DCI
- IBUF\_HSTL\_III\_DCI
- IBUF\_HSTL\_IV\_DCI
- IBUF\_LVDCI\_15
- IBUF\_LVDCI\_18
- IBUF\_LVDCI\_25
- IBUF\_LVDCI\_33
- IBUF\_LVDCI\_DV2\_15
- IBUF\_LVDCI\_DV2\_18
- IBUF\_LVDCI\_DV2\_25
- IBUF\_LVDCI\_DV2\_33
- IBUF\_SSTL2\_I\_DCI
- IBUF\_SSTL2\_II\_DCI
- IBUF\_SSTL3\_I\_DCI
- IBUF\_SSTL3\_II\_DCI

The following are DCI output buffer library components:

- OBUF\_GTLP\_DCI
- OBUF\_GTL\_DCI
- OBUF\_HSTL\_I\_DCI
- OBUF\_HSTL\_II\_DCI
- OBUF\_HSTL\_III\_DCI
- OBUF\_HSTL\_IV\_DCI
- OBUF\_LVDCI\_15
- OBUF\_LVDCI\_18
- OBUF\_LVDCI\_25
- OBUF\_LVDCI\_33
- OBUF\_LVDCI\_DV2\_15
- OBUF\_LVDCI\_DV2\_18
- OBUF\_LVDCI\_DV2\_25
- OBUF\_LVDCI\_DV2\_33
- OBUF\_SSTL2\_I\_DCI
- OBUF\_SSTL2\_II\_DCI
- OBUF\_SSTL3\_I\_DCI
- OBUF\_SSTL3\_II\_DCI

The following are DCI 3 state output buffer library components:

- OBUFT\_GTLP\_DCI
- OBUFT\_GTL\_DCI
- OBUFT\_HSTL\_I\_DCI
- OBUFT\_HSTL\_II\_DCI
- OBUFT\_HSTL\_III\_DCI
- OBUFT\_HSTL\_IV\_DCI
- OBUFT\_LVDCI\_15
- OBUFT\_LVDCI\_18
- OBUFT\_LVDCI\_25
- OBUFT\_LVDCI\_33
- OBUFT\_LVDCI\_DV2\_15
- OBUFT\_LVDCI\_DV2\_18
- OBUFT\_LVDCI\_DV2\_25
- OBUFT\_LVDCI\_DV2\_33
- OBUFT\_SSTL2\_I\_DCI
- OBUFT\_SSTL2\_II\_DCI
- OBUFT\_SSTL3\_I\_DCI
- OBUFT\_SSTL3\_II\_DCI

The following are DCI I/O buffer library components:

- IOBUF\_GTLP\_DCI
- IOBUF\_GTL\_DCI
- IOBUF\_HSTL\_II\_DCI
- IOBUF\_HSTL\_IV\_DCI
- IOBUF\_SSTL2\_II\_DCI
- IOBUF\_SSTL3\_II\_DCI
- IOBUF\_LVDCI\_15
- IOBUF\_LVDCI\_18
- IOBUF\_LVDCI\_25
- IOBUF\_LVDCI\_33
- IOBUF\_LVDCI\_DV2\_15
- IOBUF\_LVDCI\_DV2\_18
- IOBUF\_LVDCI\_DV2\_25
- IOBUF\_LVDCI\_DV2\_33

### How to Use DCI in the Software

There are two ways for users to use DCI for Virtex-II devices:

- 1. Use the IOSTANDARD attribute in the constraint file.
- 2. Instantiate DCI input or output buffers in the HDL code.

### **IOSTANDARD** Attribute

The IOSTANDARD attribute can be entered through the NCF or UCF file. The syntax is as follows:

NET <net name> IOSTANDARD = LVDCI\_25;

Where <net name> is the name between the IPAD and IBUF or OPAD or OBUF. For HDL designs, this name is the same as the port name.

The following are valid DCI attributes for output drivers:

- LVDCI\_15
- LVDCI\_18
- LVDCI\_25
- LVDCI\_33
- LVDCI\_DV2\_15
- LVDCI\_DV2\_18
- LVDCI\_DV2\_25
- LVDCI\_DV2\_33

The following are valid DCI attributes for terminations:

- GTL\_DCI
- GTLP\_DCI
- HSTL\_I\_DCI
- HSTL\_II\_DCI

- HSTL\_III\_DCI
- HSTL\_IV\_DCI
- SSTL2\_I\_DCI
- SSTL2\_II\_DCI
- SSTL3\_I\_DCI
- SSTL3\_II\_DCI

### **VHDL Example**

Instantiating DCI input and output buffers is the same as instantiating any other I/O buffers. Users must make sure that the correct I/O buffer names are used and follow the standard syntax of instantiation.

For example, to instantiate a HSTL Class I output DCI buffer, the following syntax can be used:

HSTL\_DCI\_buffer: OBUF\_HSTL\_I\_DCI port map (I=>data\_out, O=>data\_out\_DCI); Below is an example VHDL code that instantiates four 2.5 V LVDCI drivers and four HSTL Class I outputs.

```
-- Module: DCI TEST
- -
-- Description: VHDL example for DCI SelectI/O-Ultra
-- Device: Virtex-II Family
-----
                                library ieee;
use ieee.std logic 1164.all;
use ieee.std logic unsigned.all;
entity dci_test is
port (clk, reset, ce, control : in std logic;
 A, B : in std logic vector (3 downto 0);
 Dout : out std_logic_vector (3 downto 0);
 muxout : out std logic vector (3 downto 0));
end dci_test;
architecture dci arch of dci test is
--DCI output buffer component declaration
component OBUF LVDCI 25 port (I : in std logic; 0 : out std logic);
end component;
attribute syn black box of OBUF LVDCI 25 : component is true;
attribute black box pad pin of OBUF LVDCI 25 : component is "O";
--HSTL Class I DCI output buffer component declaration
component OBUF HSTL I DCI port (I : in std logic; O: out std logic);
end component;
attribute syn black box of OBUF HSTL I DCI : component is true;
attribute black box pad pin of OBUF HSTL I DCI : component is "O";
signal muxout_int : std_logic_vector (3 downto 0);
signal dout int : std logic vector (3 downto 0);
begin
process (clk, reset)
begin
 if (reset = '1') then
        dout int<="0000";</pre>
   elsif (clk'event and clk='1') then
```

```
dout_int<=dout_int+1;</pre>
end if;
end process;
process (controls, A, B, DOUT_INT)
begin
 if (control='1') then
   muxout int<=A and B;</pre>
else
   muxout_int<=Dout_int;</pre>
 end if;
end process;
U0 : OBUF LVDCI 25 port map(
 I=>dout_int(0),
 O=>dout(0));
U1 : OBUF_LVDCI_25 port map(
      I=>dout_int(1),
      O=>dout(1));
U2 : OBUF_LVDCI_25 port map(
      I=>dout int(2),
      O=>dout(2));
U3 : OBUF_LVDCI_25 port map(
      I=>dout_int(3),
      O=>dout(3));
K0 : OBUF_HSTL_I_DCI port map(
 I=>muxout int(0),
 O=>muxout(0));
K1 : OBUF HSTL I DCI port map(
      I=>muxout int(1),
      O = > muxout(1));
K2 : OBUF_HSTL_I_DCI port map(
      I=>muxout int(2),
      O=>muxout(2));
K3 : OBUF_HSTL_I_DCI port map(
      I=>muxout_int(3),
      O = > muxout(3));
```

end dci\_arch;

# DCI in Virtex-II Hardware

DCI only works with certain single-ended I/O standards and does not work with any differential I/O standard. DCI supports the following Virtex-II standards:

LVDCI, LVDCI\_DV2, GTL\_DCI, GTLP\_DCI, HSTL\_I\_DCI, HSTL\_II\_DCI, HSTL\_III\_DCI, HSTL\_IV\_DCI, SSTL2\_I\_DCI, SSTL2\_II\_DCI, SSTL3\_I\_DCI, and SSTL3\_II\_DCI.

To correctly use DCI in a Virtex-II device, users must follow the following rules:

- 1.  $V_{CCO}$  pins must be connected to the appropriate  $V_{CCO}$  voltage based on the IOSTANDARDs in that bank.
- Correct DCI I/O buffers must be used in the software either by using IOSTANDARD attributes or instantiations in the HDL code.
- 3. External reference resistors must be connected to multi-purpose pins (VRN and VRP) in the bank cannot be used as regular I/Os. Refer to the Virtex-II pinouts for the

198

specific pin locations. Pin VRN must be pulled up to  $V_{CCO}$  by its reference resistor. Pin VRP must be pulled down to ground by its reference resistor.

- 4. The value of the external reference resistors should be selected to give the desired output impedance. If using GTL\_DCI, HSTL\_DCI, or SSTL\_DCI I/O standards, then they should be 50 ohms.
- 5. The values of the reference resistors must be within the supported range. Availability of this range is planned for the next release of the <u>Virtex-II Data Sheet (DS031)</u>. (~30 to  $100 \Omega$ )
- 6. Follow the DCI I/O banking rules.

The DCI I/O banking rules are the following:

- 1. V<sub>REF</sub> must be compatible for all of the inputs in the same bank.
- 2. V<sub>CCO</sub> must be compatible for all of the inputs and outputs in the same bank.
- 3. No more than one DCI I/O standard using Single Termination type is allowed per bank.
- 4. No more than one DCI I/O standard using Split Termination type is allowed per bank.
- 5. Single Termination and Split Termination, Controlled Impedance Driver, and Controlled Impedance Driver with Half Impedance can co-exist in the same bank.

The behavior of DCI 3-state outputs is as follows:

If a LVDCI or LVDCI\_DV2 driver is in 3-state, the driver is 3-stated. If a Driver with Single or Split Termination is in 3-state, the driver is 3-stated but the termination resistor remains.

The following section lists any special care actions that must be taken for each DCI I/O standard.

### LVDCI\_15, LVDCI\_18, LVDCI\_25, LVDCI\_33

Using these buffers configures the outputs as controlled impedance drivers. The number extension at the end indicates the V<sub>CCO</sub> voltage that should be used. For example, 15 means V<sub>CCO</sub>=1.5 V, etc. There is no slew rate control or drive strength settings for LVDCI drivers.

# LVDCI\_DV2\_15, LVDCI\_DV2\_18, LVDCI\_DV2\_25, LVDCI\_DV\_33

Using these buffers configures the outputs as controlled drivers with half impedance. The number extension at the end indicates the  $V_{CCO}$  voltage that should be used. For example, 15 means  $V_{CCO}$ =1.5 V, etc. There is no slew rate control or drive strength settings for LVDCI\_DV2 drivers.

### GTL\_DCI

GTLP does not require a V<sub>CCO</sub> voltage. However, for GTL\_DCI, V<sub>CCO</sub> must be connected to 1.2 V. GTL\_DCI provides single termination to V<sub>CCO</sub> for inputs or outputs.

### GTLP\_DCI

GTL+ does not require a V<sub>CCO</sub> voltage. However, for GTLP\_DCI, V<sub>CCO</sub> must be connected to 1.5 V. GTLP\_DCI provides single termination to V<sub>CCO</sub> for inputs or outputs.

### HSTL\_I\_DCI, HSTL\_III\_DCI

HSTL\_I\_DCI provides split termination to  $V_{\rm CCO}/2$  for inputs. HSTL\_III\_DCI provides single termination to  $V_{\rm CCO}$  for inputs.

# HSTL\_ II\_DCI, HSTL\_ IV\_DCI

HSTL\_II\_DCI provides split termination to  $V_{\rm CCO}/2$  for inputs or outputs. HSTL\_IV\_DCI provides single termination to  $V_{\rm CCO}$  for inputs or outputs.

# SSTL2\_I\_DCI, SSTL3\_I\_DCI

SSTL2\_I\_DCI and SSTL3\_I\_DCI provide split termination to  $V_{CCO}/2$  for inputs. Then I/O standards are SSTL compatible. SSTL2\_I\_DCI and SSTL3\_I\_DCI drivers have different  $V_{OH}$  and  $V_{OL}$  values than non-DCI SSTL2\_I and SSTL3\_I drivers.

## SSTL2\_II\_DCI, SSTL3\_II\_DCI

 $\label{eq:STL2_II_DCI and SSTL3_II_DCI provide split termination to V_{CCO}/2 for inputs. Then I/O standards are SSTL compatible. SSTL2_II_DCI and SSTL3_II_DCI drivers have different V_{OH} and V_{OL} values than non-DCI SSTL2_II and SSTL3_II drivers.$ 

Figure 2-108 provides examples illustrating the use of the HSTL\_I\_DCI, HSTL\_II\_DCI, HSTL\_III\_DCI, and HSTL\_IV\_DCI I/O standards.



DS031\_65a\_100201

### Figure 2-108: HSTL DCI Usage Examples

# Figure 2-109 provides examples illustrating the use of the SSTL2\_I\_DCI, SSTL2\_II\_DCI, SSTL3\_I\_DCI, and SSTL3\_II\_DCI I/O standards.



Notes:

1. The SSTL-compatible  $25\Omega$  series resistor is accounted for in the DCI buffer, and it is not DCI controlled.

2. Z<sub>0</sub> is the recommended PCB trace impedance.

DS031\_65b\_112502

Figure 2-109: SSTL DCI Usage Examples

**Figure 2-110** provides examples illustrating the use of the LVDS\_DCI and LVDSEXT\_DCI I/O standards. For a complete list, see the Virtex-II *User Guide*.



LVDS\_DCI and LVDSEXT\_DCI Receiver

Figure 2-110: LVDS DCI Usage Examples

# Using Double-Data-Rate (DDR) I/O

# Introduction

Virtex-II devices have dedicated registers in a single IOB to implement input, output, and output with 3-state control Double-Data-Rate (DDR) registers. Input and output DDR is accomplished with the use of two registers in the IOB. A single clock triggers one register on a Low to High transition and a second register on a High to Low transition. Output DDR with 3-state requires the use of four registers in the IOB clocked in a similar fashion. Since the introduction of DLLs, Xilinx devices can generate low-skew clock signals that are 180 degrees out of phase, with a 50/50 duty cycle. These clocks reach the DDR registers in the IOB via dedicated routing resources.

# **Data Flow**

# Input DDR

Input DDR is accomplished via a single input signal driving two registers in the IOB. Both registers are clocked on the rising edge of their respective clocks. With proper clock forwarding, alternating bits from the input signal are clocked in on the rising edge of the two clocks, which are 180 degrees out of phase. Figure 2-111 depicts the input DDR registers and the signals involved.



UG002\_C2\_036\_031301

Figure 2-111: Input DDR

CLK0 and CLK1 are 180 degrees out of phase. Both registers share the SET/PRE and RESET/CLR lines. As shown in Figure 2-112, alternating bits on the DATA line are clocked in via Q0 and Q1 while CE is High. The clocks are shifted out of phase by the DCM (CLK0 and CLK180 outputs) or by the inverter available on the CLK1 clock input.



Figure 2-112: Input DDR Timing Diagram

### Output DDR

Output DDR registers are used to clock output from the chip at twice the throughput of a single rising-edge clocking scheme. Clocking for output DDR is the same as input DDR. The clocks driving both registers are 180 degrees out of phase. The DDR MUX selects the register outputs. The output consists of alternating bits from DATA\_1 and DATA\_2. Figure 2-113 depicts the output DDR registers and the signals involved.

2



UG002\_C2\_038\_101300

Figure 2-113: Output DDR

Both registers share the SET/PRE and RESET/CLR line. Both registers share the CE line which must be High for outputs to be seen on Q1 and Q2. Figure 2-114 shows the data flow for the output DDR registers.



UG002\_C2\_039\_101300

Figure 2-114: Output DDR Timing Diagram

### **Output DDR With 3-State Control**

The 3-state control allows the output to have one of two values, either the output from the DDR MUX or high impedance.

The Enable signal is driven by a second DDR MUX (Figure 2-115). This application requires the instantiation of two output DDR primitives.



Figure 2-115: Output DDR With 3-State Control

www.xilinx.com 1-800-255-7778 All four registers share the SET/PRESET and RESET/CLEAR lines. Two registers are required to accomplish the DDR task and two registers are required for the 3-state control. There are two Clock Enable signals, one for output DDRs performing the DDR function and another for the output DDRs performing the 3-state control function. Two 180 degree out of phase clocks are used. CLK1 clocks one of the DDR registers and a 3-state register. CLK2 clocks the other DDR register and the other 3-state register.

The DDR registers and 3-state registers are associated by the clock that is driving them. Therefore, the DDR register that is clocked by CLK1 is associated to the 3-state register being clocked by CLK1. The remaining two registers are associated by CLK2. If both 3-state registers are driving a logic High, the output sees a high impedance. If both 3-state registers are driving a logic Low, the output sees the values from the DDR MUX see Figure 2-116).



Figure 2-116: Timing Diagram for Output DDR With 3-State Control

When the 3-state registers are not driving the same logic value, the 3-state register being clocked by CLK1 is called TREG1. The other 3-state register TREG2 is clocked by CLK2. Similarly, the DDR register being clocked by CLK1 is called DREG1, and the other DDR register DREG2 is clocked by CLK2. If TREG1 is driving a logic High and TREG2 is driving a logic Low, the output sees a high impedance when CLK1 is High and the value out of DREG2 when CLK2 is High. If TREG2 is driving a logic High and TREG1 is driving a logic Low, the output sees a high impedance when CLK2 is High and TREG1 is driving a logic Low, the output sees a high impedance when CLK2 is High and TREG1 is driving a logic Low, the output sees a high impedance when CLK2 is High and the value out of DREG1 when CLK1 is High.

# Characteristics

- All registers in an IOB share the same SET/PRE and RESET/CLR lines.
- The 3-State and Output DDR registers have common clocks (OTCLK1 & OTCLK2).
- All signals can be inverted (with no added delay) inside the IOB.
- DDR MUXing is handled automatically within the IOB. There is no manual control of the MUX-select. This control is generated from the clock.
- When several clocks are used, and when using DDR registers, the floorplan of a design should take into account that the input clock to an IOB is shared with a pair of IOBs.

# **Library Primitives**

Input DDR registers are inferred, and dedicated output DDR registers have been provided as primitives for Virtex-II designs. Input DDR registers consist of two inferred registers that clock in a single data line on each edge. Generating 3-state output with DDR registers is as simple as instantiating a primitive.



Figure 2-117: FDDRRSE Symbol: DDR Flip-Flop With Clock Enable and Synchronous Reset and Set



Figure 2-118: FDDRCPE Symbol: DDR Flip-Flop With Clock Enable and Asynchronous PRESET and CLR

# VHDL and Verilog Instantiation

Examples are available in "VHDL and Verilog Templates" on page 211.

In VHDL, each template has a component declaration section and an architecture section. Each part of the template should be inserted within the VHDL design file. The port map of the architecture section should include the design signal names.

Constraints file syntax is provided where input registers need to be used. These settings force the input DDR registers into the IOB. The output registers should be instantiated and do not require any constraints file syntax to be pushed into the IOB.

# **Port Signals**

# FDDRRSE

### Data inputs - D0 and D1

D0 and D1 are the data inputs into the DDR flip-flop. Data on the D0 input is loaded into the flip-flop when R and S are Low and CE is High during a Low-to-High C0 clock transition. Data on the D1 input is loaded into the flip-flop when R and S are Low and CE is High during a Low-to-High C1 clock transition.

### Clock Enable - CE

The enable pin affects the loading of data into the DDR flip-flop. When Low, new data is not loaded into the flip-flop. CE must be High to load new data into the flip-flop.

### Clocks - C0 and C1

These two clocks are phase shifted 180 degrees (via the DLL) and allow selection of two separate data inputs (D0 and D1).

### Synchronous Set - S and Synchronous Reset - R

The Reset (R) input, when High, overrides all other inputs and resets the output Low during any Low-to-High clock transition (C0 or C1). Reset has precedence over Set. When the Set (S) input is High and R is Low, the flip-flop is set, output High, during a Low-to-High clock transition (C0 or C1).

### Data Output - Q

When power is applied, the flip-flop is asynchronously cleared and the output is Low.

During normal operation, The value of Q is either D0 or D1. The Data Inputs description above states how the value of Q is chosen.

### FDDRCPE

### Data inputs - D0 and D1

D0 and D1 are the data inputs into the DDR flip-flop. Data on the D0 input is loaded into the flip-flop when PRE and CLR are Low and CE is High during a Low-to-High C0 clock transition. Data on the D1 input is loaded into the flip-flop when PRE and CLR are Low and CE is High during a Low-to-High C1 clock transition.

### Clock Enable - CE

The enable pin affects the loading of data into the DDR flip-flop. When Low, clock transitions are ignored and new data is not loaded into the flip-flop. CE must be High to load new data into the flip-flop.

### Clocks - C0 and C1

These two clocks are phase shifted 180 degrees (via the DLL) and allow selection of two separate data inputs (D0 and D1).

### Asynchronous Preset - PRE and Asynchronous Clear - CLR

The Preset (PRE) input, when High, sets the Q output High. When the Clear (CLR) input is High, the output is reset to Low.

### Data Output - Q

When power is applied, the flip-flop is asynchronously cleared and the output is Low. During normal operation, The value of Q is either D0 or D1. The Data Inputs description above states how the value of Q is chosen.

# Initialization in VHDL or Verilog

Output DDR primitives can be initialized in VHDL or Verilog code for both synthesis and simulation. For synthesis, the attributes are attached to the output DDR instantiation and are copied in the EDIF output file to be compiled by Xilinx tools. The VHDL code simulation uses a generic parameter to pass the attributes. The Verilog code simulation uses the defparam parameter to pass the attributes.

The DDR code examples (in VHDL ad Verilog) illustrate the following techniques.

# **Location Constraints**

DDR instances can have LOC properties attached to them to constrain pin placement.

The LOC constraint uses the following form.

NET <net\_name> LOC=A8;

Where "A8" is a valid I/O pin location.

# **Applications**

### DDR SDRAM

The DDR SDRAM is an enhancement to the Synchronous DRAM by effectively doubling the data throughput of the memory device. Commands are registered at every positive clock edge. Input data is registered on both edges of the data strobe, and output data is referenced to both edges of the data strobe, as well as both edges of the clock.

### **Clock Forwarding**

DDR can be used to forward a copy of the clock on the output. This can be useful for propagating a clock along with double-data-rate data that has an identical delay. It is also useful for multiple clock generation, where there is a unique clock driver for every clock load.

# VHDL and Verilog Templates

VHDL and Verilog templates are available for output, output with 3-state enable, and input DDR registers.

### Input DDR

To implement an Input DDR application, paste the following template in your code.

### DDR\_input.vhd

```
library IEEE;
use IEEE.STD_LOGIC_1164.ALL;
entity DDR_Input is
    Port (
        clk : in std_logic;
        d : in std_logic;
        rst : in std_logic;
        q1 : out std_logic;
        q2 : out std_logic
            );
end DDR_Input;
--Describe input DDR registers (behaviorally) to be inferred
architecture behavioral of DDR Input is
```

```
begin
glreg : process (clk, d, rst)
begin
   if rst='1' then --asynchronous reset, active high
    q1 <= '0';
   elsif clk'event and clk='1' then --Clock event - posedge
     q1 <= d;
   end if;
end process;
q2reg : process (clk, d, rst)
begin
   if rst='1' then --asynchronous reset, active high
      q2 <= '0';
   elsif clk'event and clk='0' then --Clock event - negedge
      q2 <= d;
   end if;
end process;
end behavioral;
-- NOTE: You must include the following constraints in the .ucf
-- file when running back-end tools,
-- in order to ensure that IOB DDR registers are used:
- -
-- INST "q2_reg" IOB=TRUE;
-- INST "q1_reg" IOB=TRUE;
_ _
-- Depending on the synthesis tools you use, it may be required to
-- check the edif file for modifications to
-- original net names...in this case, Synopsis changed the
-- names: q1 and q2 to q1_reg and q2_reg
```

### DDR\_input.v

module DDR\_Input (data\_in , q1, q2, clk, rst);

input data\_in, clk, rst; output q1, q2; reg q1, q2;

//Describe input DDR registers (behaviorally) to be inferred

always @ (posedge clk or posedge rst) //rising-edge DDR reg. and asynchronous reset

```
begin
if (rst)
  q1 = 1'b0;
else
  q1 = data_in;
end
```

2

```
always @ (negedge clk or posedge rst) //falling-edge DDR reg. and
asynchronous reset
begin
 if (rst)
    q2 = 1'b0;
 else
    q2 = data_in;
  end
assign data_out = q1 & q2;
endmodule
/* NOTE: You must include the following constraints in the .ucf file
when running back-end tools, \backslash
 in order to ensure that IOB DDR registers are used:
INST "q2_reg" IOB=TRUE;
INST "q1_reg" IOB=TRUE;
Depending on the synthesis tools you use, it may be required to check
the edif file for modifications to
original net names...in this case, Synopsis changed the names: q1 and q2
to q1_reg and q2_reg
```

\*/

### Output DDR

To implement an Output DDR application, paste the following template in your code.

### DDR\_out.vhd

| <pre>library IEEE;<br/>use IEEE.STD_LOGIC_1164.ALL;<br/> pragma translate_off<br/>LIBRARY UNISIM;<br/>use UNISIM.VCOMPONENTS.ALL;<br/>pragma translate_on</pre>                                                                                                                                                                                                                                                                                             |
|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| <pre>entity DDR_Output is   Port(     clk : in std_logic;clk and clk180 can be outputs from the DCM or   clk180 can be the     clk180 : in std_logic;logical inverse of clk (the inverter is   located in the IOB and will be inferred.     d0 : in std_logic;data in to fddr     d1 : in std_logic;data in to fddr     ce : in std_logic;clock enable     rst : in std_logic;reset     set : in std_logic;set     q : out std_logicDDR output     );</pre> |
| end DDR_Output;                                                                                                                                                                                                                                                                                                                                                                                                                                             |
| architecture behavioral of DDR_Output is                                                                                                                                                                                                                                                                                                                                                                                                                    |
| component FDDRRSE port(                                                                                                                                                                                                                                                                                                                                                                                                                                     |

www.xilinx.com 1-800-255-7778

```
Q : out std_logic;
      D0 : in std_logic;
      D1 : in std logic;
      C0 : in std_logic;
      C1 : in std_logic;
      CE : in std_logic;
      R : in std logic;
      S : in std_logic
      );
   end component;
   begin
   U0: FDDRRSE
    port map (
      Q => q
      D0 => d0,
      D1 => d1,
      C0 => clk,
      C1 => clk180,
      CE => ce,
      R => rst,
      S => set
      );
   end behavioral;
DDR_out.v
   module DDR_Output (d0 , d1, q, clk, clk180, rst, set, ce);
   input d0, d1, clk, clk180, rst, set, ce;
   output q;
   //Synchronous Output DDR primitive instantiation
   FDDRRSE U1 (.D0(d0),
                 .D1(d1),
                 .CO(clk),
                 .C1(clk180),
                 .CE(ce),
                 .R(rst),
```

endmodule

### Output DDR With 3-State Enable

);

To implement an Output DDR with 3-state Enable, paste the following template in your code:

### DDR\_3state.vhd

```
library IEEE;
use IEEE.STD_LOGIC_1164.ALL;
-- pragma translate_off
LIBRARY UNISIM;
use UNISIM.VCOMPONENTS.ALL;
--pragma translate_on
```

.S(set), .Q(q)

# XILINX<sup>®</sup>

```
entity DDR_3state is
 Port(
   clk : in std logic; --clk and clk180 can be outputs from the DCM or
clk180 can be the
   clk180 : in std logic; --logical inverse of clk (the inverter is
located in the IOB and will be inferred.
   d0 : in std_logic; --data in to fddr
   d1 : in std_logic; --data in to fddr
   ce : in std logic; --clock enable
   set : in std logic; --set
   rst : in std logic; --reset
   en0 : in std_logic; --enable signal
   en1 : in std logic; --enable signal
   data_out : out std_logic --data seen at pad
 );
end DDR_3state;
architecture behavioral of DDR 3state is
signal ddr_out, tri : std_logic;
component FDDRRSE
   port (
   Q : out std_logic;
   D0 : in std_logic;
   D1 : in std_logic;
   C0 : in std_logic;
   C1 : in std_logic;
   CE : in std_logic;
   R : in std_logic;
   S : in std_logic
       );
end component;
begin
--Instantiate Ouput DDR registers
U0: FDDRRSE port map(Q => tri,
      D0 => en0,
      D1 => en1,
      C0 => clk,
      C1 => clk180,
      CE => ce,
      R => rst,
      S => set
      );
--Instantiate three-state DDR registers
U1: FDDRRSE port map( Q => ddr out,
      D0 => d0,
      D1 => d1,
      C0 => clk,
      C1 => clk180,
      CE => ce,
      R => rst,
      S => set
      );
--inferr the 3-State buffer
```

```
process(tri, ddr_out)
```

www.xilinx.com 1-800-255-7778

```
begin
     if tri = '1' then
        data out <= 'Z';</pre>
     elsif tri = '0' then
        data_out <= ddr_out;</pre>
     end if;
   end process;
   end behavioral;
DDR 3state.v
   module DDR_3state (d0 , d1, data_out, en_0, en_1, clk, clk180, rst, set,
   ce);
   input d0, d1, clk, clk180, rst, set, ce, en_0, en_1;
   output data out;
   reg data_out;
   wire q, q_tri;
   //Synchronous Output DDR primitive instantiation
   FDDRRSE U1 ( .D0(d0),
                 .D1(d1),
                 .C0(clk),
                 .C1(clk180),
                 .CE(ce),
                 .R(rst),
                 .S(set),
                 .Q(q)
                );
   //Synchronous 3-State DDR primitive instantiation
   FDDRRSE U2 ( .D0(en_0),
                 .D1(en_1),
                 .CO(clk),
                 .C1(clk180),
                 .CE(ce),
                 .R(rst),
                 .S(set),
                 .Q(q_tri)
                );
   //3-State buffer description
   always @ (q_tri or q)
     begin
     if (q_tri)
       data_out = 1'bz;
     else
       data_out = q;
     end
```

endmodule
# **Using LVDS I/O**

# Introduction

Low Voltage Differential Signaling (LVDS) is a very popular and powerful high-speed interface in many system applications. Virtex-II I/Os are designed to comply with IEEE electrical specifications for LVDS to make system and board design easier. With the addition of an LVDS current-mode driver in the IOBs, which eliminates the need for external source termination in point-to-point applications, and with the choice of two different voltage modes and an extended mode, Virtex-II devices provide the most flexible solution for doing an LVDS design in an FPGA.

Table 2-60 lists all LVDS primitives that are available for Virtex-II devices.

| Input                 | Output            | 3-State            | Clock                  | <b>Bi-Directional</b> |
|-----------------------|-------------------|--------------------|------------------------|-----------------------|
| IBUF_LVDS             | OBUF_LVDS         | OBUFT_LVDS         | IBUFG_LVDS             | IOBUF_LVDS            |
| IBUFDS_LVDS_25        | OBUFDS_LVDS_25    | OBUFTDS_LVDS_25    | IBUFGDS_LVDS_25        |                       |
| IBUFDS_LVDS_33        | OBUFDS_LVDS_33    | OBUFTDS_LVDS_33    | IBUFGDS_LVDS_33        |                       |
| IBUFDS_LVDSEXT_25     | OBUFDS_LVDSEXT_25 | OBUFTDS_LVDSEXT_25 | IBUFGDS_LVDSEXT_25     |                       |
| IBUFDS_LVDSEXT_33     | OBUFDS_LVDSEXT_33 | OBUFTDS_LVDSEXT_33 | IBUFGDS_LVDSEXT_33     |                       |
| IBUFDS_LVDSEXT_25_DCI |                   |                    | IBUFGDS_LVDSEXT_25_DCI |                       |
| IBUFDS_LVDS_25_DCI    |                   |                    | IBUFGDS_LVDS_25_DCI    |                       |

Table 2-60: Available Virtex-II LVDS Primitives

Primitives in **bold** type are pre-existing LVDS primitives used in Virtex-E and earlier designs. They are not current-mode drivers and are still required for BLVDS (Bus LVDS) applications.

\*DS\_LVDS\_25 = 2.5V V<sub>CCO</sub> LVDS Buffer

\*DS\_LVDS\_33 = 3.3V V<sub>CCO</sub> LVDS Buffer

There is no difference in the AC characteristics of either voltage-mode LVDS I/O. These choices now provide more flexibility for mixed-I/O banking rules; that is, an LVTTL I/O can coexist with the 3.3V LVDS buffer in the same bank.

\*DS\_LVDSEXT\* = Extended mode LVDS buffer

This buffer provides a higher drive capability and voltage swing (350 - 750 mV), which makes it ideal for long-distance or cable LVDS links.

The output AC characteristics of this LVDS driver are not within the EIA/TIA specifications. This LVDS driver is intended for situations that require higher drive capabilities in order to produce an LVDS signal that is within EIA/TIA specification at the receiver.

# Creating an LVDS Input/Clock Buffer

Figure 2-119 illustrates the LVDS input and clock buffer primitives shown in Table 2-61. The pin names used are the same as those used in the HDL library primitives.

Table 2-61: LVDS Input and Clock Buffer Primitives

| LVDS Inputs           | LVDS Clocks            |
|-----------------------|------------------------|
| IBUFDS_LVDS_25        | IBUFGDS_LVDS_25        |
| IBUFDS_LVDS_33        | IBUFGDS_LVDS_33        |
| IBUFDS_LVDSEXT_25     | IBUFGDS_LVDSEXT_25     |
| IBUFDS_LVDSEXT_33     | IBUFGDS_LVDSEXT_33     |
| IBUFDS_LVDS_25_DCI    | IBUFGDS_LVDS_25_DCI    |
| IBUFDS_LVDSEXT_25_DCI | IBUFGDS_LVDSEXT_25_DCI |

IBUFDS\_LVDS\*/IBUFGDS\_LVDS\*



Figure 2-119: LVDS Input and Clock Primitives

To create an LVDS input, instantiate the desired mode (2.5 V, 3.3 V, or Extended) LVDS input buffer. Notice that the P and N channels are included in the primitive (I = P, IB = N). Software automatically uses the appropriate pin from an adjacent IOB for the N channel. The same applies to LVDS clocks: Use IBUFGDS\_LVDS\*

### LVDS Input HDL Examples

#### **VHDL** Instantiation

Verilog Instantiation

IBUFDS\_LVDS\_25 U1 ( .I(data\_in\_P), .IB(data\_in\_N), .O(data\_in) );

### Port Signals

I = P-channel data input to the LVDS input buffer

IB = N-channel data input to the LVDS input buffer

O = Non-differential input data from LVDS input buffer

### Location Constraints

NET "data\_in\_P" LOC= "NS";

### LVDS Receiver Termination

All LVDS receivers require standard termination. Figure 2-120 is an example of a typical termination for an LVDS receiver on a board with  $50\Omega$  transmission lines.



Figure 2-120: LVDS Receiver Termination

# Creating an LVDS Output Buffer

Figure 2-121 illustrates the LVDS output buffer primitives:

- OBUFDS\_LVDS\_25
- OBUFDS\_LVDS\_33
- OBUFDS\_LVDSEXT\_25
- OBUFDS\_LVDSEXT\_33

The pin names used are the same as those used in the HDL library primitives.

OBUFDS\_LVDS\*



Figure 2-121: LVDS Output Buffer Primitives

To create an LVDS output, instantiate the desired mode (2.5, 3.3V, or Extended) LVDS output buffer. Notice that the P and N channels are included in the primitive (O = P, OB = N). Software automatically uses the appropriate pin from an adjacent IOB for the N channel.

### LVDS Output HDL Examples

### **VHDL** Instantiation

### Verilog Instantiation

OBUFDS\_LVDS\_25 U1 ( .I(data\_out), .O(data\_out\_P), .OB(data\_out\_N) );

### Port Signals

I = data input to the LVDS input buffer

O = P-channel data output

OB = N-channel data output

### Location Constraints

NET "data\_out\_P" LOC= "NS";

### LVDS Transmitter Termination

The Virtex-II LVDS transmitter does not require any termination. Table 2-60 lists primitives that correspond to the Virtex-II LVDS current-mode drivers. Virtex-II LVDS current-mode drivers are a true current source and produce the proper (IEEE/EIA/TIA compliant) LVDS

signal. Figure 2-122 illustrates a Virtex-II LVDS transmitter on a board with  $50\Omega$  transmission lines.





# Creating an LVDS Output 3-State Buffer

Figure 2-123 illustrates the LVDS 3-State buffer primitives:

- OBUFTDS\_LVDS\_25
- OBUFTDS\_LVDS\_33
- OBUFTDS\_LVDSEXT\_25
- OBUFTDS\_LVDSEXT\_33

The pin names used are the same as those used in the HDL library primitives.



UG002\_C2\_033\_100200

Figure 2-123: LVDS 3-State Primitives

To create an LVDS 3-State output, instantiate the desired mode (2.5V, 3.3V, or Extended) LVDS 3-State buffer. Notice that the P and N channels are included in the primitive (O = P, OB = N). Software automatically uses the appropriate pin from an adjacent IOB for the N channel.

## LVDS 3-State HDL Example

#### VHDL Instantiation

### Verilog Instantiation

```
OBUFTDS_LVDS_25 U1 ( .I(data_out),
.T(tri),
.O(data_out_P),
.OB(data_out_N)
);
```

### Port Signals

I = data input to the 3-state output buffer

T = 3-State control signal

O = P-channel data output

OB = N-channel data output

### Location Constraints

NET "data\_out\_P" LOC = "NS";

### LVDS 3-State Termination

The Virtex-II LVDS 3-state buffer does not require any termination. Table 2-60 lists primitives that correspond to Virtex-II LVDS current-mode drivers. These drivers are a true current source, and they produce the proper (IEEE/EIA/TIA compliant) LVDS signal. Figure 2-124 illustrates a simple redundant point-to-point LVDS solution with two LVDS 3-state transmitters sharing a bus with one LVDS receiver and the required termination for the circuit.



Figure 2-124: LVDS 3-State Termination

# Creating a Bidirectional LVDS Buffer

Since LVDS is intended for point-to-point applications, BLVDS (Bus-LVDS) is not an IEEE/EIA/TIA standard implementation and requires careful adaptation of I/O and PCB layout design rules. The primitive supplied in the software library for bi-directional LVDS does not use the Virtex-II LVDS current-mode driver. Therefore, source termination is required. Refer to <u>xapp243</u> for examples of BLVDS termination.

The following are VHDL and Verilog instantiation examples of Virtex-II BLVDS primitves.

### **VHDL** Instantiation

### Verilog Instantiation

```
IOBUFDS_BLVDS_25 blvds_io ( .I(data_out),
        .O(data_in),
        .T(tri),
        .IO(data_IO_P),
        .IOB(data_IO_N)
);
```

### Port Signals

I = data output: internal logic to LVDS I/O buffer

T = 3-State control to LVDS I/O buffer

IO = P-channel data I/O to or from BLVDS pins

IOB = N-channel data I/O to or from BLVDS pins

O = Data input: off-chip data to LVDS I/O buffer

### Location Constraints

Only the P or N channel must be constrained. Software automatically places the corresponding channel of the pair on the appropriate pin.

# LDT

Lightning Data Transport (LDT) is a new high speed interface and protocol introduced by Advanced Micro Devices. LDT is a differential signaling based interface that is very similar to LVDS. Virtex-II IOBs are equipped with LDT buffers. These buffers also have corresponding software primitives as follows:

```
IBUFDS_LDT_25
IBUFGDS_LDT_25
OBUFDS_LDT_25
OBUFTDS_LDT_25
```

# LDT Implementation

LDT implementation is the same as LVDS with DDR, so follow all of the rules and guidelines set forth earlier in this chapter for LVDS-DDR, and replace the LVDS buffer with the corresponding LDT buffer. For more information on Virtex-II LDT electrical specification, refer to the <u>Virtex-II Data Sheet (DS031)</u>.

# **Using LVPECL I/O**

## Introduction

Low Voltage Positive Emitter-Coupled Logic (LVPECL) is a very popular and powerful high-speed interface in many system applications. Virtex-II I/Os are designed to comply with the EIA/TIA electrical specifications for 2.5V LVPECL to make system and board design easier.

Table 2-62 lists all LVPECL primitives that are available for Virtex-II devices.

Table 2-62: Available Virtex-II LVPECL Primitives

| Input            | Output           | 3-State           | Clock             | <b>Bi-Directional</b> |
|------------------|------------------|-------------------|-------------------|-----------------------|
| IBUFDS_LVPECL_25 | OBUFDS_LVPECL_25 | OBUFTDS_LVPECL_25 | IBUFGDS_LVPECL_25 |                       |

# Creating an LVPECL Input/Clock Buffer

Figure 2-125 illustrates the LVPECL input and clock buffer primitives shown in Table 2-63. The pin names used are the same as those used in the HDL library primitives.

Table 2-63: LVPECL Input and Clock Buffer Primitives

| LVPECL Inputs    | LVPECL Clocks     |
|------------------|-------------------|
| IBUFDS_LVPECL_25 | IBUFGDS_LVPECL_25 |

IBUFDS\_LVPECL\*/IBUFGDS\_LVPECL\*



2

#### Figure 2-125: LVPECL Input and Clock Primitives

To create an LVPECL input, instantiate the desired mode (2.5V or Extended) LVPECL input buffer. Notice that the P and N channels are included in the primitive (I = P, IB = N). Software automatically uses the appropriate pin from an adjacent IOB for the N channel. The same applies to LVPECL clocks: Use IBUFGDS\_LVPECL\*.

### LVPECL Input HDL Examples

### **VHDL** Instantiation

### Verilog Instantiation

IBUFDS\_LVPECL\_25 U1 ( .I(data\_in\_P), .IB(data\_in\_N), .O(data\_in) );

### Port Signals

I = P-channel data input to the LVDS input buffer

IB = N-channel data input to the LVDS input buffer

O = Non-differential input data from LVDS input buffer

### Location Constraints

NET "data\_in\_P" LOC= "NS";

### LVPECL Receiver Termination

All LVPECL receivers require standard termination. Figure 2-126 is an example of a typical termination for an LVPECL receiver on a board with  $50\Omega$  transmission lines.



Figure 2-126: LVPECL Receiver Termination

# Creating an LVPECL Output Buffer

Figure 2-127 illustrates the LVDS output buffer primitives:

- OBUFDS\_LVPECL\_25
- OBUFTDS\_LVPECL\_25

The pin names used are the same as those used in the HDL library primitives.



Figure 2-127: LVPECL Output Buffer Primitives

To create an LVPECL output, instantiate the desired mode LVPECL output buffer. Notice that the P and N channels are included in the primitive (O = P, OB = N). Software automatically uses the appropriate pin from an adjacent IOB for the N channel.

### LVPECL Output HDL Examples

### **VHDL** Instantiation

### Verilog Instantiation

```
OBUFDS_LVPECL_25U1 ( .I(data_out),
.O(data_out_P),
.OB(data_out_N)
);
```

## Port Signals

- I = data input to the LVPECL input buffer
- O = P-channel data output
- OB = N-channel data output

### Location Constraints

NET "data\_out\_P" LOC= "NS";

### LVPECL Transmitter Termination

The Virtex-II LVPECL transmitter does not require any termination. Table 2-62 lists primitives that correspond to the Virtex-II LVPECL drivers. Figure 2-128 illustrates a Virtex-II LVPECL transmitter on a board with  $50\Omega$  transmission lines.



Figure 2-128: LVPECL Transmitter Termination

# **Using Bitstream Encryption**

Virtex-II devices have an on-chip decryptor that can be enabled to make the configuration bitstream (and thus the whole logic design) secure. The user can encrypt the bitstream in the Xilinx software, and the Virtex-II chip then performs the reverse operation, decrypting the incoming bitstream, and internally recreating the intended configuration.

This method provides a very high degree of design security. Without knowledge of the encryption/decryption key or keys, potential pirates cannot use the externally intercepted bitstream to analyze, or even to clone the design. System manufacturers can be sure that their Virtex-II implemented designs cannot be copied and reverse engineered. Also, IP Virtex-II chips that contain the correct decryption key.

The Virtex-II devices store the internal decryption keys in a few hundred bits of dedicated RAM, backed up by a small externally connected battery. At <100 nA load, the endurance of the battery is only limited by its shelf life.

The method used to encrypt the data is Data Encryption Standard (DES). This is an official standard supported by the National Institute of Standards and Technology (NIST) and the U. S. Department of Commerce. DES is a symmetric encryption standard that utilizes a 56bit key. Because of the increased sophistication and speed of today's computing hardware, single DES is no longer considered to be secure. However, the Triple Data Encryption Algorithm (TDEA), otherwise known as triple DES, is authorized for use by U. S. federal organizations to protect sensitive data and is used by many financial institutions to protect their transactions. Triple DES has yet to be cracked. Both DES and triple DES are available in Virtex-II devices.

## What DES Is

DES and triple DES are symmetric encryption algorithms. This means that the key to encrypt and the key to decrypt are the same. The security of the data is kept by keeping the key secret. This contrasts to a public key system, like RSA or PGP. One thing to note is that Virtex-II devices use DES in Cipher Block Chaining mode. This means that each block is combined with the previous encrypted block for added security. DES uses a single 56-bit key to encrypt 64-bit blocks one at a time.

# How Triple DES is Different

Triple DES uses three keys (known as a key bundle or key set), and the encryption algorithm is repeated for each of those keys. If  $E_K(I)$  and  $D_K(I)$  denote the encryption and decryption of a data block I using key K, the Triple DES encryption algorithm is as follows (known as E-D-E):

 $Output_{encrypted} = E_{K3}(D_{K2}(E_{K1}(I)))$ 

And the decryption algorithm is as follows (known as D-E-D):

 $Output_{decrypted} = D_{K1}(E_{K2}(D_{K3}(I)))$ 

 $K_1 = K_2 = K_3$  gives the same result as single DES.

For a detailed description of the DES standard, refer to:

### http://www.itl.nist.gov/fipspubs/fip46-2.htm

For a popular description of the origin and the basic concept of DES and many other older and newer encryption schemes, see the recent best-seller:

The Code Book by Simon Singh, Doubleday 1999, ISBN 0-385-49531-5

# **Classification and Export Considerations**

Virtex-II FPGAs have been classified by the U. S. Department of Commerce as an FPLD (3A001.a.7), which is the same classification as current FPGAs. Only the decryptor is onchip and can only be used to decrypt an incoming bitstream, so the classification has not changed and no new paperwork is required. The software has been classified under ECCN#:5D002 and can be exported globally under license exception ENC. No changes to current export practices are necessary.

# **Creating Keys**

For Virtex-II, DES or triple DES (TDEA) can be used. DES uses a single 56-bit key, where triple-DES always uses three such keys. All of the keys can be chosen by the BitGen program at random, or can be explicitly specified by the user.

Virtex-II devices can have six separate keys programmed into the device. A particular Virtex-II device can store two sets of triple-DES keys and can thus accept alternate bitstreams from two competing IP vendors, without providing access to each other's design. However, all of the keys must be programmed at once.

An encrypted bitstream is created by the BitGen program. Keys and key options can be chosen in two ways: by command-line arguments to BitGen, or by specifying a KeyFile (with the –g KeyFile command-line option). The BitGen options relevant to encryption are listed in Table 2-64.:

| Option   | Description                                                                  | Values (default first where<br>appropriate) |
|----------|------------------------------------------------------------------------------|---------------------------------------------|
| Encrypt  | Whether to encrypt the bitstream                                             | No, Yes                                     |
| Key0     | DES Key 0                                                                    | pick, <hex string=""></hex>                 |
| Key1     | DES Key 1                                                                    | pick, <hex string=""></hex>                 |
| Key2     | DES Key 2                                                                    | pick, <hex string=""></hex>                 |
| Key3     | DES Key 3                                                                    | pick, <hex string=""></hex>                 |
| Key4     | DES Key 4                                                                    | pick, <hex string=""></hex>                 |
| Key5     | DES Key 5                                                                    | pick, <hex string=""></hex>                 |
| KeyFile  | Location of separate key definition file                                     | <string></string>                           |
| Keyseq0  | Set the key sequence for key 0 (S = single, F = first, M = middle, L = last) | S,F,M,L                                     |
| Keyseq1  | Set the key sequence for key 1                                               | S,F,M,L                                     |
| Keyseq2  | Set the key sequence for key 2                                               | S,F,M,L                                     |
| Keyseq3  | Set the key sequence for key 3                                               | S,F,M,L                                     |
| Keyseq4  | Set the key sequence for key 4                                               | S,F,M,L                                     |
| Keyseq5  | Set the key sequence for key 5                                               | S,F,M,L                                     |
| StartKey | Key number to start decryption                                               | 0,3                                         |
| StartCBC | Constant Block Chaining start value                                          | pick, <string></string>                     |

Table 2-64: BitGen Encryption Options

The key sequence (Keyseq) is set to S for single key encryption, F for first key in multi-key encryption, M for middle key in multi-key encryption, and L for last key in multi-key

encryption. When the KeyFile option is specified, BitGen looks in that file for all other DES key options listed above. An example for the input KeyFile using triple DES is:

```
# Comment for key file
Key 0 0x9ac28ebeb2d83b;
Key 1 pick;
Key 2 string for my key;
Key 3 0x000000000000;
Key 4 8774eb3ebb4f84;
Keyseq 0 F;
Keyseq 1 M;
Keyseq 2 L;
Keyseq 2 L;
Keyseq 3 F;
Keyseq 4 M;
Keyseq 5 L;
Key StartCBC 503f2f655b1b2f82;
StartKey 0;
```

The command line equivalent of the input key file above is as follows:

```
bitgen -g security:level2 ?g Encrypt:Yes ?g Key0: 0x9ac28ebeb2d83b ?g
Key1:pick ?g Key2:? string for my key? ?g Key30x000000000000000 ?g
Key4:8774eb3ebb4f84 ?g Keyseq0:F, -g Keyseq1:M, -gKeyseq2:L ?g
Keyseq3:F ?g Keyseq4:M ?g Keyseq5:L -g StartCBC:503f2f655b1b2f82 ?g
StartKey:0 myinput.ncd
```

If the key file is used, the command line is as follows:

```
Bitgen -g security:level2 ?g Encrypt:Yes ?g KeyFile: mykeyfile
myinput.ncd
```

The output key file from either of the above inputs looks something like this:

```
Device 2v40CS144;

Key 0 0x9ac28ebeb2d83b;

Key 1 0xdb1adb5f08b972;

Key 2 0x5452032773c286;

Key 3 0x000000000000;

Key 4 0x8774eb3ebb4f84;

Key 5 0x000000000000;

Keyseq 0 F;

Keyseq 1 M;

Keyseq 2 L;

Keyseq 3 F;

Keyseq 4 M;

Keyseq 5 L;

Key StartCBC 0x503f2f655b1b2f82;

StartKey 0;
```

In the case of the string for Key2, if the keyvalue is a character string, BitGen encodes the string into a 56-bit hex string. The same character string gives the same 56-bit hex string every time. This enables passwords or phrases to be used instead of hex strings.

The above keys are all specified as 64 bits each. The first 8 bits are used by Xilinx as header information and the following 56 bits as the key. BitGen accepts 64 bit keys, but automatically overrides the header, if necessary.

Because of security issues, the ?g Compress option cannot be used with bitstream encryption, and the Bitgen security level must be set for "level1" or "level2" (-g security:level1 or -g security:level2) to prevent readback. Setting the security level to "none" enables readback, allowing the decrypted bitstream to be read back from a configured device.. Also, partial reconfiguration is not allowed.

# Loading Keys

DES keys can only be loaded through JTAG. The JTAG Programmer and iMPACT<sup>TM</sup> tools have the capability to take a .nky file and program the device with the keys. In order to program the keys, a "key-access mode" is entered. When this mode is entered, all of the FPGA memory, including the keys and configuration data, is cleared. Once the keys are programmed, they cannot be reprogrammed without clearing the entire device. This "key access mode" is completely transparent to most users.

Keys are programmed using the ISC\_PROGRAM instruction, as detailed in the JTAG 1532 specification. SVF generation is also supported, if keys are to be programmed using a different method, such as a microprocessor or JTAG test software.

# Loading Encrypted Bitstreams

Once the device has been programmed with the correct keys, the device can be configured with an encrypted bitstream. Non-encrypted bitstreams may also be used to configure the device, and the stored keys are ignored. The method of configuration is not at all affected by encryption. Any of the modes may be used, and the signaling does not change (refer to Chapter 3: Configuration). However, *all* bitstreams must configure the entire device, since partial reconfiguration is not permitted.

Once the device has been configured with an encrypted bitstream, it cannot be reconfigured without toggling the PROG pin, cycling power, or performing the JTAG JSTART instruction. All of these events fully clear the configuration memory, but none of these events reset the keys as long as  $V_{BATT}$  or  $V_{CCAUX}$  are maintained.

# V<sub>BATT</sub>

 $V_{BATT}$  is a separate battery voltage to allow the keys to remain programmed in the Virtex-II device.  $V_{BATT}$  draws very little current (on the order of nA) to keep the keys programmed. A small watch battery is suitable (refer to  $V_{BATT}$  DC Characteristics in the <u>Virtex-II Data</u> <u>Sheet (DS031)</u> and the battery's specifications to estimate its lifetime).

While the auxiliary voltage ( $V_{CCAUX}$ ) is applied,  $V_{BATT}$  does not draw any current, and the battery can be removed or exchanged.

# **Using the CORE Generator System**

## Introduction

This section on the Xilinx CORE Generator System<sup>TM</sup> and the Xilinx Intellectual Property (IP) Core offerings is provided as an overview of products that facilitate the Virtex-II design process. For more detailed and complete information, consult the *CORE Generator Guide*, which can be accessed online in the Xilinx software installation, as well as at the http://toolbox.xilinx.com/docsan/xilinx4/manuals.htm site, under the "Design Entry Tools" heading.

# The CORE Generator System

The Xilinx CORE Generator System is the cataloging, customization, and delivery vehicle for IP cores targeted to Xilinx FPGAs. This tool is included with all Xilinx ISE BaseX, ISE Foundation, and ISE Alliance Series software packages. The CORE Generator provides centralized access to a catalog of ready-made IP functions ranging in complexity from simple arithmetic operators, such as adders, accumulators, and multipliers, to systemlevel building blocks, such as filters, transforms, and memories. Cores can be displayed alphabetically, by function, by vendor, or by type. Each core comes with its own data sheet, which documents the core's functionality in detail.

The CORE Generator User Interface (see Figure 2-129) has direct links to key Xilinx web support pages, such as the Xilinx IP Center website (<u>www.xilinx.com/ipcenter</u>) and Xilinx Technical Support, making it very easy to access the latest Virtex-II IP releases and get helpful, up-to-date specifications and information on technical issues. Links to partner IP providers are also built into the informational GUIs for the various partner-supplied AllianceCORE products described under "AllianceCORE Program" on page 234.

The use of CORE Generator IP cores in Virtex-II designs enables designers to shorten design time, and it also helps them realize high levels of performance and area efficiency without any special knowledge of the Virtex-II architecture. The IP cores achieve these high levels of performance and logic density by using Xilinx Smart-IP<sup>™</sup> technology.

| Xilinx CORE Generator                         |                      |                            |        |              |            |                |         |     |                                  |     | _ [    | □ × |
|-----------------------------------------------|----------------------|----------------------------|--------|--------------|------------|----------------|---------|-----|----------------------------------|-----|--------|-----|
| File Project Core Tools Help                  |                      |                            |        |              |            |                |         |     |                                  |     |        |     |
| Current Project: C:tprojectstcoregenitest1    |                      |                            |        |              |            |                |         |     |                                  |     |        |     |
| View Catalog: by Function                     |                      |                            |        |              |            |                |         |     |                                  |     |        |     |
| Target Family: 📲 Virtex2                      |                      | Contents of: Communica     | tion & | Network      | ing > Te   | lecommun       | ication | s   |                                  |     |        |     |
| 🦲 Basic Elements                              |                      | Name                       |        | Type         | Versi      | on <u>4K</u> 😻 | V 🖏     |     | Vendor                           | S S | status |     |
| Communication & Networking                    |                      | 1024 Channel ADPCM         |        | Alliance     | 1.0        |                | *       | +   | Amphion \$                       | 3   |        |     |
| Asynchronous Transfer Mode                    |                      | 16 Channel ADPCM           |        | Alliance     | 1.0        |                |         | •   | Amphion :                        | 3   |        |     |
| Encryption Decryption                         |                      | 256 Channel ADPCM          |        | Alliance     | 1.0        |                |         |     | 🔶 Amphion (                      | З   |        |     |
| Encrypaon Decrypaon                           |                      | 512 Channel ADPCM          |        | Alliance     | 1.0        |                | •       | •   | 🔶 Amphion S                      | 3   |        |     |
| Ethernet                                      |                      | 768 Channel ADPCM          |        | Alliance     | 1.0        |                | ۰       |     | 🔶 Amphion 🕯                      | З   |        |     |
| Telecommunications                            |                      | ADPCM32                    |        | LOQIC COPE   | 1.0        |                | . •     | •   | Xilinx, Inc.                     |     |        |     |
| 🚞 Digital Signal Processing                   |                      | BOOSTLite Bluetooth bas    | eb     | OTRE         | 1.0        |                |         | •   | NewLogic                         |     |        |     |
| Building Blocks                               |                      | Convolution Encoder        |        | LOQIC RAPE   | 1.0        |                | ÷ •     | • • | Xilinx, Inc.                     |     |        |     |
| Correlators                                   |                      | DVB Satellite Modulator C  | ore    | ORE          | 1.0        | *              | •       | •   | MemecCo                          | re  |        |     |
| DSP ProtoType & Development H                 | lardware             | Flexbus-4 Information      |        | LAGIC 243    | 1.0        |                |         |     | <ul> <li>Xilinx, Inc.</li> </ul> |     |        |     |
| Filters                                       |                      | HDLC1                      |        | LQQIC Q+S    | 1.0        |                | ÷.      |     | Xilinx, Inc.                     |     |        |     |
| Image Processing                              |                      | HDLC32                     |        | Red Contract | 1.0        |                | 1.1     |     | Xilinx, Inc.                     |     |        |     |
|                                               |                      | Interleaver/De-interleaver | Int    | Alliance _   | - 1.1      |                | 1       | *   | Xilinx, Inc.                     |     |        |     |
|                                               |                      | Noisy Transmission Cha     | nne    | CARE         | 1.0        | *              |         | ·   | TILAB                            |     |        |     |
| Generated Modules:                            |                      |                            |        |              |            |                |         |     |                                  | _   |        | _   |
| Component Name                                | 1                    | Core Name                  | ( v    | ersion       | Family     | Vend           | ior     | 1   | Generated                        | 1   |        |     |
| asyncfif31x16                                 | Asynchrono           | us FIFO                    | 3.0    |              | ~          | Xilinx, Inc.   |         | Jul | 24. 2001                         |     |        |     |
| asyncfif31x16 default                         | Asynchrono           | us FIFO                    | 3.0    |              | Ň          | Xilinx, Inc.   |         | Jul | 24.2001                          |     |        |     |
| dafir                                         | Distributed A        | Arithmetic FIR Filter      | 5.0    |              | N          | Xilinx, Inc.   |         | Jun | 25, 2001                         |     |        |     |
| distmem64x16_dec                              | Distributed Memory   |                            |        |              | N.         | Xilinx, Inc.   |         | Jul | 11,2001                          |     |        |     |
| distmem64x16_dec                              | Distributed Memory 4 |                            |        |              | V          | Xilinx, Inc.   |         | Jul | 12,2001                          |     |        |     |
| distmem64x16 sinale                           | Distributed N        | vlemorv                    | 4.1    |              | <b>\</b> _ | Xilinx. Inc.   |         | Jul | 11.2001                          |     |        | -   |
|                                               |                      |                            |        |              |            |                |         |     |                                  |     |        |     |
| Generating the UEO()) circulation current     | artfiloc             |                            |        |              |            |                |         |     |                                  |     |        | -   |
| Central of the vertice strainability solution |                      |                            |        |              |            |                |         |     |                                  | io. | 2      | 5   |
| <u>µ</u>                                      |                      |                            |        |              |            |                |         |     |                                  | - 4 |        | ••• |

ug002\_c2\_068b\_100901

Figure 2-129: Core Generator User Interface

## Smart-IP Technology

Smart-IP technology leverages Xilinx FPGA architectural features, such as look-up tables (LUTs), distributed RAM, segmented routing and floorplanning information, as well as relative location constraints and expert logic mapping to optimize the performance of every core instance in a given Xilinx FPGA design. In the context of Virtex-II cores, Smart-IP technology includes the use of the special high-performance Virtex-II architectural features, such as embedded 18x18 multipliers, block memory, shift register look-up tables (SRL16's), and special wide mux elements.

Smart-IP technology delivers:

- Physical layouts optimized for high performance
- Predictable high performance and efficient resource utilization
- Reduced power requirements through compact design and interconnect minimization
- Performance independent of device size
- Ability to use multiple cores without deterioration of performance
- Reduced compile time over competing architectures

# **CORE** Generator Design Flow

A block diagram of the CORE Generator design flow is shown in Figure 2-130.



Figure 2-130: CORE Generator Design Flow

### Note:

1. The outputs produced by the CORE Generator consist of an implementation Netlist and optional schematic symbol, HDL template files, and HDL simulation model wrapper files.

# **Core Types**

### Parameterized Cores

The CORE Generator System supplies a wide assortment of parameterized IP cores that can be customized to meet specific Virtex-II design needs and size constraints. See Figure 2-131. For each parameterized core, the CORE Generator System supplies:

• A customized EDIF implementation netlist (.EDN)

- A parameterized Verilog or VHDL behavioral simulation model (.V, .VHD) and corresponding wrapper file (also .V, .VHD)
- Verilog or VHDL templates (.VEO, .VHO)
- An ISE Foundation or Viewlogic® schematic symbol

The EDIF implementation netlist is used by the Xilinx tools to implement the core. The other design files generated depend on the Design Entry settings specified (target CAE vendor, and design flow type -- schematic or HDL). Schematic symbol files are generated when a schematic design flow is specified for the project.

Parameterized HDL simulation models are provided in two separate HDL simulation libraries, one for Verilog functional simulation support, and the other for VHDL functional simulation support. The libraries, which are included as part of the Xilinx installation, are in the following locations:

\$XILINX/verilog/src/XilinxCoreLib

\$XILINX/vhdl/src/XilinxCoreLib

| ogi <mark>C</mark> ÄRE                   |                                                  | Distributed Arithmetic FIR Filter                                                                                                                              |
|------------------------------------------|--------------------------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------|
|                                          |                                                  | Component Name:                                                                                                                                                |
|                                          |                                                  | Filter Type     Filter Type     G Single Rate FIR     Halfband     Hilbert Transform     Interpolated     Interpolation     Decimation     Decimating Halfband |
| DIN<br>LD_DIN<br>COEF_LD<br>LD_WE<br>CLK | DOUT_I<br>DOUT_Q<br>SEL_I<br>SEL_O<br>RDY<br>RFD | Filter Options         Number of Channels:       1 I Zero Packing Factor:         Sample Rate Change:       1 I Zero Valid Range: 18                           |
| — ND<br>                                 |                                                  | Subtrate       Number of Taps:       16       21024       Coefficients       Impulse Response       C Symmetric       Non Symmetric                            |
|                                          |                                                  | < Back Next > Page 1 of 2                                                                                                                                      |

ug002\_c2\_070a\_100501

#### Figure 2-131: Core Customization Window for a Parameterized Core

If using a compiled simulator, these libraries must be precompiled before performing a functional simulation of the cores. An analyze\_order file describing the required compile order of these models is included with each XilinxCoreLib library, one for Verilog (verilog\_analyze\_order) and one for VHDL (vhdl\_analyze\_order).

For an HDL design flow, Verilog and VHDL templates (.VEO and .VHO files) are also provided to facilitate the integration of the core into the design for the purposes of functional simulation, synthesis, and implementation. The Verilog (.V) and VHDL (.VHD) wrapper files are also generated. The wrapper files for a particular core are compiled like normal simulation models. They convey custom parameter values to the corresponding generic, parameterized behavioral model for that core in the XilinxCoreLib library. The custom parameter values are used to tailor the behavior of the customized core. The following is a sample VHO template:

```
component adder8
   port (
   a: IN std logic VECTOR(7 downto 0);
   b: IN std logic VECTOR(7 downto 0);
   c: IN std logic;
   ce: IN std_logic;
   ci: IN std logic;
   clr: IN std logic;
    s: OUT std logic VECTOR(8 downto 0));
end component;
-- Synplicity black box declaration
attribute black box : boolean;
attribute black_box of test: component is true;
-- COMP TAG END ----- End COMPONENT Declaration ------
-- The following code must appear in the VHDL architecture
-- body. Substitute your own instance name and net names.
----- Begin Cut here for INSTANTIATION Template ----- INST TAG
your instance name : adder8
   port map (
   a => a,
   b => b,
   C => C,
   ce => ce,
   ci => ci,
   clr => clr,
   s => s);
-- INST TAG END ----- End INSTANTIATION Template -----
-- You must compile the wrapper file test.vhd when simulating
-- the core, test. When compiling the wrapper file, be sure to
-- reference the XilinxCoreLib VHDL simulation library. For detailed
-- instructions, please refer to the "Core Generator Guide".
```

### **Fixed Netlist Cores**

The other type of Virtex-II core provided by the CORE Generator is the fixed netlist core. These are preset, non-parameterized designs that are shipped with the following:

- A fixed EDIF implementation netlist (as opposed to one that is customized on the fly)
- .VEO and .VHO templates
- Non-parameterized .V and .VHD behavioral simulation models
- Schematic symbol support

Examples include the fixed netlist Xilinx FFTs and most AllianceCORE products.

Since the HDL behavioral models for fixed netlist cores are not parameterized, the corresponding .VEO and .VHO template files are correspondingly simple. They do not need to pass customizing parameter values to a library behavioral model.

## Xilinx IP Solutions and the IP Center

The CORE Generator works in conjunction with the Xilinx IP Center on the world wide web to provide the latest IP and software upgrades. To make the most of this resource, Xilinx highly recommends that whenever starting a design, first do a quick search of the Xilinx IP Center (<u>www.xilinx.com/ipcenter</u>) to see whether a ready-made core solution is already available.

A complete catalog of Xilinx cores and IP tools resides on the IP Center, including:

- LogiCORE Products
- AllianceCORE Products
- Reference Designs
- XPERTS Partner Consultants
- Design Reuse Tools

When installing the CORE Generator software, the designer gains immediate access to dozens of cores supplied by the LogiCORE Program. In addition, data sheets are available for all AllianceCORE products, and additional, separately licensed, advanced function LogiCORE products are also available. New and updated Virtex-II IP for the CORE Generator can be downloaded from the IP Center and added to the CORE Generator catalog.

### LogiCORE Program

LogiCORE products are designed, sold, licensed, and supported by Xilinx. LogiCORE products include a wide selection of generic, parameterized functions, such as muxes, adders, multipliers, and memory cores which are bundled with the Xilinx CORE Generator software at no additional cost to licensed software customers. System-level cores, such as PCI, Reed-Solomon, ADPCM, HDLC, POS-PHY, and Color Space Converters are also available as optional, separately licensed products. Probably, the most common application of the CORE Generator is to use it to quickly generate Virtex-II block and distributed memories. A more detailed listing of available Virtex-II LogiCORE products is available in Table 2-65 and on the Xilinx IP Center website (<u>www.xilinx.com/ipcenter</u>).

Types of IP currently offered by the Xilinx LogiCORE program include:

- Basic Elements: logic gates, registers, multiplexers, adders, multipliers
- Communications and Networking: ADPCM modules, HDLC controllers, ATM building blocks, forward error correction modules, and POS-PHY Interfaces
- DSP and Video Image Processing: cores ranging from small building blocks (e.g., Time Skew Buffers) to larger system-level functions (e.g., FIR Filters and FFTs)
- System Logic: accumulators, adders, subtracters, complementers, multipliers, integrators, pipelined delay elements, single and dual-port distributed and block RAM, ROM, and synchronous and asynchronous FIFOs
- Standard Bus Interfaces: PCI 64/66 (64-bit, 66 MHz), 64/33 (64-bit, 33 MHz), and 32/33 (32-bit, 3 3MHz) Interfaces

### AllianceCORE Program

The AllianceCORE program is a cooperative effort between Xilinx and third-party IP developers to provide additional system-level IP cores optimized for Xilinx FPGAs. To ensure a high level of quality, AllianceCORE products are implemented and verified in a Xilinx device as part of the certification process.

Xilinx develops relationships with AllianceCORE partners who can complement the Xilinx LogiCORE product offering. Where Xilinx does not offer a LogiCORE for a particular function, Xilinx partners with an AllianceCORE partner to offer that function. A large percentage of Xilinx AllianceCORE partners focus on data and telecommunication applications, as well as processor and processor peripheral designs.

Together, Xilinx and the AllianceCORE partners are able to provide an extensive library of cores to accelerate the design process. AllianceCORE products include customizable cores which can be configured to exact needs, as well as fixed netlist cores targeted toward specific applications. In many cases, partners can provide cores customized to meet the specific design needs if the primary offerings do not fit the requirements. Additionally, source code versions of the cores are often available from the partners at additional cost for those who need maximum flexibility.

The library of Xilinx and AllianceCORE IP cores allows designers to leverage the expertise of experienced designers who are well-versed in optimizing designs for Virtex-II and other Xilinx architectures. This enables designers to obtain high performance and density in the target Virtex-II device with a faster time to market.

### **Reference Designs**

Xilinx offers two types of reference designs; application notes (XAPPs) developed by Xilinx, and reference designs developed through the Xilinx Reference Design Alliance Program. Both types are extremely valuable to customers looking for guidance when designing systems. Reference designs can often be used as starting points for implementing a broad spectrum of functions in Xilinx programmable logic.

Application notes developed by Xilinx usually include supporting design files. They are supplied free of charge, without technical support or warranty. To see currently available reference designs, visit <u>www.xilinx.com</u>.

Reference designs developed through the Xilinx Reference Design Alliance Program are developed, owned, and controlled by the partners in the program. The goal of the program is to form strategic engineering and marketing partnerships with other semiconductor manufacturers and design houses so as to assist in the development of high quality, multicomponent reference designs that incorporate Xilinx devices and demonstrate how they can operate at the system level with other specialized and general purpose semiconductors.

The reference designs in the Xilinx Reference Design Alliance Program are fully functional and applicable to a wide variety of digital electronic systems, including those used for networking, communications, video imaging, and DSP applications. Visit <u>www.xilinx.com</u> to see a list of designs currently available through this program.

### **XPERTS** Program

Xilinx established the XPERTS Program to provide customers with access to a worldwide network of certified design consultants proficient with Xilinx Platform FPGAs, software, and IP core integration. All XPERT members are certified and have extensive expertise and experience with Xilinx technology in various vertical applications, such as communications and networking, DSP, video and image processing, system I/O interfaces, and home networking.

XPERTS partners are an integral part of Xilinx strategy to provide customers with costefficient design solutions, while accelerating time to market. For more information on Xilinx XPERTS Program, visit <u>www.xilinx.com</u>.

### **Design Reuse Tools**

To facilitate the archiving and sharing of IP created by different individuals and workgroups within a company, Xilinx offers the IP Capture Tool. The IP Capture Tool helps to package design modules created by individual engineers in a standardized format so that they can be cataloged and distributed using the Xilinx CORE Generator. A core can take the form of synthesizable VHDL or Verilog code, or a fixed function netlist. Once it is packaged by the IP Capture Tool and installed into the CORE Generator, the "*captured*" core can be shared with other designers within a company through an internal network. The IP Capture Tool is supplied as a separate utility through the Xilinx IP Center. For more information, see the <u>www.xilinx.com/ipcenter</u> website.

# **CORE** Generator Summary

The CORE Generator delivers a complete catalog of IP including behavioral models, synthesis templates, and netlists with performance guaranteed by Xilinx Smart-IP technology. It is a repository for LogiCORE products from Xilinx, AllianceCORE products from Xilinx partners, and it supports Design Reuse for internally developed IP. In addition,

LogiCORE products are continuously updated to add support for new Xilinx architectures, such as Virtex-II. The most current IP updates are available from the Xilinx IP Center.

Utilizing the CORE Generator library of parameterizable cores, designed by Xilinx for Xilinx FPGAs, the designer can enjoy the advantages of design reuse, including faster time to market and lower cost solutions. For more information, visit the Xilinx IP Center www.xilinx.com/ipcenter website.

# Virtex-II IP Cores Support

Table 2-65 provides a partial listing of cores available for Virtex-II designs. For a complete catalog of Virtex-II IP, visit the Xilinx IP Center <u>www.xilinx.com/ipcenter</u> website.

| Eurotion                                         | Vendor    |              | Impler | Implementation Example |           | Kov Features                                          | Application                   |
|--------------------------------------------------|-----------|--------------|--------|------------------------|-----------|-------------------------------------------------------|-------------------------------|
| Function                                         | Name      | іг туре      | Occ    | MHz                    | Device    | Rey Features                                          | Examples                      |
| <b>Basic Elements</b>                            |           | -<br>-       |        |                        |           |                                                       | -                             |
| BUFE-based<br>Multiplexer Slice                  | Xilinx    | LogiCORE     |        |                        |           | 1-256 bits wide                                       |                               |
| BUFT-based<br>Multiplexer Slice                  | Xilinx    | LogiCORE     |        |                        |           | 1-256 bits wide                                       |                               |
| Binary Counter                                   | Xilinx    | LogiCORE     |        |                        |           | 2-256 bits output width                               |                               |
| Binary Decoder                                   | Xilinx    | LogiCORE     |        |                        |           | 2-256 bits output width                               |                               |
| Bit Bus Gate                                     | Xilinx    | LogiCORE     |        |                        |           | 1-256 bits wide                                       |                               |
| Bit Gate                                         | Xilinx    | LogiCORE     |        |                        |           | 1-256 bits wide                                       |                               |
| Bit Multiplexer                                  | Xilinx    | LogiCORE     |        |                        |           | 1-256 bits wide                                       |                               |
| Bus Gate                                         | Xilinx    | LogiCORE     |        |                        |           | 1-256 bits wide                                       |                               |
| Bus Multiplexer                                  | Xilinx    | LogiCORE     |        |                        |           | IO widths up to 256 bits                              |                               |
| Comparator                                       | Xilinx    | LogiCORE     |        |                        |           | 1-256 bits wide                                       |                               |
| FD-based Parallel<br>Register                    | Xilinx    | LogiCORE     |        |                        |           | 1-256 bits wide                                       |                               |
| FD-based Shift<br>Register                       | Xilinx    | LogiCORE     |        |                        |           | 1-64 bits wide                                        |                               |
| LD-based Parallel<br>Latch                       | Xilinx    | LogiCORE     |        |                        |           | 1-256 bits wide                                       |                               |
| RAM-based Shift<br>Register                      | Xilinx    | LogiCORE     |        |                        |           | 1-256 bits wide, 1024<br>words deep                   |                               |
| Communication & N                                | etworking |              |        |                        |           |                                                       |                               |
| 3G FEC Package                                   | Xilinx    | LogiCORE     |        |                        |           | Viterbi Decoder, Turbo<br>Codec, Convolutional<br>Enc | 3G Wireless<br>Infrastructure |
| 3GPP Compliant<br>Turbo Convolutional<br>Decoder | Xilinx    | LogiCORE     | 80%    | 40                     | XC2V500   | 3GPP specs, 2 Mbps,<br>BER=10-6 for 1.5dB SNR         | 3G Wireless<br>Infrastructure |
| 3GPP Compliant<br>Turbo Convolutional<br>Encoder | Xilinx    | LogiCORE     | 65%    | 60                     | XC2V250   | Compliant w/ 3GPP,<br>puncturing                      | 3G Wireless<br>Infrastructure |
| 3GPP Turbo Decoder                               | SysOnChip | AllianceCORE | 87%    | 66                     | XC2V500-5 | 3GPP/UMTS compliant,<br>IMT-2000, 2Mbps data          | Error correction,<br>wireless |

Table 2-65: Virtex-II IP Cores Support

| _                                                          | Vendor   |              | Implen    | nentati | on Example          |                                                                                                 | Application                                                                                       |
|------------------------------------------------------------|----------|--------------|-----------|---------|---------------------|-------------------------------------------------------------------------------------------------|---------------------------------------------------------------------------------------------------|
| Function                                                   | Name     | ІР Туре      | Occ       | MHz     | Device              | Key Features                                                                                    | Examples                                                                                          |
| 8b/10b Decoder                                             | Xilinx   | LogiCORE     | 1<br>BRAM | 100     | XC2V1000            | Industry std 8b/10b<br>en/decode for serial data<br>transmission                                | Physical layer of<br>Fiber Channel                                                                |
| 8b/10b Encoder                                             | Xilinx   | LogiCORE     | 1<br>BRAM | 100     | XC2V1000            | Industry std 8b/10b<br>en/decode for serial data<br>transmission                                | Physical layer of<br>Fiber Channel                                                                |
| ADPCM 1024<br>Channel                                      | Amphion  | AllianceCORE |           |         |                     | G.721, 723, 726, 726a, 727,<br>727a, u-law, a-law                                               | DECT, VOIP,<br>cordless telephony                                                                 |
| ADPCM 256 Channel                                          | Amphion  | AllianceCORE |           |         |                     | G.721, 723, 726, 726a, 727,<br>727a, u-law, a-law                                               | DECT, VOIP,<br>cordless telephony                                                                 |
| ADPCM 512 Channel                                          | Amphion  | AllianceCORE |           |         |                     |                                                                                                 |                                                                                                   |
| ADPCM 768 Channel                                          | Amphion  | AllianceCORE | 89%       | 50      | XC2V500-5           | G.721, 723, 726, 726a, 727,<br>727a, u-law, a-law                                               | DECT, VOIP,<br>cordless telephony                                                                 |
| ADPCM Speech<br>Codec, 32 Channel<br>(DO-DI-ADPCM32)       | Xilinx   | LogiCORE     | 62%       | 25      | XC2V500             | G.726, G.727, 32 duplex<br>channels                                                             | DECT, VOIP,<br>Wireless local<br>loop, DSLAM, PBX                                                 |
| ADPCM Speech<br>Codec, 64 Channel<br>(DO-DI-ADPCM64)       | Xilinx   | LogiCORE     | 61%       | 27      | XC2V500             | G.726, G.727, 64 duplex<br>channels                                                             | DECT, VOIP,<br>wireless local loop,<br>DSLAM, PBX                                                 |
| BOOST LITE<br>Bluetooth Baseband<br>Processor              | NewLogic | AllianceCORE | 73%       | 33%     | XC2V1000-4          | Compliant to Bluetooth<br>v1.1, BQB qualified<br>software for L2CAP,<br>LHP, HC1, voice support | Bluetooth<br>applications                                                                         |
| BOOST Lite<br>Bluetooth Baseband<br>Processor              | NewLogic | AllianceCORE | 73%       | 33%     | XC2V1000-4          | Compliant to Bluetooth<br>v1.1, BQB qualified<br>software for L2CAP,<br>LHP, HC1, voice support | Bluetooth<br>applications                                                                         |
| Convolutional<br>Encoder                                   | Xilinx   | LogiCORE     | 10%       | 26      | XC2V40              | k from 3 to 9, puncturing<br>from 2/3 to 12/13                                                  | 3G base stations,<br>broadcast, wireless<br>LAN, cable<br>modem, xDSL,<br>satellite com,<br>uwave |
| DVB-RCS Turbo<br>Decoder                                   | iCODING  | AllianceCORE | 54%       | 69      | XC2V2000-5          | DVB-RCS compliant,<br>9Mbps, data rate,<br>switchable code rates<br>and frame sizes             | Error correction,<br>wireless, DVB,<br>Satellite data link                                        |
| Flexbus 4 Interface<br>Core, 16-Channel<br>(DO-DI-FLX4C16) | Xilinx   | LogiCORE     | 31%       | 200     | XC2V3000<br>FG676-5 |                                                                                                 | Line card: terabit<br>routers & optical<br>switches                                               |
| Flexbus 4 Interface<br>Core, 4-Channel (DO-<br>DI-FLX4C4)  | Xilinx   | LogiCORE     | 27%       | 200     | XC2V1000<br>FG456-5 |                                                                                                 | Line card: terabit<br>routers & optical<br>switches                                               |
| Flexbus 4 Interface<br>Core, 1-Channel (DO-<br>DI-FLX4C1)  | Xilinx   | LogiCORE     | 12%       | 200     | XC2V1000<br>FG456-5 |                                                                                                 | Line card: terabit<br>routers & optical<br>switches                                               |
|                                                            |          |              |           |         |                     |                                                                                                 |                                                                                                   |

| Table 2-65: | Virtex-II IP Cores Support | (Continued) |
|-------------|----------------------------|-------------|
|-------------|----------------------------|-------------|

| Eunction                                                                                | Vendor  |              | Implen | nentati | on Example           | Kov Egaturos                                                                                                                    | Application                                                                                             |
|-----------------------------------------------------------------------------------------|---------|--------------|--------|---------|----------------------|---------------------------------------------------------------------------------------------------------------------------------|---------------------------------------------------------------------------------------------------------|
| Function                                                                                | Name    | іг іуре      | Occ    | MHz     | Device               | Rey realules                                                                                                                    | Examples                                                                                                |
| HDLC Controller<br>Core, 32 Channels                                                    | Xilinx  | LogiCORE     | 34%    | 81      | XC2V250              | 32 full duplex, CRC-<br>16/32, 8/16-bit address<br>insertion/deletion                                                           | X.25, POS, cable<br>modems, frame<br>relay switches,<br>video confer. over<br>ISDN                      |
| HDLC Controller<br>Core, Single Channel                                                 | Xilinx  | LogiCORE     | 15%    | 115     | XC2V250              | 16/32-bit frame seq,<br>8/16-bit addr<br>insert/delete, flag/zerop<br>insert/detect                                             | X.25, POS, cable<br>modems, frame<br>relay switches,<br>video conf. over<br>ISDN                        |
| Interleaver/De-<br>interleaver                                                          | Xilinx  | LogiCORE     | 30%    | 187     | XC2V40               | Convolutional, width up<br>to 256 bits, 256 branches                                                                            | Broadcast, wireless<br>LAN, cable<br>modem, xDSL,<br>satellite<br>com,uwave nets,<br>digital TV         |
| PE-MACMII Dual<br>Speed 10/100 Mbps<br>Ethernet MAC                                     | Alcatel | AllianceCORE | 33%    | 60      | XC2V500-4            | 802.3 compliant,<br>Supports single &<br>multimode fiber optic<br>devices, M11 interfaces,<br>RMON and Etherstate<br>statistics | Networking,<br>Broadband, NIC,<br>SOHO, Home<br>networking,<br>storage, routers,<br>switches, printers, |
| POS-PHY Level 3<br>Link Layer Interface<br>Core, 48 Channel<br>(DO-DI-<br>POSL3LINK48A) | Xilinx  | LogiCORE     | 33%    | 104     | XC2V6000<br>FF1152-4 |                                                                                                                                 |                                                                                                         |
| POS-PHY L3 Link<br>Layer Interface, 16-Ch<br>(DO-DI-<br>POSL3LINK16)                    | Xilinx  | LogiCORE     | 40%    | 104     | XC2V1000<br>FG456-4  |                                                                                                                                 | Line card: terabit<br>routers & optical<br>switches                                                     |
| POS-PHY L3 Link<br>Layer Interface, 4-Ch<br>(DO-DI-<br>POSL3LINK4)                      | Xilinx  | LogiCORE     | 15%    | 104     | XC2V1000<br>FG456-4  |                                                                                                                                 | Line card: terabit<br>routers & optical<br>switches                                                     |
| POS-PHY L3 Link<br>Layer Interface, 2-Ch<br>(DO-DI-<br>POSL3LINK2)                      | Xilinx  | LogiCORE     | 55%    | 104     | XCV50E-8             |                                                                                                                                 | Line card: terabit<br>routers & optical<br>switches                                                     |
| POS-PHY L3 Link<br>Layer Interface,<br>Single Channel                                   | Xilinx  | LogiCORE     | 6%     | 104     | XC2V1000<br>FG456-4  |                                                                                                                                 |                                                                                                         |
| POS-PHY L4 Multi-<br>Channel Interface<br>(DO-DI-POSL4MC)                               | Xilinx  | LogiCORE     | 29%    | 104     | XC2V3000<br>FG676-5  |                                                                                                                                 |                                                                                                         |
| Reed-Solomon<br>Decoder                                                                 | Xilinx  | LogiCORE     | 40%    | 98      | XC2V250              | Std or custom coding, 3-<br>12 bit symbol width, up<br>to 4095 symbols                                                          | Broadcast, wireless<br>LAN, digital TV,<br>cable modem,<br>xDSL, satellite<br>com,uwave nets            |
| Reed-Solomon<br>Decoder                                                                 | TILAB   | AllianceCORE | 56%    | 61      | XC2V1000-5           | parameterizable, RTL<br>available                                                                                               | Error correction,<br>wireless, DSL                                                                      |

|                                              | Vendor    |              | Impler | ementation Example        |            | ementation Examp                                                                            |                                                                                                   |  | Application |
|----------------------------------------------|-----------|--------------|--------|---------------------------|------------|---------------------------------------------------------------------------------------------|---------------------------------------------------------------------------------------------------|--|-------------|
| Function                                     | Name      | ІР Туре      | Occ    | MHz                       | Device     | Key Features                                                                                | Examples                                                                                          |  |             |
| Reed-Solomon<br>Encoder                      | Xilinx    | LogiCORE     | 42%    | 180                       | XC2V40     | Std or cust coding, 3-12<br>bit width, up to 4095<br>symbols with 256 check<br>symb.        | Broadcast, wireless<br>LAN, digital TV,<br>cable modem,<br>xDSL, satellite<br>com,uwave nets      |  |             |
| SDLC Controller                              | CAST      | AllianceCORE | 38%    | 158                       | XC2V100-5  | Like Intel 8XC152 Global<br>Serial Channel, Serial<br>Comm., HDLC apps,<br>telecom          | Embedded<br>systems,<br>professionalaudio,<br>video                                               |  |             |
| SPEEDROUTER<br>Network Processor             | IP        | AllianceCORE | 64%    | 80<br>MHz,<br>2.5<br>Gbps | XC2V1500-5 | Solution requires<br>SPEEDAnalyzer ASIC,<br>2.5 Gbps fdx wire speed;<br>net processor (NPV) | Networking, edge<br>and access,<br>Switches and<br>routers                                        |  |             |
| Turbo Decoder -<br>3GPP                      | SysOnChip | AllianceCORE | 88%    | 65                        | XC2V2000-5 | 3GPP/UMTS compliant,<br>2Mbps data rate                                                     | Error correction,<br>wireless                                                                     |  |             |
| Turbo Encoder                                | TILAB     | AllianceCORE | 48%    | 120                       | XC2V80-5   | 3GPP/UMTS compliant,<br>upto 4 interleaver laws                                             | Error correction,<br>wireless                                                                     |  |             |
| TURBO_DEC Turbo<br>Decoder                   | TILAB     | AllianceCORE | 99%    | 65                        | XC2V2000-5 | 3GPP/UMTS compliant,<br>>2Mbps data rate                                                    | Error correction,<br>wireless                                                                     |  |             |
| Viterbi Decoder                              | Xilinx    | LogiCORE     | 80%    | 100                       | XC2V250    | Puncturing, serial & parallel architecture,                                                 | 3G base stations,<br>broadcast, wireless<br>LAN, cable<br>modem, xDSL,<br>satellite com,<br>uwave |  |             |
| Viterbi Decoder, IEEE<br>802-compatible      | Xilinx    | LogiCORE     | 70%    | 147                       | XC2V250    | Constraint length(k)=7,<br>G0=171, G1=133                                                   | L/MMDS, cable<br>modem, broadcast<br>equip, wireless<br>LAN, xDSL, sat<br>com, uwave nets         |  |             |
| Digital Signal Proces                        | sing      |              |        |                           |            | 1                                                                                           |                                                                                                   |  |             |
| 1024-Point Complex<br>FFT IFFT for Virtex-II | Xilinx    | LogiCORE     | 62%    | 41us,<br>100<br>MHz       | XC2V500    | 16 bit complex data, 2's<br>comp, forward and<br>inverse transform                          |                                                                                                   |  |             |
| 16-Point Complex<br>FFT IFFT for Virtex-II   | Xilinx    | LogiCORE     | 37%    | 123ns,<br>130<br>MHz      | XC2V500    | 16 bit complex data, 2's<br>comp, forward and<br>inverse transform                          |                                                                                                   |  |             |
| 256-Point Complex<br>FFT IFFT for Virtex-II  | Xilinx    | LogiCORE     | 54%    | 7.7us,<br>100<br>MHz      | XC2V500    | 16 bit complex data, 2's<br>comp, forward and<br>inverse transform                          |                                                                                                   |  |             |
| 32 Point Complex<br>FFT/IFFT                 | Xilinx    | LogiCORE     |        |                           |            |                                                                                             |                                                                                                   |  |             |
| 64-Point Complex<br>FFT IFFT for Virtex-II   | Xilinx    | LogiCORE     | 38%    | 1.9us,<br>100<br>MHz      | XC2V500    | 16 bit complex data, 2's<br>comp, forward and<br>inverse transform                          |                                                                                                   |  |             |
| Bit Correlator                               | Xilinx    | LogiCORE     |        |                           |            | 4096 taps, serial/parallel input, 4096 bits width                                           |                                                                                                   |  |             |
| Cascaded Integrator<br>Comb (CIC)            | Xilinx    | LogiCORE     |        |                           |            | 32 bits data width, rate change from 8 to 16384                                             |                                                                                                   |  |             |

| Eurotion                                          | Vendor  |              | Impler | nentatio | on Example | Key Feeturee                                                                                       | Application                    |
|---------------------------------------------------|---------|--------------|--------|----------|------------|----------------------------------------------------------------------------------------------------|--------------------------------|
| Function                                          | Name    | ір туре      | Occ    | MHz      | Device     | Key reatures                                                                                       | Examples                       |
| Direct Digital<br>Synthesizer                     | Xilinx  | LogiCORE     |        |          |            | 8-65K samples, 32-bits<br>output precision, phase<br>dithering/offset                              |                                |
| Distributed<br>Arithmetic FIR Filter              | Xilinx  | LogiCORE     |        |          |            | 32-bit input/coeff width,<br>1024 taps, 1-8 chan,<br>polyphase, online coeff<br>reload             |                                |
| GVA-300 Virtex-II<br>DSP Hardware<br>Accelerator  | GV      | AllianceCORE | NA     | NA       |            | 2 Virtex-II, Spartan-II<br>FPGAs, 1 CPLD, Matlab<br>I/F                                            | DSP prototyping                |
| LFSR, Linear<br>Feedback Shift<br>Register        | Xilinx  | LogiCORE     |        |          |            | 168 input widths,<br>SRL16/register<br>implementation                                              |                                |
| Math Functions                                    |         |              |        |          |            |                                                                                                    |                                |
| Accumulator                                       | Xilinx  | LogiCORE     |        |          |            | 1-256s bit wide                                                                                    |                                |
| Adder Subtracter                                  | Xilinx  | LogiCORE     |        |          |            | 1-256s bit wide                                                                                    |                                |
| DFP2INT Floating<br>Point to Integer<br>Converter | Digital | AllianceCORE | 39%    | 66       | XC2V250-5  | Full IEEE-754<br>compliance, 4 pipelines,<br>Single precision real<br>format support               | DSP, Math,<br>Arithmetic apps  |
| DFPADD Floating<br>Point Adder                    | Digital | AllianceCORE | 39%    | 66       | XC2V250-5  | Full IEEE-754<br>compliance, 4 pipelines,<br>Single precision real<br>format support               | DSP, Math,<br>Arithmetic apps  |
| DFPCOMP Floating<br>Point Comparator              | Digital | AllianceCORE | 16%    | 91       | XC2V80-5   | Full IEEE-754<br>compliance, 4 pipelines,<br>Single precision real<br>format support               | DSP, Math,<br>Arithmetic apps. |
| DFPDIV Floating<br>Point Divider                  | Digital | AllianceCORE | 99%    | 53       | XC2V250-5  | Full IEEE-754<br>compliance, 15 pipelines,<br>Single precision real<br>format support              | DSP, Math,<br>Arithmetic apps  |
| DFPMUL Floating<br>Point Multiplier               | Digital | AllianceCORE | 44%    | 74       | XC2V250-5  | Full IEEE-754<br>compliance, 7<br>pipelines,32x32 mult,<br>Single precision real<br>format support | DSP, Math,<br>Arithmetic apps. |
| DFPSQRT Floating<br>Point Square Root             | Digital | AllianceCORE | 39%    | 66       | XC2V250-5  | Full IEEE-754<br>compliance, 4 pipelines,<br>Single precision real<br>format support               | DSP, Math,<br>Arithmetic apps  |
| DINT2FP Integer to<br>Floating Point<br>Converter | Digital | AllianceCORE | 37%    | 73       | XC2V250-5  | Full IEEE-754<br>compliance, double<br>word input, 2 pipelines,<br>Single precision real<br>output | DSP, Math,<br>Arithmetic apps  |
| Multiply<br>Accumulator (MAC)                     | Xilinx  | LogiCORE     |        |          |            | Input width up to 32 bits,<br>65-bit accumulator,<br>truncation rounding                           |                                |

| Eurotion                                  | Vendor        |              | Implen | nentatio | on Example | Kov Footuroo                                                                                                     | Application                                      |
|-------------------------------------------|---------------|--------------|--------|----------|------------|------------------------------------------------------------------------------------------------------------------|--------------------------------------------------|
| Function                                  | Name          | ір туре      | Occ    | MHz      | Device     | Key Features                                                                                                     | Examples                                         |
| Multiply Generator                        | Xilinx        | LogiCORE     |        |          |            | 64-bit input data width,<br>constant, reloadable or<br>variable inputs,<br>parallel/sequential<br>implementation |                                                  |
| Pipelined Divider                         | Xilinx        | LogiCORE     |        |          |            | 32-bit input data width,<br>multiple clock per output                                                            |                                                  |
| Sine Cosine Look Up<br>Table              | Xilinx        | LogiCORE     |        |          |            | 3-10 bit in, 4-32 bit out,<br>distributed/block ROM                                                              |                                                  |
| Twos Complementer                         | Xilinx        | LogiCORE     |        |          |            | Input width up to 256<br>bits                                                                                    |                                                  |
| Memories & Storage                        | Elements      |              |        |          |            |                                                                                                                  |                                                  |
| Asynchronous FIFO                         | Xilinx        | LogiCORE     |        |          |            | 1-256 bits, 15-65535<br>words, DRAM or BRAM,<br>independent I/O clock<br>domains                                 |                                                  |
| Content Addressable<br>Memory (CAM)       | Xilinx        | LogiCORE     |        |          |            | 1-512 bits, 2-10K words,<br>SRL16                                                                                |                                                  |
| Distributed Memory                        | Xilinx        | LogiCORE     |        |          |            | 1-1024 bit, 16-65536<br>word,<br>RAM/ROM/SRL16, opt<br>output regs and<br>pipelining                             |                                                  |
| Dual-Port Block<br>Memory                 | Xilinx        | LogiCORE     |        |          |            | 1-256 bits, 2-13K words                                                                                          |                                                  |
| Single-Port Block<br>Memory               | Xilinx        | LogiCORE     |        |          |            | 1-256 bits, 2-128K words                                                                                         |                                                  |
| Synchronous FIFO                          | Xilinx        | LogiCORE     |        |          |            | 1-256 bits, 16-256 words,<br>distributed/block RAM                                                               |                                                  |
| Microprocessors, Cor                      | ntrollers & F | Peripherals  |        |          |            |                                                                                                                  |                                                  |
| 10/100 Ethernet<br>MAC                    | Xilinx        | LogiCORE     |        |          |            | Interfaces through OPB<br>to MicroBlaze                                                                          | Networking,<br>comm., processor<br>applications  |
| AX1610 16-bit RISC<br>Processor           | Loarant       | AllianceCORE | 12%    | 91       | XC2V500-5  | 44 opcode, 64-K word<br>data, program, Harvard<br>arch.                                                          | Control functions,<br>State mach,<br>Coprocessor |
| C165X<br>MicroController                  | CAST          | AllianceCORE | 60%    | 134      | XC2V80-5   | Microchip 16C5X PIC<br>like                                                                                      | Embedded<br>systems, telecom                     |
| C68000<br>Microprocessor                  | CAST          | AllianceCORE | 90%    | 32       | XC2V500-5  | MC68000 Compatible                                                                                               | Embedded<br>systems, pro<br>audio, video         |
| CPU FPGA (Virtex-II)<br>MicroEngine Cards | NMI           | AllianceCORE | NA     | NA       | NA         | Hitachi SH-3 CPU                                                                                                 | Embedded<br>systems                              |
| CZ80CPU<br>Microprocessor                 | CAST          | AllianceCORE | 55%    | 72       | XC2V500-5  | Zilog Z80 compatible, 8-<br>bit processor                                                                        | Embedded<br>systems,<br>Communications           |

| Function                                        | Vendor         | IP Type      | Implementation Example |       | on Example | Kov Foaturos                                                                                              | Application                                             |
|-------------------------------------------------|----------------|--------------|------------------------|-------|------------|-----------------------------------------------------------------------------------------------------------|---------------------------------------------------------|
| Function                                        | Name           | іг туре      | Occ                    | MHz   | Device     | Rey realures                                                                                              | Examples                                                |
| DDR SDRAM<br>Controller Core                    | Memec-<br>Core | AllianceCORE | 7%                     | 133   | XC2V1000-4 | DDR SDRAM burst<br>length support for 2,4,8<br>per access, supports data<br>16,32, 64, 72.                | Digital video,<br>embedded<br>computing ,<br>networking |
| DFPIC125X Fast RISC<br>MicroController          | Digital        | AllianceCORE | 49%                    | 126   | XC2V80-5   | PIC 12c4x like, 2X faster,<br>12-bit wide instruction<br>set, 33 instructions                             | Embedded<br>systems, telecom,<br>audio and video        |
| DFPIC1655X Fast<br>RISC MicroController         | Digital        | AllianceCORE | 79%                    | 140   | XC2V80-5   | S/W compatible with<br>PIC16C55X, 14-bit<br>instruction set, 35<br>instructions                           | Embedded<br>systems, telecom,<br>audio and video        |
| DFPIC165X Fast RISC<br>MicroController          | Digital        | AllianceCORE | 49%                    | 126   | XC2V80-5   | PIC 12c4x like, 2X faster,<br>12-bit wide instruction<br>set, 33 instructions                             | Embedded<br>systems, telecom,<br>audio and video        |
| DI2CM I2C Bus<br>Controller Master              | Digital        | AllianceCORE | 58%                    | 143   | XC2V50-5   | I2C-like, multi master,<br>fast/std. modes                                                                | Embedded<br>systems                                     |
| DI2CM I2C Bus<br>Controller Slave               | Digital        | AllianceCORE | 28%                    | 157   | XC2V50-5   | I2C-like, Slave                                                                                           | Embedded                                                |
| DI2CSB I2C Bus<br>Controller Slave Base         | Digital        | AllianceCORE | 15%                    | 187   | XC2V50-5   | I2C-like, Slave                                                                                           | Embedded<br>Systems                                     |
| DR8051 RISC<br>MicroController                  | Digital        | AllianceCORE | 68%                    | 73    | XC2V250-5  | 80C31 instruction set,<br>RISC architecture 6.7X<br>faster than standard 8051                             | Embedded<br>systems, telecom,<br>video                  |
| DR8051BASE RISC<br>MicroController              | Digital        | AllianceCORE | 46%                    | 80-90 | XC2V250-5  | 80C31 instruction set,<br>high speed multiplier,<br>RISC architecture 6.7X<br>faster than standard 8051   | Embedded<br>systems, telecom,<br>video                  |
| DR8052EX RISC<br>MicroController                | Digital        | AllianceCORE | 99%                    | 71    | XC2V250-5  | 80C31 instruction set,<br>high speed mult/div<br>,RISC 6.7X faster than<br>standard 8051                  | Embedded<br>systems, telecom,<br>video                  |
| e8254 Programmable<br>Interval<br>Timer/Counter | einfochips     | AllianceCORE | 1%                     | 175   | XC2V1000-5 | Three 8-bit parallel ports,<br>24 programmable IO<br>lines, 8-bit bidi data bus                           | Processor, I/O<br>interface                             |
| e8255 Peripheral<br>Interface                   | einfochips     | AllianceCORE | 1%                     | 175   | XC2V1000-5 | Three 8-bit parallel ports,<br>24 programmable IO<br>lines, 8-bit bidi data bus                           | Processor, I/O<br>interface                             |
| Flip805x-PS<br>Microprocessor                   | Dolphin        | AllianceCORE | 39%                    | 38    | XC2V1000-5 | Avg 8X faster & code<br>compatible v. legacy<br>8051, verification bus<br>monitor, SFR IF, DSP<br>focused | DSP, Telecom,<br>industrial, high<br>speed control      |
| IIC                                             | Xilinx         | LogiCORE     |                        |       |            | Interfaces through OPB<br>to MicroBlaze                                                                   | Networking, com, processor applic                       |
| LavaCORE<br>Configurable Java<br>Processor Core | Derivation     | AllianceCORE | 38%                    | 20    | XC2V1000-5 | 32b data/address<br>optional DES                                                                          | Internet appliance,<br>industrial control               |
| LavaCORE<br>Configurable Java<br>Processor Core | Derivation     | AllianceCORE | 38%                    | 20    | XC2V1000-5 | 32b data/address<br>optional DES                                                                          | Internet appliance,<br>industrial control               |

| Function                                                     | Vendor         |              | Implen | nentati | on Example          | Key Festures                                                                                                                                                                                        | Application                                                                     |
|--------------------------------------------------------------|----------------|--------------|--------|---------|---------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---------------------------------------------------------------------------------|
| Function                                                     | Name           | ір туре      | Occ    | MHz     | Device              | Key Features                                                                                                                                                                                        | Examples                                                                        |
| Lightfoot 32-bit Java<br>Processor Core                      | Digital        | AllianceCORE | 33%    | 40      | XC2V1000-5          | 32bit data, 24 bit address,<br>3 Stage pipeline, Java/C<br>dev. tools                                                                                                                               | Internet appliance,<br>industrial control,<br>HAVi multimedia,<br>set top boxes |
| MicroBlaze Soft RISC<br>Processor                            | Xilinx         | LogiCORE     |        | 125     |                     | Soft RISC Processor,<br>small footprint                                                                                                                                                             | Networking,<br>communications                                                   |
| OPB Arbiter                                                  | Xilinx         | LogiCORE     |        | 125     |                     | Bundled in the<br>MicroBlaze<br>Development Kit                                                                                                                                                     | Processor<br>applications                                                       |
| OPB GPIO                                                     | Xilinx         | LogiCORE     |        | 125     |                     | Bundled in the<br>MicroBlaze<br>Development Kit                                                                                                                                                     | Processor<br>applications                                                       |
| OPB Interrupt<br>Controller                                  | Xilinx         | LogiCORE     |        | 125     |                     | Bundled in the<br>MicroBlaze<br>Development Kit                                                                                                                                                     | Processor<br>applications                                                       |
| OPB Memory<br>Interface (Flash,<br>SRAM)                     | Xilinx         | LogiCORE     |        | 125     |                     | Bundled in the<br>MicroBlaze<br>Development Kit                                                                                                                                                     | Processor<br>applications                                                       |
| OPB Timer/Counter                                            | Xilinx         | LogiCORE     |        | 125     |                     | Bundled in the<br>MicroBlaze<br>Development Kit                                                                                                                                                     | Processor<br>applications                                                       |
| OPB UART (16450,<br>16550)                                   | Xilinx         | LogiCORE     |        | 125     |                     | Interfaces through OPB<br>to MicroBlaze                                                                                                                                                             | Processor<br>applications                                                       |
| OPB UART Lite                                                | Xilinx         | LogiCORE     |        | 125     |                     | Bundled in the<br>MicroBlaze<br>Development Kit                                                                                                                                                     | Processor<br>applications                                                       |
| OPB WDT                                                      | Xilinx         | LogiCORE     |        | 125     |                     | Bundled in the Processo<br>MicroBlaze applicatio                                                                                                                                                    |                                                                                 |
| PF3100 PC/104-Plus<br>Reconfigurable<br>Module               | Derivation     | AllianceCORE | N/A    | N/A     | XC2V1000<br>FG256   | PC/104 & PC/104+<br>devlopment board                                                                                                                                                                | Internet appliance,<br>industrial control                                       |
| SPI                                                          | Xilinx         | LogiCORE     |        |         |                     | Interfaces through OPB<br>to MicroBlaze                                                                                                                                                             | Networking,<br>communications,<br>processor<br>applications                     |
| XF-UART<br>Asynchronous<br>Communications<br>Core            | Memec-<br>Core | AllianceCORE | 15%    | 50      | XCS20-4             | UART and baud rate generator                                                                                                                                                                        | Serial data<br>communication                                                    |
| Standard Bus Interfa                                         | ces            |              |        |         |                     |                                                                                                                                                                                                     |                                                                                 |
| PCI-X 64/100<br>Interface for Virtex-II<br>(DO-DI-PCIX64-VE) | Xilinx         | LogiCORE     | 30%    | 100     | XC2V1000<br>FG456-5 | PCI-X 1.0 comp, 64/32-<br>bit, 66 MHz PCI-X<br>initiator and target IF,<br>PCI 2.2 comp, 64/32-bit,<br>33 MHz PCI initiator and<br>target IF, 3.3 V PCI-X at<br>33-66 MHz, 3.3 V PCI at<br>0-33 MHz | Server,Embedded,<br>gb ethernet,U320<br>SCSI,Fibre<br>Ch,RAID<br>cntl,graphics  |

| Table 2-65: | Virtex-II IP Cores Support | (Continued) |
|-------------|----------------------------|-------------|
|             |                            | (           |

| <b>_</b>                                                        | Vendor          |              | Implementation Example |     | on Example          | ·· - ·                                                                                                      | Application                                                                                                                                                            |
|-----------------------------------------------------------------|-----------------|--------------|------------------------|-----|---------------------|-------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Function                                                        | Name            | ІР Туре      | Occ                    | MHz | Device              | Key Features                                                                                                | Examples                                                                                                                                                               |
| PCI32 Virtex Interface<br>Design Kit (DO-DI-<br>PCI32-DKT)      | Xilinx          | LogiCORE     | 6%                     | 66  | XC2V1000<br>FG456-5 | Includes PCI32 board,<br>drive development kit,<br>and customer education<br>3-day training class           |                                                                                                                                                                        |
| PCI32 Virtex<br>Interface, IP Only<br>(DO-DI-PCI32-IP)          | Xilinx          | LogiCORE     | 6%                     | 66  | XC2V1000<br>FG456-5 | v2.2 comp, assured PCI<br>timing, 3.3/5-V, 0-<br>waitstate, CPCI hot swap<br>friendly                       | PC add-in boards,<br>CPCI, Embedded                                                                                                                                    |
| PCI64 & PCI32,<br>IP Only<br>(DO-DI-PCI-AL)                     | Xilinx          | LogiCORE     | 6 - 7%                 | 66  | XC2V1000<br>FG456-5 | v2.2 comp, assured PCI<br>timing, 3.3/5-V, 0-<br>waitstate, CPCI hot swap<br>friendly                       | PC<br>boards,CPCI,Emb<br>edded,hiperf<br>video,gb ethernet                                                                                                             |
| PCI64 Virtex Interface<br>Design Kit (DO-DI-<br>PCI64-DKT)      | Xilinx          | LogiCORE     | 7%                     | 66  | XC2V1000<br>FG456-5 | v2.2 comp, assured PCI<br>timing, 3.3/5-V, 0-<br>waitstate, CPCI hot swap<br>friendly                       | PC boards, CPCI,<br>Embedded, hiperf<br>video, gb ethernet                                                                                                             |
| PCI64 Virtex<br>Interface, IP Only<br>(DO-DI-PCI64-IP)          | Xilinx          | LogiCORE     | 7%                     | 66  | XC2V1000<br>FG456-5 | v2.2 comp, assured PCI<br>timing, 3.3/5-V, 0-<br>waitstate, CPCI hot swap<br>friendly                       | PC<br>boards,CPCI,Emb<br>edded,hiperf<br>video,gb ethernet                                                                                                             |
| RapidIO 8-bit port<br>LP-LVDS Phy Layer<br>(DO-DI-RIO8-PHY)     | Xilinx          | LogiCORE     | 24%                    | 250 | XC2V1000<br>FG456-5 | RapidIO Interconnect<br>v1.1 compliant, verified<br>with Motorola's RapidIO<br>bus functional model<br>v1.4 | Routers, switches,<br>backplane, control<br>plane, data path,<br>embedded sys,<br>high speed<br>interface to<br>memory and<br>encryption<br>engines, high end<br>video |
| USB 1.1 Device<br>Controller                                    | Memec-<br>Core  | AllianceCORE | 21%                    | 12  | XC2V1000-5          | Compliant with USB1.1<br>spec., Supports VCI bus,<br>Performs CRC, Supports<br>1.5 Mbps & 12 Mbps           | Scanners, Printers,<br>Handhelds, Mass<br>Storage                                                                                                                      |
| Video & Image Proce                                             | ssing           |              |                        |     |                     |                                                                                                             |                                                                                                                                                                        |
| 1-D Discrete Cosine<br>Transform                                | Xilinx          | LogiCORE     |                        |     |                     | 8-24 bits for coeff & input, 8-64 pts                                                                       |                                                                                                                                                                        |
| 2-D DCT/IDCT<br>Forward/Inverse<br>Discrete Cosine<br>Transform | Xilinx          | LogiCORE     |                        |     |                     |                                                                                                             | image, video<br>phone, color laser<br>printers                                                                                                                         |
| FASTJPEG_BW<br>Decoder                                          | BARCO-<br>SILEX | AllianceCORE | 67%                    | 73  | XC2V1000-4          | Conforms to ISO/IEC<br>Baseline 10918-1, Gray-<br>Scale                                                     | Video editing,<br>digital camera,<br>scanners                                                                                                                          |
| FASTJPEG_C<br>Decoder                                           | BARCO-<br>SILEX | AllianceCORE | 78%                    | 56  | XC2V1000-4          | Conforms to ISO/IEC<br>Baseline 10918-1, color,<br>multi-scan, Gray-Scale                                   | Video editing,<br>digital camera,<br>scanners                                                                                                                          |



# Chapter 3

# Configuration

# Summary

This chapter covers the following topics:

- Introduction
- Configuration Solutions
- Master Serial Programming Mode
- Slave Serial Programming Mode
- Master SelectMAP Programming Mode
- Slave SelectMAP Programming Mode
- JTAG/ Boundary Scan Programming Mode
  - Boundary-Scan for Virtex-II Devices Using IEEE Standard 1149.1
  - Boundary-Scan for Virtex-II Devices Using IEEE Standard 1532
- Configuration Details
- Readback

# Introduction

Virtex-II devices are configured by loading application-specific configuration data into internal memory. Configuration is carried out using a subset of the device pins, some of which are dedicated, while others can be reused as general-purpose inputs and outputs after configuration is complete.

Depending on the system design, several configuration modes are selectable via mode pins. The mode pins M2, M1, and M0 are dedicated pins. An additional pin, HSWAP\_EN, is used in conjunction with the mode pins to select whether user I/O pins have pull-up resistors during configuration. By default, HSWAP\_EN is tied High (internal pull-up resistor), which shuts off pull-up resistors on the user I/O pins during configuration. When HSWAP\_EN is tied Low, the pull-up resistors are on and therefore, the user I/Os have pull-up resistors during configuration.

Other dedicated pins are:

- CCLK the configuration clock pin
- DONE configuration status pin
- TDI, TDO, TMS, TCK boundary-scan pins
- PROG\_B configuration reset pin

3

Depending on the configuration mode selected, CCLK can be an output generated by the Virtex-II FPGA or an input accepting externally generated clock data. For correct operation, these pins require a  $V_{CCAUX}$  of 3.3V to permit low-voltage transistor-to-transistor logic (LVTTL) operations.

All dual-function configuration pins are contained in banks 4 and 5. Bank 4 contains pins used in serial configuration modes, and banks 4 and 5 contain pins used for SelectMAP modes.

A persist option is available, which can be used to force pins to retain their configuration function even after device configuration is complete. If the persist option is not selected, then the configuration pins with the exception of CCLK, PROG\_B, and DONE can be used for user I/O in normal operation. The persist option does not apply to boundary-scan related pins. The persist feature is valuable in applications that employ partial reconfiguration, dynamic reconfiguration, or readback.

# **Configuration Modes**

Virtex-II supports the following configuration modes:

- Master-Serial
- Slave-Serial (default)
- Master SelectMAP
- Slave SelectMAP
- Boundary-Scan (IEEE 1532 and IEEE 1149)

Table 3-1 shows Virtex-II configuration mode pin settings.

| Configuration Mode <sup>1</sup> | M2 | M1 | MO | CCLK Direction | Data Width | Serial Dout <sup>2</sup> |
|---------------------------------|----|----|----|----------------|------------|--------------------------|
| Master Serial                   | 0  | 0  | 0  | Out            | 1          | Yes                      |
| Slave Serial                    | 1  | 1  | 1  | In             | 1          | Yes                      |
| Master SelectMAP                | 0  | 1  | 1  | Out            | 8          | No                       |
| Slave SelectMAP                 | 1  | 1  | 0  | In             | 8          | No                       |
| Boundary Scan                   | 1  | 0  | 1  | N/A            | 1          | No                       |

Table 3-1: Virtex-II Configuration Mode Pin Settings

#### Notes:

- 1. The HSWAP\_EN pin controls the pullups. Setting M2, M1, and M0 selects the configuration mode, while the HSWAP\_EN pin controls whether or not the pullups are used.
- 2. Daisy chaining is possible only in modes where Serial Dout is used. For example, in SelectMAP modes, the first device does NOT support daisy chaining of downstream devices.

Table 3-2 lists the total number of bits required to configure each device:

Table 3-2: Virtex-II Bitstream Lengths

| Device   | Total Number of<br>Configuration Bits<br>(including header) | Device   | Total Number of<br>Configuration Bits<br>(including header) |
|----------|-------------------------------------------------------------|----------|-------------------------------------------------------------|
| XC2V40   | 360,096                                                     | XC2V2000 | 7,492,000                                                   |
| XC2V80   | 635,296                                                     | XC2V3000 | 10,494,368                                                  |
| XC2V250  | 1,697,184                                                   | XC2V4000 | 15,659,936                                                  |
| XC2V500  | 2,761,888                                                   | XC2V6000 | 21,849,504                                                  |
| XC2V1000 | 4,082,592                                                   | XC2V8000 | 29,063,072                                                  |
| XC2V1500 | 5,659,296                                                   |          |                                                             |

# **Configuration Process and Flow**

The configuration process involves loading the configuration bitstream into the FPGA using the selected mode. There are four major phases in the configuration process:

- Clearing Configuration Memory
- Initialization
- Loading Configuration Data
- Device Startup

Figure 3-1 illustrates the configuration process flow.



Figure 3-1: Configuration Process

www.xilinx.com 1-800-255-7778

### Power Up

The V<sub>CCINT</sub> power pins must be supplied with a 1.5V source. (Refer to the <u>Virtex-II Data</u> <u>Sheet (DS031)</u> for DC characteristics.) The IOB voltage input for Bank 4 (V<sub>CCO\_4</sub>) and the auxiliary voltage input (V<sub>CCAUX</sub>) are also used as a logic input to the Power-On-Reset (POR) circuitry. Even if this bank is not being used, V<sub>CCO\_4</sub> must be connected to a 1.5V or greater source.

### **Clearing Configuration Memory**

In the memory clear phase, non-configuration I/O pins are 3-stated with optional pull-up resistors. The INIT\_B and DONE pins are driven Low by the FPGA, and the memory is cleared. After PROG\_B transitions High, memory is cleared twice and initialization can begin.

The INIT\_B pin transitions High when the clearing of configuration memory is complete. A logic Low on the PROG\_B input resets the configuration logic and holds the FPGA in the clear configuration memory state. When PROG\_B is released, the FPGA continues to hold INIT\_B Low until it has completed clearing all of the configuration memory. The minimum Low pulse time for PROG\_B is defined by the T<sub>PROGRAM</sub> timing parameter. There is no maximum value. The power-up timing of configuration signals is shown in Figure 3-2 and the corresponding timing characteristics are listed in Table 3-3.



Figure 3-2: Power-Up Timing Configuration Signals

| Table 3-3: | Power-Up | Timing | Characteristics |
|------------|----------|--------|-----------------|
|------------|----------|--------|-----------------|

| Description         | Symbol               | Value           | Units              |
|---------------------|----------------------|-----------------|--------------------|
| Program Latency     | T <sub>PL</sub>      | device-specific | 4 μs per frame max |
| Power-on-Reset      | T <sub>POR</sub>     | $T_{PL}$ +2     | ms, max            |
| CCLK (output) Dolay | Τ                    | 0.5             | μs, min            |
| CCLK (output) Delay | <sup>1</sup> ICCK    | 4.0             | μs, max            |
| Program Pulse Width | T <sub>PROGRAM</sub> | 300             | ns, min            |

#### Notes:

1. See Table 3-15 for number of frames by device.

www.xilinx.com

1-800-255-7778

### Initialization

For the initialization phase, the INIT\_B pin is released, the mode pins are sampled, the appropriate pins become active, and the configuration process begins. It is possible to delay configuration by externally holding INIT\_B Low.

### Delaying Configuration

The INIT\_B pin can also be held Low externally to delay configuration of the FPGA. The FPGA samples its mode pins on the rising edge of INIT\_B. After INIT\_B transitions to High, configuration can begin. No additional time-out or waiting periods are required, but configuration does not need to commence immediately after the transition of INIT\_B. The configuration logic does not begin processing data until the synchronization word from the bitstream is loaded.

### Loading Configuration Data

Once configuration begins, the target FPGA starts to receive data frames. Cyclic Redundancy Checking (CRC) is performed before and after the last data frame. CRC is also automatically checked after each block write to an internal data register (FDRI). If the CRC checks prove valid, the device start-up phase can begin.

If the CRC values do not match, INIT\_B is asserted Low to indicate that a CRC error has occurred, startup is aborted, and the FPGA does not become active.

To reconfigure the device, the PROG\_B pin should be asserted to reset the configuration logic. Recycling power also resets the FPGA for configuration. For more information on CRC calculation, see "Cyclic Redundancy Checking Algorithm" on page 295.

The details of loading configuration data in each of the five modes are discussed in the following sections:

- "Master Serial Programming Mode" on page 260
- "Master SelectMAP Programming Mode" on page 263
- "Slave Serial Programming Mode" on page 261
- "Slave SelectMAP Programming Mode" on page 265
- "JTAG/ Boundary Scan Programming Mode" on page 269

### Device Startup

Device startup is a transition phase from the configuration mode to normal programmed device operation. Although the order of the start-up events are user programmable via software, the default sequence of events is as follows:

Upon completion of the start-up sequence, the target FPGA is operational.

The Start-Up Sequencer is an 8-phase sequential state machine that counts from phase 0 to phase 7. (See Figure 3-3.)

The Start-Up Sequencer performs the following tasks:

- Release the DONE pin.
- Negate GTS, activating all of the I/Os.
- Assert GWE, allowing all RAMs and flip-flops to change state.
- Assert EOS. The End-Of-Start-Up flag is always set in phase 7. This is an internal flag that is not user accessible.

BitGen options control the order of the Start-Up Sequence. The default Start-Up Sequence is the bold line in Figure 3-3. The Start-Up Sequence can also be stalled at any phase until either DONE has been externally forced High, or a specified DCM or DCI has established LOCK. For details, see Appendix A: BitGen and PROMGen Switches and Options.

At the cycle selected for the DONE to be released, the sequencer always waits in that state until the DONE is externally released. However, this does not delay the GTS or GWE if they are selected to be released prior to DONE. Therefore, DONE is selected first in the sequence for default settings.



Figure 3-3: Default Start-Up Sequence

# **Configuration Pins**

Certain pins in the FPGA are designated for configuration and are listed in Table 3-4. Some pins are dedicated to the configuration function and others are dual-function pins that can be user I/O after configuration.

|               | 1            |                         |                                                                    |
|---------------|--------------|-------------------------|--------------------------------------------------------------------|
| Name          | Direction    | Driver Type             | Description                                                        |
| Dedicated Pir | is           |                         |                                                                    |
| CCLK          | Input/Output | Active                  | Configuration clock. Output in Master mode.                        |
| PROG_B        | Input        |                         | Asynchronous reset to configuration logic.                         |
| DONE          | Input/Output | Active/<br>Open-Drain   | Configuration status and start-up control.                         |
| M2, M1, M0    | Input        |                         | Configuration mode selection.                                      |
| HSWAP_EN      | Input        |                         | I/O pullups during configuration.                                  |
| TMS           | Input        |                         | Boundary Scan Mode Select.                                         |
| ТСК           | Input        |                         | Boundary Scan Clock.                                               |
| TDI           | Input        |                         | Boundary Scan Data Input.                                          |
| TDO           | Output       | Active                  | Boundary Scan Data Output.                                         |
| Dual Function | n Pins       |                         |                                                                    |
| DIN (D0)      | Input/Output | Active<br>Bidirectional | Serial configuration data<br>input/SelectMAP readback data output. |
| D1:D7         | Input/Output | Active<br>Bidirectional | SelectMAP configuration data input, readback data output.          |
| CS_B          | Input        |                         | Chip Select (SelectMAP mode only).                                 |
| RDWR_B        | Input        |                         | Active Low write select, read select<br>(SelectMAP mode only).     |
| BUSY/DOUT     | Output       | Active                  | Serial configuration data output for serial daisy-chains (active). |
| INIT_B        | Input/Output | Open-Drain              | Delay configuration, indicate configuration error.                 |

Table 3-4: Configuration Pins

# Mixed Voltage Environments

Virtex-II devices have separate voltage sources:

- V<sub>CCINT</sub> = 1.5V powers the internal circuitry.
- V<sub>CCAUX</sub> = 3.3V powers critical resources in the FPGA.
- V<sub>CCO</sub> (1.5, 1.8, 2.5, or 3.3V) powers the IOB circuitry.

SelectI/O-Ultra is separated into eight banks of I/O groups. Each bank can be configured with one of several I/O standards. Refer to the Design Considerations section for I/O banking rules and available I/O standards. Before and during configuration, all I/O banks are set for the LVTTL standard, which requires an output voltage ( $V_{CCO}$ ) of 3.3V for normal operation.

If  $V_{CCO}$  is less than 3.3V on banks 4 and 5, serial and SelectMAP configuration modes might have a lower frequency. (See Table 3-5).

| Configuration Mode | Pins Used                                     | V <sub>CCO_4</sub> | V <sub>CCO_5</sub> |
|--------------------|-----------------------------------------------|--------------------|--------------------|
| JTAG               | Dedicated Pins                                | not a concern      | not a concern      |
| Serial             | Dedicated Pins plus<br>DOUT, DIN, and<br>INIT | 3.3V               | not a concern      |
| SelectMAP          | Dedicated Pins plus<br>dual-function pins     | 3.3V               | 3.3V               |

Table 3-5: Configuration Modes and V<sub>CCO</sub> Voltages

#### Notes:

1. If less than 3.3V ( $V_{CCO_4/5} = 2.5V$ ), the configuration frequency might be as low as half of the typical frequency.

All dedicated configuration pins are powered by  $V_{CCAUX}$ . All dual-function configuration pins are located within banks 4 and 5. As described under Configuration Process and Flow, the  $V_{CCO_4}$  input voltage is used as a logic input to the power-on-reset (POR) circuitry.

For JTAG configuration mode, JTAG inputs are independent of  $V_{CCO}$  and work between 2.5V and 3.3V TTL levels (VIL max = .8V, VIH min = 2.0V). The JTAG input pins are 3.3V tolerant. The JTAG output (TDO) is an open-drain output and must be pulled up to the appropriate voltage level (typically 3.3V) through an external resistor. The value of the external pullup resistor depends on the capacitive loading on the TDO pin and the operating frequency, but it should not be less than 200 ohms. The optimal TDO pullup value can be determined through IBIS simulation.

For serial configuration mode, V<sub>CCO\_4</sub> pins require a 3.3V supply for output configuration pins to operate normally. In serial mode, all of the configuration pins are in bank 4.

For SelectMAP configuration mode,  $V_{CCO_4}$  and  $V_{CCO_5}$  pins require a 3.3V supply for output configuration pins to operate normally. In SelectMAP mode, all of the configuration pins are in banks 4 and 5.

If the Virtex-II device is being configured in serial or SelectMAP mode, and the desired I/O standard in banks 4 and 5 is for a voltage other than 3.3V, then  $V_{CCO_4}$  and  $V_{CCO_5}$  (SelectMAP only) must have 3.3V supplies at configuration, and they can be switched to the desired voltage after configuration is complete.
# **Configuration Solutions**

Several configuration solutions are available to support Virtex-II, each targeted to specific application requirements. Guidance and support (application notes, reference designs, and so forth) is also available for designers looking to develop and implement their own configuration solution for Virtex FPGAs.

# System Advanced Configuration Environment (System ACE™) Series

The System ACE series of configuration solutions offers a system-level configuration manager for designers using multiple FPGAs or FPGAs requiring multiple bitstreams. This solution combines standard industry Flash storage with Xilinx-designed configuration control. Features common to the entire System ACE family include:

- Support for multiple bitstreams
- Built-in support for embedded processors in FPGAs
- Support for reconfiguring, updating, or debugging systems over a network
- Built-in system interface
- Scalability (density) and re-useability (across many designs)
- Centralization of configuration control for reduced board space and simpler debugging
- Use of excess storage capacity for non-configuration, system storage

### System ACE CF

System ACE CF (CompactFlash<sup>TM</sup>) solution combines a standard CompactFlash Association (CFA) Type-I or Type-II memory module (CompactFlash or 1" disk drive) with a Xilinx-designed ACE Controller<sup>TM</sup> configuration control chip. See Figure 3-4.



UG002\_C4\_041\_091902

Figure 3-4: System ACE CompactFlash and Controller

The CompactFlash card stores an unlimited number of bitstreams and ranges in density from 128 Mb to 3 Gb. This card is capable of storing one large bitstream or several smaller bitstreams. If several bitstreams are used, the system can be set up so that individual bitstreams are callable as needed, allowing for dynamic reconfiguration of the Virtex-II device and other Xilinx FPGAs in the JTAG chain.

The ACE Controller drives bits through the FPGA JTAG chain and has three other ports:

- A port for interfacing with a microprocessor, a network, or a MultiLINX cable
- A port for interfacing with the CompactFlash card
- A port that provides access to the FPGA JTAG chain for FPGA testing or configuration via automatic test equipment or via desktop or third-party programmers

For further information on any System ACE product, visit www.xilinx.com.

### System ACE Multi-Package Module (MPM)

System ACE MPM is a multi-package module consisting of a packaged standard Flash from AMD, a packaged FPGA, and a packaged configuration PROM, all in a 388-pin BGA package. The Flash stores configuration and other data, while the FPGA acts as an advanced configuration controller and is configured by the PROM. This solution provides high density and high-speed configuration capability in a single package, helping to simplify the design and manufacturing process. It is available in 16-Mbit, 32-Mbit, and 64-Mbit densities.

### System ACE Soft Controller (SC)

System ACE SC is a downloadable version of the configuration controller found in System ACE MPM; versions are provided that support various standard Flash interfaces. System ACE SC provides all of the features of System MPM without the Single Package. It allows designers to use the Flash memory already in their system to store configuration data. The System ACE SC controller is available free of charge in the form of a PROM file that can be downloaded from the System ACE website. This pre-engineered solution is implemented by connecting up to four Flash chips on a board to an FPGA that will be used as a configuration controller and then downloading the controller file into a PROM. Figure 3-5 describes the controller for both System ACE MPM and System ACE SC.





System ACE MPM and System ACE SC have these unique features:

- High speed configuration up to 154 Mb/sec
- Support for both SelectMAP (8-bit) (see Figure 3-6) and Slave Serial (1-bit) (see Figure 3-7) configuration
- Configuration of multiple FPGAs in parallel
- Bitstream compression for increased storage capability
- Storage of up to 8 different bitstreams



Figure 3-6: SelectMAP (8-bit) Configuration





# **Configuration PROMs**

## Using XC18V00 PROMs

The XC18V00 family of Flash in-system programmable (ISP) configuration PROMs offers the flexibility of re-programmability and multiple package offerings, combined with both serial and SelectMAP FPGA configurability. This family is programmable using Xilinx iMPACT software and ranges in density from 256 Kb to 4 Mb; these PROMs can also be cascaded to support larger bitstreams.

The 18V00 family offers data throughput rates of up to 264 Mb/s. It is also capable of triggering FPGA reconfiguration via a JTAG command. The parts can be programmed via cable, HW-130, or standard third party programmers. The XC18V00 PROMs are available in SO20, PC20, VQ44, and PC44 packages. Refer to Appendix B: XC18V00 Series PROMs for the latest version of the XC18V00 PROMs data sheet, as well as package diagrams for the entire PROM family. See Table 3-6 to determine which PROMs go with which Virtex-II FPGAs.

### Using XC17V00 PROMs

The XC17V00 family of one-time programmable (OTP) PROMs provides a proven, lowcost, compact, and pre-engineered configuration solution. Ranging from 1 Mb to 16 Mb, this family is also the PROM density leader; it can also be daisy-chained to support larger bitstreams. This family supports serial configuration of Virtex-II FPGAs; in addition, the XC17V08 and XC17V16 support SelectMAP configuration modes.

The XC17V00 family can be used for stabilized designs that are in a high-volume production flow and/or for designs requiring a low-cost solution. XC17V00 PROMs can be programmed either by using the HW-130 or by using a variety of third-party programmers. The XC17V00 PROMs are available in VO8, SO20, PC20, VQ44, and PC44 packages. Data sheets for PROMs are available at <u>www.xilinx.com</u>. See Table 3-6 to determine which PROMs go with which Virtex-II FPGAs, and see Appendix B: XC18V00 Series PROMs for package diagrams.

# Flash PROMs With a CPLD Configuration Controller

Some designers prefer to leverage existing Flash memory in their system to store the configuration bitstreams. A small CPLD-based configuration controller can provide the mechanism to access the bitstreams in the FLASH and deliver them quickly to Virtex-II devices. The following application notes describe the details for a serial or SelectMAP configuration architecture using FLASH memories and CPLDs:

• XAPP079: *Configuring* Xilinx *FPGAs Using an XC9500 CPLD and Parallel PROM* (available on <u>www.xilinx.com</u>) describes an architecture that configures a chain of Virtex-II devices using Master-Serial mode. See Figure 3-8 for an example of FPGA configuration using a CPLD and a parallel PROM.



Figure 3-8: Configuring Virtex-II Using a CPLD and Parallel PROM

• XAPP137: *Configuring Virtex FPGAs From Parallel EPROMs With a CPLD* (available on <u>www.xilinx.com</u>) describes an architecture that configures one or more Virtex-II devices using the Slave SelectMAP mode. See Figure 3-9 for an example of FPGA configuration using a CPLD and a parallel EPROM.





# **Embedded Solutions**

### Using an Embedded Microcontroller

XAPP058: Xilinx *In-System Programming Using an Embedded Microcontroller* (available on <u>www.xilinx.com</u>) describes a compact and robust process that (re)configures Virtex-II devices directly from a microprocessor through the JTAG test port of the Virtex-II device. The process additionally supports (re)configuration of XC18V00 ISP PROMs and CPLDs that reside on the JTAG scan chain. Portable, reference C-code is provided with the application note for rapid implementation.

## Using IEEE Standard 1532

Systems that implement an IEEE Standard 1532 player can configure Virtex-II devices. Users need a 1532 BSDL file and a 1532 configuration data file. 1532 BSDL files for Xilinx devices and information on the Xilinx J DRIVE 1532 configuration engine are available at <a href="http://www.support.xilinx.com">http://www.support.xilinx.com</a>.

# **PROM Selection Guide**

Use Table 3-6 to determine which PROMs go with which Virtex-II FPGAs.

|           | Approximate                         | PROM Family       |                | PROM Package     |      |                  |      |                  |  |
|-----------|-------------------------------------|-------------------|----------------|------------------|------|------------------|------|------------------|--|
| Device    | Length <sup>(3)</sup><br>(Megabits) | 18Vxx             | 17Vxx          | V08              | SO20 | PC20             | PC44 | VQ44             |  |
| XCV2V40   | 0.36                                | 18V01             | 17V01          | x <sup>(1)</sup> | x    | x                |      | x <sup>(2)</sup> |  |
| XCV2V80   | 0.64                                | 18V01             | 17V01          | x <sup>(1)</sup> | x    | x                |      | x <sup>(2)</sup> |  |
| XCV2V250  | 1.7                                 | 18V02             | 17V02          |                  |      | x <sup>(1)</sup> | x    | x                |  |
| XCV2V500  | 2.8                                 | 18V04             | 17V04          |                  |      | x <sup>(1)</sup> | x    | x                |  |
| XCV2V1000 | 4.1                                 | 18V04             | 17V04          |                  |      | x <sup>(1)</sup> | x    | x                |  |
| XCV2V1500 | 5.7                                 | 18V04<br>18V02    | 17V08          |                  |      |                  | x    | x                |  |
| XCV2V2000 | 7.5                                 | 2, 18V04          | 17V08          |                  |      |                  | x    | x                |  |
| XCV2V3000 | 10.5                                | 3, 18V04          | 17V16          |                  |      |                  | x    | x                |  |
| XCV2V4000 | 15.7                                | 4, 18V04          | 17V16          |                  |      |                  | x    | x                |  |
| XCV2V6000 | 21.8                                | 5, 18V04<br>18V02 | 17V16<br>17V08 |                  |      |                  | x    | x                |  |
| XCV2V8000 | 29.0                                | 7, 18V04          | 2, 17V16       |                  |      |                  | x    | x                |  |

Table 3-6: Using Virtex-II Devices With PROMs

#### Notes:

1. 17Vxx only

2. 18Vxx only

<sup>3.</sup> Different versions of software produce different bit counts and in some cases can reduce the PROM size requirement. To obtain the most accurate information, visit the Xilinx Answers Search Data Base and reference record 12326 at <u>www.support.xilinx.com</u>.

# Software Support and Data Files

This section provides information on Xilinx device programming software and configuration-related data files.

### **iMPACT** Software

For programming Virtex-II and other Xilinx devices with a personal computer, Xilinx provides iMPACT software as a part of the ISE software package. A free version of iMPACT software is also available through the WebPACK software suite. More information on WebPACK is available at <u>http://www.support.xilinx.com</u>.

### **Programming Cables**

iMPACT software supports several Xilinx programming cables that are compatible with Virtex-II devices, including the Parallel Cable III, Parallel Cable IV, and MultiLINX cables. For more information on these cables, or to order programming cables online, visit <a href="http://www.support.xilinx.com">http://www.support.xilinx.com</a>.

## Boundary Scan Interconnect Testing for Virtex-II Devices

Virtex-II devices support the EXTEST, INTEST, and SAMPLE/PRELOAD instructions required for Boundary Scan interconnect tests. Xilinx does not provide direct support for Boundary Scan software, although several third party suppliers offer Boundary Scan test equipment that is compatible with Virtex-II devices. A list of third-parties offering Boundary Scan test equipment is available online at <u>http://www.support.xilinx.com</u>.

Customers seeking to perform interconnect tests on a Virtex-II device using third-party boundary scan tester must have a Boundary Scan Description Language (BSDL) file for the Virtex-II device. BSDL files for all Xilinx devices are provided with the ISE software installation, and are also available online.

## In-System Programming Data Files

Many third party JTAG configuration solutions and in-system configuration solutions require an SVF (Serial Vector Format) or STAPL (Standard Test and Programming Language) file. The SVF and STAPL file formats are used to convey Boundary Scan instructions in a generic format. Customers requiring an SVF or STAPL file can use iMPACT software to generate these files. For more information on SVF, STAPL, and In-System Programming (ISP), see the following resources online:

- The iMPACT Software Manual is included on the software manuals web page.
- In-System Programming details are contained in Xilinx Application Note 058.

# Master Serial Programming Mode

In serial configuration mode, the FPGA is configured by loading one bit per CCLK cycle. In Master Serial mode, the FPGA drives the CCLK pin. In Slave Serial mode, the FPGAs CCLK pin is driven by an external source. In both serial configuration modes, the MSB of each data byte is always written to the DIN pin first.

The Master Serial mode is designed so the FPGA can be configured from a Serial PROM, Figure 3-10. The speed of the CCLK is selectable by BitGen options; see Appendix A: BitGen and PROMGen Switches and Options. Be sure to select a CCLK speed supported by the PROM.

Figure 3-10 shows a Master Serial FPGA configuring from a PROM.



Figure 3-10: Master Serial Mode Circuit Diagram

#### Notes:

1. If the Virtex-II device has not selected the DriveDONE option, then an external pull-up resistor of  $330\Omega$  should be added to the DONE pin. This pull-up resistor is not needed if DriveDONE = Yes.



Figure 3-11: Master Serial Configuration Clocking Sequence

#### Notes:

1. For Master configurations, the CCLK does not transition until after initialization, as indicated by the arrow.

# **Slave Serial Programming Mode**

In serial configuration mode, the FPGA is configured by loading one bit per CCLK cycle. In Slave Serial mode, the FPGAs CCLK pin is driven by an external source. In both serial configuration modes, the MSB of each data byte is always written to the DIN pin first.

The Slave Serial configuration mode allows for FPGAs to be configured from other logic devices, such as microprocessors, or in a daisy-chain fashion. Figure 3-12 shows a Master Serial FPGA configuring from a PROM with a Slave Serial FPGA in a daisy-chain with the Master.

# Daisy-Chain Configuration

Virtex-II FPGAs can be used in a daisy-chain configuration only with XC4000X, SpartanXL, Spartan-II or other Virtex FPGAs. For serial daisy chains consisting of both 4k and Virtex devices, it is recommended that all Virtex-E, Virtex-II, and Virtex-II Pro devices be grouped at the beginning of the serial daisy chain, with the 4000X and Spartan devices following. For a serial daisy chain consisting only of Spartan-II or Spartan-E and Virtex, Virtex-E, Virtex-II, or Virtex-II Pro devices, there are no restrictions on the order of the devices in the chain. However, there are restrictions on the total number of bits that an FPGA can pass to the downstream FPGAs in the chain. If a Virtex-II FPGA is placed as the Master and a non-Virtex-II FPGA is placed as a slave, select a configuration CCLK speed supported by *all* devices in the chain.

The separate bitstreams for the FPGAs in a daisy-chain must be combined into a single PROM file, by using either iMPACT software or the PROMGen utility (see Appendix A: BitGen and PROMGen Switches and Options). Separate .bit files can *not* be simply concatenated together to form a daisy-chain bitstream.



Figure 3-12: Master/Slave Serial Mode Circuit Diagram

#### Notes:

1. If none of the devices have been selected to DriveDONE, then an external pull-up resistor of 330  $\Omega$  should be added to the common DONE line. This pull-up resistor is not needed if DriveDONE = Yes. If used, DriveDONE should be selected only for the last device in the configuration chain.

The first device in the chain is the first to be configured. No data is passed onto the DOUT pin until all the data frames, start-up command, and CRC check have been loaded. CRC checks only include the data for the current device, not for any others in the chain. After finishing the first stream, data for the next device is loaded. The data for the downstream device appears on DOUT typically about 80 CCLK cycles after being loaded into DIN. This is due to internal packet processing. Each daisy-chained bitstream carries its own

synchronization word. Nothing of the first bitstream is passed to the next device in the chain other than the daisy-chained configuration data.

The DONE\_cycle must be set before GTS, or during the same cycle to guarantee each Virtex-II device to move to the operation state when all the DONE pins have been released. When daisy-chaining multiple devices, either set the last device in the chain to DriveDONE, or add external pull-up resistors to counteract the combined capacitive loading on DONE. If non-Virtex devices are included in the daisy-chain, it is important to set their bitstreams to SyncToDONE with BitGen options. For more information on Virtex BitGen options, see Appendix A: BitGen and PROMGen Switches and Options.



Figure 3-13: Serial Configuration Clocking Sequence

#### Notes:

1. For Slave configurations, a free running CCLK can be used, as shown in Figure 3-13.

|      | •                                                        |                                      | •         |          |
|------|----------------------------------------------------------|--------------------------------------|-----------|----------|
|      | Description                                              | Symbol                               | Values    | Units    |
|      | DIN setup/hold, slave mode                               | T <sub>DCC</sub> /T <sub>CCD</sub>   | 5.0/0.0   | ns, min  |
|      | DIN setup/hold, master mode                              | T <sub>DSCK</sub> /T <sub>SCKD</sub> | 5.0/0.0   | ns, min  |
| CCLK | DOUT                                                     | T <sub>CCO</sub>                     | 12.0      | ns, max  |
|      | High time                                                | T <sub>CCH</sub>                     | 5.0       | ns, min  |
|      | Low time                                                 | T <sub>CCL</sub>                     | 5.0       | ns, min  |
|      | Maximum Frequency                                        | F <sub>CC_SERIAL</sub>               | 66        | MHz, max |
|      | Frequency Tolerance, master mode with respect to nominal |                                      | +45% -30% |          |

Table 3-7: Master/Slave Serial Mode Programming Switching

# Master SelectMAP Programming Mode

The SelectMAP mode provides an 8-bit bidirectional data bus interface to the Virtex-II configuration logic that can be used for both configuration and readback. Virtex-II devices can not be serially daisy-chained when the SelectMAP interface is used. However, they can be connected in a parallel-chain as shown in Figure 3-16. The DATA pins (D0:D7), CCLK, RDWR\_B, BUSY, PROG\_B, DONE, and INIT\_B can be connected in common between all of the devices. CS\_B inputs should be kept separate so each device can be accessed individually. If all devices are to be configured with the same bitstream, readback is not being used, and CCLK is less than F<sub>CC</sub>\_SelectMAP, the CS\_B pins can be connected to a common line so the devices are configured simultaneously.



Figure 3-14: Virtex-II Interfaced With an 18V00 PROM

#### Notes:

1. If none of the Virtex-II devices have been selected to DriveDONE, add an external 330  $\Omega$  pull-up resistor to the common DONE line. This pull-up resistor is not needed if DriveDONE is selected. If used, DriveDONE should be selected only for the last device in the configuration chain.

The following pins are involved in Master SelectMAP configuration mode:

# DATA Pins (D[0:7])

The D0 through D7 pins function as a bidirectional data bus in the SelectMAP mode. Configuration data is written to the bus, and readback data is read from the bus. The bus direction is controlled by the RDWR\_B signal. see "Configuration Details" on page 287. The D0 pin is considered the MSB of each byte.

## RDWR\_B

When asserted Low, the RDWR\_B signal indicates that data is being written to the data bus. When High, the RDWR\_B signal indicates that data is being read from the data bus.

# CS\_B

The Chip Select input (CS\_B) enables the SelectMAP data bus. To write or read data onto or from the bus, the CS\_B signal must be asserted Low. When CS\_B is High, Virtex-II devices do not drive onto or read from the bus.

# CCLK

The CCLK pin is a clock output in the Master SelectMAP interface. It synchronizes all loading and reading of the data bus for configuration and readback. The CCLK pin is driven by the FPGA.

## Data Loading

To load data in the Master SelectMAP mode, a data byte is loaded on every rising CCLK edge as shown in Figure 3-15. If the CCLK frequency is less than  $F_{CC}$  SelectMAP, this can be done without handshaking. For frequencies above  $F_{CC}$  SelectMAP, the BUSY signal must be monitored. If BUSY is High, the current byte must be reloaded when BUSY is Low.

The first byte can be loaded on the first rising CCLK edge that INIT\_B is High, and when both CS\_B and RDWR\_B are asserted Low. CS\_B and RDWR\_B can be asserted anytime before or after INIT\_B has gone High. However, the SelectMAP interface is not active until after INIT\_B has gone High. The order of CS\_B and RDWR\_B does not matter, but RDWR\_B must be asserted throughout configuration. If RDWR\_B is de-asserted before all data has been loaded, the FPGA aborts the operation. To complete configuration, the FPGA must be reset by PROG\_B and reconfigured with the entire stream. For applications that need to de-assert RDWR\_B between bytes, see "Controlled CCLK" on page 268.



Figure 3-15: Data Loading in SelectMAP

# Slave SelectMAP Programming Mode

The SelectMAP mode provides an 8-bit bidirectional data bus interface to the Virtex-II configuration logic that can be used for both configuration and readback. Virtex-II devices can not be serially daisy-chained when the SelectMAP interface is used. However, they can be connected in a parallel-chain as shown in Figure 3-16. The DATA pins (D0:D7), CCLK, RDWR\_B, BUSY, PROG\_B, DONE, and INIT\_B can be connected in common between all of the devices. CS\_B inputs should be kept separate so each device can be accessed individually. If all devices are to be configured with the same bitstream, readback is not being used, and CCLK is less than F<sub>CC</sub>\_SelectMAP, the CS\_B pins can be connected to a common line so the devices are configured simultaneously.

Although Figure 3-16 does not show a control module for the SelectMAP interface, the SelectMAP interface is typically driven by a processor, micro controller, or some other logic device such as an FPGA or a CPLD.



Figure 3-16: Slave SelectMAP Mode Circuit Diagram

#### Notes:

1. If none of the Virtex-II devices have been selected to DriveDONE, add an external 330  $\Omega$  pull-up resistor to the common DONE line. This pull-up resistor is not needed if DriveDONE = Yes. If used, DriveDONE should be selected only for the last device in the configuration chain.

The following pins are involved in Slave SelectMAP configuration mode:

# DATA Pins (D[0:7])

The D0 through D7 pins function as a bidirectional data bus in the SelectMAP mode. Configuration data is written to the bus, and readback data is read from the bus. The bus direction is controlled by the RDWR\_B signal. see "Configuration Details" on page 287.. The D0 pin is considered the MSB of each byte.

# RDWR\_B

When asserted Low, the RDWR\_B signal indicates that data is being written to the data bus. When asserted High, the RDWR\_B signal indicates that data is being read from the data bus.

# CS\_B

The Chip Select input (CS\_B) enables the SelectMAP data bus. To write or read data onto or from the bus, the CS\_B signal must be asserted Low. When CS\_B is High, Virtex-II devices do not drive onto or read from the bus.

# BUSY

When CS\_B is asserted, the BUSY output indicates when the FPGA can accept another byte. If BUSY is Low, the FPGA reads the data bus on the next rising CCLK edge where both CS\_B and RDWR\_B are asserted Low. If BUSY is High, the current byte is ignored and must be reloaded on the next rising CCLK edge when BUSY is Low. When CS\_B is *not* asserted, BUSY is 3-stated.

BUSY is only necessary for CCLK frequencies above  $F_{CC}$  SelectMAP. For frequencies at or below  $F_{CC}$  SelectMAP, BUSY is ignored, see "Data Loading" on page 264. For parallel chains, as shown in Figure 3-16, where the same bitstream is to be loaded into multiple devices simultaneously, BUSY should not be used. Thus, the maximum CCLK frequency for such an application must be less than  $F_{CC}$  SelectMAP.

# CCLK

Unlike the Master SelectMAP mode of configuration, the CCLK pin is an input in the Slave SelectMAP mode interface. The CCLK signal synchronizes all loading and reading of the data bus for configuration and readback. Additionally, the CCLK drives internal configuration circuitry. The CCLK can be driven either by a free running oscillator or an externally-generated signal.

Several scenarios exist when configuring the FPGA in SelectMAP mode, depending on the source of CCLK.

# Free-Running CCLK

A free-running oscillator can be used to drive Virtex-II CCLK pins. For applications that can provide a continuous stream of configuration data, refer to the timing diagram discussed in "Data Loading" on page 264. For applications that cannot provide a continuous data stream, missing the clock edges, refer to the timing diagram discussed in "Non-Contiguous Data Strobe" on page 267. An alternative to a free-running CCLK is discussed in "Controlled CCLK" on page 268.

## Express-Style Loading

In express-style loading, a data byte is loaded on every rising CCLK edge as shown in Figure 3-17. If the CCLK frequency is less than  $F_{CC}$  SelectMAP, this can be done without handshaking. For frequencies above  $F_{CC}$  SelectMAP, the BUSY signal must be monitored. If BUSY is High, the current byte must be reloaded when BUSY is Low.

The first byte can be loaded on the first rising CCLK edge that INIT\_B is High, and when both CS\_B and RDWR\_B are asserted Low. CS\_B and RDWR\_B can be asserted anytime before or after INIT\_B has gone High. However, the SelectMAP interface is not active until after INIT\_B has gone High. The order of CS\_B and RDWR\_B does not matter, but RDWR\_B must be asserted throughout configuration. If RDWR\_B is de-asserted before all data has been loaded, the FPGA aborts the operation. To complete configuration, the FPGA must be reset by PROG\_B and reconfigured with the entire stream.



For applications that need to de-assert RDWR\_B between bytes, see "Controlled CCLK" on page 268.

Figure 3-17: "Express Style" Continuous Data Loading in SelectMAP





### Non-Contiguous Data Strobe

In applications where multiple clock cycles might be required to access the configuration data before each byte can be loaded into the SelectMAP interface, data might not be ready for each consecutive CCLK edge. In such a case, the CS\_B signal can be de-asserted until the next data byte is valid on the DATA[0:7] pins. This is demonstrated in Figure 3-18. While CS\_B is High, the SelectMAP interface does not expect any data and ignores all CCLK transitions. However, RDWR\_B must continue to be asserted while CS\_B is asserted. If RDWR\_B is High during a positive CCLK transition while CS\_B is asserted, the FPGA aborts the operation. For applications that need to de-assert the RDWR\_B signal without de-asserting CS\_B, see "Controlled CCLK".

## Controlled CCLK

Some applications require that RDWR\_B be de-asserted between the loading of configuration data bytes asynchronously from the CS\_B. Typically, this would be due to the RDWR\_B signal being a common connection to other devices on the board, such as memory storage elements. In such a case, driving CCLK as a controlled signal instead of a free-running oscillator makes this type of operation possible. In Figure 3-19, the CCLK, CS\_B, and RDWR\_B are asserted Low while a data byte becomes active. Once the CCLK has gone High, the data is loaded. RDWR\_B can be de-asserted and re-asserted as many times as necessary, just as long as it is Low before the next rising CCLK edge.



Figure 3-19: Controlling CCLK for RDWR\_B De-Assertion

|      | Description                         | Symbol                                   | Value   | Units    |
|------|-------------------------------------|------------------------------------------|---------|----------|
| CCLK | D <sub>0-7</sub> Setup/Hold         | $T_{SMDCC}/T_{SMCCD}$                    | 5.0/0.0 | ns, min  |
|      | CS_B Setup/Hold                     | T <sub>SMCSCC</sub> /T <sub>SMCCCS</sub> | 7.0/0.0 | ns, min  |
|      | RDWR_B Setup/Hold                   | T <sub>SMCCW</sub> /T <sub>SMWCC</sub>   | 7.0/0.0 | ns, min  |
|      | BUSY Propagation Delay              | T <sub>SMCKBY</sub>                      | 12.0    | ns, max  |
|      | Maximum Frequency                   | F <sub>CC</sub> _SelectMAP               | 66      | MHz, max |
|      | Maximum Frequency with no handshake | F <sub>CCNH</sub>                        | 66      | MHz, max |

Table 3-8: SelectMAP Write Timing Characteristics

# JTAG/ Boundary Scan Programming Mode

# Introduction

Virtex-II devices support the new IEEE 1532 standard for In-System Configuration (ISC), based on the IEEE 1149.1 standard. The IEEE 1149.1 Test Access Port and Boundary-Scan Architecture is commonly referred to as JTAG. JTAG is an acronym for the Joint Test Action Group, the technical subcommittee initially responsible for developing the standard. This standard provides a means to assure the integrity of individual components and the interconnections between them at the board level. With increasingly dense multi-layer PC boards, and more sophisticated surface mounting techniques, boundary-scan testing is becoming widely used as an important debugging standard.

Devices containing boundary-scan logic can send data out on I/O pins in order to test connections between devices at the board level. The circuitry can also be used to send signals internally to test the device specific behavior. These tests are commonly used to detect opens and shorts at both the board and device level.

In addition to testing, boundary-scan offers the flexibility for a device to have its own set of user-defined instructions. The added common vendor specific instructions, such as configure and verify, have increased the popularity of boundary-scan testing and functionality.

# Boundary-Scan for Virtex-II Devices Using IEEE Standard 1149.1

The Virtex-II family is fully compliant with the IEEE Standard 1149.1 Test Access Port and Boundary-Scan Architecture. The architecture includes all mandatory elements defined in the IEEE 1149.1 Standard. These elements include the Test Access Port (TAP), the TAP controller, the instruction register, the instruction decoder, the boundary-scan register, and the bypass register. The Virtex-II family also supports some optional instructions; the 32-bit identification register, and a configuration register in full compliance with the standard. Outlined in the following sections are the details of the JTAG architecture for Virtex-II devices.

## Test Access Port

The Virtex-II TAP contains four mandatory dedicated pins as specified by the protocol (Table 3-9).

| Pin | Description      |
|-----|------------------|
| TDI | Test Data In     |
| TDO | Test Data Out    |
| TMS | Test Mode Select |
| ТСК | Test Clock       |

 Table 3-9:
 Virtex-II TAP Controller Pins

There are three input pins and one output pin to control the 1149.1 boundary-scan TAP controller. There are optional control pins, such as TRST (Test Reset) and enable pins, which might be found on devices from other manufacturers. It is important to be aware of these optional signals when interfacing Xilinx devices with parts from different vendors, because they might need to be driven.

The TAP controller is a 16-state state machine shown in Figure 3-20. The four mandatory TAP pins are outlined below.

• TMS - This pin determines the sequence of states through the TAP controller on the rising edge of TCK. TMS has an internal resistive pull-up to provide a logic High if the pin is not driven.

- TCK This pin is the JTAG test clock. It sequences the TAP controller and the JTAG registers in the Virtex-II devices.
- TDI This pin is the serial input to all JTAG instruction and data registers. The state of the TAP controller and the current instruction held in the instruction register determine which register is fed by the TDI pin for a specific operation. TDI has an internal resistive pull-up to provide a logic High to the system if the pin is not driven. TDI is applied into the JTAG registers on the rising edge of TCK.
- TDO This pin is the serial output for all JTAG instruction and data registers. The state of the TAP controller and the current instruction held in the instruction register determine which register (instruction or data) feeds TDO for a specific operation. TDO changes state on the falling edge of TCK and is only active during the shifting of instructions or data through the device. This pin is 3-stated at all other times.

#### Notes:

As specified by the IEEE Standard, the TMS and TDI pins all have internal pull-up resistors. These internal pull-up resistors of 50-150 k $\Omega$  are active, regardless of the mode selected.

For JTAG configuration mode, JTAG inputs are independent of  $V_{CCO}$  and work between 2.5V and 3.3V TTL levels (VIL max = .8V, VIH min = 2.0V). The JTAG input pins are 3.3V tolerant. The JTAG output (TDO) is an open-drain output and must be pulled up to the appropriate voltage level (typically 3.3V) through an external resistor. The value of the external pullup resistor depends on the capacitive loading on the TDO pin and the operating frequency, but it should not be less than 200 ohms. The optimal TDO pullup value can be determined through IBIS simulation.

### **TAP** Controller

Figure 3-20 diagrams a 16-state finite state machine. The four TAP pins control how data is scanned into the various registers. The state of the TMS pin at the rising edge of TCK determines the sequence of state transitions. There are two main sequences, one for shifting data into the data register and the other for shifting an instruction into the instruction register.



NOTE: The value shown adjacent to each state transition in this figure represents the signal present at TMS at the time of a rising edge at TCK.

Figure 3-20: State Diagram for the TAP Controller

### **Boundary-Scan Instruction Set**

To determine the operation to be invoked, an instruction is loaded into the Instruction Register (IR). The Instruction Register is 6 bits long in Virtex-II devices to support the new IEEE Standard 1532 for In-System Configurable (ISC) devices. Table 3-10 lists the available instructions for Virtex-II devices.

| Boundary Scan<br>Command | Binary Code<br>(5:0) | Description                                             |
|--------------------------|----------------------|---------------------------------------------------------|
| EXTEST                   | 000000               | Enables boundary-scan EXTEST operation                  |
| SAMPLE                   | 000001               | Enables boundary-scan SAMPLE operation                  |
| USER1                    | 000010               | Access user-defined register 1                          |
| USER2                    | 000011               | Access user-defined register 2                          |
| CFG_OUT                  | 000100               | Access the configuration bus for readback               |
| CFG_IN                   | 000101               | Access the configuration bus for configuration          |
| INTEST                   | 000111               | Enables boundary-scan INTEST operation                  |
| USERCODE                 | 001000               | Enables shifting out user code                          |
| IDCODE                   | 001001               | Enables shifting out of ID code                         |
| HIGHZ                    | 001010               | 3-states output pins while enabling the bypass register |
| JSTART                   | 001100               | Clocks the start-up sequence when StartClk is TCK       |
| JSHUTDOWN                | 001101               | Clocks the shutdown sequence                            |
| BYPASS                   | 111111               | Enables BYPASS                                          |
| JPROG_B                  | 001011               | Equivalent to and has the same affect as PROG_B         |
| RESERVED                 | All other codes      | Xilinx reserved instructions                            |

Table 3-10: Virtex-II Boundary Scan Instructions

The mandatory IEEE 1149.1 commands are supported in Virtex-II devices, as well as several Xilinx vendor-specific commands. Virtex-II devices have a powerful command set. The EXTEST, INTEST, SAMPLE/PRELOAD, BYPASS, IDCODE, USERCODE, and HIGHZ instructions are all included. The TAP also supports two internal user-defined registers (USER1 and USER2) and configuration/readback of the device. The Virtex-II boundary-scan operations are independent of mode selection. The boundary-scan mode in Virtex-II devices overrides other mode selections. For this reason, boundary-scan instructions using the boundary-scan register (SAMPLE/PRELOAD, INTEST, EXTEST) must not be performed during configuration. All instructions except USER1 and USER2 are available before a Virtex-II device is configured. After configuration, all instructions are available.

JSTART and JSHUTDOWN are instructions specific to the Virtex-II architecture and configuration flow. As described in Table 3-10, the JSTART and JSHUTDOWN instructions clock the startup sequence when the appropriate BitGen option is selected. The instruction does not work correctly without the correct BitGen option selected.

bitgen -g startupclk:jtagclk designName.ncd

For details on the standard boundary-scan instructions EXTEST, INTEST, and BYPASS, refer to the IEEE Standard. The user-defined registers (USER1/USER2) are described in "USER1, USER2 Registers" on page 275.

## Boundary-Scan Architecture

Virtex-II device registers include all registers required by the IEEE 1149.1 Standard. In addition to the standard registers, the family contains optional registers for simplified testing and verification (Table 3-11).

| Register Name               | Register<br>Length | Description                                                                           |
|-----------------------------|--------------------|---------------------------------------------------------------------------------------|
| Instruction register        | 6 bits             | Holds current instruction OPCODE and captures internal device status.                 |
| Boundary scan register      | 3 bits per I/O     | Controls and observes input, output, and output enable.                               |
| Bypass register             | 1 bit              | Device bypass.                                                                        |
| Identification register     | 32 bits            | Captures device ID.                                                                   |
| JTAG configuration register | 64 bits            | Allows access to the configuration bus when using the CFG_IN or CFG_OUT instructions. |
| USERCODE register           | 32 bits            | Captures user-programmable code                                                       |

Table 3-11: Virtex-II JTAG Registers

# Boundary-Scan Register

The test primary data register is the boundary-scan register. Boundary-scan operation is independent of individual IOB configurations. Each IOB, bonded or un-bonded, starts as bidirectional with 3-state control. Later, it can be configured to be an input, output, or 3-state only. Therefore, three data register bits are provided per IOB (Figure 3-21).

When conducting a data register (DR) operation, the DR captures data in a parallel fashion during the CAPTURE-DR state. The data is then shifted out and replaced by new data during the SHIFT-DR state. For each bit of the DR, an update latch is used to hold the input data stable during the next SHIFT-DR state. The data is then latched during the UPDATE-DR state when TCK is Low.

The update latch is opened each time the TAP Controller enters the UPDATE-DR state. Care is necessary when exercising an INTEST or EXTEST to ensure that the proper data has been latched before exercising the command. This is typically accomplished by using the SAMPLE/PRELOAD instruction.

Consider internal pull-up and pull-down resistors when developing test vectors for testing opens and shorts. The boundary-scan mode determines if the IOB has a pull-up resistor. Figure 3-21 is a representation of Virtex-II Boundary-Scan Architecture.



Figure 3-21: Virtex Series Boundary Scan Logic

## **Bit Sequence**

The order in each non-TAP IOB is described in this section. The input is first, then the output, and finally the 3-state IOB control. The 3-state IOB control is closest to the TDO. The input-only pins contribute only the input bit to the boundary-scan I/O data register. The bit sequence of the device is obtainable from the "Boundary-Scan Description Language Files" (BSDL files) for the Virtex family. These files can be obtained from the Xilinx software download area. The bit sequence is independent of the design. It always has the same bit order and the same number of bits.

### **Bypass Register**

The other standard data register is the single flip-flop BYPASS register. It passes data serially from the TDI pin to the TDO pin during a bypass instruction. This register is initialized to zero when the TAP controller is in the CAPTURE-DR state.

#### Instruction Register

The instruction register is a 6-bit register that loads the OPCODE necessary for the Virtex-II boundary-scan instruction set. This register loads the current OPCODE and captures internal device status.

### Configuration Register (Boundary-Scan)

The configuration register is a 64-bit register. This register allows access to the configuration bus and readback operations.

### Identification Register

Virtex devices have a 32-bit identification register, commonly referred to as the IDCODE register. This register is based upon IEEE Standard 1149.1 and allows easy identification of the part being tested or programmed via boundary scan.

## Virtex-II Identification Register

The Virtex-II JTAG ID Code register has the following format.

```
3322 222222 211111111 11000000000
1098 7654321 098765432 109876543210 bit positions(00 to 31)
vvvv:fffffff:aaaaaaaaaa:ccccccccccl
```

#### where

- v is the revision code and
- f is the 7-bit family code =  $0001000 0 \times 08$
- a is the number of array rows in the part expressed in 9 bits.

| XC2V40   | = | 8 =   | 0x08  |
|----------|---|-------|-------|
| XC2V80   | = | 16 =  | 0x010 |
| XC2V250  | = | 24 =  | 0x018 |
| XC2V500  | = | 32 =  | 0x020 |
| XC2V1000 | = | 40 =  | 0x028 |
| XC2V1500 | = | 48 =  | 0x030 |
| XC2V2000 | = | 56 =  | 0x038 |
| XC2V3000 | = | 64 =  | 0x040 |
| XC2V4000 | = | 80 =  | 0x050 |
| XC2V6000 | = | 96 =  | 0x060 |
| XC2V8000 | = | 112 = | 0x070 |

c is the company code = 00001001001 = 0x049\*

\*Since the last bit of the JTAG IDCODE is always one, the last three hex digits appear as 0x093.

|         | vvvv | ffff | fff | a | aaaa | aaaa | CCCC | CCCC | CCCC |
|---------|------|------|-----|---|------|------|------|------|------|
| XC2V250 |      | 0001 | 000 | 0 | 0001 | 1000 | 0000 | 1001 | 0011 |
|         | v    | 1    | 0   |   | 1    | 8    | 0    | 9    | 3    |
| XC2V500 | v    | 1    | 0   |   | 2    | 0    | 0    | 9    | 3    |

ID Codes assigned to Virtex-II FPGAs are shown in Table 3-12.

Table 3-12: Virtex-II Device ID Codes

| FPGA     | IDCODE    |
|----------|-----------|
| XC2V40   | v01008093 |
| XC2V80   | v01010093 |
| XC2V250  | v01018093 |
| XC2V500  | v01020093 |
| XC2V000  | v01028093 |
| XC2V1500 | v01030093 |
| XC2V2000 | v01038093 |
| XC2V3000 | v01040093 |
| XC2V4000 | v01050093 |
| XC2V6000 | v01060093 |
| XC2V8000 | v01070093 |

#### Notes:

1. The "v" in the IDCODE is the revision code field.

### **USERCODE** Register

USERCODE is supported in the Virtex family as well. This register allows a user to specify a design-specific identification code. The USERCODE can be programmed into the device and read back for verification at a later time. The USERCODE is embedded into the bitstream during bitstream generation (bitgen -g UserID option) and is valid only after configuration.

### **USER1, USER2 Registers**

The USER1 and USER2 registers are only valid after configuration. These two registers must be defined by the user within the design. These registers can be accessed after they are defined by the TAP pins.

The BSCAN\_VIRTEX2 library macro is required when creating these registers. This symbol is only required for driving internal scan chains (USER1 and USER2). The BSCAN\_VIRTEX2 macro provides two user pins (SEL1 and SEL2) for determining usage of USER1 or USER2 instructions respectively. For these instructions, two corresponding pins (TDO1 and TDO2) allow user scan data to be shifted out of TDO. In addition, there are individual clock pins (DRCK1 and DRCK2) for each user register. There is a common input pin (TDI) and shared output pins that represent the state of the TAP controller (RESET, SHIFT, and UPDATE). Unlike earlier FPGA families that required the BSCAN macro to dedicate TAP pins for boundary scan, Virtex-II TAP pins are dedicated and do not require the BSCAN\_VIRTEX2 macro for normal boundary-scan instructions or operations.

Note that these are user-defined registers. The example (Figure 3-22) is one of many implementations. For HDL, the BSCAN\_VIRTEX2 macro needs to be instantiated in the design.



Figure 3-22: BSCAN\_VIRTEX2 (Example Usage)

# Using Boundary Scan in Virtex-II Devices

Characterization data for some of the most commonly requested timing parameters shown in Figure 3-23 is listed in Table 3-13.



x139\_05\_020300

#### Figure 3-23: Virtex-II Boundary Scan Port Timing Waveforms

#### Table 3-13: Boundary-Scan Port Timing Specifications

| Symbol              | Parameter                            | Value | Units    |
|---------------------|--------------------------------------|-------|----------|
| T <sub>TAPTCK</sub> | TMS and TDI setup time before TCK    | 4.0   | ns, min  |
| T <sub>TCKTAP</sub> | TMS and TDI hold times after TCK     | 2.0   | ns, min  |
| T <sub>TCKTDO</sub> | TCK falling edge to TDO output valid | 11.0  | ns, min  |
| F <sub>TCK</sub>    | Maximum TCK clock frequency          | 33.0  | MHz, max |

For further information on the Startup sequence, bitstream, and internal configuration registers referenced here, refer to "Readback" on page 296.

### Configuring Through Boundary-Scan

One of the most common boundary-scan vendor-specific instructions is the configure instruction. An individual Virtex-II device is configured via JTAG on power-up using TAP. If the Virtex-II device is configured on power-up, it is advisable to the mode pins to the boundary-scan configuration mode settings; 101 (M2 = 1, M1 = 0, M0 = 1).

Configuration flow for Virtex-II device configuration with JTAG is shown in Figure 3-24. The sections that follow describe how the Virtex-II device can be configured as a single device via boundary-scan or as part of a multiple-device scan chain.

A configured device can be reconfigured by toggling the TAP and entering a CFG\_IN instruction after pulsing the PROG\_B pin or issuing the shut-down sequence. (Refer to "Power Up" on page 248). For additional details on power-up or the start-up sequence in Virtex-II devices, see "Device Startup" on page 250.

Users seeking to implement a Virtex-II JTAG configuration algorithm are advised to use the SVF-based flow provided in Xilinx Application Note 058 (available on <u>www.xilinx.com</u>).



Figure 3-24: Device Configuration Flow Diagram

#### Single Device Configuration

Configure a Virtex-II part as a single device via boundary-scan operations as follows. Ensure that the bitstream is generated with the JTAG clock option.

bitgen -g startupclk:jtagclk designName.ncd

Also, when using iMPACT software, verify that the most current version is being used.

Table 3-14 describes the TAP controller commands required to configure a Virtex-II device. Refer to Figure 3-20 for TAP controller states. These TAP controller commands are issued automatically if configuring the part with the iMPACT software.

|     | TAD Controller Step Description                                                                                                      | Set & Hold                        |     | # of Clocks              |  |
|-----|--------------------------------------------------------------------------------------------------------------------------------------|-----------------------------------|-----|--------------------------|--|
|     |                                                                                                                                      | TDI                               | TMS | тск                      |  |
| 1.  | On power-up, place a logic "one" on the TMS and clock the TCK five times. This ensures starting in the TLR (Test-Logic-Reset) state. | X                                 | 1   | 5                        |  |
| 2.  | Move into the RTI state.                                                                                                             | X                                 | 0   | 1                        |  |
| 3.  | Move into the SELECT-IR state.                                                                                                       | X                                 | 1   | 2                        |  |
| 4.  | Enter the SHIFT-IR state.                                                                                                            | X                                 | 0   | 2                        |  |
| 5.  | Start loading the CFG_IN instruction, LSB first:                                                                                     | 000101                            | 0   | 9                        |  |
| 6.  | Load the MSB of CFG_IN instruction when exiting SHIFT-IR, as defined in the IEEE standard.                                           | 1                                 | 1   | 1                        |  |
| 7.  | Enter the SELECT-DR state.                                                                                                           | X                                 | 1   | 2                        |  |
| 8.  | Enter the SHIFT-DR state.                                                                                                            | X                                 | 0   | 2                        |  |
| 9.  | Shift in the Virtex-II bitstream. Bit <sub>n</sub> (MSB) is the first bit in the bitstream <sup><math>(1)</math></sup> .             | bit <sub>1</sub> bit <sub>n</sub> | 0   | (bits in<br>bitstream)-1 |  |
| 10. | Shift in the last bit of the bitstream. $Bit_0$ (LSB) shifts on the transition to EXIT1-DR.                                          | bit <sub>0</sub>                  | 1   | 1                        |  |
| 11. | Enter UPDATE-DR state.                                                                                                               | X                                 | 1   | 1                        |  |
| 12. | Reset TAP by clocking five 1's on TMS                                                                                                | X                                 | 1   | 5                        |  |
| 13. | Enter the SELECT-IR state.                                                                                                           | X                                 | 1   | 2                        |  |
| 14. | Move to the SHIFT-IR state.                                                                                                          | X                                 | 0   | 2                        |  |
| 15. | Start loading the JSTART instruction. The JSTART instruction initializes the startup sequence.                                       | 01100                             | 0   | 5                        |  |
| 16. | Load the last bit of the JSTART instruction.                                                                                         | 0                                 | 1   | 1                        |  |
| 17. | Move to the UPDATE-IR state.                                                                                                         | X                                 | 1   | 1                        |  |
| 18. | Move to RTI and clock the STARTUP sequence by applying a minimum of 12 clock cycles to the TCK.                                      | X                                 | 0   | Š12                      |  |
| 19. | Move to the TLR state. The device is now functional.                                                                                 | X                                 | 1   | 3                        |  |

#### Notes:

1. In the Configuration Register, data is shifted in from the right (TDI) most significant bit, to the left, least significant bit (D0 ... D7).

#### **Multiple Device Configuration**

It is possible to configure multiple Virtex-II devices in a chain. The devices in the JTAG chain are configured one at a time. The multiple device configuration steps can be applied to any size chain. Ensure the bitstream is generated with the JTAG clock option.

bitgen -g startupclk:jtagclk designName.ncd

Refer to the State Diagram in Figure 3-20 for the following TAP controller steps.

- 1. On power-up, place a logic "one" on the TMS and clock the TCK five times. This ensures starting in the TLR (Test-Logic-Reset) state.
- 2. Load the CFG\_IN instruction into the target device (and BYPASS in all other devices). Go through RTI (RUN-TEST/IDLE).
- 3. Load in the configuration bitstream per step 7 through step 11 in Table 3-14.
- 4. Repeat step 2 and step 3 for each device.
- 5. Reset all TAPs by clocking five 1's on TMS.
- 6. Load the JSTART command into all devices.
- Go to RTI and clock TCK 12 times. All devices are active at this point.



Figure 3-25: Boundary Scan Chain of Devices

#### Notes:

1. PROG\_B pin should be deasserted during JTAG operation.

## Reconfiguring Through Boundary Scan

The ability of Virtex-II devices to perform partial reconfiguration is the reason that the configuration memory is not cleared when reconfiguring the device. When reconfiguring a chain of devices, refer to step 3 in Table 3-14. There are two methods to reconfigure Virtex-II devices without possible internal contention. The first method is to pulse the PROG\_B pin which resets the internal configuration memory. The alternate method is to perform a shutdown sequence, placing the device in a safe state. The following shutdown sequence includes using internal registers. (For details on internal registers, refer to "Readback" on page 296.)

- 1. Load the CFG\_IN instruction.
- 2. In SHIFT-DR state, load the synchronization word followed by the Reset CRC Register (RCRC) command.

- 3. Load JSHUTDOWN.
- 4. Go to RTI and clock TCK at least 12 times to clock the shutdown sequence.
- 5. Proceed to SHIFT-IR state and load the CFG\_IN instruction again.
- 6. Go to SHIFT-DR state and load the configuration bits. Make sure the configuration bits contain AGHIGH command, which asserts the global signal GHIGH\_B. This prevents contention while writing configuration data.

- 7. When all configuration bits have been loaded, reset the TAP by clocking five 1's on TMS.
- 8. Go to SHIFT-IR state and load the JSTART instruction.
- 9. Go to RTI and clock TCK at least 12 times to clock the startup sequence.
- 10. Go to TLR state to complete the reconfiguration process.
- 11. Reset the DCMs after reconfiguration through JTAG.

### **Debugging Configuration**

To verify successful configuration, there are several options. Some of the most helpful verification steps include using TAP pins and the readback command. Using the Virtex-II TAP controller and status pins is discussed first.

When using TAP controller pins, TDO is driven only in the SHIFT-DR and SHIFT-IR state. If the output of the TDO can be changed via an external pull-up resistor, the TAP is not in SHIFT-IR or SHIFT-DR. If the TAP can be controlled precisely, use this to test the application.

In JTAG configuration, the status pin (DONE) functions the same as in the other configuration modes. The DONE pin can be monitored to determine if a bitstream has been completely loaded into the device. If DONE is Low, the entire bitstream has not been sent or the start-up sequence is not finished. If DONE is High, the entire bitstream has been received correctly. The INIT\_B pin functions similar to a normal INIT\_B but does not indicate a configuration error in boundary-scan configuration.

In addition to external pin monitoring, an internal test can be conducted. The second method includes the following steps to capture the internal device status register contents:

- 1. Move the TAP to TLR state.
- 2. Go to SHIFT-IR state and load in the CFG\_IN instruction.
- 3. Go to SHIFT-DR state and shift in the following 64-bit pattern with the MSB (left-most bit), shifted in first.

```
      1111
      1111
      1111
      1111
      1111
      1111
      1111
      -> Dummy word

      1010
      1001
      1001
      0101
      0101
      0110
      0110
      -> Synchronization word

      0010
      1000
      0000
      0000
      1110
      0000
      0010
      -> Read STATus Register <sup>1</sup>)

      0000
      0000
      0000
      0000
      0000
      0000
      -> flush pipe

      0000
      0000
      0000
      0000
      0000
      0000
      -> flush pipe

      0000
      0000
      0000
      0000
      0000
      -> flush pipe
```

1. Since the JTAG readback shift register is 64-bit long, two 32-bit words are needed to fill the shift register.

- 4. After shifting in the pattern, load the CFG\_OUT instruction in the SHIFT-IR state.
- 5. Move to SHIFT-DR state and clock TCK 32 times while reading TDO. The data seen on TDO is the content of the status register. The last bit out is a one if a CRC error occurred. If successful, it should read as follows.

Notes:

1. MMM is the mode pins value.

2. Assuming that the device is in normal operation mode.

Since the read status activity causes the crc\_error status to be asserted, it is important to clear the crc\_error status to ensure normal device operation. This can be done by writing the precalculated CRC value to the CRC register or writing an RCRC command.

- 6. Go to SHIFT-IR state and load the CFG\_IN instruction again.
- 7. Move to SHIFT-DR state and shift in the following bit pattern:

 0011
 0000
 0000
 1000
 0000
 0001 -> Header: Write to CMD register

 0000
 0000
 0000
 0000
 0000
 0111 -> RCRC command

 0000
 0000
 0000
 0000
 0000
 0000 -> flush pipe

 0000
 0000
 0000
 0000
 0000
 0000 -> flush pipe

8. Put the TAP in TLR state when finished.

The device status register also gives the status of the DONE and INIT\_B signals. For information on the status register, refer to Figure 3-30.





ISC Modal States

Figure 3-26: ISC Modal States

Once the device is powered up, it goes to an Unprogrammed state. The I/Os are all either 3-stated or pulled up. When ISC\_ENABLE is successfully executed, the ISC\_Enabled signal is asserted, and the device moves to ISC\_Accessed state. When the device moves to ISC\_Accessed state from Operational state, the shutdown sequence is executed. The I/Os are all either 3-stated or pulled up.

The StartUp sequence is executed when in the ISC\_Accessed state. At the end of the StartUp Sequence, ISC\_Enabled is cleared and the device moves to ISC\_Complete. The minimum clock cycle requirement is the number of clock cycles required to complete the StartUp sequence. At the completion of the minimum required clock cycles, ISC\_Enabled is deasserted.

Whether the StartUp sequence is successful or not is determined by CRC or configuration error status from the configuration processor. If the startup is completed, ISC\_Done is asserted; otherwise, ISC\_Done stays Low. The I/Os are either 3-stated or pulled up.

When ISC\_Done is set in ISC\_Complete state, the device moves to the Operational state. Otherwise, if ISC\_Done is clear, the device moves to an Unprogrammed state. However, if the TAP controller goes to TLR state while the device is in ISC\_Accessed state and if ISC\_Done is set, then the device moves to the Operational state. However, the I/O is not active yet because the Startup sequence has not been performed. The Startup sequence has to be performed in the Operational state to bring the I/O active.

## Clocking Startup and Shutdown Sequence (JTAG Version)

There are three clock sources for Startup and Shutdown sequence, CCLK, UserCLK, and JTAGCLK. Clock selection is set by BitGen. The Startup sequence is executed in ISC\_Accessed state. When it is clocked by JTAGCLK, the Startup sequence receives the JTAGCLK in TAP Run/Test Idle state while ISC\_DISABLE is the current JTAG instruction. The number of clock cycles in Run/Test Idle state for successful completion of ISC\_DISABLE is determined by the number of clock cycles needed to complete the Startup sequence.

When UserCLK or CCLK is used to clock the Startup sequence, the user should know how many JTAGCLK cycles should be spent in Run/Test Idle to successfully complete the Startup sequence.

The Shutdown sequence is executed when the device transitions from an Operational to ISC\_Accessed state. Shutdown is done while executing the ISC\_ENABLE instruction. When the Shutdown sequence is clocked using JTAGCLK, the clock is supplied in the Run/Test Idle state of the ISC\_ENABLE instruction. The number of clock cycles in Run/Test Idle is determined by the number of clock cycles needed to complete the Shutdown sequence.

When the Shutdown sequence is clocked by CCLK or UserCLK, the user is responsible for knowing how many JTAGCLK cycles in Run/Test Idle are needed to complete the Shutdown sequence.

#### Notes:

- 1. It has been decided that when configuring the device through JTAG, the startup and shutdown clock should come from TCK, regardless of the selection in BitGen.
- 2. In IEEE 1532 configuration mode, Startup and Shutdown clock source is always TCK.

3

# Configuration Flows Using JTAG



Figure 3-27: IEEE 1532 Configuration Flow



Figure 3-28: Signal Diagram for Successful First Time ISC Configuration



Figure 3-29: Signal Diagram for Successful ISC Partial and Full Reconfiguration

# **Configuration Details**

This section provides a bit-level understanding of the configuration stream. For the purpose of debugging, designing embedded readback operations, or otherwise complex styles of configuring multiple FPGAs, the Virtex-II bitstream, internal configuration logic, and internal processing of configuration data are described here.

### Data Frames

The internal configuration memory is partitioned into segments called "Frames." The portions of the bitstream that actually get written to the configuration memory are "Data Frames." The number and size of frames varies with device size as shown in Table 3-15. The total number of configuration bits for a particular device is calculated by multiplying the number of frames by the number of bits per frame, and then adding the total number of bits needed to perform the *Configuration Register Writes* shown in Table 3-15.

Table 3-15: Virtex-II Configuration Data Frames and Programming Times

| Device   | No. of<br>Frames | Frame<br>Length in<br>Bits | Configuration<br>Bits | Total No. of<br>Bits (including<br>header) | Approx.<br>SelectMAP<br>Download Time<br>(50 MHz) ms | Approx. Serial<br>Download Time<br>(50 MHz) ms | Approx. JTAG<br>Download Time<br>(33 MHz) ms |
|----------|------------------|----------------------------|-----------------------|--------------------------------------------|------------------------------------------------------|------------------------------------------------|----------------------------------------------|
| XC2V40   | 404              | 832                        | 360,096               | 339,040                                    | 0.84                                                 | 6.72                                           | 10.19                                        |
| XC2V80   | 404              | 1472                       | 635,296               | 598,880                                    | 1.49                                                 | 11.89                                          | 18.02                                        |
| XC2V250  | 752              | 2112                       | 1,697,184             | 1,593,696                                  | 3.97                                                 | 31.76                                          | 48.13                                        |
| XC2V500  | 928              | 2752                       | 2,761,888             | 2,560,608                                  | 6.38                                                 | 51.08                                          | 77.39                                        |
| XC2V1000 | 1104             | 3392                       | 4,082,592             | 3,752,800                                  | 9.36                                                 | 74.90                                          | 113.48                                       |
| XC2V1500 | 1280             | 4032                       | 5,659,296             | 5,170,272                                  | 12.90                                                | 103.22                                         | 156.39                                       |
| XC2V2000 | 1456             | 4672                       | 7,492,000             | 6,813,024                                  | 17.01                                                | 136.05                                         | 206.13                                       |
| XC2V3000 | 1804             | 5312                       | 10,494,368            | 9,594,720                                  | 23.96                                                | 191.66                                         | 290.39                                       |
| XC2V4000 | 2156             | 6592                       | 15,659,936            | 14,226,784                                 | 35.53                                                | 284.25                                         | 430.68                                       |
| XC2V6000 | 2508             | 7872                       | 21,849,504            | 19,759,968                                 | 49.36                                                | 394.86                                         | 598.27                                       |
| XC2V8000 | 2860             | 9152                       | 29,063,072            | 26,194,272                                 | 65.44                                                | 523.49                                         | 793.17                                       |

## **Configuration Registers**

The Virtex-II configuration logic was designed so that an external source can have complete control over all configuration functions by accessing and loading addressed internal configuration registers over a common configuration bus. The internal configuration registers that are used for configuration and readback are listed in Table 3-16. All configuration data, except the synchronization word and dummy words, is written to internal configuration registers.

Table 3-16: Internal Configuration Registers

| Symbol | Register Name                                            | Address |
|--------|----------------------------------------------------------|---------|
| CRC    | CRC Register                                             | 00000   |
| FAR    | Frame Address Register                                   | 00001   |
| FDRI   | Frame Data Input Register (Write Configuration Data)     | 00010   |
| FDRO   | Frame Data Output Register (Readback Configuration Data) | 00011   |
| CMD    | Command Register                                         | 00100   |

| Symbol | Register Name                                 | Address |
|--------|-----------------------------------------------|---------|
| CTL    | Control Register                              | 00101   |
| MASK   | Masking Register for CTL                      | 00110   |
| STAT   | Status Register                               | 00111   |
| LOUT   | Legacy Output Register (DOUT for daisy chain) | 01000   |
| COR    | Configuration Option Register                 | 01001   |
| MFWR   | Multiple Frame Write                          | 01010   |
| FLR    | Frame Length Register                         | 01011   |
| IDCODE | Product ID Code Register                      | 01110   |

#### Table 3-16: Internal Configuration Registers

#### Command Register (CMD)

Commands shown in Table 3-17 are executed by loading the binary code into the CMD register.

| Symbol   | Command                                   | Binary Code |
|----------|-------------------------------------------|-------------|
| WCFG     | Write Configuration Data                  | 0001        |
| MFWR     | Multi-Frame Write                         | 0010        |
| DGHIGH   | De-asserts GHIGH                          | 0011        |
| RCFG     | Read Configuration Data                   | 0100        |
| START    | Begin STARTUP Sequence                    | 0101        |
| RCAP     | Reset CAPTURE (after Single-Shot Capture) | 0110        |
| RCRC     | Reset CRC Register                        | 0111        |
| AGHIGH   | Assert GHIGH                              | 1000        |
| SWITCH   | Switch CCLK Frequency                     | 1001        |
| GRESTORE | Pulse GRESTORE Signal                     | 1010        |
| SHUTDOWN | Begin SHUTDOWN Sequence                   | 1011        |
| GCAPTURE | Pulse GCAPTURE Signal (one shot)          | 1100        |
| DESYNCH  | Forces realignment to 32 bits             | 1101        |

Table 3-17: CMD Register Commands

### Frame Length Register (FLR)

The FLR is used to indicate the frame size to the internal configuration logic. This allows the internal configuration logic to be identical for all Virtex-II devices. The value loaded into this register is the number of actual configuration words that get loaded into the configuration memory frames.
### Configuration Option Register (COR)

The COR is loaded with the user selected options from bitstream generation. See Appendix A: BitGen and PROMGen Switches and Options.

| Name         | Description                                                       | Bits  |
|--------------|-------------------------------------------------------------------|-------|
| CRC_BYPASS   | Does not check against updated CRC value.                         | 29    |
| SHUT_RST_DCI | DCI resets if SHUTDOWN and AGHIGH are performed.                  | 27    |
| SHUT_RST_DCM | DCM resets if SHUTDOWN and AGHIGH are performed.                  | 26    |
| DONE_PIPE    | Add pipeline stage to DONEIN.                                     | 25    |
| DRIVE_DONE   | DONE pin is an active driver, not open drain.                     | 24    |
| SINGLE       | Readback capture is one shot.                                     | 23    |
| OSCFSEL      | Select CCLK frequency in Master Serial Mode.                      | 22:17 |
| SSCLKSRC     | Select STARTUP block clock source.                                | 16:15 |
| DONE_CYCLE   | Startup cycle when DONE is asserted/de-asserted.                  | 14:12 |
| MATCH_CYCLE  | Stall in this Startup cycle until DCI match signals are asserted. | 11:9  |
| LOCK_CYCLE   | Stall in this Startup cycle until DCM signals are asserted.       | 8:6   |
| GTS_CYCLE    | Startup cycle when GTS_CFG_B is de-asserted.                      | 5:3   |
| GWE_CYCLE    | Startup cycle when GWE is asserted.                               | 2:0   |

Table 3-18: Configuration Option Register

### Control Register (CTL)

The CTL controls internal functions such as *Security* and *Port Persistence*.

Table 3-19: Control Register

| Name      | Description                                                                      | Bits |
|-----------|----------------------------------------------------------------------------------|------|
| SBITS     | Security level.                                                                  | 4:5  |
| PERSIST   | Configuration ports remain after configuration.                                  | 3    |
| Reserved  | For internal use.                                                                | 2:1  |
| GTS_USR_B | Active Low global 3-state I/Os. Turns off pullups if GTS_CFG_B is also asserted. | 0    |

### Mask Register (MASK)

The MASK is a safety mechanism that controls which bits of the CTL register can be reloaded. Prior to loading new data into the CTL register, each bit must be independently enabled by its corresponding bit in the MASK register. Any CTL bit not selected by the MASK register is ignored when reloading the CTL register.

### Frame Address Register (FAR)

The FAR sets the starting frame address for the next configuration data input write cycle.

### Frame Data Register Input (FDRI)

The FDRI is the input stage for configuration data frames to be stored in the configuration memory. Starting with the frame address specified in the FAR, the FDRI writes its contents to the configuration memory frames. The FDRI automatically increments the frame

address after writing each frame for the number of frames specified in the FDRI write command. This is detailed in the next section.

### CRC Register (CRC)

The CRC is loaded with a CRC value that is embedded in the bitstream and compared against an internally calculated CRC value. Resetting the CRC register and circuitry is controlled by the CMD register.

### Frame Data Register Output (FDRO)

FDRO is an output stage for reading frame data from the configuration memory during readback. This works the same as the FDRI but with data flowing in the other direction.

### Legacy Data Output Register (LOUT)

LOUT is pipeline data to be sent out the DOUT pin for serially daisy-chained configuration data output.

### Status Register (STAT)

The STAT register contains bits that indicate the state of the device. Such bits include the status of error pins, global signals, the DCM, and DCI. This register is read-only and can be read using the JTAG or SelectMAP port for debugging purposes.

|   |   |    |    |    |    |    |    |    |    |    |    |    |    |    |    |    | RESERVED | RESERVED | <b>ID_ERROR</b> | DONE |    |    | MODE |   | GHIGH_B | GWE | GTS_CFG_B | IN_ERROR | DCI_MATCH | DCM_LOCK | RESERVED | CRC_ERROR |
|---|---|----|----|----|----|----|----|----|----|----|----|----|----|----|----|----|----------|----------|-----------------|------|----|----|------|---|---------|-----|-----------|----------|-----------|----------|----------|-----------|
| 3 | 1 | 30 | 29 | 28 | 27 | 26 | 25 | 24 | 23 | 22 | 21 | 20 | 19 | 18 | 17 | 16 | 15       | 14       | 13              | 12   | 11 | 10 | 9    | 8 | 7       | 6   | 5         | 4        | 3         | 2        | 1        | 0         |
| ( | 0 | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 0  | 0  | х        | х        | х               | х    | х  | х  | х    | х | х       | х   | х         | х        | х         | х        | х        | х         |

Figure 3-30: Status Register Fields

### Table 3-20: Status Register

| Name      | Description                                     | Bit Location |
|-----------|-------------------------------------------------|--------------|
| ID_ERROR  | IDCODE not validated while trying to write FDRI | 13           |
| DONE      | DONEIN input form DONE pin                      | 12           |
| INIT_B    | Value of CFG_RDY (INIT_B)                       | 11           |
| MODE      | Value or MODE pins (M2, M1, M0)                 | 10:8         |
| GHIGH_B   | Status of GHIGH                                 | 7            |
| GWE       | Status of GWE                                   | 6            |
| GTS_CFG_B | Status of GTS_CFG_B                             | 5            |
| IN_ERROR  | Legacy input error                              | 4            |
| DCI_MATCH | DCI matched                                     | 3            |
| DCM_LOCK  | DCM matched                                     | 2            |
| Reserved  | For internal use                                | 1            |
| CRC_ERROR | CRC error                                       | 0            |

## Configuration Data Processing Flow

The complete (standard) reconfiguration of a Virtex-II device follows the internal flow shown in Figure 3-31. All associated configuration commands are listed in Table 3-21.



Figure 3-31: Internal Configuration Processing Flow

| Туре                 | Number of 32-bit Words |  |  |  |  |
|----------------------|------------------------|--|--|--|--|
| Command Set 1        |                        |  |  |  |  |
| Dummy words          | 1                      |  |  |  |  |
| Synchronization word | 1                      |  |  |  |  |
| Write CMD (RCRC)     | 2                      |  |  |  |  |
| Write FLR            | 2                      |  |  |  |  |
| Write COR            | 2                      |  |  |  |  |
| Write ID             | 2                      |  |  |  |  |
| Write MASK           | 2                      |  |  |  |  |
| Write CMD (SWITCH)   | 2                      |  |  |  |  |
| Command Set 2        |                        |  |  |  |  |
| Write FAR            | 2                      |  |  |  |  |
| Write CMD (WCFG)     | 2                      |  |  |  |  |
| Write FDRI           | part size dependent    |  |  |  |  |
| Write CMD (DGHIGH)   | 2                      |  |  |  |  |
| Command Set 3        |                        |  |  |  |  |
| Write COR            | 2                      |  |  |  |  |
| Write CMD (START)    | 2                      |  |  |  |  |
| Write CTL            | 2                      |  |  |  |  |
| Write CRC            | 2                      |  |  |  |  |
| Write CMD (DESYNCH)  |                        |  |  |  |  |
| Dummy words          | 4                      |  |  |  |  |
| TOTAL                | 40                     |  |  |  |  |

#### Table 3-21: Configuration Register Writes

The first command set prepares the internal configuration logic for the loading of the data frames. The internal configuration logic is first initialized with several CCLK cycles represented by dummy words, then it is synchronized to recognize the 32-bit word boundaries by the synchronization word. The CRC register and circuitry must then be reset by writing the RCRC command to the CMD register. The frame length size for the device being configured is then loaded into the FLR register. The configuration options are loaded into the COR. The CCLK frequency selected is specified in the COR; however, to switch to that frequency the SWITCH command must be loaded into the CMD register. The ID register is written to ensure that the correct bitstream is being used. Now the data frames can be loaded.

The second command set loads the configuration data frames. First, a WCFG (Write Configuration) command is loaded into the CMD register activating the circuitry that writes the data loaded into the FDRI into the configuration memory cells. To load a set of data frames, the starting address for the first frame is first loaded to the FAR, followed by a write command, and then by the data frames to the FDRI. The FDRI write command also specifies the amount of data that is to follow in terms of the number of 32-bit words that comprise the data frames being written. When all but the last frame has been loaded, an initial CRC checksum is loaded into the CRC register. The De-assert GHIGH (DGHIGH) is loaded into the CMD register.

The third command set initializes the Start-Up Sequence and finishes CRC checking. After all the data frames have been loaded, the START command is loaded into the CMD register,

followed by any internal control data to CTL, the final CRC value into the CRC register, and the DESYNCH command to the CMD register. The four dummy words at the end are flushed through the system to provide the finishing CCLK cycles to activate the FPGA.

### Standard Bitstream

Virtex-II devices have the ability to be only partially re-configured or read back. The standard bitstream, currently generated by BitGen, follows the format shown in Table 3-22, Table 3-23, and Table 3-24. *This format assumes D0 is considered the MSB*. It is divided into three tables to follow the three command sets described in the previous subsection. Table 3-22 shows the first set of commands in the bitstream that prepare the configuration logic for rewriting the memory frames. All commands are described as 32-bit words, since configuration data is internally processed from a common 32-bit bus.

| <b>e</b> 1                                        |  |  |  |  |  |  |
|---------------------------------------------------|--|--|--|--|--|--|
| Data Type                                         |  |  |  |  |  |  |
| Dummy word                                        |  |  |  |  |  |  |
| Synchronization word                              |  |  |  |  |  |  |
| Packet Header: Write to CMD register              |  |  |  |  |  |  |
| Packet Data: RCRC                                 |  |  |  |  |  |  |
| Packet Header: Write to FLR register              |  |  |  |  |  |  |
| Packet Data: Frame Length                         |  |  |  |  |  |  |
| Packet Header: Write to COR                       |  |  |  |  |  |  |
| Packet Data: Configuration options (user defined) |  |  |  |  |  |  |
| Packet Header: Write to ID register               |  |  |  |  |  |  |
| Packet Data: IDCODE                               |  |  |  |  |  |  |
| Packet Header: Write to CMD register              |  |  |  |  |  |  |
| Packet Data: SWITCH                               |  |  |  |  |  |  |
| Packet Header: Write to CMD register              |  |  |  |  |  |  |
| Packet Data: WCFG                                 |  |  |  |  |  |  |
|                                                   |  |  |  |  |  |  |

Table 3-22: Bitstream Header and Configuration Options

From Table 3-22, the first dummy word pads the front of the bitstream to provide the clock cycles necessary for initialization of the configuration logic. No actual processing takes place until the synchronization word is loaded. Since the Virtex-II configuration logic processes data as 32-bit words, but can be configured from a serial or 8-bit source, the synchronization word is used to define the 32-bit word boundaries. That is, the first bit after the synchronization word is the first bit of the next 32-bit word, and so on.

After synchronization, all data (register writes and frame data) are encapsulated in *packets*. There are two kinds of packets, Header and Data. A header packet has two types: Type 1 and Type 2. Type 1 Packet Headers are used for register writes. A combination of Type 1 and Type Packet Headers are used for frame data writes. A Type 1 Packet Header, shown in Figure 3-32, is always a single 32-bit word that describes the header type, whether it is a read/write function to a specific configuration register address (see Table 3-16) as the destination, and how many 32-bit words are in the following Packet Data portion. A Type 1 Packet Data portion can contain anywhere from 0 to 2,047 32-bit data words.

| Packet Header | Туре  | Operation<br>(Write/Read) | Register Address<br>(Destination) | Byte Address | Word Count<br>(32-bit Words) |
|---------------|-------|---------------------------|-----------------------------------|--------------|------------------------------|
| Bits[31:0]    | 31:29 | 28:27                     | 26:13                             | 12:11        | 10:0                         |
| Type 1        | 001   | 10 / 01                   | XXXXXXXXXXXXXXXX                  | XX           | XXXXXXXXXXXXX                |

X138\_10\_082599

Figure 3-32: Type 1 Packet Header

| Packet Header | Туре  | Operation (Write/Read) | Word Count (32-bit Words)               |
|---------------|-------|------------------------|-----------------------------------------|
| Bits[31:0]    | 31:29 | 28:27                  | 26:0                                    |
| Type 2        | 010   | 10 / 01                | XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX |

X138\_11\_082599

### Figure 3-33: Type 2 Packet Header

The first packet header in Table 3-22 is a Type 1 packet header that specifies writing one data word to the CMD register. The following packet data is a data word specifying a reset of the CRC register (compare the data field of Table 3-22 to the binary codes of Table 3-17).

The second packet header in Table 3-22 loads the frame size into the FLR.

The third packet header loads the configuration options into the COR register. The binary description of this register is not documented. Following this is a similar write of the SWITCH command to the CMD register which selects the CCLK frequency specified in the COR. Finally, the WCFG command is loaded into the CMD register so that the loading of frame data can commence.

The fourth packet header writes to the ID register. This ensures the correct bitstream for the correct Virtex-II family member.

Table 3-23 shows the packets that load all of the data frames, starting with a Type 1 packet header to load the starting frame address, which is always 0h.

| Data Type                                                                                                              |
|------------------------------------------------------------------------------------------------------------------------|
| Packet Header: Write to FAR register                                                                                   |
| Packet Data: Starting frame address                                                                                    |
| Packet Header: Write to FDRI                                                                                           |
| Packet Header Type 2: Data words                                                                                       |
| <b>Packet Data:</b> Configuration data frames in 32-bit words. Total number of words specified in Type 2 Packet Header |
| Packet Data: CRC value                                                                                                 |
| Packet Header: Write to CMD register                                                                                   |
| Packet Data: GRESTORE                                                                                                  |
| Packet Header: Write to CMD register                                                                                   |
| Packet Data: DGHIGH                                                                                                    |
| Packet Header: NO OP                                                                                                   |
| Packet Data: one frame of NO OP                                                                                        |

### Table 3-23: Bitstream Data Frames and CRC Sequence

The loading of data frames requires a combination of Type 1 and Type 2 packet headers. Type 2 packet headers must always be preceded by a Type 1 packet header. The Type 2 packet data can be up to 67,108,863 data words in size.

The Type 2 packet header, shown in Figure 3-33, differs slightly from a Type 1 packet header in that there is no Register Address or Byte Address fields.

To write a set of data frames to the configuration memory, after the starting frame address has been loaded into the FAR, a Type 1 packet header issues a write command to the FDRI, followed by a Type 2 packet *header* specifying the number of data words to be loaded, and then followed by the actual frame data as Type 2 packet *data*. Writing data frames might require a Type 1/Type 2 packet header combination, or a Type 1 only. This depends on the amount of data being written.

Table 3-24 shows the packets needed to issue the start-up operations and load the final CRC check. The FPGA does not go active until after the final CRC is loaded. The number of clock cycles required to complete the start-up sequence depends on the BitGen options selected. Completion of the configuration process requires 8 to 16 clock cycles after the DESYNCH command. The DESYNCH command forces realignment to 32-bit boundaries and, therefore, a synchronization word is needed.

| Data Type                            |
|--------------------------------------|
| Packet Header: Write to CMD register |
| Packet Data: START                   |
| Packet Header: Write to MASK         |
| Packet Data: CTL mask                |
| Packet Header: Write to CTL          |
| Packet Data: Control commands        |
| Packet Header: Write to CRC          |
| Packet Data: CRC value               |
| Packet Header: Write to CMD          |
| Packet Data: DESYNCH command         |
| Dummy word                           |
| Dummy word                           |
| Dummy word                           |
| Dummy word                           |

| Table 3-24: | Bitstream | Final (  | CRC and | Start-Ur | o Seo | luence |
|-------------|-----------|----------|---------|----------|-------|--------|
|             | Ditaticam | i inai v |         |          |       | uchicc |

Typically, DONE is released within the first seven CCLK cycles after the final CRC value is loaded, but the rest of the dummy data at the end of the stream should continue to be loaded. The FPGA needs the additional clock cycles to finish internal processing, but this is not a concern when a free-running oscillator is used for CCLK. In serial mode, this requires only 16 bits (two bytes), but in SelectMAP mode, this requires 16 bytes of dummy words at the end of the bitstream. Since the intended configuration mode to be used is unknown by Bitgen, four 32-bit dummy words (16 bytes) are always placed at the end of the bitstream.

## Cyclic Redundancy Checking Algorithm

Virtex-II configuration uses a standard 16-bit CRC checksum algorithm to verify bitstream integrity during configuration. The 16-bit CRC polynomial is shown below.

 $CRC-16 = X^{16} + X^{15} + X^2 + 1$ 

The algorithm is implemented by shifting the data stream into a 16-bit shift register, shown in Figure 3-34. Register Bit(0) receives an XOR of the incoming data and the output of Bit(15). Bit(2) receives an XOR of the input to Bit(0) and the output of Bit(1). Bit(15) receives an XOR of the input to Bit(0) and the output of Bit(14).

A CRC Reset resets all the CRC registers to zero. As data is shifted into the CRC circuitry, a CRC calculation accumulates in the registers. When the CRC value is loaded into the CRC calculation register, the ending CRC checksum is loaded into the CRC Register. The value loaded into the CRC Register should be zero; otherwise, the configuration failed CRC check.



Figure 3-34: Serial 16-bit CRC Circuitry

Not all of the configuration stream is loaded into the CRC circuitry. Only data that is written to one of the registers shown in Table 3-21 is included. For each 32-bit word that is written to one of the registers (Table 3-21), the address code for the register and the 32-bit data word is shifted LSB first into the CRC calculation circuitry, see Figure 3-34. When multiple 32-bit words are written to the same register, the same address is loaded after each word. All other data in the configuration stream is ignored and does not affect the CRC checksum.

This description is a model that can be used to generate an identical CRC value. The actual circuitry in the device is a slightly more complex Parallel CRC circuit that produces the same result.

# Readback

Readback is the process of reading all the data in the internal configuration memory. This can be used to verify that the current configuration data is correct and to read the current state of all internal CLB and IOB registers as well as the current LUT RAM and block RAM values.

Readback is only available through the SelectMAP and Boundary Scan interfaces. This discussion covers the use of the SelectMAP interface for performing readback. For information on using the Boundary Scan interface for readback see "Readback When Using Boundary Scan" on page 297.

## Readback Verification and Capture

Readback verification is used to verify the validity of the stored configuration data. This is most commonly used in space-based applications where exposure to radiation might alter the data stored in the configuration memory cells.

Readback capture is used to list the states of all the internal flip-flops. This can be used for hardware debugging and functional verification. When Capture is initiated, the internal register states are loaded into unused spaces in the configuration memory which can be extracted after a readback of the configuration memory.

While both *Verify* and *Capture* can be performed in one readback, each require slightly different preparation and post processing.

The BUSY pin should be monitored to detect the availability of valid readback data. Several cycles pass between the time the readback command is written and valid data is presented. Readback data is indicated by the BUSY pin going Low.

## Preparing for Readback in Design Entry

If only a readback verification is to be performed, there are no additional steps at the time of design entry. However, if readback capture is to be used, the Virtex-II library primitive CAPTURE\_VIRTEX2 must be instantiated in the user design as shown in Figure 3-35.

The CAPTURE\_VIRTEX2 component is used in the FPGA design to control when the logic states of all the registers are captured into configuration memory. The CLK pin can be driven by any clock source that would synchronize Capture to the changing logic states of the registers. The CAP pin is an enable control. When CAP is asserted, the register states are captured in memory on the next CLK rising edge.

Capture can be performed in two ways: single-shot or continuous. In continuous capture, the CAP line is held High until the desired capture event occurs causing CAP to go Low. See Figure 3-35. Continuous capture does not require a readback operation to reset the CAPTURE block. In single-shot capture, the CAP line is pulsed once, and subsequent pulses are ignored until a readback operation has been performed. Captured data is read using the same process as a normal readback.



Figure 3-35: Readback CAPTURE\_VIRTEX2 Library Primitive

## Enabling Readback in the Software

Since readback is performed through the SelectMAP interface after configuration, the configuration ports must continue to be active by setting the persistence switch in BitGen. Additionally, a readback bit file, which contains the commands to execute a readback and a bitmap for data verification, can optionally be generated by setting the readback option in BitGen. An example of the BitGen command line is shown below.

bitgen -w -l -m -g readback -g persist:yes...

The **-w** option overwrites existing output. The **-l** option generates a *Logic Allocation* file. The **-m** option generates a *Mask* file. The **-g readback** option generates a *readback bit* file, and the **-g persist:yes** option keeps the SelectMAP interface active after configuration. For more information on BitGen options, see Appendix A: BitGen and PROMGen Switches and Options.

## Readback When Using Boundary Scan

### **Regular Readback Flow**

It is highly recommended to perform shutdown before reading back bitstream to ensure normal operation. The Shutdown Sequence can be executed by loading the JSHUTDOWN instruction and spending at least 12 TCK cycles in RTI TAP controller state. CRC\_ERROR status and configuration error (CFGERR) must be cleared after readback by issuing Reset CRC bitstream command or writing the correct CRC value to CRC register.



Figure 3-36: Regular Readback Flow

## IEEE 1532 Readback Flow

In IEEE 1532 readback mode, full chip shutdown is performed when ISC\_ENABLE is executed. At the end of readback, CRC Error status must be cleared by issuing Reset CRC command or writing the correct CRC value to CRC register. ISC\_DISABLE cannot be executed correctly unless the CRC error status is cleared.



Figure 3-37: IEEE 1532 Readback Flow

www.xilinx.com 1-800-255-7778

# Using ChipScope ILA

The ChipScope ILA functional verification tool is currently sold separately through the Xilinx web site. This program uses a combination of PC software and instantiated soft cores to capture states of internal signals. This data is read out of the JTAG USER1 scan chain using the MultiLINX cable or a parallel cable. ChipScope ILA supports only the Virtex architecture and allows for the functional verification and debugging of an FPGA configured design.

ChipScope ILA supports the high speed USB interface to the MultiLINX cable set on Windows 98/2000 platforms and the RS232 connection on Windows 95/98/2000/NT platforms. UNIX support is not available. More details are available under ChipScope ILA at: <u>www.xilinx.com</u>



# Chapter 4

# **PCB** Design Considerations

# **Summary**

This chapter covers the following topics:

- Pinout Information
- Pinout Diagrams
- Package Specifications
- Flip-Chip Packages
- Thermal Data
- Printed Circuit Board Considerations
- Board Routability Guidelines
- Power Consumption
- IBIS Models
- BSDL and Boundary Scan Models

4

# **Pinout Information**

# Introduction

This section describes the pinouts for Virtex-II devices in the following packages:

- CS144: wire-bond chip-scale ball grid array (BGA) of 0.80 mm pitch
- FG256, FG456, and FG676: wire-bond fine-pitch BGA of 1.00 mm pitch
- . FF896, FF1152, FF1517: flip-chip fine-pitch BGA of 1.00 mm pitch
- BG575 and BG728: wire-bond BGA of 1.27 mm pitch
- BF957: flip-chip BGA of 1.27 mm pitch

All of the devices supported in a particular package are pinout compatible and are listed in the same table (one table per package). In addition, the FG456 and FG676 packages are compatible, as are the FF896 and FF1152 packages. Pins that are not available for the smallest devices are listed in right-hand columns.

Each device is split into eight I/O banks to allow for flexibility in the choice of I/O standards (see the Virtex-II Data Sheet). Global pins, including JTAG, configuration, and power/ground pins, are listed at the end of each table. Table 4-1 provides definitions for all pin types.

The FG256 pinouts (Table 4-2) is included as an example. All Virtex-II pinout tables are available on the distribution CD-ROM, or on the www.xilinx.com website.

# **Pin Definitions**

Table 4-1 provides a description of each pin type listed in Virtex-II pinout tables.

Table 4-1: Virtex-II Pin Definitions

| Pin Name                                | Direction    | Description                                                                                                                                                                |
|-----------------------------------------|--------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| User I/O Pins                           |              |                                                                                                                                                                            |
| IO_LXXY_#                               | Input/Output | All user I/O pins are capable of differential signalling and can implement LVDS, ULVDS, BLVDS, LVPECL, or LDT pairs. Each user I/O is labeled " <b>IO_LXXY_#</b> ", where: |
|                                         |              | IO indicates a user I/O pin.                                                                                                                                               |
|                                         |              | <b>LXXY</b> indicates a differential pair, with <b>XX</b> a unique pair in the bank and <b>Y</b> = $P/N$ for the positive and negative sides of the differential pair.     |
|                                         |              | # indicates the bank number (0 through 7)                                                                                                                                  |
| <b>Dual-Function Pins</b>               |              |                                                                                                                                                                            |
| IO_LXXY_#/ZZZ                           |              | The dual-function pins are labelled " <b>IO_LXXY_#/ZZZ</b> ", where <b>ZZZ</b> can be one of the following pins:                                                           |
|                                         |              | Per Bank - VRP, VRN, or VREF                                                                                                                                               |
|                                         |              | Globally - GCLKX(S/P), BUSY/DOUT, INIT_B, DIN/D0 – D7, RDWR_B, or CS_B                                                                                                     |
| With /ZZZ:                              |              |                                                                                                                                                                            |
| DIN / D0, D1, D2,<br>D3, D4, D5, D6, D7 | Input/Output | In SelectMAP mode, D0 through D7 are configuration data pins. These pins become user I/Os after configuration, unless the SelectMAP port is retained.                      |
|                                         |              | In bit-serial modes, DIN (D0) is the single-data input. This pin becomes a user I/O after configuration.                                                                   |
| CS_B                                    | Input        | In SelectMAP mode, this is the active-low Chip Select signal. The pin becomes a user I/O after configuration, unless the SelectMAP port is retained.                       |
| RDWR_B                                  | Input        | In SelectMAP mode, this is the active-low Write Enable signal. The pin becomes a user I/O after configuration, unless the SelectMAP port is retained.                      |

1-800-255-7778

| Pin Name                    | Direction                     | Description                                                                                                                                                                                                                                                                           |
|-----------------------------|-------------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| BUSY/DOUT                   | Output                        | In SelectMAP mode, BUSY controls the rate at which configuration data is loaded. The pin becomes a user I/O after configuration, unless the SelectMAP port is retained.                                                                                                               |
|                             |                               | In bit-serial modes, DOUT provides preamble and configuration data to down-<br>stream devices in a daisy-chain. The pin becomes a user I/O after configuration.                                                                                                                       |
| INIT_B                      | Bidirectional<br>(open-drain) | When Low, this pin indicates that the configuration memory is being cleared. When held Low, the start of configuration is delayed. During configuration, a Low on this output indicates that a configuration data error has occurred. The pin becomes a user I/O after configuration. |
| GCLKx (S/P)                 | Input                         | These are clock input pins that connect to Global Clock Buffers. These pins become regular user I/Os when not needed for clocks.                                                                                                                                                      |
| VRP                         | Input                         | This pin is for the DCI voltage reference resistor of P transistor (per bank).                                                                                                                                                                                                        |
| VRN                         | Input                         | This pin is for the DCI voltage reference resistor of N transistor (per bank).                                                                                                                                                                                                        |
| ALT_VRP                     | Input                         | This is the alternative pin for the DCI voltage reference resistor of P transistor.                                                                                                                                                                                                   |
| ALT_VRN                     | Input                         | This is the alternative pin for the DCI voltage reference resistor of N transistor.                                                                                                                                                                                                   |
| V <sub>REF</sub>            | Input                         | These are input threshold voltage pins. They become user I/Os when an external threshold voltage is not needed (per bank).                                                                                                                                                            |
| Dedicated Pins <sup>1</sup> |                               |                                                                                                                                                                                                                                                                                       |
| CCLK                        | Input/Output                  | Configuration clock. Output in Master mode or Input in Slave mode.                                                                                                                                                                                                                    |
| PROG_B                      | Input                         | Active Low asynchronous reset to configuration logic. This pin has a permanent weak pull-up resistor.                                                                                                                                                                                 |
| DONE                        | Input/Output                  | DONE is a bidirectional signal with an optional internal pull-up resistor. As an output, this pin indicates completion of the configuration process. As an input, a Low level on DONE can be configured to delay the start-up sequence.                                               |
| M2, M1, M0                  | Input                         | Configuration mode selection.                                                                                                                                                                                                                                                         |
| HSWAP_EN                    | Input                         | Enable I/O pullups during configuration.                                                                                                                                                                                                                                              |
| ТСК                         | Input                         | Boundary Scan Clock.                                                                                                                                                                                                                                                                  |
| TDI                         | Input                         | Boundary Scan Data Input.                                                                                                                                                                                                                                                             |
| TDO                         | Output                        | Boundary Scan Data Output.                                                                                                                                                                                                                                                            |
| TMS                         | Input                         | Boundary Scan Mode Select.                                                                                                                                                                                                                                                            |
| PWRDWN_B                    | Input                         | Active Low power down pin. PWRDWN_B should always be pulled High,<br>which is its default value. Driving this pin Low can affect device operation and<br>configuration. PWRDWN_B does not require an external pull-up or pull-down.                                                   |
| Other Pins                  |                               |                                                                                                                                                                                                                                                                                       |
| DXN, DXP                    | N/A                           | Temperature-sensing diode pins (Anode: DXP, Cathode: DXN).                                                                                                                                                                                                                            |
| V <sub>BATT</sub>           | Input                         | Decryptor key memory backup supply. (Do not connect if battery is not used.)                                                                                                                                                                                                          |
| RSVD                        | N/A                           | Reserved pin - do not connect.                                                                                                                                                                                                                                                        |
| V <sub>CCO</sub>            | Input                         | Power-supply pins for the output drivers (per bank).                                                                                                                                                                                                                                  |
| V <sub>CCAUX</sub>          | Input                         | Power-supply pins for auxiliary circuits.                                                                                                                                                                                                                                             |
| V <sub>CCINT</sub>          | Input                         | Power-supply pins for the internal core logic.                                                                                                                                                                                                                                        |
| GND                         | Input                         | Ground.                                                                                                                                                                                                                                                                               |

### Table 4-1: Virtex-II Pin Definitions (Continued)

### Notes:

1. All dedicated pins (JTAG and configuration) are powered by  $V_{CCAUX}$  (independent of the bank  $V_{CCO}$  voltage).

4

# FG256 Fine-Pitch BGA Package

As shown in Table 4-2, XC2V40, XC2V80, XC2V250, XC2V500, and XC2V1000 Virtex-II devices are available in the FG256 fine-pitch BGA package. Pins in the XC2V250, XC2V500, and XC2V1000 devices are the same. The No Connect column shows pin differences for the XC2V40 and XC2V80 devices.

The FG256 pinout information (Table 4-2) is included as an example. All Virtex-II pinout tables are available on the distribution CD-ROM, or on the web (at <u>http://www.xilinx.com</u>).

| Bank | Pin Description  | Pin<br>Number | No Connect<br>in XC2V40 | No Connect<br>in XC2V80 |
|------|------------------|---------------|-------------------------|-------------------------|
| 0    | IO_L01N_0        | C4            |                         |                         |
| 0    | IO_L01P_0        | B4            |                         |                         |
| 0    | IO_L02N_0        | D5            |                         |                         |
| 0    | IO_L02P_0        | C5            |                         |                         |
| 0    | IO_L03N_0/VRP_0  | B5            |                         |                         |
| 0    | IO_L03P_0/VRN_0  | A5            |                         |                         |
| 0    | IO_L04N_0/VREF_0 | D6            | NC                      | NC                      |
| 0    | IO_L04P_0        | C6            | NC                      | NC                      |
| 0    | IO_L05N_0        | B6            | NC                      | NC                      |
| 0    | IO_L05P_0        | A6            | NC                      | NC                      |
| 0    | IO_L92N_0        | E6            | NC                      | NC                      |
| 0    | IO_L92P_0        | E7            | NC                      | NC                      |
| 0    | IO_L93N_0        | D7            | NC                      | NC                      |
| 0    | IO_L93P_0        | C7            | NC                      | NC                      |
| 0    | IO_L94N_0/VREF_0 | B7            |                         |                         |
| 0    | IO_L94P_0        | A7            |                         |                         |
| 0    | IO_L95N_0/GCLK7P | D8            |                         |                         |
| 0    | IO_L95P_0/GCLK6S | C8            |                         |                         |
| 0    | IO_L96N_0/GCLK5P | B8            |                         |                         |
| 0    | IO_L96P_0/GCLK4S | A8            |                         |                         |
|      |                  |               |                         |                         |
| 1    | IO_L96N_1/GCLK3P | A9            |                         |                         |
| 1    | IO_L96P_1/GCLK2S | B9            |                         |                         |
| 1    | IO_L95N_1/GCLK1P | C9            |                         |                         |
| 1    | IO_L95P_1/GCLK0S | D9            |                         |                         |
| 1    | IO_L94N_1        | A10           |                         |                         |
| 1    | IO_L94P_1/VREF_1 | B10           |                         |                         |
| 1    | IO_L93N_1        | C10           | NC                      | NC                      |
| 1    | IO_L93P_1        | D10           | NC                      | NC                      |
| 1    | IO_L92N_1        | E10           | NC                      | NC                      |
| 1    | IO_L92P_1        | E11           | NC                      | NC                      |
| 1    | IO_L05N_1        | A11           | NC                      | NC                      |
| 1    | IO_L05P_1        | B11           | NC                      | NC                      |

Table 4-2: FG256 BGA — XC2V40, XC2V80, XC2V250, XC2V500, and XC2V1000

| Table 4-2: | <i>-2:</i> <b>FG256 BGA</b> — XC2V40, XC2V80, XC2V250, XC2V500, and XC2V1000 |               |                         |                         |  |
|------------|------------------------------------------------------------------------------|---------------|-------------------------|-------------------------|--|
| Bank       | Pin Description                                                              | Pin<br>Number | No Connect<br>in XC2V40 | No Connect<br>in XC2V80 |  |
| 1          | IO_L04N_1                                                                    | C11           | NC                      | NC                      |  |
| 1          | IO_L04P_1/VREF_1                                                             | D11           | NC                      | NC                      |  |
| 1          | IO_L03N_1/VRP_1                                                              | A12           |                         |                         |  |
| 1          | IO_L03P_1/VRN_1                                                              | B12           |                         |                         |  |
| 1          | IO_L02N_1                                                                    | C12           |                         |                         |  |
| 1          | IO_L02P_1                                                                    | D12           |                         |                         |  |
| 1          | IO_L01N_1                                                                    | B13           |                         |                         |  |
| 1          | IO_L01P_1                                                                    | C13           |                         |                         |  |
|            |                                                                              |               |                         |                         |  |
| 2          | IO_L01N_2                                                                    | C16           |                         |                         |  |
| 2          | IO_L01P_2                                                                    | D16           |                         |                         |  |
| 2          | IO_L02N_2/VRP_2                                                              | D14           |                         |                         |  |
| 2          | IO_L02P_2/VRN_2                                                              | D15           |                         |                         |  |
| 2          | IO_L03N_2                                                                    | E13           |                         |                         |  |
| 2          | IO_L03P_2/VREF_2                                                             | E14           |                         |                         |  |
| 2          | IO_L04N_2                                                                    | E15           | NC                      |                         |  |
| 2          | IO_L04P_2                                                                    | E16           | NC                      |                         |  |
| 2          | IO_L06N_2                                                                    | F13           | NC                      |                         |  |
| 2          | IO_L06P_2                                                                    | F14           | NC                      |                         |  |
| 2          | IO_L43N_2                                                                    | F15           | NC                      | NC                      |  |
| 2          | IO_L43P_2                                                                    | F16           | NC                      | NC                      |  |
| 2          | IO_L45N_2                                                                    | F12           | NC                      | NC                      |  |
| 2          | IO_L45P_2/VREF_2                                                             | G12           | NC                      | NC                      |  |
| 2          | IO_L91N_2                                                                    | G13           | NC                      |                         |  |
| 2          | IO_L91P_2                                                                    | G14           | NC                      |                         |  |
| 2          | IO_L93N_2                                                                    | G15           | NC                      |                         |  |
| 2          | IO_L93P_2/VREF_2                                                             | G16           | NC                      |                         |  |
| 2          | IO_L94N_2                                                                    | H13           |                         |                         |  |
| 2          | IO_L94P_2                                                                    | H14           |                         |                         |  |
| 2          | IO_L96N_2                                                                    | H15           |                         |                         |  |
| 2          | IO_L96P_2                                                                    | H16           |                         |                         |  |
|            |                                                                              |               |                         |                         |  |
| 3          | IO_L96N_3                                                                    | J16           |                         |                         |  |
| 3          | IO_L96P_3                                                                    | J15           |                         |                         |  |
| 3          | IO_L94N_3                                                                    | J14           |                         |                         |  |
| 3          | IO_L94P_3                                                                    | J13           |                         |                         |  |
| 3          | IO_L93N_3/VREF_3                                                             | K16           | NC                      |                         |  |
| 3          | IO_L93P_3                                                                    | K15           | NC                      |                         |  |
| 3          | IO_L91N_3                                                                    | K14           | NC                      |                         |  |

|      | 2. T G250 DGA — XC2 V40, XC2 V60, XC2 V250, XC2 V500, and XC2 V1000 |               |                         |                         |  |
|------|---------------------------------------------------------------------|---------------|-------------------------|-------------------------|--|
| Bank | Pin Description                                                     | Pin<br>Number | No Connect<br>in XC2V40 | No Connect<br>in XC2V80 |  |
| 3    | IO_L91P_3                                                           | K13           | NC                      |                         |  |
| 3    | IO_L45N_3/VREF_3                                                    | K12           | NC                      | NC                      |  |
| 3    | IO_L45P_3                                                           | L12           | NC                      | NC                      |  |
| 3    | IO_L43N_3                                                           | L16           | NC                      | NC                      |  |
| 3    | IO_L43P_3                                                           | L15           | NC                      | NC                      |  |
| 3    | IO_L06N_3                                                           | L14           | NC                      |                         |  |
| 3    | IO_L06P_3                                                           | L13           | NC                      |                         |  |
| 3    | IO_L04N_3                                                           | M16           | NC                      |                         |  |
| 3    | IO_L04P_3                                                           | M15           | NC                      |                         |  |
| 3    | IO_L03N_3/VREF_3                                                    | M14           |                         |                         |  |
| 3    | IO_L03P_3                                                           | M13           |                         |                         |  |
| 3    | IO_L02N_3/VRP_3                                                     | N15           |                         |                         |  |
| 3    | IO_L02P_3/VRN_3                                                     | N14           |                         |                         |  |
| 3    | IO_L01N_3                                                           | N16           |                         |                         |  |
| 3    | IO_L01P_3                                                           | P16           |                         |                         |  |
| t    |                                                                     |               |                         |                         |  |
| 4    | IO_L01N_4/DOUT                                                      | T14           |                         |                         |  |
| 4    | IO_L01P_4/INIT_B                                                    | T13           |                         |                         |  |
| 4    | IO_L02N_4/D0                                                        | P13           |                         |                         |  |
| 4    | IO_L02P_4/D1                                                        | R13           |                         |                         |  |
| 4    | IO_L03N_4/D2/ALT_VRP_4                                              | N12           |                         |                         |  |
| 4    | IO_L03P_4/D3/ALT_VRN_4                                              | P12           |                         |                         |  |
| 4    | IO_L04N_4/VREF_4                                                    | R12           | NC                      | NC                      |  |
| 4    | IO_L04P_4                                                           | T12           | NC                      | NC                      |  |
| 4    | IO_L05N_4/VRP_4                                                     | N11           | NC                      | NC                      |  |
| 4    | IO_L05P_4/VRN_4                                                     | P11           | NC                      | NC                      |  |
| 4    | IO_L91N_4/VREF_4                                                    | R11           | NC                      | NC                      |  |
| 4    | IO_L91P_4                                                           | T11           | NC                      | NC                      |  |
| 4    | IO_L92N_4                                                           | M11           | NC                      | NC                      |  |
| 4    | IO_L92P_4                                                           | M10           | NC                      | NC                      |  |
| 4    | IO_L93N_4                                                           | N10           | NC                      | NC                      |  |
| 4    | IO_L93P_4                                                           | P10           | NC                      | NC                      |  |
| 4    | IO_L94N_4/VREF_4                                                    | R10           |                         |                         |  |
| 4    | IO_L94P_4                                                           | T10           |                         |                         |  |
| 4    | IO_L95N_4/GCLK3S                                                    | N9            |                         |                         |  |
| 4    | IO_L95P_4/GCLK2P                                                    | P9            |                         |                         |  |
| 4    | IO_L96N_4/GCLK1S                                                    | R9            |                         |                         |  |
| 4    | IO_L96P_4/GCLK0P                                                    | T9            |                         |                         |  |
|      |                                                                     |               |                         |                         |  |

Table A-2. EG256 BGA XC2V/0 XC2V80 XC2V250 XC2V500 and XC2V1000

| Bank | Pin Description        | Pin<br>Number | No Connect<br>in XC2V40 | No Connect<br>in XC2V80 |
|------|------------------------|---------------|-------------------------|-------------------------|
| 5    | IO_L96N_5/GCLK7S       | T8            |                         |                         |
| 5    | IO_L96P_5/GCLK6P       | R8            |                         |                         |
| 5    | IO_L95N_5/GCLK5S       | P8            |                         |                         |
| 5    | IO_L95P_5/GCLK4P       | N8            |                         |                         |
| 5    | IO_L94N_5              | T7            |                         |                         |
| 5    | IO_L94P_5/VREF_5       | R7            |                         |                         |
| 5    | IO_L93N_5              | P7            | NC                      | NC                      |
| 5    | IO_L93P_5              | N7            | NC                      | NC                      |
| 5    | IO_L92N_5              | M7            | NC                      | NC                      |
| 5    | IO_L92P_5              | M6            | NC                      | NC                      |
| 5    | IO_L91N_5              | T6            | NC                      | NC                      |
| 5    | IO_L91P_5/VREF_5       | R6            | NC                      | NC                      |
| 5    | IO_L05N_5/VRP_5        | P6            | NC                      | NC                      |
| 5    | IO_L05P_5/VRN_5        | N6            | NC                      | NC                      |
| 5    | IO_L04N_5              | T5            | NC                      | NC                      |
| 5    | IO_L04P_5/VREF_5       | R5            | NC                      | NC                      |
| 5    | IO_L03N_5/D4/ALT_VRP_5 | P5            |                         |                         |
| 5    | IO_L03P_5/D5/ALT_VRN_5 | N5            |                         |                         |
| 5    | IO_L02N_5/D6           | R4            |                         |                         |
| 5    | IO_L02P_5/D7           | P4            |                         |                         |
| 5    | IO_L01N_5/RDWR_B       | T4            |                         |                         |
| 5    | IO_L01P_5/CS_B         | T3            |                         |                         |
|      |                        |               |                         |                         |
| 6    | IO_L01P_6              | P1            |                         |                         |
| 6    | IO_L01N_6              | N1            |                         |                         |
| 6    | IO_L02P_6/VRN_6        | N3            |                         |                         |
| 6    | IO_L02N_6/VRP_6        | N2            |                         |                         |
| 6    | IO_L03P_6              | M4            |                         |                         |
| 6    | IO_L03N_6/VREF_6       | M3            |                         |                         |
| 6    | IO_L04P_6              | M2            | NC                      |                         |
| 6    | IO_L04N_6              | M1            | NC                      |                         |
| 6    | IO_L06P_6              | L4            | NC                      |                         |
| 6    | IO_L06N_6              | L3            | NC                      |                         |
| 6    | IO_L43P_6              | L2            | NC                      | NC                      |
| 6    | IO_L43N_6              | L1            | NC                      | NC                      |
| 6    | IO_L45P_6              | L5            | NC                      | NC                      |
| 6    | IO_L45N_6/VREF_6       | K5            | NC                      | NC                      |
| 6    | IO_L91P_6              | K4            | NC                      |                         |
| 6    | IO_L91N_6              | К3            | NC                      |                         |

| <i>iable 4-2:</i> | 4-2: FG256 BGA — XC2V40, XC2V80, XC2V250, XC2V500, and XC2V1000 |               |                         |                         |  |
|-------------------|-----------------------------------------------------------------|---------------|-------------------------|-------------------------|--|
| Bank              | Pin Description                                                 | Pin<br>Number | No Connect<br>in XC2V40 | No Connect<br>in XC2V80 |  |
| 6                 | IO_L93P_6                                                       | K2            | NC                      |                         |  |
| 6                 | IO_L93N_6/VREF_6                                                | K1            | NC                      |                         |  |
| 6                 | IO_L94P_6                                                       | J4            |                         |                         |  |
| 6                 | IO_L94N_6                                                       | J3            |                         |                         |  |
| 6                 | IO_L96P_6                                                       | J2            |                         |                         |  |
| 6                 | IO_L96N_6                                                       | J1            |                         |                         |  |
|                   |                                                                 |               |                         |                         |  |
| 7                 | IO_L96P_7                                                       | H1            |                         |                         |  |
| 7                 | IO_L96N_7                                                       | H2            |                         |                         |  |
| 7                 | IO_L94P_7                                                       | H3            |                         |                         |  |
| 7                 | IO_L94N_7                                                       | H4            |                         |                         |  |
| 7                 | IO_L93P_7/VREF_7                                                | G1            | NC                      |                         |  |
| 7                 | IO_L93N_7                                                       | G2            | NC                      |                         |  |
| 7                 | IO_L91P_7                                                       | G3            | NC                      |                         |  |
| 7                 | IO_L91N_7                                                       | G4            | NC                      |                         |  |
| 7                 | IO_L45P_7/VREF_7                                                | G5            | NC                      | NC                      |  |
| 7                 | IO_L45N_7                                                       | F5            | NC                      | NC                      |  |
| 7                 | IO_L43P_7                                                       | F1            | NC                      | NC                      |  |
| 7                 | IO_L43N_7                                                       | F2            | NC                      | NC                      |  |
| 7                 | IO_L06P_7                                                       | F3            | NC                      |                         |  |
| 7                 | IO_L06N_7                                                       | F4            | NC                      |                         |  |
| 7                 | IO_L04P_7                                                       | E1            | NC                      |                         |  |
| 7                 | IO_L04N_7                                                       | E2            | NC                      |                         |  |
| 7                 | IO_L03P_7/VREF_7                                                | E3            |                         |                         |  |
| 7                 | IO_L03N_7                                                       | E4            |                         |                         |  |
| 7                 | IO_L02P_7/VRN_7                                                 | D2            |                         |                         |  |
| 7                 | IO_L02N_7/VRP_7                                                 | D3            |                         |                         |  |
| 7                 | IO_L01P_7                                                       | D1            |                         |                         |  |
| 7                 | IO_L01N_7                                                       | C1            |                         |                         |  |
| · · · ·           |                                                                 |               |                         |                         |  |
| 0                 | VCCO_0                                                          | F8            |                         |                         |  |
| 0                 | VCCO_0                                                          | F7            |                         |                         |  |
| 0                 | VCCO_0                                                          | E8            |                         |                         |  |
| 1                 | VCCO_1                                                          | F10           |                         |                         |  |
| 1                 | VCCO_1                                                          | F9            |                         |                         |  |
| 1                 | VCCO_1                                                          | E9            |                         |                         |  |
| 2                 | VCCO_2                                                          | H12           |                         |                         |  |
| 2                 | VCCO_2                                                          | H11           |                         |                         |  |
| 2                 | VCCO_2                                                          | G11           |                         |                         |  |

Table ECOSE BCA VC2V/0 VC2V90 VC2V250 VC2V500 10

| Bank | Pin Description | Pin<br>Number | No Connect<br>in XC2V40 | No Connect<br>in XC2V80 |
|------|-----------------|---------------|-------------------------|-------------------------|
| 3    | VCCO_3          | K11           |                         |                         |
| 3    | VCCO_3          | J12           |                         |                         |
| 3    | VCCO_3          | J11           |                         |                         |
| 4    | VCCO_4          | M9            |                         |                         |
| 4    | VCCO_4          | L10           |                         |                         |
| 4    | VCCO_4          | L9            |                         |                         |
| 5    | VCCO_5          | M8            |                         |                         |
| 5    | VCCO_5          | L8            |                         |                         |
| 5    | VCCO_5          | L7            |                         |                         |
| 6    | VCCO_6          | K6            |                         |                         |
| 6    | VCCO_6          | J6            |                         |                         |
| 6    | VCCO_6          | J5            |                         |                         |
| 7    | VCCO_7          | H6            |                         |                         |
| 7    | VCCO_7          | H5            |                         |                         |
| 7    | VCCO_7          | G6            |                         |                         |
|      |                 |               |                         | 1                       |
| NA   | CCLK            | P15           |                         |                         |
| NA   | PROG_B          | A2            |                         |                         |
| NA   | DONE            | R14           |                         |                         |
| NA   | M0              | T2            |                         |                         |
| NA   | M1              | P2            |                         |                         |
| NA   | M2              | R3            |                         |                         |
| NA   | HSWAP_EN        | B3            |                         |                         |
| NA   | ТСК             | A15           |                         |                         |
| NA   | TDI             | C2            |                         |                         |
| NA   | TDO             | C15           |                         |                         |
| NA   | TMS             | B14           |                         |                         |
| NA   | PWRDWN_B        | T15           |                         |                         |
| NA   | RSVD            | A4            |                         |                         |
| NA   | RSVD            | A3            |                         |                         |
| NA   | VBATT           | A14           |                         |                         |
| NA   | RSVD            | A13           |                         |                         |
|      |                 |               |                         |                         |
| NA   | VCCAUX          | R16           |                         |                         |
| NA   | VCCAUX          | R1            |                         |                         |
| NA   | VCCAUX          | B16           |                         |                         |
| NA   | VCCAUX          | B1            |                         |                         |
| NA   | VCCINT          | N13           |                         |                         |
| NA   | VCCINT          | N4            |                         |                         |

| Bank | Pin Description | Pin<br>Number | No Connect<br>in XC2V40 | No Connect<br>in XC2V80 |
|------|-----------------|---------------|-------------------------|-------------------------|
| NA   | VCCINT          | M12           |                         |                         |
| NA   | VCCINT          | M5            |                         |                         |
| NA   | VCCINT          | E12           |                         |                         |
| NA   | VCCINT          | E5            |                         |                         |
| NA   | VCCINT          | D13           |                         |                         |
| NA   | VCCINT          | D4            |                         |                         |
|      |                 |               |                         |                         |
| NA   | GND             | T16           |                         |                         |
| NA   | GND             | T1            |                         |                         |
| NA   | GND             | R15           |                         |                         |
| NA   | GND             | R2            |                         |                         |
| NA   | GND             | P14           |                         |                         |
| NA   | GND             | P3            |                         |                         |
| NA   | GND             | L11           |                         |                         |
| NA   | GND             | L6            |                         |                         |
| NA   | GND             | K10           |                         |                         |
| NA   | GND             | К9            |                         |                         |
| NA   | GND             | K8            |                         |                         |
| NA   | GND             | K7            |                         |                         |
| NA   | GND             | J10           |                         |                         |
| NA   | GND             | J9            |                         |                         |
| NA   | GND             | J8            |                         |                         |
| NA   | GND             | J7            |                         |                         |
| NA   | GND             | H10           |                         |                         |
| NA   | GND             | H9            |                         |                         |
| NA   | GND             | H8            |                         |                         |
| NA   | GND             | H7            |                         |                         |
| NA   | GND             | G10           |                         |                         |
| NA   | GND             | G9            |                         |                         |
| NA   | GND             | G8            |                         |                         |
| NA   | GND             | G7            |                         |                         |
| NA   | GND             | F11           |                         |                         |
| NA   | GND             | F6            |                         |                         |
| NA   | GND             | C14           |                         |                         |
| NA   | GND             | C3            |                         |                         |
| NA   | GND             | B15           |                         |                         |
| NA   | GND             | B2            |                         |                         |
| NA   | GND             | A16           |                         |                         |
| NA   | GND             | A1            |                         |                         |

Table 4-2. FG256 BGA XC2V40, XC2V80, XC2V250, XC2V500, and XC2V1000

# **Pinout Diagrams**

This section contains pinout diagrams for the largest device in each of the following Virtex-II packages:

- "CS144 Chip-Scale BGA Composite Pinout Diagram (XC2V250)" on page 312
- "FG256 Fine-Pitch BGA Composite Pinout Diagram (XC2V1000)" on page 313
  - FG256 Bank Information
  - FG256 Dedicated Pins
- "FG456 Fine-Pitch BGA Composite Pinout Diagram (XC2V1000)" on page 317
  - FG456 Bank Information
  - FG456 Dedicated Pins
- "FG676 Fine-Pitch BGA Composite Pinout Diagram (XC2V3000)" on page 321
  - FG676 Bank Information
  - FG676 Dedicated Pins
- "BG575 Standard BGA Composite Pinout Diagram (XC2V2000)" on page 325
  - BG575 Bank Information
  - BG575 Dedicated Pins
- "BG728 Standard BGA Composite Pinout Diagram (XC2V3000)" on page 329
  - BG728 Bank Information
  - BG728 Dedicated Pins
- "FF896 Flip-Chip Fine-Pitch BGA Composite Pinout Diagram (XC2V2000)" on page 333
  - FF896 Bank Information
  - FF896 Dedicated Pins
- "FF1152 Flip-Chip Fine-Pitch BGA Composite Pinout Diagram (XC2V8000)" on page 337
  - FF1152 Bank Information
  - FF1152 Dedicated Pins
- "FF1517 Flip-Chip Fine-Pitch BGA Composite Pinout Diagram (XC2V8000)" on page 341
  - FF1517 Bank Information
  - FF1517 Dedicated Pins
- "BF957 Flip-Chip BGA Composite Pinout Diagram (XC2V6000)" on page 345
  - BF957 Bank Information
  - BF957 Dedicated Pins
- "FG456 FG676 Pinout Compatibility Diagram" on page 348
- "FF896 FF1152 Pinout Compatibility Diagram" on page 349

# CS144 Chip-Scale BGA Composite Pinout Diagram (XC2V250)



Figure 4-1: CS144 Chip-Scale BGA Composite Pinout Diagram (XC2V250)

# FG256 Fine-Pitch BGA Composite Pinout Diagram (XC2V1000)



| User       | r I/O Pins      |     | Dedicated Pins |   |                    |
|------------|-----------------|-----|----------------|---|--------------------|
| 0          | IO_LXXY_#       | С   | CCLK           |   |                    |
| Dual       | -Purpose Pins:  | Р   | PROG_B         |   |                    |
| •          | DIN/D0-D7       | D   | DONE           | Ħ | VBATT              |
| •          | CS_B            | 210 | M2, M1, M0     | R | RSVD               |
| ۲          | RDWR_B          | H   | HSWAP_EN       |   | vcco               |
| igodot     | BUSY/DOUT       | K   | тск            |   | VCCAUX             |
| ●          | INIT_B          |     | TDI            |   | VCCINT             |
| $\bigcirc$ | GCLKx (P)       | O   | TDO            |   | GND                |
| $\oplus$   | GCLKx (S)       | Μ   | TMS            | n | NO CONNECT         |
| $\ominus$  | VRP             | W   | PWRDWN_B       |   |                    |
| $\oplus$   | VRN             |     |                |   |                    |
| $\otimes$  | VREF            |     |                |   |                    |
| Triple     | e-Purpose Pins: |     |                |   |                    |
| €          | D2, D4/ALT_VRP  |     |                |   |                    |
| $\oplus$   | D3, D5/ALT_VRN  |     |                |   |                    |
|            |                 |     |                |   | ua002 c4 47 031501 |

Figure 4-2: FG256 Fine-Pitch BGA Composite Pinout Diagram (XC2V1000)

4

## FG256 Bank Information



| Use        | er I/O Pins      | Dedicated P | lins               |
|------------|------------------|-------------|--------------------|
| 0          | IO_LXXY_#        |             |                    |
| Dua        | al-Purpose Pins: |             |                    |
| ۲          | DIN/D0-D7        |             |                    |
| 0          | CS_B             |             |                    |
| ۲          | RDWR_B           |             | VCCO               |
| igodot     | BUSY/DOUT        |             |                    |
| ●          | INIT_B           |             |                    |
| $\bigcirc$ | GCLKx (P)        |             |                    |
| $\oplus$   | GCLKx (S)        |             |                    |
| $\ominus$  | VRP              |             |                    |
| $\oplus$   | VRN              |             |                    |
| $\otimes$  | VREF             |             |                    |
| Trip       | le-Purpose Pins: |             |                    |
| €          | D2, D4/ALT_VRP   |             |                    |
| $\oplus$   | D3, D5/ALT_VRN   |             |                    |
|            |                  |             | ug002_c4_47b_03150 |



## FG256 Dedicated Pins

FG256 - Top View



| User I/O Pins | Dedicated Pins |            |                |            |
|---------------|----------------|------------|----------------|------------|
|               | С              | CCLK       |                |            |
|               | Р              | PROG_B     |                |            |
|               | D              | DONE       | $\blacksquare$ | VBATT      |
|               | 210            | M2, M1, M0 | R              | RSVD       |
|               | H              | HSWAP_EN   |                |            |
|               | K              | тск        |                | VCCAUX     |
|               |                | TDI        |                | VCCINT     |
|               | O              | TDO        |                | GND        |
|               | Μ              | TMS        | n              | NO CONNECT |
|               | W              | PWRDWN_B   |                |            |
|               |                |            |                |            |
|               |                |            |                |            |
|               |                |            |                |            |
|               |                |            |                |            |
|               |                |            |                |            |

ug002\_c4\_47c\_120400

Figure 4-4: FG256 Dedicated Pins

## FG456 Fine-Pitch BGA Composite Pinout Diagram (XC2V1000)



Figure 4-5: FG456 Fine-Pitch BGA Composite Pinout Diagram (XC2V1000)

4

## FG456 Bank Information



Figure 4-6: FG456 Bank Information

FG456 Dedicated Pins



Ι

0

M

TDI TDO

TMS

PWRDWN\_B

ug002\_c4\_48c\_120400

n

VCCINT

NO CONNECT

GND

Figure 4-7: FG456 Dedicated Pins

www.xilinx.com 1-800-255-7778

# FG676 Fine-Pitch BGA Composite Pinout Diagram (XC2V3000)



ug002\_c4\_49\_031501

### Figure 4-8: FG676 Fine-Pitch BGA Composite Pinout Diagram (XC2V3000)

www.xilinx.com 1-800-255-7778

## FG676 Bank Information



ug002\_c4\_49b\_031501



www.xilinx.com

1-800-255-7778

## FG676 Dedicated Pins



| User I/O Pins |     | Dedicated P | Pins |            |
|---------------|-----|-------------|------|------------|
|               | С   | CCLK        | Ν    | DXN        |
|               | Р   | PROG_B      | Α    | DXP        |
|               | D   | DONE        | E    | VBATT      |
|               | 210 | M2, M1, M0  | R    | RSVD       |
|               | H   | HSWAP_EN    |      |            |
|               | K   | тск         |      | VCCAUX     |
|               |     | TDI         |      | VCCINT     |
|               | O   | TDO         |      | GND        |
|               | Μ   | TMS         | n    | NO CONNECT |
|               | W   | PWRDWN_B    |      |            |
|               |     |             |      |            |
|               |     |             |      |            |
|               |     |             |      |            |
|               |     |             |      |            |
|               |     |             |      |            |

ug002\_c4\_49c\_120400

Figure 4-10: FG676 Dedicated Pins
### BG575 Standard BGA Composite Pinout Diagram (XC2V2000)



ug002\_c4\_50\_031501

#### Figure 4-11: BG575 Standard BGA Composite Pinout Diagram (XC2V2000)

www.xilinx.com 1-800-255-7778

#### BG575 Bank Information



ug002\_c4\_50b\_031501

Figure 4-12: BG575 Bank Information

www.xilinx.com

1-800-255-7778

#### **BG575 Dedicated Pins**



BG575 - Top View

| User I/O Pins | Dedicated Pins |            |   |            |  |  |
|---------------|----------------|------------|---|------------|--|--|
|               | C              | CCLK       | N | DXN        |  |  |
|               | Р              | PROG_B     | A | DXP        |  |  |
|               | D              | DONE       |   | VBATT      |  |  |
|               | 210            | M2, M1, M0 | R | RSVD       |  |  |
|               | H              | HSWAP_EN   |   |            |  |  |
|               | K              | тск        |   | VCCAUX     |  |  |
|               |                | TDI        |   | VCCINT     |  |  |
|               | O              | TDO        |   | GND        |  |  |
|               | Μ              | TMS        | n | NO CONNECT |  |  |
|               | W              | PWRDWN_B   |   |            |  |  |
|               |                |            |   |            |  |  |
|               |                |            |   |            |  |  |
|               |                |            |   |            |  |  |
|               |                |            |   |            |  |  |
|               |                |            |   |            |  |  |

ug002\_c4\_50c\_120400

#### Figure 4-13: BG575 Dedicated Pins

4

# BG728 Standard BGA Composite Pinout Diagram (XC2V3000)



#### Figure 4-14: BG728 Standard BGA Composite Pinout Diagram (XC2V3000)

www.xilinx.com 1-800-255-7778

#### BG728 Bank Information



Figure 4-15: BG728 Bank Information

#### **BG728 Dedicated Pins**



BG728 - Top View

| User I/O Pins | Dedicated Pins |            |   |            |  |
|---------------|----------------|------------|---|------------|--|
|               | С              | CCLK       | Ν | DXN        |  |
|               | Р              | PROG_B     | Α | DXP        |  |
|               | D              | DONE       | E | VBATT      |  |
|               | 210            | M2, M1, M0 | R | RSVD       |  |
|               | Η              | HSWAP_EN   |   |            |  |
|               | K              | тск        |   | VCCAUX     |  |
|               |                | TDI        |   | VCCINT     |  |
|               | O              | TDO        |   | GND        |  |
|               | Μ              | TMS        | n | NO CONNECT |  |
|               | W              | PWRDWN_B   |   |            |  |
|               |                |            |   |            |  |
|               |                |            |   |            |  |
|               |                |            |   |            |  |
|               |                |            |   |            |  |
|               |                |            |   |            |  |

ug002\_c4\_51c\_120400

#### Figure 4-16: BG728 Dedicated Pins

4

# FF896 Flip-Chip Fine-Pitch BGA Composite Pinout Diagram (XC2V2000)



ug002\_c4\_52\_031501

*Figure 4-17:* **FF896 Flip-Chip Fine-Pitch BGA Composite Pinout Diagram (XC2V2000)** 

#### FF896 Bank Information



ug002\_c4\_52b\_031501



#### FF896 Dedicated Pins



| User I/O Pins | Dedicated Pins |            |                |            |  |
|---------------|----------------|------------|----------------|------------|--|
|               | C              | CCLK       | Ν              | DXN        |  |
|               | Р              | PROG_B     | Α              | DXP        |  |
|               | D              | DONE       | $\blacksquare$ | VBATT      |  |
|               | 210            | M2, M1, M0 | R              | RSVD       |  |
|               | H              | HSWAP_EN   |                |            |  |
|               | K              | тск        |                | VCCAUX     |  |
|               |                | TDI        |                | VCCINT     |  |
|               | O              | TDO        |                | GND        |  |
|               | Μ              | TMS        | n              | NO CONNECT |  |
|               | W              | PWRDWN_B   |                |            |  |
|               |                |            |                |            |  |
|               |                |            |                |            |  |
|               |                |            |                |            |  |
|               |                |            |                |            |  |
|               |                |            |                |            |  |

ug002\_c4\_52c\_120400

#### Figure 4-19: FF896 Dedicated Pins

# FF1152 Flip-Chip Fine-Pitch BGA Composite Pinout Diagram (XC2V8000)



ug002 c4 53 031501

Figure 4-20: FF1152 Flip-Chip Fine-Pitch BGA Composite Pinout Diagram (XC2V8000)

VRN

VREF

Triple-Purpose Pins: D2, D4/ALT\_VRP

D3, D5/ALT\_VRN

 $\oplus$ 

 $\otimes$ 

 $\odot$ 

 $\oplus$ 





| Dua        | I-Purpose Pins:<br>DIN/D0-D7 |      |
|------------|------------------------------|------|
| •          | CS_B                         |      |
| 8          | RDWR_B                       | VCCO |
| igodot     | BUSY/DOUT                    |      |
| O          | INIT_B                       |      |
| $\bigcirc$ | GCLKx (P)                    |      |
| $\oplus$   | GCLKx (S)                    |      |
| $\ominus$  | VRP                          |      |
| $\oplus$   | VRN                          |      |
| $\otimes$  | VREF                         |      |
| Tripl      | e-Purpose Pins:              |      |
| $\odot$    | D2, D4/ALT_VRP               |      |
| Ð          | D3, D5/ALT_VRN               |      |

ug002\_c4\_53b\_031501

Figure 4-21: FF1152 Bank Information

www.xilinx.com

1-800-255-7778

### FF1152 Dedicated Pins



| User I/O Pins | Dedicated Pins |            |                |            |
|---------------|----------------|------------|----------------|------------|
|               | C              | CCLK       | Ν              | DXN        |
|               | Ρ              | PROG_B     | Α              | DXP        |
|               | D              | DONE       | $\blacksquare$ | VBATT      |
|               | 210            | M2, M1, M0 | R              | RSVD       |
|               | Η              | HSWAP_EN   |                |            |
|               | K              | тск        |                | VCCAUX     |
|               |                | TDI        |                | VCCINT     |
|               | 0              | TDO        |                | GND        |
|               | Μ              | TMS        | n              | NO CONNECT |
|               | W              | PWRDWN_B   |                |            |
|               |                |            |                |            |
|               |                |            |                |            |
|               |                |            |                |            |
|               |                |            |                |            |
|               |                |            |                |            |
|               |                |            |                | ua         |

Figure 4-22: FF1152 Dedicated Pins

# XILINX<sup>®</sup>

# FF1517 Flip-Chip Fine-Pitch BGA Composite Pinout Diagram (XC2V8000)



Figure 4-23: FF1517 Flip-Chip Fine-Pitch BGA Composite Pinout Diagram (XC2V8000)

FF1517 Bank Information



Figure 4-24: FF1517 Bank Information

www.xilinx.com

1-800-255-7778

### FF1517 Dedicated Pins





# BF957 Flip-Chip BGA Composite Pinout Diagram (XC2V6000)



Figure 4-26: BF957 Flip-Chip BGA Composite Pinout Diagram (XC2V6000)

#### **BF957 Bank Information**





#### **BF957 Dedicated Pins**



| User I/O Pins |     | Dedicated Pins |   |            |  |
|---------------|-----|----------------|---|------------|--|
|               | C   | CCLK           | N | DXN        |  |
|               | Р   | PROG_B         | Α | DXP        |  |
|               | D   | DONE           | E | VBATT      |  |
|               | 210 | M2, M1, M0     | R | RSVD       |  |
|               | H   | HSWAP_EN       |   |            |  |
|               | K   | тск            |   | VCCAUX     |  |
|               |     | TDI            |   | VCCINT     |  |
|               | O   | TDO            |   | GND        |  |
|               | Μ   | TMS            | n | NO CONNECT |  |
|               | W   | PWRDWN_B       |   |            |  |
|               |     |                |   |            |  |
|               |     |                |   |            |  |
|               |     |                |   |            |  |
|               |     |                |   |            |  |
|               |     |                |   |            |  |
|               |     |                |   | ug002_c4_5 |  |

Figure 4-28: BF957 Dedicated Pins

### FG456 - FG676 Pinout Compatibility Diagram



**Note:** FF456 and FG676 are pinout compatible with with the exception of the LVDS pairs. I/O  $V_{REF}$  pins in FG676 are user I/O pins in FG456. In addition, some user I/O pins are not in the same bank (see  $\bigcirc$  lines). VRP (V7) and VRN (V6) in Bank 5 and VRP (W17) and VRN (Y17) in Bank 4 are only user I/Os in FG676.



ug002\_c4\_56\_080601

#### Figure 4-29: FG456 - FG676 Pinout Compatibility Diagram

#### FF1152/FF896 Pinout Compatibility Diagram FF896 $1 \ {}^{2} \ {}^{3} \ {}^{4} \ {}^{5} \ {}^{6} \ {}^{7} \ {}^{8} \ {}^{9} \ {}^{10} 11 \ {}^{12} 13 \ {}^{14} 15 \ {}^{16} 17 \ {}^{18} 19 \ {}^{20} 21 \ {}^{22} 23 \ {}^{24} 25 \ {}^{26} 27 \ {}^{28} 29 \ {}^{30}$ FF1152 R C В DOC В EÓČ С С D D Е )()F G $\otimes \square$ F F G G н н κ $\mathcal{O}\mathcal{O}$ κ $\supset \otimes$ L Μ 00 М Ν R Ν R R U w U V Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y Y $\otimes$ Y v $\supset \bigcirc AA W$ DO AB Υ $\bigcirc \bigcirc$ ac aa DO AD AB AF AC OO AF AD

FF896 - FF1152 Pinout Compatibility Diagram

Note: FF896 is pinout compatible with the FF1152 except for LVDS pairs. Also, in Bank 4, VRP/VRN pins are not compatible: for FF896, VRP is in AC10 and VRN is in AC11, and for FF1152, VRP is in AK9 and VRN is in AJ8. If DCI is not used in Bank 4, or is used with ALT\_VRP or ALT\_VRN, then the user I/Os are compatible.

| Use                          | er I/O Pins      |     | Dedicated F | Pins      |            |            |                  |
|------------------------------|------------------|-----|-------------|-----------|------------|------------|------------------|
| 0                            | IO_LXXY_#        | C   | CCLK        | N         | DXN        | Correspond | ling Pinouts     |
| Dua                          | al-Purpose Pins: | Р   | PROG_B      | Α         | DXP        | ГГООС      |                  |
| •                            | DIN/D0-D7        | D   | DONE        | $\square$ | VBATT      | FF090      | FF1152           |
|                              | CS_B             | 210 | M2, M1, M0  | R         | RSVD       | A2         | C4               |
|                              | RDWR_B           | H   | HSWAP_EN    |           | VCCO       |            |                  |
|                              | BUSY/DOUT        | K   | тск         |           | VCCAUX     | -          | •                |
|                              | INIT_B           |     | TDI         |           | VCCINT     | •          | •                |
| $\circ$                      | GCLKx (P)        | O   | TDO         |           | GND        | •          | •                |
| $\square$                    | GCLKx (S)        | Μ   | TMS         | n         | NO CONNECT |            |                  |
| $\Theta$                     | VRP              | M   | PWRDWN_B    |           |            | AK29       | AM31             |
| $\oplus$                     | VRN              |     |             |           |            |            |                  |
| $\otimes$                    | VREF             |     |             |           |            |            |                  |
| Trip                         | le-Purpose Pins: |     |             |           |            |            |                  |
| $\overline{\mathbf{\Theta}}$ | D2, D4/ALT_VRP   |     |             |           |            |            |                  |
| Ð                            | D3, D5/ALT_VRN   |     |             |           |            |            | ug002_c4_55_0329 |

Figure 4-30: FF896 - FF1152 Pinout Compatibility Diagram

# **Package Specifications**

This section contains specifications for the following Virtex-II packages:

- "CS144 Chip-Scale BGA Package (0.80 mm Pitch)" on page 352
- "FG256 Fine-Pitch BGA Package (1.00 mm Pitch)" on page 353
- "FG456 Fine-Pitch BGA Package (1.00 mm Pitch)" on page 354
- "FG676 Fine-Pitch BGA Package (1.00 mm Pitch)" on page 355
- "BG575 Standard BGA Package (1.27 mm Pitch)" on page 356
- "BG728 Standard BGA Package (1.27 mm Pitch)" on page 357
- "FF896 Flip-Chip Fine-Pitch BGA Package (1.00 mm Pitch)" on page 358
- "FF1152 Flip-Chip Fine-Pitch BGA Package (1.00 mm Pitch)" on page 359
- "FF1517 Flip-Chip Fine-Pitch BGA Package (1.00 mm Pitch)" on page 360
- "BF957 Flip-Chip BGA Package (1.27 mm Pitch)" on page 361

# CS144 Chip-Scale BGA Package (0.80 mm Pitch)





### FG256 Fine-Pitch BGA Package (1.00 mm Pitch)

#### BOTTOM VIEW

<u>TOP VIEW</u>





# FG456 Fine-Pitch BGA Package (1.00 mm Pitch)





FG676 Fine-Pitch BGA Package (1.00 mm Pitch)



Figure 4-34: FG676 Fine-Pitch BGA Package

www.xilinx.com 1-800-255-7778

### BG575 Standard BGA Package (1.27 mm Pitch)



www.xilinx.com

1-800-255-7778

# XILINX®





# FF896 Flip-Chip Fine-Pitch BGA Package (1.00 mm Pitch)





## FF1152 Flip-Chip Fine-Pitch BGA Package (1.00 mm Pitch)



Figure 4-38: FF1152 Flip-Chip Fine-Pitch BGA Package

4

# FF1517 Flip-Chip Fine-Pitch BGA Package (1.00 mm Pitch)



Figure 4-39: FF1517 Flip-Chip Fine-Pitch BGA Package
# BF957 Flip-Chip BGA Package (1.27 mm Pitch)



Figure 4-40: BF957 Flip-Chip BGA Package

www.xilinx.com 1-800-255-7778 4

# **Flip-Chip Packages**

As silicon devices become more integrated with smaller feature sizes as well as increased functionality and performance, packaging technology is also evolving to take advantage of these silicon advancements. Flip-chip packaging is the latest packaging option introduced by Xilinx to meet the demand for high I/O count and high performance required by today's advanced applications.

Flip-chip packaging interconnect technology replaces peripheral bond pads of traditional wire-bond interconnect technology with area array interconnect at the die/substrate interface.

The area array pads contain wettable metallization for solders (either eutectic or highlead), where a controlled amount of solder is deposited either by plating or screenprinting. These parts are then reflowed to yield bumped dies with relatively uniform solder bumps spread over the surface of the device. Unlike traditional packaging in which the die is attached to the substrate face up and the connection is made by using wire, the bumped die in a flip-chip package is flipped over and placed face down, with the conductive bumps connecting directly to the matching metal pads on the ceramic or organic laminate substrate. The solder material at molten stage is self-aligning and produces good joints even if the chip is placed offset on the substrate.

Flip-chip packages are assembled on high-density, multi-layer ceramic or organic laminate substrates. Since flip-chip bump pads are in area array configuration, very fine lines and geometry on the substrates are required to be able to successfully route the signals from the die to the periphery of the substrates. Multi-layer build-up structures offer this layout flexibility on flip-chip packages, and they provide improvements in power distribution and signal transmission characteristics.

# Advantages of Flip-Chip Technology

Flip-chip interconnections in combination with the advanced multi-layer laminated substrates provide superior performance over traditional wire-bond packaging. Benefits include:

- Easy access to core power/ground and shorter interconnects, resulting in better electrical performance
- Better noise control since the inductance of flip-chip interconnect is lower
- Excellent thermal performance due to direct heatsinking to backside of the die
- Higher I/O density since bond pads are in area array format
- Smaller size

# **Thermal Data**

# Thermal Considerations

Due to the variety of applications in which Virtex-II FPGA devices are likely to be used, it is traditionally a challenge to predict power requirements, and thus the thermal management needs, of a particular application. Virtex-II devices in general are characterized by high I/O counts and very high user gate counts. The attributes that make the devices popular with users also give the devices the potential of being clocked fast, which results in high power consumption. Because of this high heat-generating potential, the Virtex-II package offering includes medium and high power capable packaging options (see Table 4-3).

Table 4-3 shows junction-to-ambient, junction-to-case, and junction-to-board thermal resistance parameters and estimated power consumption for Virtex-II packages. These values were derived using typical thermal management assumptions, stated in the table. Table 4-3 provides only an example and is not intended as a maximum power dissipation specification.

| Package                | Lead<br>Pitch<br>(mm) | Junction to<br>Ambient<br>Theta-J <sub>A</sub> Range<br>°C/Watt in Air | Junction<br>to Case<br>Theta-J <sub>C</sub><br>Typical<br>°C/Watt | Junction to<br>Board<br>Psi-J <sub>B</sub><br>("Theta-J <sub>B</sub> ")<br>Typical °C/Watt | Max Power<br>Bare Pkg<br>(Watts)<br>T <sub>A</sub> = 50 °C<br>T <sub>JMAX</sub> = 100 °C | Power With Heatsink (Watts)<br>Theta-SA = 1.5 °C/Watt<br>Theta-cs = 0.1 °C/Watt<br>$T_A = 50^\circ C$<br>$T_J = 100^\circ C$ |
|------------------------|-----------------------|------------------------------------------------------------------------|-------------------------------------------------------------------|--------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------------------|
| CS144 Flex Based 12x12 | 0.8                   | 32 - 36                                                                | 1                                                                 | 20                                                                                         | 1.5                                                                                      | N/A <sup>(1)</sup>                                                                                                           |
| FG256 2- 4L PCB 17x17  | 1.0                   | 30 -35                                                                 | 3.5                                                               | 19                                                                                         | 1.5                                                                                      | N/A <sup>(1)</sup>                                                                                                           |
| FG456 4L PCB 23x23     | 1.0                   | 15 - 28                                                                | 2.0                                                               | 11                                                                                         | 2.4                                                                                      | N/A <sup>(1)</sup>                                                                                                           |
| FG676 4L PCB 27x27     | 1.0                   | 14 -22                                                                 | 1.8                                                               | 9                                                                                          | 2.8                                                                                      | 15                                                                                                                           |
| BG575 4L PCB 31x31     | 1.27                  | 13 - 20                                                                | 1.6                                                               | 7                                                                                          | 3.1                                                                                      | 16                                                                                                                           |
| BG728 4L PCB 35x35     | 1.27                  | 12 -20                                                                 | 1.5                                                               | 6                                                                                          | 3.3                                                                                      | 16                                                                                                                           |
| BF957 40x40 Flip-Chip  | 1.27                  | 8 - 13                                                                 | 0.7                                                               | 3                                                                                          | 5.0                                                                                      | 22                                                                                                                           |
| FF896 31x31 Flip-Chip  | 1.0                   | 9 - 14                                                                 | 0.8                                                               | 4                                                                                          | 4.5                                                                                      | 21                                                                                                                           |
| FF1152 35x35 Flip-Chip | 1.0                   | 8 - 13                                                                 | 0.8                                                               | 4                                                                                          | 4.5                                                                                      | 21                                                                                                                           |
| FF1517 40x40 Flip-Chip | 1.0                   | 8 - 12                                                                 | 0.7                                                               | 3                                                                                          | 5.0                                                                                      | 22                                                                                                                           |

Table 4-3: Thermal Data for Virtex-II Packages

#### Notes:

1. The heat sink used in this example is not mechanically compatible with the CS144, FG256, and FG456 packages.

Virtex-II packages can be grouped into three broad performance categories: low, medium, and high, based on their power handling capabilities. All of the packages can use external thermal enhancements, which can range from simple airflow to schemes that can include passive as well as active heatsinks. This is particularly true for high-performance flip-chip packages where system designers have the option to further enhance the packages to handle in excess of 25 watts, with arrangements that take system physical constraints into consideration. Table 4-4 shows simple but incremental power management schemes that can be brought to bear on flip-chip packages.

Table 4-4: Virtex-II Flip-Chip Thermal Management

| Power                       | Technique                                           | Description                                                                                 |
|-----------------------------|-----------------------------------------------------|---------------------------------------------------------------------------------------------|
| Low End<br>(1 - 6 watts)    | Bare package with moderate air<br>8 - 12 °C/Watt    | Bare package. Package can be<br>used with moderate airflow<br>within a system.              |
| Mid Range<br>(4 - 10 watts) | Passive heatsink with air<br>5 - 10 ° <b>C/Watt</b> | Package is used with various<br>forms of passive heatsinks and<br>heat spreader techniques. |
| High End<br>(8 - 25 watts)  | Active heatsink<br>2 - 3 °C/Watt or better          | Package is used with active<br>heatsinks, TEC, and board-<br>level heat spreader techniques |

# **Thermal Management Options**

The following are thermal management options to consider:

- For moderate power dissipation (2 to 6 watts), the use of passive heatsinks and heatspreaders attached with thermally conductive double-sided tapes or retainers can offer quick thermal solutions.
- The use of lightweight finned external passive heatsinks can be effective for dissipating up to 10 watts. If implemented with forced air as well, the benefit can be a 40% to 50% increase in heat handling efficiency over bare packages. The more efficient external heatsinks tend to be tall and heavy. To help protect component joints from bulky heatsink induced stresses, the use of spring loaded pins or clips that transfer the mounting stress to a circuit board is advisable. The diagonals of some of these heatsinks can be designed with extensions to allow direct connections to the board.
- Flip-chip packages: All flip-chip packages are thermally enhanced BGAs with die facing down. They are offered with exposed metal heatsink at the top. These high-end thermal packages lend themselves to the application of external heatsinks (passive or active) for further heat removal efficiency. Again, precaution should be taken to prevent component damage when a bulky heatsink is attached.
- Active heatsinks can include a simple heatsink incorporating a mini fan or even a Peltier Thermoelectric Cooler (TECs) with a fan to blow away any heat generated. Any considerations to apply TEC in heat management should require consultation with experts in using the device, since these devices can be reversed and cause damage to the components. Also, condensation can be an issue.
- Molded packages (FG456, FG676, BG575, BG728, and so forth) with or without exposed metal at the top can also use heatsinks at the top for further heat removal. These BGA packages are similar in construction to those used in Graphics cards in PC applications, and heatsinks used for those applications can easily be used for these packages, as well. In this case, the Junction-to-Case resistance is the limiting consideration.
- Outside the package itself, the board on which the package sits can have a significant impact on thermal performance. Board designs can be implemented to take advantage of a board's ability to spread heat. The effect of the board is dependent on its size and how it conducts heat. Board size, the level of copper traces on it, and the number of buried copper planes all lower the junction-to-ambient thermal resistance for packages mounted on the board.

The junction-to-board thermal resistance for Virtex-II packages are given in Table 4-3. A standard JEDEC type board was used for obtaining the data. Users need to be aware that a direct heat path to the board from a component also exposes the component to the effect of other heat sources - particularly if the board is not cooled effectively. An otherwise cooler component might be heated by other heat contributing components on the board.

# **Printed Circuit Board Considerations**

# Layout Considerations

The PC board is no longer just a means to hold ICs in place. At today's high clock rates and fast signal transitions, the PC board performs a vital function in feeding stable supply voltages to the IC and in maintaining signal integrity between devices.

## VCC and Ground Planes

Since CMOS power consumption is dynamic, it is a non-trivial task to assure stable supply voltages at the device pins and to minimize ground differentials. A multi-layer PC board is a must, with four layers for the simplest circuits, 6 to 12 layers for typical boards. Ground and V<sub>CC</sub> must each be distributed in complete layers with few holes. Slots in these layers would cause an unacceptable inductive voltage drop, when the supply current changes at a rate of 1 A/ns, or even faster. Besides an uninterrupted ground plane, Virtex-II devices require one plane for V<sub>CCINT</sub> (1.5 V) plus one plane for V<sub>CCAUX</sub> (3.3 V). V<sub>CCO</sub> can be distributed on wide signal traces with sufficient bypass capacitors.

Beyond low resistance and inductance, ground and V<sub>CC</sub> planes combined can also provide a small degree of V<sub>CC</sub> decoupling. The capacitance between two planes is ~100 pF/inch<sup>2</sup> or ~15 pF/cm<sup>2</sup>, assuming 10 mil (0.25 mm) spacing with FR4 epoxy.

# V<sub>CC</sub> Decoupling

Fast changing Icc transitions must be supplied by local decoupling capacitors, placed very closely to the V<sub>CC</sub> device pins or balls. These capacitors must have sufficient capacitance to supply Icc for a few ns and must have low intrinsic resistance and inductance. X7R or NPO ceramic surface-mounted capacitors of 0.01 to 0.1  $\mu$ F, one per V<sub>CC</sub> device pin, are appropriate. 0.1  $\mu$ F can supply 1A for 2ns with a 20 mV voltage droop.

 $1A \bullet 2ns = 2$  nanocoulomb =  $100 \text{ nF} \bullet 0.02 \text{ V}$ 

Low impedance at >100 MHz is important, but capacitance variation with temperature is acceptable. These small capacitors are the first-line source for Icc, and they must be placed very close to the  $V_{CC}$  pins. A half-inch or 10 mm trace represents an inductance of several nanohenries, defeating the purpose of the decoupling capacitor. Backing up this local decoupling is one tantalum capacitor of 10 to 100  $\mu$ F, able to supply multiple amperes for about 100 ns.

Finally, each board needs a power-supply decoupling electrolytic capacitor of 1000 to  $10,000 \,\mu\text{F}$  able to supply even more current for a portion of the supply switching period. As described below, larger capacitors inevitably have higher series resistance and inductance, which is the reason for the above-mentioned hierarchy of supply decoupling. As a general rule, multiple capacitors in parallel always offer lower resistance and inductance than any single capacitor.

## **Decoupling Capacitors**

The ideal decoupling capacitor would present a short circuit to ground for all ac signals. A real capacitor combines a given amount of capacitance with unavoidable parasitics, a small series resistance and inductance. At low frequencies, the composite impedance is capacitive, i.e., it decreases with increasing frequency. At high frequencies, it is inductive and increases with frequency, making the decoupling ineffective. In-between, there is the LC resonant frequency, where the capacitor looks like a small resistor.

Different technologies provide different trade-offs between desirable features like small size and high capacitance, and undesirable features like series resistance and inductance. Electrolytic and tantalum capacitors offer the largest capacitance in a given physical size, but also have the highest inductance. This makes them useful for decoupling low frequencies and storing large amounts of charge, but useless for high frequency decoupling. Surface-mount ceramic capacitors, on the other hand, offer the lowest

inductance and the best high-frequency performance, but offer only a small amount of capacitance, less than a microfarad.

**Figure 4-41** shows the frequency-dependent impedance and resistance of a typical electrolytic capacitor of 1500 µF, while Figure 4-42 and Figure 4-43 show the equivalent data for ceramic bypass capacitors of 33,000 and 3,300 pF, respectively. Note that the resonant frequency for the small ceramic bypass capacitor at 100 MHz is 10,000 times higher than the resonance frequency of the large electrolytic capacitor at 10 KHz. For more technical information on decoupling capacitors, see the manufacturers' websites.







Figure 4-42: 33000 pF X7R Component Frequency Response Curve



Figure 4-43: 3300 pF X7R Component Frequency Response Curve

## Transmission Line Reflections and Terminations

A PC board trace must be analyzed as a transmission line. Its series resistance and parallel conductance can generally be ignored, but series inductance and parallel capacitance per unit length are important parameters. Any signal transition (rising or falling edge) travels along the trace at a speed determined by the incremental inductance and capacitance.

For an outer-layer trace (air on one side) the propagation delay is 140 ps/inch, or 55 ps/cm. For an inner-layer trace (FR4 with  $\epsilon$ =4.5 on both sides), the propagation delay is 180 ps/inch, or 70 ps/cm.

The voltage-to-current ratio at any point along the transmission line is called the characteristic impedance  $Z_0$ . It is determined by w/d, the ratio of trace width w to the distance d above the ground or  $V_{CC}$  plane.

For an outer layer trace (microstrip),

 $Z_0=50 \Omega$  when w = 2d (e.g., w = 12 mil, d = 6 mil),

 $Z_0=75 \Omega$  when w = d (e.g., both 6 mil = 0.15 mm).

For an inner layer trace between two ground or V<sub>CC</sub> planes (stripline),

 $Z_0=50$ Ω when w = 0.6•d (e.g., w = 5 mil, d = 8 mil),

 $Z_0=75 \Omega$  when w = 0.25•d (impractical).

Most signal traces fall into the range of 40 to 80  $\Omega$ .

A slow transition treats a short narrow trace as a lumped capacitance of about 2 pF per inch (0.8 pF per cm). However, if the trace is so long, or the signal transition is so fast that the potential echo from the far end arrives after the end of the transition, then the trace must be analyzed as a transmission line.

In this case, the driver sees the trace not as a lumped capacitance, but rather as a pure resistance of  $Z_0$ . The signal transition then travels along the trace at the speed mentioned above. At any trace-impedance discontinuity all or part of the signal is reflected back to the origin. If the far end is resistively terminated with  $R=Z_0$ , then there is no reflection. If, however, the end is open, or loaded with only a CMOS input, then the transition doubles in amplitude, and this new wave travels back to the driver, where it may be reflected again, resulting in the familiar ringing. Such ringing has a serious impact on signal integrity, reduces noise margins, and can lead to malfunction, especially if an asynchronous signal or

4

a clock signal crosses the input threshold voltage unpredictably. Two alternate ways to avoid reflections and ensure signal integrity are parallel termination and series termination.

#### **Parallel Termination**

Reflections from the far end of the transmission line are avoided if the far end is loaded with a resistor equal to  $Z_0$ . A popular variation uses two resistors, one to  $V_{CC}$ , one to ground, as the Thevenin equivalent of  $Z_0$ . This reduces the load current for one signal level, while increasing it for the other. Parallel termination causes dc power consumption which can be eliminated by inserting a capacitor between the terminating resistor and ground. The value of this capacitor is determined as follows:

Signal transition time << RC << signal level duration

For example, 50  $\Omega$  • 120 pF for a 2 ns transition every 20 ns. See Figure 4-44.



Figure 4-44: Parallel Termination

#### Series Termination

While parallel termination eliminates reflections, series termination relies on the reflection from the far end to achieve a full-amplitude signal. For series termination, the driver impedance is adjusted to equal  $Z_0$ , thus driving a half-amplitude signal onto the transmission line. At the unterminated far end, the reflection creates a full-amplitude signal, which then travels back to the driver where it gets absorbed, since the output impedance equals  $Z_0$ . See Figure 4-45.



*Figure 4-45:* **Series Termination** 

Series termination dissipates no dc power, but the half-amplitude round-trip delay signal means that there must be no additional loads along the line. Series termination is ideal (and only meaningful) for single-source-single-destination interconnects.

Virtex-II devices offer digitally controlled output impedance drivers and digitallycontrolled input termination, thus eliminating the need for any external termination resistors. This feature is extremely valuable with high pin-count, high density packages.

These PC board considerations apply to all modern systems with fast current and voltage transitions, irrespective of the actual clock frequency. The designer of relatively slow systems is more likely caught off-guard by the inherent speed of modern CMOS ICs, where di/dt is measured in A/ns, dV/dt is measured in V/ns, and input flip-flops can react to 1 ns pulses, that are invisible on mid-range oscilloscopes. Powerful tools like HyperLynx can analyze signal integrity on the PC board and can often be amortized by one eliminated board-respin.

## JTAG Configuration and Test Signals

Poor signal integrity and limitations of devices in a JTAG scan chain can reduce the maximum JTAG test clock (TCK) rate and reliability of JTAG-based configuration and test procedures. The JTAG TCK and test mode (TMS) signals must be buffered, distributed, and routed with the same care as any clock signal especially for long JTAG scan chains. The devices in a JTAG scan chain should be ordered such that the connections from the TDO of one device to the TDI of the next device are minimized. When high-speed JTAG-based configuration for the Virtex-II devices is required, devices with lower-specified maximum TCK rates can be placed in a separate JTAG scan chain.

#### Crosstalk

Crosstalk can happen when two signals are routed closely together. Current through one of the traces creates a magnetic field that induces current on the neighboring trace, or the voltage on the trace couples capacitively to its neighbor. Crosstalk can be accurately modeled with signal integrity software, but two easy to remember rules of thumb are:

- Crosstalk falls off with the square of increasing distance between the traces.
- Crosstalk also falls off with the square of decreasing distance to a ground plane.

Peak Crosstalk Voltage = 
$$\frac{DV}{1 + (D/H)^2}$$

where

DV is the voltage swing

D is the distance between traces (center to center)

H is the spacing above the ground plane

#### Example:

3.3V swing, and two stripline traces 50 mils apart and 50 mils above the ground plane.

Peak Crosstalk Voltage =  $(3.3 \text{ V})/(1 + (0.05/0.05)^2) = 1.65 \text{ V}$ 

This can cause a false transition on the neighboring trace. Separating the trace by an additional 50 mils is significantly better:

Peak Crosstalk Voltage =  $(3.3 \text{ V})/(1 + (0.1/0.05)^2) = 0.66 \text{ V}$ 

## Signal Routing to and from Package Pins

Signal escaping (traces leaving the pin/ball area) can be quite difficult for the large FG and flip-chip packages. The number of signal layers required to escape all the pins depends on the PCB design rules. The thinner the traces, the more signals per layer can be routed, and the fewer layers are needed. The thinner traces have higher characteristic impedance, so choose an impedance plan that makes sense, and then be consistent. Traces from 40 to 80 ohms are common.

If only one signal can be escaped between two pads, only two rows of pins can be escaped per layer. For FG packages (1.0mm pitch) one signal of width 5 mils (0.13mm) can be

escaped between two pads, assuming a space constraint equal to the trace width. For a discussion of signal routing specific to Virtex-II devices, see <u>www.xilinx.com</u> for currently available application notes.

As packages are able to handle more I/Os with a minimum increase in size, the signal integrity of those signals must be considered, regardless of clock frequency. Especially with the largest packages, precise PCB layer stackup is required. Parameters such as board material, trace width, pad type, and stackup must be defined based on simulation, and the fabrication drawings must be marked with "precise layer stackup" and the stackup specified. A number of board-level signal integrity simulators exist, and careful attention to PCB design rules creates a robust design with low EMI and high signal reliability.

# **Board Routability Guidelines**

# **Board-Level BGA Routing Challenges**

Xilinx ball grid array (BGA) wire-bond and flip-chip packages contain a matrix of solder balls (see Figure 4-46). These packages are made of multilayer BT substrates. Signal balls are in a perimeter format. Power and ground pins are grouped together appropriately.



Figure 4-46: Fine-Pitch BGA Pin Assignments

The number of layers required for effective routing of these packages is dictated by the layout of pins in each package. If several other technologies and components are already present on the board, the system cost is factored with every added board layer. The intent of a board designer is to optimize the number of layers required to route these packages, considering both cost and performance. This section provides guidelines for minimizing required board layers for routing BGA products using standard PCB technologies (5 mils-wide lines and spaces or 6 mils-wide lines and spaces).

For high performance and other system needs, designers can use premium technologies with finer lines/spaces on the board. The pin assignment and pin grouping scheme in BGA packages enables efficient routing of the board with an optimum number of required board layers.

# **Board Routing Strategy**

The diameter of a land pad on the component side is provided by Xilinx. This information is required prior to the start of board layout when designing the board pads to match component-side land geometry. Typical values for these land pads are described in Figure 4-47 and summarized in Table 4-5.



x157\_02\_120500

#### Figure 4-47: Suggested Board Layout of Soldered Pads for BGA Packages

| Table 1 5. | Summor  | / of <sup>-</sup> | Typical | Lond | Dod | Values | (mm) |
|------------|---------|-------------------|---------|------|-----|--------|------|
| Table 4-5. | Summary | / 01              | rypicar | Lanu | Pau | values | (mm) |

| Land Pad Characteristics                       | CS144   | FG256   | FG456          | FG676   | BG575   | BG728   | FF896   | FF1152  | FF1517  | BF957   |
|------------------------------------------------|---------|---------|----------------|---------|---------|---------|---------|---------|---------|---------|
| Component Land Pad Diameter (SMD) <sup>4</sup> | 0.35    | 0.45    | 0.45           | 0.45    | 0.61    | 0.61    | 0.48    | 0.48    | 0.48    | 0.61    |
| Solder Land (L) Diameter                       | 0.33    | 0.40    | 0.40           | 0.40    | 0.56    | 0.56    | 0.45    | 0.45    | 0.45    | 0.56    |
| Opening in Solder Mask (M) Diameter            | 0.44    | 0.50    | 0.50           | 0.50    | 0.66    | 0.66    | 0.55    | 0.55    | 0.55    | 0.66    |
| Solder (Ball) Land Pitch (e)                   | 0.80    | 1.00    | 1.00           | 1.00    | 1.27    | 1.27    | 1.00    | 1.00    | 1.00    | 1.27    |
| Line Width Between Via and Land (w)            | 0.130   | 0.130   | 0.130          | 0.130   | 0.203   | 0.203   | 0.130   | 0.130   | 0.130   | 0.203   |
| Distance Between Via and Land (D)              | 0.56    | 0.70    | 0.70           | 0.70    | 0.90    | 0.90    | 0.70    | 0.70    | 0.70    | 0.90    |
| Via Land (VL) Diameter                         | 0.51    | 0.61    | 0.61           | 0.61    | 0.65    | 0.65    | 0.61    | 0.61    | 0.61    | 0.65    |
| Through Hole (VH), Diameter                    | 0.250   | 0.300   | 0.300          | 0.300   | 0.356   | 0.356   | 0.300   | 0.300   | 0.300   | 0.356   |
| Pad Array                                      | -       | Full    | Full           | Full    | Full    | Full    | Full    | Full    | Full    | Full    |
| Matrix or External Row                         | 13 x 13 | 16 x 16 | 22 x 22        | 26 x 26 | 24 x 24 | 27 x 27 | 30 x 30 | 34 x 34 | 39 x 39 | 31 x 31 |
| Periphery Rows                                 | 4       | -       | 7 <sup>3</sup> | -       | -       | -       | -       | -       | -       | -       |

#### Notes:

- 1. Dimension in millimeters.
- 2. 3 x 3 matrix for illustration only, one land pad shown with via connection.
- 3. FG456 package has solder balls in the center in addition to the periphery rows of balls.
- 4. Component land pad diameter refers to the pad opening on the component side (solder-mask defined).

For Xilinx BGA packages, non-solder-mask defined (NSMD) pads on the board are suggested. This allows a clearance between the land metal (diameter L) and the solder mask opening (diameter M) as shown in Figure 4-47. The space between the NSMD pad and the solder mask, as well as the actual signal trace widths, depend on the capability of the PCB vendor. The cost of the PCB is higher when the line width and spaces are smaller.

Selection of pad types and sizes determines the available space between adjacent balls for signal escape. Based on PCB capability, the number of lines that can share the available space is described in Figure 4-48. Based on geometrical considerations, if one signal escapes between adjacent balls, then two signal rows can be routed on a single metal layer. This is illustrated in Figure 4-48 as routing with one line/channel, either at 6 mils-wide lines and spaces or 5 mils-wide lines and spaces. Using this suggested routing scheme, a minimum of eight PCB layers are required to route 10 signal rows in a package.

A slightly lower trace width can be used by the inner signal rows routed in internal layers than the width used in top and bottom external or exposed traces. Depending on the signal being handled, the practice of "necking down" a trace in the critical space between the BGA balls is allowable. Changes in width over very short distances can cause small impedance changes. Validate these issues with the board vendor and signal integrity engineers responsible for the design.



#### Figure 4-48: FG676 PC Board Layout/Land Pattern

**Figure 4-48** describes a board-level layout strategy for a Xilinx 1.0 mm pitch FG676 package. Detail A in Figure 4-48 describes the opening geometry for the Land Pad and the Solder Mask. Routing with 5 mils-wide lines or spaces allows one signal per channel (between the balls). For successful routing, eight-row deep signal traces require six PCB layers. Figure 4-49 shows the suggested schematic of layers for the six-layer routing scheme. Using premium board technology, such as Microvia Technology (allowing up to 4 mils-wide lines and spaces), efficient routing is possible with a reduced number of board layers. A grouping scheme for power, ground, control, and I/O pins, might also enable efficient routing.



Figure 4-49: Six-Layer Routing Scheme

**Figure 4-50** through **Figure 4-67** show suggested layer-by-layer board routing for each Virtex-II package, including flip-chip packages. These drawings assume a standard PCB technology of 5 mils-wide lines and spaces. Table 4-6 lists the layer-by-layer routing examples provided. More details are contained in XAPP157, which is available on the web at <u>www.xilinx.com/xapp/xapp157.pdf</u>, as is a full-color (PDF) version of this document.

| Package | Standard Routing                           | Routing With LVDS Pairs                    |
|---------|--------------------------------------------|--------------------------------------------|
| FG256   | Top and bottom layers                      | Top and bottom layers                      |
| FG456   | Top, 2nd, and bottom layers                | Top, 2nd, and bottom layers                |
| FG676   | Top, 2nd, 3rd, and bottom layers           | Top, 2nd, 3rd, and bottom layers           |
| BG575   | Top, 2nd, and bottom layers                | Top, 2nd, and bottom layers                |
| BG728   | Top, 2nd, 3rd, and bottom layers           | Top, 2nd, 3rd, and bottom layers           |
| FF896   | Top, 2nd, 3rd, and bottom layers           | Top, 2nd, 3rd, and bottom layers           |
| FF1152  | Top, 2nd, 3rd, 4th, and bottom layers      | Top, 2nd, 3rd, 4th, and bottom layers      |
| FF1517  | Top, 2nd, 3rd, 4th, 5th, and bottom layers | Top, 2nd, 3rd, 4th, 5th, and bottom layers |
| BF957   | Top, 2nd, 3rd, and bottom layers           | Top, 2nd, 3rd, and bottom layers           |

Table 4-6: Layer-By-Layer Board Routing Examples



Figure 4-50: FG256 Standard Routing



Figure 4-51: FG256 Routing With LVDS Pairs

4



Figure 4-52: FG456 Standard Routing

FG456: STANDARD ROUTING



Figure 4-53: FG456 Routing With LVDS Pairs

UG002 (v1.5) 2 December 2002 Virtex-II Platform FPGA User Guide

FG456: ROUTING WITH LVDS PAIR

www.xilinx.com 1-800-255-7778



Figure 4-54: FG676 Standard Routing

| <u>COMPONENT ATRIBUTE:</u><br>1) Ball diameter 0.6 mm<br>2) Pad opening 0.45 mm Solder Mask Defined. | NOTES ON BOARD:<br>1) Solder land diameter 0.4 mm Non Solder Mask Defined.<br>2) Via diameter 0.3 mm on 0.61 mm diameter Via Land.<br>3) Top and bottom layer signal trace width 0.127 mm.<br>4) Inner layer signal trace width 0.110 mm. |
|------------------------------------------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| FG676: ROUTING MITH LVDS PAIR                                                                        |                                                                                                                                                                                                                                           |
|                                                                                                      |                                                                                                                                                                                                                                           |

Figure 4-55: FG676 Routing With LVDS Pairs

<u>www.xilinx.com</u> 1-800-255-7778 4



Figure 4-56: BG575 Standard Routing

Virtex-II Platform FPGA User Guide

# BG575: STANDARD ROUTING

1-800-255-7778

BG575: ROUTING WITH LVDS PAIR



Figure 4-57: BG575 Routing With LVDS Pairs

www.xilinx.com 1-800-255-7778 J2002\_c4\_r\_bg575lvdspair\_031301



Figure 4-58: BG728 Standard Routing



Figure 4-59: BG728 Routing With LVDS Pairs

383

4





<u>www.xilinx.com</u> 1-800-255-7778

|                          | <u>Layer 3</u>   | COMPONENT ATTRIBUTE:<br>2) Pad opening 0.48 mm Solder Mask Defined.<br>NOTES ON BOARD:<br>1) Solder land diameter 0.45 mm Non Solder Mask Defined.<br>2) Via diameter 0.3 mm on 0.61 mm diameter Via Land.<br>3) Top and bottom layer signal trace width 0.127 mm.<br>4) Inner layer signal trace width 0.110 mm. | ug002_c4_r_ff1152_091902 |
|--------------------------|------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------------------------|
| FF1152: STANDARD ROUTING | <u>Layer 2</u>   |                                                                                                                                                                                                                                                                                                                   | <u>Bottom Layer</u>      |
|                          | <u>Top Layer</u> |                                                                                                                                                                                                                                                                                                                   | Layer 4                  |



www.xilinx.com 1-800-255-7778

|                                | Layer 3        | COMPONENT ATTRIBUTE:<br>2) Pad opening 0.48 mm Solder Mask Defined.<br>NOTES ON BOARD:<br>1) Solder land diameter 0.45 mm Non Solder Mask Defined.<br>2) Via diameter 0.3 mm on 0.61 mm diameter Via Land.<br>3) Top and bottom layer signal trace width 0.127 mm.<br>4) Inner layer signal trace width 0.110 mm. | ug002_c4_r_ff1152wdspair_091902 |
|--------------------------------|----------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---------------------------------|
| FF1152: ROUTING WITH LVDS PAIR | <u>Layer 2</u> |                                                                                                                                                                                                                                                                                                                   | <u>Bottom Layer</u>             |
|                                | Top Layer      |                                                                                                                                                                                                                                                                                                                   | Layer 4                         |

Figure 4-63: FF1152 Routing With LVDS Pairs

<u>www.xilinx.com</u> 1-800-255-7778



Figure 4-64: FF1517 Standard Routing



Figure 4-65: FF1517 Routing With LVDS Pairs

www.xilinx.com 1-800-255-7778

# XILINX<sup>®</sup>



Figure 4-66: BF957 Standard Routing

UG002 (v1.5) 2 December 2002 Virtex-II Platform FPGA User Guide



<u>www.xilinx.com</u> 1-800-255-7778 4

# **Power Consumption**

The Virtex-II power estimator worksheet estimates power consumption for a Virtex-II design before it is downloaded. It considers the design resource usage, toggle rates, I/O power, and many other factors in the estimation. The formulas used for calculations in the program are based on test design measurements.

Xilinx provides two versions of the power estimator, an Excel 97 version that works with Microsoft Office 97 software, and a CGI version for use with web browsers. They are identical in terms of estimations and data entries.

This section explains how to use the Power Estimator Worksheet to calculate estimated power consumption for Virtex-II designs. Since this is an estimation tool, results may not match precisely with what is measured on the board.

The power estimator consists of six categories: CLB (configurable logic block) logic power, dedicated non-multiplier power, dedicated registered multiplier power, block SelectRAM power, DCM (digital clock management), input/output power, and the results. To estimate power with the worksheet, a designer must determine how to group portions of the design into modules, what resources each module contains, the respective clock frequencies, and average toggle rates.

#### Note:

1. The Virtex-II power estimation is still under development. The table entries in this section may be different from the entries in the released version of the power estimation tool,

# **CLB Logic Power**

Table 4-7 shows the data entries required for the CLB Logic Power section in the Power Estimator. This section estimates the power consumption of the CLBs for a Virtex-II design. In this section, users need to partition designs into modules, specify area utilization, and toggle rates.

|               |                    |               |                        |                   | LUT       |                               |                   |
|---------------|--------------------|---------------|------------------------|-------------------|-----------|-------------------------------|-------------------|
| Module        | Frequency<br>(MHz) | CLB<br>Slices | Flip-Flops/<br>Latches | Shift<br>Register | SelectRAM | Average<br>Toggle Rate<br>(%) | Routing<br>Amount |
| User Module 1 | 0                  | 0             | 0                      | 0                 | 0         | 0%                            | Medium            |
| User Module 2 | 0                  | 0             | 0                      | 0                 | 0         | 0%                            | Medium            |
| User Module 3 | 0                  | 0             | 0                      | 0                 | 0         | 0%                            | Medium            |
| User Module 4 | 0                  | 0             | 0                      | 0                 | 0         | 0%                            | Medium            |
| User Module 5 | 0                  | 0             | 0                      | 0                 | 0         | 0%                            | Medium            |
| User Module 6 | 0                  | 0             | 0                      | 0                 | 0         | 0%                            | Medium            |
| User Module 7 | 0                  | 0             | 0                      | 0                 | 0         | 0%                            | Medium            |
| User Module 8 | 0                  | 0             | 0                      | 0                 | 0         | 0%                            | Medium            |

#### Table 4-7: CLB Logic Power

## Modules

Modules are portions of a design. A designer could treat the entire design as one module and calculate its toggle rate. However, estimating power this way is not as accurate as when the design is divided into multiple modules. Generally, with more modules the estimate is better. The Virtex-II power estimator allows designs to be partitioned into a maximum of eight modules. Determining how to partition the design into modules depends on user preference. Three partitioning approaches are presented below as guidelines.

#### Grouping by Hierarchy

If a design contains hierarchical components at the top level, these components may be separated or grouped together to represent modules.

#### Grouping by Clocks

If a design has several different clocks, the logic associated with each clock should be treated as a module. For accuracy, it is recommended that each module contains only one clock.

#### Grouping by Functionality

For a design with sub-components that perform different functions, each sub-component can be considered as a module. For example, a microprocessor can be thought of as three main modules: an ALU, a Register File, and a Control System.

#### Frequency (MHz)

Frequency is the clock speed for the module. Again, it is strongly recommended that each module contains only one clock.

#### **CLB** Slices

This involves the total CLB usage of a module. This number is available from the synthesis report in a specific synthesis tool. For a more accurate result, MAP only this module in Xilinx Foundation software, and take the numbers from the map.mrp file. The map.mrp file is the output resource usage file produced by running the MAP program in the Xilinx Foundation software.

For schematic-based designs, obtaining this number is slightly more difficult. Designers can either estimate CLB usage based on the design structure or MAP the module and read the numbers from the map.mrp file.

## Flip Flops or Latches

The total number of flip-flop and latch elements used for each module can be obtained from the synthesis report, the map.mrp file, or by adding up the registers from the schematics.

## Shift Register LUTs

This is the total number of SRL16 elements used in each module.

## SelectRAM LUTs

This is the total number of LUTs used as Distributed Select RAM components. For Virtex-II devices, one 16 x 1 synchronous RAM is equivalent to one LUT, and one 16 x 1 dual-port RAM is equivalent to two LUTs (split between two slices).

## Average Toggle Rate (%)

The toggle rate describes how often the output changes with respect to the input clock, usually between 6% and 12% for a typical module. Functional simulation is required to accurately calculate the toggle rate. Designers need to simulate all the flip-flop outputs in each module with regard to the clock, and calculate how often the flip-flop outputs change in relation to the clock.

Measuring the toggle rate becomes a more complex and a time-consuming process as module size increases. A toggle flip-flop has a 100% toggle rate, an 8-bit counter has 28%, and 16-bit counter has 14%.

Figure 4-68 is an example of how to calculate the toggle rate for a 4-bit counter.



Figure 4-68: Output Waveform of a 4-bit Counter

**Figure 4-68** shows the simulation wave form of a 4-bit counter. D0 stands for the LSB of the count, and D3 stands for the MSB. The toggle rate of D0 is 100% because D0 changes after every clock cycle. The toggle rate of D1 is 50% because D1 changes after every two clock cycles. The toggle rate of D2 is 25% because D2 changes after every four clock cycles. The toggle rate of D3 is 12.5% because D3 changes after every eight clock cycles. In this example, the average toggle rate of a 4-bit counter derived in the following equation is 46.875%.

$$\frac{(100+50+25+12.5)}{4} = 46.875$$

## **Routing Amount**

There are three levels concerning the amount of routing to be used: low, medium, and high. The routing level is determined by the primary logic type of the module. Typical data path logic typically requires a low routing usage, random logic calls for a medium level, and control logic needs a high level.

Each designer needs to determine the routing that is most appropriate for each module.

Routing, which is determined by the type of logic in the module, is divided into three levels: low, medium, and high. Each designer needs to determine the routing that is most appropriate for each module.

- 1. Typical data path logic, which uses combinatorial logic such as multiplexers, adders, AND gates, and OR gates, usually requires a low routing usage. This also applies to any other signals that have one or two fanouts between structures.
- 2. Random logic, such as decoders, encoders, or any logic that has three to five fanouts, calls for a medium level of routing usage.
- 3. Control logic is typically logic with high fanout signals (excluding clocks) such as clock enables or reset signals. Control logic used in state machines also belongs to this category.

## Block SelectRAM Power

Table 4-8 shows the data entries required for the Block SelectRAM Power section. This section is used to specify how many block RAMs are used and to determine their estimated power consumption. Before doing the calculation, designers can either treat all the RAMB16 cells as one module or break them down into smaller modules. RAMB16 is the base name for the Virtex-II Block SelectRAM component.

### RAMB16 Cells

This is total number of Block Select RAMs (RAMB16 cells) used in each module.

### Port A Frequency (MHz)

This is the frequency on the CLKA pin.

#### Port A Width

This is data width of DIA and DOA busses.

#### Port A Enable Rate (%)

This specifies how often ENA is enabled with respect to the clock. For a typical design, the rate may be 100% because the enable could be enabled all the time. For a FIFO design, the rate could be approximately 50% due to bursting of data into and out of the RAM.

#### Port B Frequency (MHz)

This is the frequency on the CLKB pin.

#### Port B Width

This is the data width of DIB and DOB busses.

## Port B Enable Rate (%)

This specifies how often ENB is enabled with respect to the clock.

#### Table 4-8: Block SelectRAM Power

|                  | RAMB16 |                    | Port A |                    | Port B             |       |                    |  |
|------------------|--------|--------------------|--------|--------------------|--------------------|-------|--------------------|--|
| Module           | Cells  | Frequency<br>(Mhz) | Width  | Enable<br>Rate (%) | Frequency<br>(MHz) | Width | Enable<br>Rate (%) |  |
| User<br>Module 1 | 0      | 0                  | 0      | 0                  | 0%                 | 0     | 0%                 |  |
| User<br>Module 2 | 0      | 0                  | 0      | 0                  | 0%                 | 0     | 0%                 |  |
| User<br>Module 3 | 0      | 0                  | 0      | 0                  | 0%                 | 0     | 0%                 |  |
| User<br>Module 4 | 0      | 0                  | 0      | 0                  | 0%                 | 0     | 0%                 |  |
| User<br>Module 5 | 0      | 0                  | 0      | 0                  | 0%                 | 0     | 0%                 |  |
| User<br>Module 6 | 0      | 0                  | 0      | 0                  | 0%                 | 0     | 0%                 |  |
| User<br>Module 7 | 0      | 0                  | 0      | 0                  | 0%                 | 0     | 0%                 |  |
| User<br>Module 8 | 0      | 0                  | 0      | 0                  | 0%                 | 0     | 0%                 |  |

## **Digital Clock Management Power**

Table 4-9 shows the data entries required for the DCM Power section and is used to estimate how much power DCMs consume. Only the clock input frequencies to the CLKIN pin needs to be entered.

| Module      | Clock Input Frequency (MHz) |
|-------------|-----------------------------|
| User DCM 1  | 0                           |
| User DCM 2  | 0                           |
| User DCM 3  | 0                           |
| User DCM 4  | 0                           |
| User DCM 5  | 0                           |
| User DCM 6  | 0                           |
| User DCM 7  | 0                           |
| User DCM 8  | 0                           |
| User DCM 9  | 0                           |
| User DCM 10 | 0                           |
| User DCM 11 | 0                           |
| User DCM 12 | 0                           |

#### Table 4-9: Clock Delay Locked Loop Power

## Non-Registered Multiplier Power

The data entries for the Non-Registered Multiplier Power section are shown in Table 4-10. These entries are used to estimate Non-Registered Multiplier power consumption.

| Module        | Mult18x18 Cell | Port A Width | Port B Width |
|---------------|----------------|--------------|--------------|
| User Module 1 | 0              | 0            | 0            |
| User Module 2 | 0              | 0            | 0            |
| User Module 3 | 0              | 0            | 0            |
| User Module 4 | 0              | 0            | 0            |
| User Module 5 | 0              | 0            | 0            |
| User Module 6 | 0              | 0            | 0            |
| User Module 7 | 0              | 0            | 0            |
| User Module 8 | 0              | 0            | 0            |

 Table 4-10:
 Data Entries for Non-Registered Multiplier Power

#### Multi18x18 Cell

Multi18x18 cell is the total number of Multipliers used in each module.

#### Port A Width

Port A width is the data width of A busses.

#### Port B Width

Port B width is the data width of B busses.
## **Registered Multiplier Power**

Data entries for the Registered Multiplier Power section are shown in Table 4-11. They are used to estimate Registered Multiplier power consumption.

## Frequency

This is the frequency that the Multipliers operate at.

## Multi18x18 Cell

Multi18x18 cell is the total number of Multipliers used in each module.

### Port A Width

Port A width is the data width of A busses.

## Port B Width

Port B width is the data width of B busses.

## Average Toggle Rate

This is the toggle rate for the multiplier modules. This number can be obtained in the same way as obtaining the Average Toggle Rate in the CLB logic power section.

| Module        | Frequency<br>(MHz) | Mult18x18<br>Cell | Port A<br>Width | Port B<br>Width | Average<br>Toggle Rate |
|---------------|--------------------|-------------------|-----------------|-----------------|------------------------|
| User Module 1 | 0                  | 0                 | 0               | 0               | 0                      |
| User Module 2 | 0                  | 0                 | 0               | 0               | 0                      |
| User Module 3 | 0                  | 0                 | 0               | 0               | 0                      |
| User Module 4 | 0                  | 0                 | 0               | 0               | 0                      |
| User Module 5 | 0                  | 0                 | 0               | 0               | 0                      |
| User Module 6 | 0                  | 0                 | 0               | 0               | 0                      |
| User Module 7 | 0                  | 0                 | 0               | 0               | 0                      |
| User Module 8 | 0                  | 0                 | 0               | 0               | 0                      |

 Table 4-11:
 Data Entries for Registered Multiplier Power

## Input/Output Power

Table 4-12 shows the data entries for the Input/Output Power section used to estimate the power dissipation of the Inputs and Outputs. I/Os should be grouped into modules based on their I/O standard type. If the entire design has only one I/O standard type, all of the I/Os can be treated as one module. However, separating the I/Os into smaller modules makes it easier to obtain more accurate results.

## Frequency (MHz)

This is the frequency of the module.

## I/O Standard Type

This is the type of I/Os used in the module. Each module can have only one I/O standard type. I/O power is strongly influenced by the I/O standard used.

## Inputs

This is the total number of the input buffers in each module.

## Outputs

This is the total number of the output buffers in each module.

## Average Output Toggle Rate (%)

This number can be obtained in the same way as obtaining the Average Toggle rate in the CLB Logic Power section.

## Average Output Load (pF)

This specifies the average capacitive load on the outputs.

| Module        | Frequency<br>(MHz) | l/O<br>Standard<br>Type | Inputs | Outputs | Average<br>Output Toggle<br>Rate (%) | Average<br>Output<br>Load (pF) |
|---------------|--------------------|-------------------------|--------|---------|--------------------------------------|--------------------------------|
| User Module 1 | 0                  | LVTTL_12                | 0      | 0       | 0%                                   | 0                              |
| User Module 2 | 0                  | LVTTL_12                | 0      | 0       | 0%                                   | 0                              |
| User Module 3 | 0                  | LVTTL_12                | 0      | 0       | 0%                                   | 0                              |
| User Module 4 | 0                  | LVTTL_12                | 0      | 0       | 0%                                   | 0                              |
| User Module 5 | 0                  | LVTTL_12                | 0      | 0       | 0%                                   | 0                              |
| User Module 6 | 0                  | LVTTL_12                | 0      | 0       | 0%                                   | 0                              |
| User Module 7 | 0                  | LVTTL_12                | 0      | 0       | 0%                                   | 0                              |
| User Module 8 | 0                  | LVTTL_12                | 0      | 0       | 0%                                   | 0                              |

Table 4-12: Data Entries for Input/Output Power

## Results

The results section of the power estimator are shown in Table 4-13. The four sections of the power estimator program independently estimate power consumption, and the results are displayed at the end of each section.

The total design power consumption is the summation of those, and is displayed at the very top of the program.

 Table 4-13:
 Power Estimator Results

| Таі     | rget    | Estimated Design Power Values (mW) |                          |                        |                        |                        |                      |
|---------|---------|------------------------------------|--------------------------|------------------------|------------------------|------------------------|----------------------|
| Device  | Package | Total Power                        | V <sub>CCINT</sub> 1.5 V | V <sub>CCO</sub> 3.3 V | V <sub>CCO</sub> 2.5 V | V <sub>CCO</sub> 1.5 V | Output Sink<br>Power |
| XC2V500 | FG256   | 0                                  | 0                        | 0                      | 0                      | 0                      | 0                    |

## **Target Device**

This refers to the target Virtex-II device size.

Note: No checking is done to verify that the module entries fit into the amount of resources available in the selected devices.

## Target Package

This refers to the package of the device.

Note: No checking is done to verify that the selected device-package combination is valid.

## **Estimated Total Power**

This section displays the total power consumption of the design. It is the summation of CLB Logic power, Block Select RAM power, Multiplier power, DCM power, and Input/Output power.

## Estimated V<sub>CCINT</sub> 1.5V Power

This section displays the total power consumption from the core supply voltage ( $V_{CCINT}$ ). It does not include the power consumption from the input and output source voltage ( $V_{CCO}$ ).

## Estimated V<sub>CCAUX</sub> 3.3V Power

This section displays the power consumption from auxiliary circuits.

## Estimated V<sub>CCO</sub> 3.3V Power

This section displays the V<sub>CCO</sub> power consumption of 3.3 V applications. The I/O standards that use 3.3V V<sub>CCO</sub> are LVTTL, LVCMOS33 PCI, SSTL3 Class I and II, and AGP2X.

## Estimated V<sub>CCO</sub> 2.5V Power

This section displays the  $V_{CCO}$  power consumption of 2.5 V applications. The supported I/O standards are LVCMOS25 and SSTL2 Class I and II.

## Estimated V<sub>CCO</sub> 1.5V Power

This section displays the  $V_{CCO}$  power consumption of 1.5 V applications. The supported I/O standards are LVCMOS15, and HSTL Class I, II, III, and IV.

## Estimated Output Sink Power

This section displays the power consumption when sinking current to ground. The supported I/O standards are GTL and GTL+.

## **IBIS Models**

The need for higher system performance leads to faster output transitions. Signals with fast transitions cannot be considered purely digital; it is important to understand their analog behavior for signal integrity analysis.

To simulate the signal integrity on printed circuit boards (PCB) accurately and solve design problems before the PCB is fabricated, models of the I/O characteristics are required. SPICE models are most frequently used for this purpose. A manufacturer's SPICE models, however, contain proprietary circuit-level information. Therefore, simpler models are devised to extract SPICE parameters for the proprietary information to remain protected. One such standard is the I/O Buffer Information Specification (IBIS) format originally suggested by Intel.

In the early 1990's, the IBIS Open Forum was formed and the first IBIS specification was written to promote tool independent I/O models for system signal integrity analysis.

IBIS is now the ANSI/EIA-656 and IEC 62014-1 standard. IBIS accurately describes the signal behavior of the interconnections without disclosing the actual technology and circuitry used to implement the I/O. The standard is basically a black-box approach to protecting proprietary information.

## **Using IBIS Models**

IBIS models are used by designers for system-level analysis of signal integrity issues, such as the evaluation and matching of loads to drivers for ringing and ground bounce, examining effects of cross talk, and predicting RFI/EMI. It is useful in that complete designs can be simulated and evaluated before additional costs are incurred for PCB fabrication and assembly time.

IBIS models consist of look-up tables that predict the I/V characteristics and dV/dt of integrated circuit inputs and outputs when combined with the PCB wiring. The predictions are performed for the typical case, minimum case (weak transistors, low  $V_{CC}$ , hot temperatures), and maximum case (strong transistors, high  $V_{CC}$ , cold temperatures). IBIS models have limitations in that they do not contain internal delay modeling and are limited in package modeling. IBIS models contain package parasitic information for simulation of ground bounce. Although the data is available within the model file, not all simulators are able to use the data to simulate ground bounce. Simulation results may not agree with the actual results due to package, die, and PCB ground plane modeling problems. Similarly, because simultaneous switching outputs (SSOs) are also difficult to model, only a first approximation is provided to the designer.

## **IBIS** Generation

IBIS is generated either from SPICE simulations, or actual measurements of final devices. IBIS models that are derived from measurements do not have process corner information, unlike IBIS models that are derived from SPICE simulations. The measurements are of only a few parts, and the extremes of production are not represented by such a method. SPICE is a transistor model based on detailed equations using device geometry, and properties of materials. A SPICE netlist of the CMOS buffer is required for V/I and dV/dt curve simulations. These SPICE simulations are then converted to IBIS format/syntax.

## Advantages of IBIS

SPICE requires a greater knowledge of the internal workings of the circuits being modeled, and as such, errors may be made in simulation indicating a problem when there is none. IBIS models are easy to use, and because many of the decisions required for simulation parameters have been organized. IBIS simulations are faster compared to SPICE simulations, because IBIS does not contain circuit details. The voltage/current/time information provided in the IBIS model is only for the external nodes of the building block, making IBIS ideal for system-level interconnects design. Although IBIS models are not as accurate as SPICE models, they are entirely adequate for system-level analysis.

## **IBIS File Structure**

An IBIS file contains two sections, the header and the model data for each component. One IBIS file can describe several devices. The following is the contents list in a typical IBIS file:

- IBIS Version
- File Name
- File Revision
- Component
- Package R/L/C
- Pin name, model, R/L/C
- Model (i.e., 3-state)
- Temperature Range (typical, minimum, and maximum)
- Voltage Range (typical, minimum, and maximum)
- Pull-Up Reference
- Pull-Down Reference
- Power Clamp Reference
- Ground Clamp Reference
- V/I Tables for:
  - Pullup
  - Pulldown
  - Power Clamp
  - Ground Clamp
- Rise and Fall dV/dt for minimum, typical, and maximum conditions (driving 50 ohms)
- Package Model (optional) XXXX.pkg with RLC sections.

## IBIS I/V and dV/dt Curves

A digital buffer can be measured in receive (3-state mode) and drive mode. IBIS I/V curves are based on the data of both these modes. The transition between modes is achieved by phasing in/out the difference between the driver and the receiver models, while keeping the receiver model constantly in the circuit.

The I/V curve range required by the IBIS specification is  $-V_{CC}$  to (2 x  $V_{CC}$ ). This wide voltage range exists because the theoretical maximum overshoot due to a full reflection is twice the signal swing. The ground clamp I/V curve must be specified over the range –  $V_{CC}$  to  $V_{CC}$ , and the power clamp I/V curve must be specified from  $V_{CC}$  to (2 x  $V_{CC}$ ).

The three supported conditions for the IBIS buffer models are typical values (required), minimum values (optional), and maximum values (optional). For CMOS buffers, the minimum condition is defined as high temperature and low supply voltage, and the maximum condition is defined as low temperature and high supply voltage.

An IBIS model of a digital buffer has four I/V curves:

- The pull-down I/V curve contains the mode data for the driver driving low. The origin of the curve is at 0 V for CMOS buffers.
- The pull-up I/V curve contains the mode data for the driver driving high. The origin of the curve is at the supply voltage (V<sub>CC</sub> or V<sub>DD</sub>).
- The ground clamp I/V curve contains receive (3-state) mode data, with the origin of the curve at 0 V for CMOS buffers.

• The power clamp I/V curve contains receive (3-state) mode data, with the origin of the curve at the supply voltage (V<sub>CC</sub> or V<sub>DD</sub>). For 3.3 V buffers that are 5 V tolerant, the power clamp is referenced to 5 V while the pullup is referenced to 3.3 V.

## Ramp and dV/dt Curves

The Ramp keyword contains information on how fast the pull-up and pull-down transistors turn on/off. The dV/dt curves give the same information, while including the effects of die capacitance (C\_comp). C\_comp is the total die capacitance as seen at the die pad, excluding the package capacitance.

dV/dt curves describe the transient characteristics of a buffer more accurately than ramps. A minimum of four dV/dt curves is required to describe a CMOS buffer: pull-down ON, pull-up OFF, pull-down OFF, and pull-up ON. dV/dt curves incorporate the clock-to-out delay, and the length of the dV/dt curve corresponds to the clock speed at which the buffer is used. Each dV/dt curve has t = 0, where the pulse crosses the input threshold.

## **IBIS Simulations**



Figure 4-69: Unterminated Example

## 



Figure 4-71: Parallel Termination Example

www.xilinx.com 1-800-255-7778

## **IBIS Simulators**

Several different IBIS simulators are available today, and each simulator provides different results. An overshoot or undershoot of  $\pm 10\%$  of the measured result is tolerable. Differences between the model and measurements occur, because not all parameters are modeled. Simulators for IBIS models are provided by the following vendors:

- Cadence
- Avanti Corporation
- Hyperlynx
- Mentor
- Microsim
- Intusoft
- Veribest
- Viewlogic

## Xilinx IBIS Advantages

Xilinx provides preliminary IBIS files before working silicon has been verified (before tape out), as well as updated versions of IBIS files after the ICs are verified. Preliminary IBIS files are generated from SPICE models before working silicon has been verified. After the IC (device) is verified, appropriate changes are made to the existing IBIS files. These IBIS files are available at the following web site:

http://www.xilinx.com/support/sw ibis.htm

## **IBIS Reference Web Site**

http://www.eia.org/eig/ibis/ibis.htm

## **BSDL and Boundary Scan Models**

Boundary scan is a technique that is used to improve the testability of ICs. With Virtex-II devices, registers are placed on I/Os that are connected together as a long shift register. Each register can be used to either save or force the state of the I/O. There are additional registers for accessing test modes.

The most common application for boundary scan is testing for continuity of the IC to the board. Some packages make visual inspection of solder joints impossible, e.g. BGA. The large number of I/Os available requires the use of such packages, and also increases the importance of testing. A large number of I/Os also means a long scan chain.

Test software is available to support testing with boundary scan. The software requires a description of the boundary scan implementation of the IC. The IEEE 1149.1 specification provides a language description for Boundary Scan Description Language (BSDL). Boundary scan test software accepts BSDL descriptions.

The IEEE 1149.1 spec also defines a 4 to 5 pin interface known as the JTAG interface. IEEE 1532 is a capability extension of IEEE 1149.1.

## **BSDL** Files

Preliminary BSDL files are provided from the IC Design Process. Final BSDL files have been verified by an external third party test and verification vendor. The following are Virtex-II BDSL file names.

| Virtex-II BSDL File Names |                     |  |  |  |  |
|---------------------------|---------------------|--|--|--|--|
| XC2V40_CS144.BSD          | XC2V2000_FG676.BSD  |  |  |  |  |
| XC2V40_FG256.BSD          | XC2V2000_FF896.BSD  |  |  |  |  |
| XC2V80_CS144.BSD          | XC2V2000_BG575.BSD  |  |  |  |  |
| XC2V80_FG256.BSD          | XC2V2000_BF957.BSD  |  |  |  |  |
| XC2V250_CS144.BSD         | XC2V3000_FG676.BSD  |  |  |  |  |
| XC2V250_FG256.BSD         | XC2V3000_FF1152.BSD |  |  |  |  |
| XC2V250_FG456.BSD         | XC2V3000_BG728.BSD  |  |  |  |  |
| XC2V500_FG256.BSD         | XC2V3000_BF957.BSD  |  |  |  |  |
| XC2V500_FG456.BSD         | XC2V4000_FF1152.BSD |  |  |  |  |
| XC2V1000_FG256.BSD        | XC2V4000_FF1517.BSD |  |  |  |  |
| XC2V1000_FG456.BSD        | XC2V4000_BF957.BSD  |  |  |  |  |
| XC2V1000_FF896.BSD        | XC2V6000_FF1152.BSD |  |  |  |  |
| XC2V1000_BG575.BSD        | XC2V6000_FF1517.BSD |  |  |  |  |
| XC2V1500_FG676.BSD        | XC2V6000 _BF957.BSD |  |  |  |  |
| XC2V1500_FF896.BSD        | XC2V8000_FF1152.BSD |  |  |  |  |
| XC2V1500_BG575.BSD        | XC2V8000_FF1517.BSD |  |  |  |  |



## Appendix A

# BitGen and PROMGen Switches and Options

## **Using BitGen**

BitGen produces a bitstream for Xilinx device configuration. After the design has been completely routed, it is necessary to configure the device so that it can execute the desired function. The Xilinx bitstream necessary to configure the device is generated with BitGen. BitGen takes a fully routed NCD (Circuit Description) file as its input and produces a configuration bitstream—a binary file with a .bit extension.

The BIT file contains all of the configuration information from the NCD file defining the internal logic and interconnections of the FPGA, plus device-specific information from other files associated with the target device. The binary data in the BIT file can then be downloaded into the FPGA memory cells, or it can be used to create a PROM file (see Figure A-1).



X9227



В

## **BitGen Syntax**

The following syntax creates a bitstream from your NCD file.

bitgen [options] infile[.ncd] [outfile] [pcf\_file]

options is one or more of the options listed in the "BitGen Options" on page 409.

*Infile* is the name of the NCD design for which you want to create the bitstream. You can specify only one design file, and it must be the first file specified on the command line.

You do not have to use an extension. If you do not, **.ncd** is assumed. If you do use an extension, it must be **.ncd**.

*Outfile* is the name of the output file. If you do not specify an output file name, BitGen creates one in the same directory as the input file. If you specify -l on the command line, the extension is .ll (see -l command line option). If you specify -m (see -m command line option), the extension is .msk. If you specify -b, the extension is .rbt. Otherwise the extension is .bit. If you do not specify an extension, BitGen appends one according to the aforementioned rules. If you do include an extension, it must also conform to the rules.

*Pcf\_file* is the name of a physical constraints (PCF) file. BitGen uses this file to determine which nets in the design are critical for tiedown, which is not available for Virtex families. BitGen automatically reads the .pcf file by default. If the physical constraints file is the second file specified on the command line, it must have a .pcf extension. If it is the third file specified, the extension is optional; .pcf is assumed. If a .pcf file name is specified, it must exist, otherwise the input design name with a .pcf extension is read if that file exists.

A report file containing all BitGen's output is automatically created under the same directory as the output file. The report file has the same root name as the output file with a .bgn extension.

## **BitGen Files**

This section describes input files that BitGen requires and output files that BitGen generates.

### Input Files

Input to BitGen consists of the following files.

- NCD file—a physical description of the design mapped, placed and routed in the target device. The NCD file must be fully routed.
- PCF—an optional user-modifiable ASCII Physical Constraints File. If you specify a PCF file on the BitGen command line, BitGen uses this file to determine which nets in the design are critical for tiedown (not used for Virtex families).

### **Output Files**

Output from BitGen consists of the following files.

- BIT file—a binary file with a .bit extension. The BIT file contains all of the configuration information from the NCD file defining the internal logic and interconnections of the FPGA, plus device-specific information from other files associated with the target device. The binary data in the BIT file can then be downloaded into the FPGA memory cells, or it can be used to create a PROM file (see "Using PROMGen" on page 413).
- RBT file—an optional "rawbits" file with an .rbt extension. The rawbits file is ASCII ones and zeros representing the data in the bitstream file. If you enter a -b option on the BitGen command line, an RBT file is produced in addition to the binary BIT file (see "-b (Create Rawbits File)" on page 409).
- LL file—an optional ASCII logic allocation file with a .ll extension. The logic allocation file indicates the bitstream position of latches, flip-flops, and IOB inputs and outputs. A .ll file is produced if you enter a -l option on the BitGen command line (see "-l (Create a Logic Allocation File)" on page 413).

- MSK file—an optional mask file with an .msk extension. This file is used to compare relevant bit locations for executing a readback of configuration data contained in an operating FPGA. A MSK file is produced if you enter a -m option on the BitGen command line (see "-m (Generate a Mask File)" on page 413).
- BGN file—a report file containing information about the BitGen run.
- DRC file—a Design Rule Check (DRC) file for the design. A DRC runs and the DRC file is produced unless you enter a -d option on the BitGen command line (see "-d (Do Not Run DRC)" on page 409).

## **BitGen Options**

Following is a description of command line options and how they affect BitGen behavior.

## -b (Create Rawbits File)

Create a "rawbits" (*file\_name*.rbt) file. The rawbits file consists of ASCII ones and zeros representing the data in the bitstream file.

If you are using a microprocessor to configure a single FPGA, you can include the rawbits file in the source code as a text file to represent the configuration data. The sequence of characters in the rawbits file is the same as the sequence of bits written into the FPGA.

## -d (Do Not Run DRC)

Do not run DRC (Design Rule Check). Without the -d option, BitGen runs a DRC and saves the DRC results in two output files: the BitGen report file (*file\_name*.bgn) and the DRC file (*file\_name*.drc). If you enter the -d option, no DRC information appears in the report file and no DRC file is produced.

Running DRC before a bitstream is produced detects any errors that could cause the FPGA to malfunction. If DRC does not detect any errors, BitGen produces a bitstream file (unless you use the -j option described in the "-j (No BIT File)" on page 413).

## -f (Execute Commands File)

### -f command\_file

The -f option executes the command line arguments in the specified *command\_file*.

## -g (Set Configuration)

### -g option:setting

The -g option specifies the startup timing and other bitstream options for Xilinx FPGAs. The settings for the -g option depend on the design's architecture. These options have the following syntax.

### Compress

Enable bitstream compression using multiple frame writes (MFW).

### Readback

This allows the user to perform Readback by the creating the necessary bitstream (**.rbb** file).

## CRC

Virtex-II allows the user to enable or disable the CRC checking. If CRC checking is disabled, a CBC (Constant Bit Check) is used instead.

Settings: Enable, Disable

Default: Enable

В

#### DebugBitstream

This option creates a modified bitstream which loads each frame individually, and places an LOUT write after each, for debugging purposes. This option should be used only in Master or Slave Serial downloads.

Settings: Yes, No

Default: No

#### ConfigRate

Virtex-II devices use an internal oscillator to generate CCLK when configuring in Master SelectMAP or Master Serial modes. This option sets the CCLK rate in MHz.

Settings: 4,5,6,7,8,10,13,15,20,26,30,34,41,45,51,55,60,130

Default: 4

## StartupClk

The last few cycles of configuration is called the startup sequence. The startup sequence can be clocked by CCLK signal, a User clock (connected to the STARTUP block), or TCK (the JTAG clock).

Settings: CCLK, UserClk, JTAGClk

Default: CCLK

#### PowerdownStatus

This options allows the user to choose whether the DONE pin is used as the PowerDown pin after configuration.

Settings: Enable, Disable

Default: Enable

#### DCMShutdown

If the DCMShutdown option is enabled, the DCM resets if the SHUTDOWN and AGHIGH commands are performed.

Settings: Enable, Disable

Default: Enable

#### CclkPin

This option selects an internal pullup on the CCLK pin.

Settings: Pullnone, Pullup

Default: Pullup

#### DonePin

This option selects an internal pullup on the DONE pin.

Settings: Pullnone, Pullup

Default: Pullup

#### M0Pin

This option selects an internal pullup or pulldown on the M0 (Mode 0) pin.

Settings: Pullnone, Pullup, Pulldown

Default: Pullup

#### M1Pin

This option selects an internal pullup or pulldown on the M1 (Mode 1) pin. Settings: Pullnone, Pullup, Pulldown Default: Pullup

### M2Pin

5

This option selects an internal pullup or pulldown on the M2 (Mode 2) pin. Settings: Pullnone, Pullup, Pulldown

Default: Pullup

## ProgPin

This options selects an internal pullup on the PROGRAM pin.

Settings: Pullnone, Pullup

Default: Pullup

### TckPin

This option selects an internal pullup or pulldown on the TCK (JTAG Clock) pin.

Settings: Pullnone, Pullup, Pulldown

Default: Pullup

#### TdiPin

This option selects an internal pullup or pulldown on the TDI (JTAG Input) pin.

Settings: Pullnone, Pullup, Pulldown

Default: Pullup

#### TdoPin

This option selects an internal pullup or pulldown on the TDO (JTAG Output) pin.

Settings: Pullnone, Pullup, Pulldown

Default: Pullnone

#### TmsPin

This option selects an internal pullup or pulldown on the TMS (JTAG Mode Select) pin.

Settings: Pullnone, Pullup, Pulldown

Default: Pullup

#### UnusedPin

This option selects an internal pullup or pulldown on all unused I/Os.

Settings: Pullnone, Pullup, Pulldown

Default: Pulldown

### GWE\_cycle

Selects the startup phase that asserts the internal write enable to flip-flops, LUT RAMs, shift registers, and BRAMs. Before the startup phase both BRAM writing and reading are disabled. The Done setting asserts GWE when the DoneIn signal is high. DoneIn is either the value of the DONE pin or a delayed version if DonePipe=Yes. The Keep setting is used to keep the current value of the GWE signal.

Settings: 1, 2, 3, 4, 5, 6, Done, Keep

Default: 6

### GTS\_cycle

Selects the startup phase that releases the internal 3-state control to the I/O buffers. The Done setting releases GTSA when the DoneIn signal is high. DoneIn is either the value of the DONE pin or a delayed version if DonePipe=Yes. The Keep setting is used to keep the current value of the GTS signal.

Settings: 1, 2, 3, 4, 5, 6, Done, Keep Default: 5 Selects the startup phase to wait until DCM locks are asserted.

Settings: 0, 1, 2, 3, 4, 5, 6, NoWait Default: NoWait

MATCH\_cycle

Selects the startup phase to wait until DCI locks are asserted.

Settings: 0, 1, 2, 3, 4, 5, 6, NoWait

Default: NoWait

#### DONE\_cycle

Selects the startup phase that activates the FPGA DONE signal. DONE is delayed when DonePipe=Yes.

Settings: 1, 2, 3, 4, 5, 6

Default: 4

#### Persist

This option is needed for Readback and Partial Reconfiguration using the configuration pins. If Persist=Yes, all the configuration pins used retain their function. Which configuration pins are persisted is determined by the mode pin settings. If a serial mode is chosen, the persisted pins would be INIT, DOUT, and DIN. If a SelectMAP mode is chosen, the persisted pins would be INIT, BUSY, D0-D7, CS, and WRITE.

Settings: Yes, No

Default: No

#### DriveDone

This option actively drives the DONE pin high as opposed to an open-drain driver. Take care when setting DriveDone=Yes in daisy chain applications.

Settings: Yes, No

Default: No

#### DonePipe

This option is intended for use with FPGAs being set up in a high-speed daisy chain configuration. When set to Yes, the FPGA waits on the DONE pin, and waits for the first StartupClk edge before moving to the Done state.

Settings: Yes, No

Default: No

#### Security

This options selects the level of bitstream security. Selecting Level 1 disables Readback, and selecting Level 2 disables Readback and reconfiguration.

Settings: Level1, Level2, None

Default: None

#### UserID

The user can enter up to an 8-digit hexadecimal code (32-bit value) in the UserID register. You can use the register to identify implementation or design revisions.

Settings: <any 8-digit hex string>

Default: 0xFFFFFFFF

## -h or -help (Command Usage)

#### -h architecture

Displays a usage message for BitGen. The usage message displays all available options for BitGen operating on the specified *architecture*.

## -j (No BIT File)

Do not create a bitstream file (.bit file). This option is generally used when you want to generate a report without producing a bitstream. For example, if you wanted to run DRC without producing a bitstream file, you would use the -j option.

Note: The .msk or .rbt files might still be created.

## -I (Create a Logic Allocation File)

This option creates an ASCII logic allocation file (*design*.**ll**) for the selected design. The logic allocation file indicates the bitstream position of latches, flip-flops, and IOB inputs and outputs.

In some applications, you may want to observe the contents of the FPGA internal registers at different times. The file created by the -l option helps you identify which bits in the current bitstream represent outputs of flip-flops and latches. Bits are referenced by frame and bit number within the frame.

The Hardware Debugger uses the **design.ll** file to locate signal values inside a readback bitstream.

## -m (Generate a Mask File)

Creates a mask file. This file is used to compare relevant bit locations for executing a readback of configuration data contained in an operating FPGA.

## -w (Overwrite Existing Output File)

Enables you to overwrite an existing BIT, LL, MSK, or RBT output file.

## **Using PROMGen**

The PROMGen program is compatible with the following families.

• Virtex/Virtex-E/Virtex-II

PROMGen formats a BitGen-generated configuration bitstream (BIT) file into a PROM format file (Figure A-2).

The PROM file contains configuration data for the FPGA device. PROMGen converts a BIT file into one of three PROM formats: MCS-86 (Intel), EXORMAX (Motorola), or TEKHEX (Tektronix). It can also generate a Hex file format.



There are two functionally equivalent versions of PROMGen. There is a stand-alone version you can access from an operating system prompt. You can also access an interactive version, called the PROM File Formatter, from inside the Design Manager for Alliance or the Project Manager in Foundation. This chapter describes the stand-alone version; the interactive version is described in the *PROM File Formatter Guide*.

You can also use PROMGen to concatenate bitstream files to daisy-chain FPGAs.

Note: If the destination PROM is one of the Xilinx Serial PROMs, you are using a Xilinx PROM Programmer, and the FPGAs are not being daisy-chained, it is not necessary to make a PROM file. See the *Hardware User Guide* for more information about daisy-chained designs

## PROMGen Syntax

Use the following syntax to start PROMGen from the operating system prompt:

promgen [options]

*Options* can be any number of the options listed in"PROMGen Options" on page 415. Separate multiple options with spaces.

## **PROMGen Files**

This section describes the PROMGen input and output files.

#### Input Files

The input to PROMGEN consists of BIT files— one or more bitstream files. BIT files contain configuration data for an FPGA design.

#### Output Files

Output from PROMGEN consists of the following files.

- PROM files—The file or files containing the PROM configuration information. Depending on the PROM file format used by the PROM programmer, you can output a TEK, MCS, or EXO file. If you are using a microprocessor to configure your devices, you can output a HEX file, containing a hexadecimal representation of the bitstream.
- PRM file—The PRM file is a PROM image file. It contains a memory map of the output PROM file. The file has a **.prm** extension.

## Bit Swapping in PROM Files

PROMGen produces a PROM file in which the bits within a byte are swapped compared to the bits in the input BIT file. Bit swapping (also called "bit mirroring") reverses the bits within each byte, as shown in Figure A-3.



Figure A-3: Bit Swapping

In a bitstream contained in a BIT file, the Least Significant Bit (LSB) is always on the left side of a byte. But when a PROM programmer or a microprocessor reads a data byte, it identifies the LSB on the right side of the byte. In order for the PROM programmer or microprocessor to read the bitstream correctly, the bits in each byte must first be swapped so they are read in the correct order.

In this release of the Xilinx Development System, the bits are swapped for all of the PROM formats: MCS, EXO, and TEK. For a HEX file output, bit swapping is on by default, but it can be turned off by entering a -b PROMGen option that is available only for HEX file format.

## **PROMGen Options**

This section describes the options that are available for the PROMGen command.

### -b (Disable Bit Swapping—HEX Format Only)

This option only applies if the -p option specifies a HEX file for the output of PROMGen. By default (no -b option), bits in the HEX file are swapped compared to bits in the input BIT files. If you enter a -b option, the bits are not swapped. Bit swapping is described in "Bit Swapping in PROM Files" on page 415.

#### -c (Checksum)

#### promgen -c

The -c option generates a checksum value appearing in the .prm file. This value should match the checksum in the prom programmer. Use this option to verify that correct data was programmed into the prom.

### -d (Load Downward)

promgen -d hexaddress0 filename filename...

This option loads one or more BIT files from the starting address in a downward direction. Specifying several files after this option causes the files to be concatenated in a daisy chain. You can specify multiple -d options to load files at different addresses. You must specify this option immediately before the input bitstream file.

The multiple file syntax is as follows:

promgen -d hexaddress0 filename filename...

В

promgen -d hexaddress1 filename -d hexaddress2 filename...

#### -f (Execute Commands File)

```
-f command_file
```

The -f option executes the command line arguments in the specified *command\_file*.

#### -help (Command Help)

This option displays help that describes the PROMGen options.

#### I option (Disable Length Count)

#### promgen -1

The -l option disables the length counter in the FPGA bitstream. It is valid only for 4000EX, 4000XL, 4000XLA, 4000XV, and SpartanXL Devices. Use this option when chaining together bitstreams exceeding the 24 bit limit imposed by the length counter.

#### -n (Add BIT Flles)

-n file1[.bit] file2[.bit]...

This option loads one or more BIT files up or down from the next available address following the previous load. The first -n option *must* follow a -u or -d option because -n does not establish a direction. Files specified with this option are not daisy-chained to previous files. Files are loaded in the direction established by the nearest prior -u, -d, or -n option.

The following syntax shows how to specify multiple files. When you specify multiple files, PROMGen daisy-chains the files.

promgen -d hexaddress file0 -n file1 file2...

The following syntax when using multiple -n options prevents the files from being daisychained:

promgen -d hexaddress file0 -n file1 -n file2...

#### -o (Output File Name)

-o file1[.ext] file2[.ext]...

This option specifies the output file name of a PROM if it is different from the default. If you do not specify an output file name, the PROM file has the same name as the first BIT file loaded.

*ext* is the extension for the applicable PROM format.

Multiple file names may be specified to split the information into multiple files. If only one name is supplied for split PROM files (by you or by default), the output PROM files are named *file\_#.ext*, where *file* is the base name, *#* is 0, 1, etc., and *ext* is the extension for the applicable PROM format.

promgen -d hexaddress file0 -o filename

#### -p (PROM Format)

-p {mcs | exo | tek | hex}

This option sets the PROM format to one of the following: MCS (Intel MCS86), EXO (Motorola EXORMAX), TEK (Tektronix TEKHEX). The option may also produce a HEX file, which is a hexadecimal representation of the configuration bitstream used for microprocessor downloads. If specified, the -p option must precede any -u, -d, or -n options. The default format is MCS.

5

## -r (Load PROM File)

-r promfile

This option reads an existing PROM file as input instead of a BIT file. All of the PROMGen output options may be used, so the -r option can be used for splitting an existing PROM file into multiple PROM files or for converting an existing PROM file to another format.

### -s (PROM Size)

-s promsize1 promsize2...

This option sets the PROM size in kilobytes. The PROM size must be a power of 2. The default value is 64 kilobytes. The -s option must precede any -u, -d, or -n options.

Multiple *promsize* entries for the -s option indicates the PROM will be split into multiple PROM files.

Note: PROMGen PROM sizes are specified in bytes. *The Programmable Logic Data Book* specifies PROM sizes in bits for Xilinx serial PROMs (see -x option).

### -u (Load Upward)

-u hexaddress0 filename1 filename2...

This option loads one or more BIT files from the starting address in an upward direction. When you specify several files after this option, PROMGen concatenates the files in a daisy chain. You can load files at different addresses by specifying multiple -u options.

This option must be specified immediately before the input bitstream file.

## -x (Specify Xilinx PROM)

-x xilinx\_prom1 xilinx\_prom2...

The -x option specifies one or more Xilinx serial PROMs for which the PROM files are targeted. Use this option instead of the -s option if you know the Xilinx PROMs to use.

Multiple xilinx\_*prom* entries for the -x option indicates the PROM will be split into multiple PROM files.

## **Examples**

To load the file test.bit up from address 0x0000 in MCS format, enter the following information at the command line.

promgen -u 0 test

To daisy-chain the files test1.bit and test2.bit up from address 0x0000 and the files test3.bit and test4.bit from address 0x4000 while using a 32K PROM and the Motorola EXORmax format, enter the following information at the command line.

```
promgen -s 32 -p exo -u 00 test1 test2 -u 4000 test3 test4
```

To load the file test.bit into the PROM programmer in a downward direction starting at address 0x400, using a Xilinx XC1718D PROM, enter the following information at the command line.

```
promgen -x xc1718d -d 0x400 test
```

To specify a PROM file name that is different from the default file name enter the following information at the command line.

promgen options filename -o newfilename

В

:



## Appendix B

## **XC18V00 Series PROMs**

This appendix contains package specifications for the XC18V00 Series of In-System Programmable Configuration PROMs, as well as the XC18V00 Series product specification (DS026). The latest version of this information is available online (at <u>www.xilinx.com</u>).

## **PROM Package Specifications**

This section contains specifications for the following Virtex-II packages:

- PC20-84 Specification
- SO20 Specification
- VQ44 Specification

C



#### NDTES:

- 1. ALL DIMENSIONS AND TOLERANCES CONFORM TO ANSI Y14.5M-1982.
- 2. DIMENSIONS 'D1' AND 'E1' DO NOT INCLUDE MOLD FLASH OR PROTRUSIONS, MOLD FLASH OR PROTRUSIONS SHALL NOT EXCEED .010 PER SIDE.
- 3. 'N' IS NUMBER DF TERMINALS.
- 4. CONFORM TO JEDEC MO-047
- 5. TOP OF PACKAGE MAY BE SMALLER THAN BOTTOM BY .010".

20, 28, 44, 68 and 84-PIN PLCC (PC20 THRU PC84)

UG002\_app\_01\_111600

2

#### Figure B-1: PC20-84 Specification

## SO20 Specification

2





UG002\_app\_02\_111600

#### Figure B-2: SO20 Specification

www.xilinx.com 1-800-255-7778 С

## VQ44 Specification



| 0.15 | NOIES: |  |
|------|--------|--|
|      |        |  |

- 1. ALL DIMENSIONS AND TOLERANCES CONFORM TO ANSI Y14.5M-1982.
- 2. DIMENSIONS D1 AND E1 DO NOT INCLUDE MOLD PROTRUSION. ALLOWABLE MOLD PROTRUSION SHALL NOT EXCEED 0.25mm PER SIDE.
- 3. THE TOP OF PACKAGE MAY BE SMALLER THAN THE BOTTOM OF PACKAGE BY 0.15mm.

## 44, 64, 100-PIN PLASTIC VERY THIN QFP (VQ44, VQ64, VQ100)

0.05

0.95

0.09 - -----

0.45

the

×

0.10

1.00

16.00 BSC.

14.00 BSC.

0.17 0.22 0.27

0.50 BSC.

the

×

100

JEDEC MS-026-AED

0.60 0.75

1.05

0.20

0.08

0.08

UG002\_app\_04\_111600

#### Figure B-3: VQ44 Specification

0.05

0.95

0.09

0.45

z

z

 $A_1$ 

Az

D/E

 $D_1/E_1$ 

b

с

e

L

CCC

ddd

Ν

REF.

z

1.00

12.00 BSC

10.00 BSC

0.80 BSC.

×

0.60

ž

z

44

JEDEC MS-026-ACB

0.30 0.37

0.15

1.05

0.45

0.20

0.75

0.10

0.20

0.05

0.95

0.17

0.09

0.45

×

×

0.10

1.00

12.00 BSC.

10.00 BSC

×.

0.50 BSC

0.60

the

z

64

JEDEC MS-026-ACD

0.22 0.27

0.15

1.05

0.20

0.75

0.08

0.08

# 

#### DS026 (v3.9) November 18, 2002

## XC18V00 Series In-System Programmable Configuration PROMs

#### Product Specification

## **Features**

- In-system programmable 3.3V PROMs for configuration of Xilinx FPGAs
  - Endurance of 20,000 program/erase cycles
  - Program/erase over full commercial/industrial voltage and temperature range
- IEEE Std 1149.1 boundary-scan (JTAG) support
- Simple interface to the FPGA
- Cascadable for storing longer or multiple bitstreams
- Low-power advanced CMOS FLASH process

- Dual configuration modes
  - Serial Slow/Fast configuration (up to 33 MHz)
  - Parallel (up to 264 Mb/s at 33 MHz)
- 5V tolerant I/O pins accept 5V, 3.3V and 2.5V signals
- 3.3V or 2.5V output capability
- Available in PC20, SO20, PC44 and VQ44 packages
- Design support using the Xilinx Alliance and Foundation series software packages.
- JTAG command initiation of standard FPGA configuration

## Description

Xilinx introduces the XC18V00 series of in-system programmable configuration PROMs (Figure 1). Initial devices in this 3.3V family are a 4-megabit, a 2-megabit, a 1-megabit, a 512-Kbit, and a 256-Kbit PROM that provide an easy-to-use, cost-effective method for re-programming and storing large Xilinx FPGA or CPLD configuration bitstreams.

When the FPGA is in Master Serial mode, it generates a configuration clock that drives the PROM. A short access time after  $\overline{CE}$  and OE are enabled, data is available on the PROM DATA (D0) pin that is connected to the FPGA D<sub>IN</sub> pin. New data is available a short access time after each rising clock edge. The FPGA generates the appropriate number of clock pulses to complete the configuration. When the FPGA is in Slave Serial mode, the PROM and the FPGA are clocked by an external clock.

When the FPGA is in Master-SelectMAP mode, the FPGA generates a configuration clock that drives the PROM. When the FPGA is in Slave-Parallel or Slave-SelectMAP Mode, an external oscillator generates the configuration clock that drives the PROM and the FPGA. After  $\overline{CE}$  and OE are enabled, data is available on the PROMs DATA (D0-D7) pins. New data is available a short access time after each rising clock edge. The data is clocked into the FPGA on the following rising edge of the CCLK. Neither Slave-Parallel nor SelectMAP utilize a Length Count, so a free-running oscillator can be used in the Slave-Parallel or Slave-SelecMAP modes.

Multiple devices can be concatenated by using the  $\overline{CEO}$  output to drive the  $\overline{CE}$  input of the following device. The clock inputs and the DATA outputs of all PROMs in this chain are interconnected. All devices are compatible and can be cascaded with other members of the family or with the XC17V00 one-time programmable Serial PROM family.

© 2001, 2002 Xilinx, Inc. All rights reserved. All Xilinx trademarks, registered trademarks, patents, and further disclaimers are as listed at <a href="http://www.xilinx.com/legal.htm">http://www.xilinx.com/legal.htm</a>. All other trademarks and registered trademarks are the property of their respective owners. All specifications are subject to change without notice.

NOTICE OF DISCLAIMER: Xilinx is providing this design, code, or information "as is." By providing the design, code, or information as one possible implementation of this feature, application, or standard, Xilinx makes no representation that this implementation is free from any claims of infringement. You are responsible for obtaining any rights you may require for your implementation. Xilinx expressly disclaims any warranty whatsoever with respect to the adequacy of the implementation, including but not limited to any warranties or representations that this implementation is free from any any implied warranties of merchantability or fitness for a particular purpose.

www.xilinx.com 1-800-255-7778



Figure 1: XC18V00 Series Block Diagram

## **Pinout and Pin Description**

Pins not listed are "no connects."

#### Table 1: Pin Names and Descriptions

| Pin<br>Name | Boundary<br>Scan<br>Order | Function         | Pin Description                                                         | 44-pin<br>VQFP | 44-pin<br>PLCC | 20-pin<br>SOIC &<br>PLCC |
|-------------|---------------------------|------------------|-------------------------------------------------------------------------|----------------|----------------|--------------------------|
| D0          | 4                         | DATA OUT         | D0 is the DATA output pin to provide data for                           | 40             | 2              | 1                        |
|             | 3                         | OUTPUT<br>ENABLE | configuring an FPGA in serial mode.                                     |                |                |                          |
| D1          | 6                         | DATA OUT         | D0-D7 are the output pins to provide parallel                           | 29             | 35             | 16                       |
|             | 5                         | OUTPUT<br>ENABLE | data for configuring a Xilinx FPGA in<br>Slave-Parallel/SelectMap mode. |                |                |                          |
| D2          | 2                         | DATA OUT         |                                                                         | 42             | 4              | 2                        |
|             | 1                         | OUTPUT<br>ENABLE |                                                                         |                |                |                          |
| D3          | 8                         | DATA OUT         |                                                                         | 27             | 33             | 15                       |
|             | 7                         | OUTPUT<br>ENABLE |                                                                         |                |                |                          |
| D4          | 24                        | DATA OUT         |                                                                         | 9              | 15             | 7(1)                     |
|             | 23                        | OUTPUT<br>ENABLE |                                                                         |                |                |                          |
| D5          | 10                        | DATA OUT         |                                                                         | 25             | 31             | 14                       |
|             | 9                         | OUTPUT<br>ENABLE |                                                                         |                |                |                          |
| D6          | 17                        | DATA OUT         |                                                                         | 14             | 20             | 9                        |
|             | 16                        | OUTPUT<br>ENABLE |                                                                         |                |                |                          |
| D7          | 14                        | DATA OUT         |                                                                         | 19             | 25             | 12                       |
|             | 13                        | OUTPUT<br>ENABLE |                                                                         |                |                |                          |

| Pin<br>Name     | Boundary<br>Scan<br>Order | Function         | Pin Description                                                                                                                                                                                                                          | 44-pin<br>VQFP    | 44-pin<br>PLCC    | 20-pin<br>SOIC &<br>PLCC |
|-----------------|---------------------------|------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------------------|-------------------|--------------------------|
| CLK             | 0                         | DATA IN          | Each rising edge on the CLK input increments the internal address counter if both $\overline{CE}$ is Low and OE/RESET is High.                                                                                                           | 43                | 5                 | 3                        |
|                 | 20                        | DATA IN          | When Low, this input holds the address                                                                                                                                                                                                   | 13                | 19                | 8                        |
| nese i          | 19                        | DATA OUT         | high-impedance state. This is a bidirectional                                                                                                                                                                                            |                   |                   |                          |
|                 | 18                        | OUTPUT<br>ENABLE | PROM is reset. Polarity is NOT programmable.                                                                                                                                                                                             |                   |                   |                          |
| CE              | 15                        | DATA IN          | When $\overline{CE}$ is High, the device is put into<br>low-power standby mode, the address<br>counter is reset, and the DATA pins are put in<br>a high-impedance state.                                                                 | 15                | 21                | 10                       |
| CF              | 22                        | DATA OUT         | Allows JTAG CONFIG instruction to initiate                                                                                                                                                                                               | 10                | 16                | 7(1)                     |
|                 | 21                        | OUTPUT<br>ENABLE | FPGA. This is an open-drain output that is pulsed Low by the JTAG CONFIG command.                                                                                                                                                        |                   |                   |                          |
| CEO             | 12                        | DATA OUT         | Chip Enable Output (CEO) is connected to                                                                                                                                                                                                 | 21                | 27                | 13                       |
|                 | 11                        | OUTPUT<br>ENABLE | This output is Low when CE is Low and<br>OE/RESET input is High, AND the internal<br>address counter has been incremented<br>beyond its Terminal Count (TC) value. CEO<br>returns to High when OE/RESET goes Low or<br>CE goes High.     |                   |                   |                          |
| GND             |                           |                  | GND is the ground connection.                                                                                                                                                                                                            | 6, 18, 28 &<br>41 | 3, 12, 24<br>& 34 | 11                       |
| TMS             |                           | MODE<br>SELECT   | The state of TMS on the rising edge of TCK determines the state transitions at the Test Access Port (TAP) controller. TMS has an internal 50K ohm resistive pull-up on it to provide a logic "1" to the device if the pin is not driven. | 5                 | 11                | 5                        |
| ТСК             |                           | CLOCK            | This pin is the JTAG test clock. It sequences<br>the TAP controller and all the JTAG test and<br>programming electronics.                                                                                                                | 7                 | 13                | 6                        |
| TDI             |                           | data in          | This pin is the serial input to all JTAG<br>instruction and data registers. TDI has an<br>internal 50K ohm resistive pull-up on it to<br>provide a logic "1" to the system if the pin is<br>not driven.                                  | 3                 | 9                 | 4                        |
| TDO             |                           | DATA OUT         | This pin is the serial output for all JTAG instruction and data registers. TDO has an internal 50K ohm resistive pull-up on it to provide a logic "1" to the system if the pin is not driven.                                            | 31                | 37                | 17                       |
| V <sub>CC</sub> |                           |                  | Positive 3.3V supply voltage for internal logic and input buffers.                                                                                                                                                                       | 17, 35 &<br>38    | 23, 41 &<br>44    | 18 & 20                  |

## Table 1: Pin Names and Descriptions (Continued)

| Pin<br>Name      | Boundary<br>Scan<br>Order | Function | Pin Description                                                               | 44-pin<br>VQFP                                                             | 44-pin<br>PLCC                                                               | 20-pin<br>SOIC &<br>PLCC |
|------------------|---------------------------|----------|-------------------------------------------------------------------------------|----------------------------------------------------------------------------|------------------------------------------------------------------------------|--------------------------|
| V <sub>CCO</sub> |                           |          | Positive 3.3V or 2.5V supply voltage connected to the output voltage drivers. | 8, 16, 26 &<br>36                                                          | 14, 22,<br>32 & 42                                                           | 19                       |
| NC               |                           |          | No connects.                                                                  | 1, 2, 4,<br>11, 12, 20,<br>22, 23, 24,<br>30, 32, 33,<br>34, 37, 39,<br>44 | 1, 6, 7, 8,<br>10, 17,<br>18, 26,<br>28, 29,<br>30, 36,<br>38, 39,<br>40, 43 |                          |

#### Table 1: Pin Names and Descriptions (Continued)

#### Notes:

1. By default, pin 7 is the D4 pin in the 20-pin packages. However, CF --> D4 programming option can be set to override the default and route the CF function to pin 7 in the Serial mode.

## **Pinout Diagrams**







## Xilinx FPGAs and Compatible PROMs

Table 2 provides a list of Xilinx FPGAs and compatiblePROMs.

| Device   | Configuration<br>Bits | XC18V00<br>Solution        |
|----------|-----------------------|----------------------------|
| XC2VP2   | 1,305,440             | XC18V02                    |
| XC2VP4   | 3,006,560             | XC18V04                    |
| XC2VP7   | 4,485,472             | XC18V04 +<br>XC18V512      |
| XC2VP20  | 8,214,624             | 2 of XC18V04               |
| XC2VP30  | 11,364,608            | 3 of XC18V04               |
| XC2VP40  | 15,563,264            | 4 of XC18V04               |
| XC2VP50  | 19,021,472            | 5 of XC18V04               |
| XC2VP70  | 25,604,096            | 6 of XC18V04 +<br>XC18V512 |
| XC2VP100 | 33,645,312            | 8 of XC18V04 +<br>XC18V256 |
| XC2VP125 | 42,782,208            | 10 of XC18V04 +<br>XC18V01 |
| XC2V40   | 360,096               | XC18V512                   |
| XC2V80   | 635,296               | XC18V01                    |
| XC2V250  | 1,697,184             | XC18V02                    |
| XC2V500  | 2,761,888             | XC18V04                    |
| XC2V1000 | 4,082,592             | XC18V04                    |
| XC2V1500 | 5,659,296             | XC18V04<br>+ XC18V02       |
| XC2V2000 | 7,492,000             | 2 of XC18V04               |
| XC2V3000 | 10,494,368            | 3 of XC18V04               |

| Tahle | 2. | Xiliny | <b>FPG</b> As | and | Com   | natihle | PROMs       |
|-------|----|--------|---------------|-----|-------|---------|-------------|
| Table | ۷. |        | I F GAS       | anu | COIII | patible | F I I O WIS |

## Table 2: Xilinx FPGAs and Compatible PROMs

| Device   | Configuration<br>Bits | XC18V00<br>Solution       |
|----------|-----------------------|---------------------------|
| XC2V4000 | 15,659,936            | 4 of XC18V04              |
| XC2V6000 | 21,849,504            | 5 of XC18V04 +<br>XC18V02 |
| XC2V8000 | 29,063,072            | 7 of XC18V04              |
| XCV50    | 559,200               | XC18V01                   |
| XCV100   | 781,216               | XC18V01                   |
| XCV150   | 1,040,096             | XC18V01                   |
| XCV200   | 1,335,840             | XC18V02                   |
| XCV300   | 1,751,808             | XC18V02                   |
| XCV400   | 2,546,048             | XC18V04                   |
| XCV600   | 3,607,968             | XC18V04                   |
| XCV800   | 4,715,616             | XC18V04 +<br>XC18V512     |
| XCV1000  | 6,127,744             | XC18V04 +<br>XC18V02      |
| XCV50E   | 630,048               | XC18V01                   |
| XCV100E  | 863,840               | XC18V01                   |
| XCV200E  | 1,442,016             | XC18V02                   |
| XCV300E  | 1,875,648             | XC18V02                   |
| XCV400E  | 2,693,440             | XC18V04                   |
| XCV405E  | 3,430,400             | XC18V04                   |
| XCV600E  | 3,961,632             | XC18V04                   |
| XCV812E  | 6,519,648             | 2 of XC18V04              |
| XCV1000E | 6,587,520             | 2 of XC18V04              |

|          | •                     | -                   |  |  |
|----------|-----------------------|---------------------|--|--|
| Device   | Configuration<br>Bits | XC18V00<br>Solution |  |  |
| XCV1600E | 8,308,992             | 2 of XC18V04        |  |  |
| XCV2000E | 10,159,648            | 3 of XC18V04        |  |  |
| XCV2600E | 12,922,336            | 4 of XC18V04        |  |  |
| XCV3200E | 16,283,712            | 4 of XC18V04        |  |  |
| XC2S15   | 197,696               | XC18V256            |  |  |
| XC2S30   | 336,768               | XC18V512            |  |  |
| XC2S50   | 559,200               | XC18V01             |  |  |
| XC2S100  | 781,216               | XC18V01             |  |  |
| XC2S150  | 1,040,096             | XC18V01             |  |  |
| XC2S200  | 1,335,840             | XC18V02             |  |  |
| XC2S50E  | 630,048               | XC18V01             |  |  |
| XC2S100E | 863,840               | XC18V01             |  |  |
| XC2S150E | 1,134,496             | XC18V02             |  |  |
| XC2S200E | 1,442,016             | XC18V02             |  |  |
| XC2S300E | 1,875,648             | XC18V02             |  |  |
| XC2S400E | 2,693,440             | XC18V04             |  |  |
| XC2S600E | 3,961,632             | XC18V04             |  |  |

#### Table 2: Xilinx FPGAs and Compatible PROMs

## Capacity

| Devices  | Configuration Bits |
|----------|--------------------|
| XC18V04  | 4,194,304          |
| XC18V02  | 2,097,152          |
| XC18V01  | 1,048,576          |
| XC18V512 | 524,288            |
| XC18V256 | 262,144            |

## **In-System Programming**

In-System Programmable PROMs can be programmed individually, or two or more can be daisy-chained together and programmed in-system via the standard 4-pin JTAG protocol as shown in Figure 2. In-system programming offers quick and efficient design iterations and eliminates unnecessary package handling or socketing of devices. The Xilinx development system provides the programming data sequence using either Xilinx iMPACT software and a download cable, a third-party JTAG development system, a JTAG-compatible board tester, or a simple microprocessor

interface that emulates the JTAG instruction sequence. The iMPACT software also outputs serial vector format (SVF) files for use with any tools that accept SVF format and with automatic test equipment.

All outputs are held in a high-impedance state or held at clamp levels during in-system programming.

## **OE/RESET**

The ISP programming algorithm requires issuance of a reset that causes OE to go Low.

## **External Programming**

Xilinx reprogrammable PROMs can also be programmed by the Xilinx HW-130 or a third-party device programmer. This provides the added flexibility of using pre-programmed devices with an in-system programmable option for future enhancements and design changes.

## **Reliability and Endurance**

Xilinx in-system programmable products provide a guaranteed endurance level of 20,000 in-system program/erase cycles and a minimum data retention of 20 years. Each device meets all functional, performance, and data retention specifications within this endurance limit.

## **Design Security**

The Xilinx in-system programmable PROM devices incorporate advanced data security features to fully protect the programming data against unauthorized reading via JTAG. Table 3 shows the security setting available.

The read security bit can be set by the user to prevent the internal programming pattern from being read or copied via JTAG. When set, it allows device erase. Erasing the entire device is the only way to reset the read security bit.

#### Table 3: Data Security Options

| Default = Reset       | Set                     |
|-----------------------|-------------------------|
| Read Allowed          | Read Inhibited via JTAG |
| Program/Erase Allowed | Program/Erase Allowed   |
| Verify Allowed        | Verify Inhibited        |





## IEEE 1149.1 Boundary-Scan (JTAG)

The XC18V00 family is fully compliant with the IEEE Std. 1149.1 Boundary-Scan, also known as JTAG. A Test Access Port (TAP) and registers are provided to support all required boundary scan instructions, as well as many of the optional instructions specified by IEEE Std. 1149.1. In addition, the JTAG interface is used to implement in-system programming (ISP) to facilitate configuration, erasure, and verification operations on the XC18V00 device.

Table 4 lists the required and optional boundary-scan instructions supported in the XC18V00. Refer to the IEEE Std. 1149.1 specification for a complete description of boundary-scan architecture and the required and optional instructions.

## **Instruction Register**

The Instruction Register (IR) for the XC18V00 is eight bits wide and is connected between TDI and TDO during an instruction scan sequence. In preparation for an instruction scan sequence, the instruction register is parallel loaded with a fixed instruction capture pattern. This pattern is shifted out onto TDO (LSB first), while an instruction is shifted into the instruction register from TDI. The detailed composition of the instruction capture pattern is illustrated in Figure 3.

The ISP Status field, IR(4), contains logic "1" if the device is currently in ISP mode; otherwise, it contains logic "0". The Security field, IR(3), contains logic "1" if the device has been programmed with the security option turned on; otherwise, it contains logic "0".

| Boundary-Scan<br>Command      | Binary<br>Code [7:0]  | Description                                                     |  |  |  |  |
|-------------------------------|-----------------------|-----------------------------------------------------------------|--|--|--|--|
| Required Instruct             | Required Instructions |                                                                 |  |  |  |  |
| BYPASS                        | 11111111              | Enables BYPASS                                                  |  |  |  |  |
| SAMPLE/<br>PRELOAD            | 00000001              | Enables boundary-scan<br>SAMPLE/PRELOAD operation               |  |  |  |  |
| EXTEST                        | 00000000              | Enables boundary-scan<br>EXTEST operation                       |  |  |  |  |
| <b>Optional Instructi</b>     | ons                   |                                                                 |  |  |  |  |
| CLAMP                         | 11111010              | Enables boundary-scan<br>CLAMP operation                        |  |  |  |  |
| HIGHZ                         | 11111100              | all outputs in high-impedance state simultaneously              |  |  |  |  |
| IDCODE                        | 11111110              | Enables shifting out<br>32-bit IDCODE                           |  |  |  |  |
| USERCODE                      | 11111101              | Enables shifting out<br>32-bit USERCODE                         |  |  |  |  |
| XC18V00 Specific Instructions |                       |                                                                 |  |  |  |  |
| CONFIG                        | 11101110              | Initiates FPGA configuration by pulsing $\overline{CF}$ pin Low |  |  |  |  |

#### Table 4: Boundary Scan Instructions

|       | IR[7:5] | IR[4]         | IR[3]    | IR[2] | IR[1:0] |       |
|-------|---------|---------------|----------|-------|---------|-------|
| TDI-> | 000     | ISP<br>Status | Security | 0     | 01      | ->TDO |

#### Notes:

1. IR(1:0) = 01 is specified by IEEE Std. 1149.1

Figure 3: Instruction Register Values Loaded into IR as Part of an Instruction Scan Sequence

## **Boundary Scan Register**

The boundary-scan register is used to control and observe the state of the device pins during the EXTEST, SAM-PLE/PRELOAD, and CLAMP instructions. Each output pin on the XC18V00 has two register stages that contribute to the boundary-scan register, while each input pin only has one register stage.

For each output pin, the register stage nearest to TDI controls and observes the output state, and the second stage closest to TDO controls and observes the High-Z enable state of the pin.

For each input pin, the register stage controls and observes the input state of the pin.

## **Identification Registers**

The IDCODE is a fixed, vendor-assigned value that is used to electrically identify the manufacturer and type of the device being addressed. The IDCODE register is 32 bits wide. The IDCODE register can be shifted out for examination by using the IDCODE instruction. The IDCODE is available to any other system component via JTAG.

The IDCODE register has the following binary format:

vvvv:ffff:ffff:aaaa:aaaa:cccc:ccccl

#### where

v = the die version number

f = the family code (50h for XC18V00 family)

a = the ISP PROM product ID (26h for the XC18V04)

c = the company code (49h for Xilinx)

**Note**: The LSB of the IDCODE register is always read as logic "1" as defined by IEEE Std. 1149.1.

 Table 5 lists the IDCODE register values for the XC18V00 devices.

| Table 5: IDCODES As | signed to XC18V00 Devices |
|---------------------|---------------------------|
|---------------------|---------------------------|

| ISP-PROM | IDCODE    |
|----------|-----------|
| XC18V01  | 05024093h |
| XC18V02  | 05025093h |
| XC18V04  | 05026093h |
| XC18V256 | 05022093h |
| XC18V512 | 05023093h |

The USERCODE instruction gives access to a 32-bit user programmable scratch pad typically used to supply information about the device's programmed contents. By using the USERCODE instruction, a user-programmable identification code can be shifted out for examination. This code is loaded into the USERCODE register during programming of the XC18V00 device. If the device is blank or was not loaded during programming, the USERCODE register contains FFFFFFFh.

## **XC18V00 TAP Characteristics**

The XC18V00 family performs both in-system programming and IEEE 1149.1 boundary-scan (JTAG) testing via a single 4-wire Test Access Port (TAP). This simplifies system designs and allows standard Automatic Test Equipment to perform both functions. The AC characteristics of the XC18V00 TAP are described as follows.

## **TAP** Timing

Figure 4 shows the timing relationships of the TAP signals. These TAP timing characteristics are identical for both boundary-scan and ISP operations.



Figure 4: Test Access Port Timing

## **TAP AC Parameters**

Table 6 shows the timing parameters for the TAP waveforms shown in Figure 4

| Symbol              | Parameter                             | Min | Max | Units |
|---------------------|---------------------------------------|-----|-----|-------|
| T <sub>CKMIN1</sub> | TCK minimum clock period              | 100 | -   | ns    |
| T <sub>CKMIN2</sub> | TCK minimum clock period, Bypass Mode | 50  | -   | ns    |
| T <sub>MSS</sub>    | TMS setup time                        | 10  | -   | ns    |
| T <sub>MSH</sub>    | TMS hold time                         | 25  | -   | ns    |
| T <sub>DIS</sub>    | TDI setup time                        | 10  | -   | ns    |
| T <sub>DIH</sub>    | TDI hold time                         | 25  | -   | ns    |
| T <sub>DOV</sub>    | TDO valid delay                       | -   | 25  | ns    |

| Table | 6· | Test Acce | ess Port   | Timina | Parameters |
|-------|----|-----------|------------|--------|------------|
| Table | υ. | ICSLACCO  | 533 F UI L | inning | rarameters |

## **Connecting Configuration PROMs**

Connecting the FPGA device with the configuration PROM (see Figure 5 and Figure 6).

- The DATA output(s) of the PROM(s) drives the D<sub>IN</sub> input of the lead FPGA device.
- The Master FPGA CCLK output drives the CLK input(s) of the PROM(s) (in Master-Serial and Master-SelectMAP modes only).
- The CEO output of a PROM drives the CE input of the next PROM in a daisy chain (if any).
- The OE/RESET pins of all PROMs are connected to the INIT pins of all FPGA devices. This connection assures that the PROM address counter is reset before the start of any (re)configuration, even when a reconfiguration is initiated by a V<sub>CC</sub> glitch.
- The PROM CE input can be driven from the DONE pin. The CE input of the first (or only) PROM can be driven by the DONE output of all target FPGA devices, provided that DONE is not permanently grounded. CE can also be permanently tied Low, but this keeps the DATA output active and causes an unnecessary supply current of 10 mA maximum.
- Slave-Parallel/SelectMap mode is similar to slave serial mode. The DATA is clocked out of the PROM one byte per CCLK instead of one bit per CCLK cycle. See FPGA data sheets for special configuration requirements.

## **Initiating FPGA Configuration**

The XC18V00 devices incorporate a pin named  $\overline{CF}$  that is controllable through the JTAG CONFIG instruction. Executing the CONFIG instruction through JTAG pulses the  $\overline{CF}$  low for 300-500 ns, which resets the FPGA and initiates configuration.

The  $\overline{CF}$  pin must be connected to the  $\overline{PROGRAM}$  pin on the FPGA(s) to use this feature.

The iMPACT software can also issue a JTAG CONFIG command to initiate FPGA configuration through the "Load FPGA" setting.

The 20-pin packages do not have a dedicated  $\overline{CF}$  pin. For 20-pin packages, the CF --> D4 setting can be used to route the  $\overline{CF}$  pin function to pin 7 only if the parallel output mode is *not* used.

## **Selecting Configuration Modes**

The XC18V00 accommodates serial and parallel methods of configuration. The configuration modes are selectable through a user control register in the XC18V00 device. This control register is accessible through JTAG, and is set using the "Parallel mode" setting on the Xilinx iMPACT software. Serial output is the default configuration mode.

## **Master Serial Mode Summary**

The I/O and logic functions of the Configurable Logic Block (CLB) and their associated interconnections are established by a configuration program. The program is loaded either automatically upon power up, or on command, depending on the state of the three FPGA mode pins. In Master Serial mode, the FPGA automatically loads the configuration program from an external memory. Xilinx PROMs are designed to accommodate the Master Serial mode.

Upon power-up or reconfiguration, an FPGA enters the Master Serial mode whenever all three of the FPGA mode-select pins are Low (M0=0, M1=0, M2=0). Data is read from the PROM sequentially on a single data line. Synchronization is provided by the rising edge of the temporary signal CCLK, which is generated by the FPGA during configuration.

Master Serial Mode provides a simple configuration interface. Only a serial data line, a clock line, and two control lines are required to configure an FPGA. Data from the PROM is read sequentially, accessed via the internal address and bit counters which are incremented on every valid rising edge of CCLK. If the user-programmable, dual-function  $D_{IN}$  pin on the FPGA is used only for configuration, it must still be held at a defined level during normal operation. The Xilinx FPGA families take care of this automatically with an on-chip pull-up resistor.

## **Cascading Configuration PROMs**

For multiple FPGAs configured as a serial daisy-chain, or a single FPGA requiring larger configuration memories in a serial or SelectMAP configuration mode, cascaded PROMs provide additional memory (Figure 5). Multiple XC18V00 devices can be concatenated by using the  $\overline{CEO}$  output to drive the  $\overline{CE}$  input of the downstream device. The clock inputs and the data outputs of all XC18V00 devices in the chain are interconnected. After the last data from the first PROM is read, the next clock signal to the PROM asserts its  $\overline{CEO}$  output Low and drives its DATA line to a high-impedance state. The second PROM recognizes the Low level on its  $\overline{CE}$  input and enables its DATA output. See Figure 7.

After configuration is complete, address counters of all cascaded PROMs are reset if the PROM  $OE/\overline{RESET}$  pin goes Low or  $\overline{CE}$  goes High.


(1) For Mode pin connections and DONE pin pullup value, refer to appropriate FPGA data sheet.

DS026\_08\_090502





\* For Mode pin connections, refer to appropriate FPGA data sheet. \*\*Master/Slave Serial Mode does not require D[1:7].

DS026\_08\_090502

Figure 6: Configuring Multiple Virtex-II Devices with Identical Patterns in Master/Slave or Serial/SelectMAP Modes



(1) For Mode pin connections and Done pullup value, refer to the appropriate FPGA data sheet.

#### (a) Master Serial Mode



(1)  $\overline{\text{CS}}$  and  $\overline{\text{WRITE}}$  must be either driven Low or pulled down externally. One option is shown.

(2) For Mode pin connections and Done pullup value, refer to the appropriate FPGA data sheet.

(3) External oscillator required for Virtex/Virtex-E SelectMAP or Virtex-II/Virtex-II Pro Slave-SelectMAP modes.

#### (b) Virtex/Virtex-E/Virtex-II/Virtex-II Pro SelectMAP Mode



DS026\_05\_090502



### **Reset Activation**

On power up, OE/RESET is held low until the XC18V00 is active (1 ms). OE/RESET is connected to an external resistor to pull OE/RESET HIGH releasing the FPGA INIT and allowing configuration to begin. If the power drops below 2.0V, the PROM resets. OE/RESET polarity is *not* programmable. See Figure 8 for power-up requirements.



Figure 8: V<sub>CC</sub> Power-Up Requirements

Table 7: Truth Table for PROM Control Inputs

### **Standby Mode**

The PROM enters a low-power standby mode whenever  $\overline{CE}$  is asserted High. The address is reset. The output remains in a high-impedance state regardless of the state of the OE input. JTAG pins TMS, TDI and TDO can be in a high-impedance state or High.

#### **5V Tolerant I/Os**

The I/Os on each re-programmable PROM are fully 5V tolerant even through the core power supply is 3.3V. This allows 5V CMOS signals to connect directly to the PROM inputs without damage. In addition, the 3.3V V<sub>CC</sub> power supply can be applied before or after 5V signals are applied to the I/Os. In mixed 5V/3.3V/2.5V systems, the user pins, the core power supply (V<sub>CC</sub>), and the output power supply (V<sub>CCO</sub>) can have power applied in any order. This makes the PROM devices immune to power supply sequencing issues.

#### **Customer Control Bits**

The XC18V00 PROMs have various control bits accessible by the customer. These can be set after the array has been programmed using "Skip User Array" in Xilinx iMPACT software. See Table 7.

| Control Ir | nputs |                                                                                         | Outputs          |             |                   |
|------------|-------|-----------------------------------------------------------------------------------------|------------------|-------------|-------------------|
| OE/RESET   | CE    | Internal Address                                                                        | DATA             | CEO         | Icc               |
| High       | Low   | If address $\leq TC^{(1)}$ : increment<br>If address > TC <sup>(1)</sup> : don't change | Active<br>High-Z | High<br>Low | Active<br>Reduced |
| Low        | Low   | Held reset                                                                              | High-Z           | High        | Active            |
| High       | High  | Held reset                                                                              | High-Z           | High        | Standby           |
| Low        | High  | Held reset                                                                              | High-Z           | High        | Standby           |

#### Notes:

1. TC = Terminal Count = highest address value. TC + 1 = address 0.

#### Absolute Maximum Ratings<sup>(1,2)</sup>

| Symbol           | Description                                    | Value        | Units |
|------------------|------------------------------------------------|--------------|-------|
| V <sub>CC</sub>  | Supply voltage relative to GND                 | -0.5 to +4.0 | V     |
| V <sub>IN</sub>  | Input voltage with respect to GND              | -0.5 to +5.5 | V     |
| V <sub>TS</sub>  | Voltage applied to High-Z output               | -0.5 to +5.5 | V     |
| T <sub>STG</sub> | Storage temperature (ambient)                  | -65 to +150  | °C    |
| T <sub>SOL</sub> | Maximum soldering temperature (10s @ 1/16 in.) | +260         | °C    |
| TJ               | Junction temperature                           | +150         | °C    |

#### Notes:

1. Maximum DC undershoot below GND must be limited to either 0.5V or 10 mA, whichever is easier to achieve. During transitions, the device pins can undershoot to -2.0V or overshoot to +7.0V, provided this over- or undershoot lasts less then 10 ns and with the forcing current being limited to 200 mA.

2. Stresses beyond those listed under Absolute Maximum Ratings might cause permanent damage to the device. These are stress ratings only, and functional operation of the device at these or any other conditions beyond those listed under Operating Conditions is not implied. Exposure to Absolute Maximum Ratings conditions for extended periods of time might affect device reliability.

## **Recommended Operating Conditions**

| Symbol             | Parameter                                                         |            | Min | Max              | Units |
|--------------------|-------------------------------------------------------------------|------------|-----|------------------|-------|
| V <sub>CCINT</sub> | Internal voltage supply ( $T_A = 0^{\circ}C$ to +70°C) Commercial |            |     | 3.6              | V     |
|                    | Internal voltage supply ( $T_A = -40^{\circ}C$ to +85°C)          | Industrial | 3.0 | 3.6              | V     |
| V <sub>CCO</sub>   | Supply voltage for output drivers for 3.3V operation              |            |     | 3.6              | V     |
|                    | Supply voltage for output drivers for 2.5V operation              |            |     | 2.7              | V     |
| V <sub>IL</sub>    | Low-level input voltage                                           | 0          | 0.8 | V                |       |
| V <sub>IH</sub>    | High-level input voltage                                          |            |     | 5.5              | V     |
| V <sub>O</sub>     | Output voltage                                                    |            |     | V <sub>CCO</sub> | V     |
| T <sub>VCC</sub>   | $V_{CC}$ rise time from 0V to nominal voltage <sup>(1)</sup>      | 1          | 50  | ms               |       |

#### Notes:

 At power up, the device requires the V<sub>CC</sub> power supply to monotonically rise from 0V to nominal voltage within the specified V<sub>CC</sub> rise time. If the power supply cannot meet this requirement, then the device might not perform power-on-reset properly. See Figure 8.

#### **Quality and Reliability Characteristics**

| Symbol           | Description                      | Min    | Мах | Units  |
|------------------|----------------------------------|--------|-----|--------|
| T <sub>DR</sub>  | Data retention                   | 20     | -   | Years  |
| N <sub>PE</sub>  | Program/erase cycles (Endurance) | 20,000 | -   | Cycles |
| V <sub>ESD</sub> | Electrostatic discharge (ESD)    | 2,000  | -   | Volts  |

#### **DC Characteristics Over Operating Conditions**

| Symbol          | Parameter                                  | Test Conditions           | Min                  | Max | Units |
|-----------------|--------------------------------------------|---------------------------|----------------------|-----|-------|
| V <sub>OH</sub> | High-level output voltage for 3.3V outputs | I <sub>OH</sub> = -4 mA   | 2.4                  | -   | V     |
|                 | High-level output voltage for 2.5V outputs | I <sub>OH</sub> = -500 μA | 90% V <sub>CCO</sub> | -   | V     |

| Symbol                                  | Parameter                                 | Test Conditions                                                   | Min  | Max | Units |
|-----------------------------------------|-------------------------------------------|-------------------------------------------------------------------|------|-----|-------|
| V <sub>OL</sub>                         | Low-level output voltage for 3.3V outputs | I <sub>OL</sub> = 8 mA                                            | -    | 0.4 | V     |
|                                         | Low-level output voltage for 2.5V outputs | I <sub>OL</sub> = 500 μA                                          | -    | 0.4 | V     |
| I <sub>CC</sub>                         | Supply current, active mode 25 MHz        |                                                                   | -    | 25  | mA    |
| I <sub>CCS</sub>                        | Supply current, standby mode              |                                                                   | -    | 10  | mA    |
| I <sub>ILJ</sub>                        | JTAG pins TMS, TDI, and TDO               | V <sub>CC =</sub> MAX<br>V <sub>IN</sub> = GND                    | -100 | -   | μA    |
| IL                                      | Input leakage current                     | V <sub>CC</sub> = Max<br>V <sub>IN</sub> = GND or V <sub>CC</sub> | -10  | 10  | μA    |
| IIH                                     | Input and output High-Z leakage current   | V <sub>CC</sub> = Max<br>V <sub>IN</sub> = GND or V <sub>CC</sub> | -10  | 10  | μA    |
| C <sub>IN</sub> and<br>C <sub>OUT</sub> | Input and output capacitance              | V <sub>IN</sub> = GND<br>f = 1.0 MHz                              | -    | 10  | pF    |

## AC Characteristics Over Operating Conditions for XC18V04 and XC18V02



| Symbol           | Description                                                        | Min | Max | Units |
|------------------|--------------------------------------------------------------------|-----|-----|-------|
| T <sub>OE</sub>  | OE/RESET to data delay                                             | -   | 10  | ns    |
| T <sub>CE</sub>  | CE to data delay                                                   | -   | 20  | ns    |
| T <sub>CAC</sub> | CLK to data delay                                                  | -   | 20  | ns    |
| Т <sub>ОН</sub>  | Data hold from $\overline{CE}$ , $OE/\overline{RESET}$ , or $CLK$  | 0   | -   | ns    |
| T <sub>DF</sub>  | $\overline{CE}$ or OE/RESET to data float delay <sup>(2)</sup>     | -   | 25  | ns    |
| T <sub>CYC</sub> | Clock periods                                                      | 50  | -   | ns    |
| T <sub>LC</sub>  | CLK Low time <sup>(3)</sup>                                        | 10  | -   | ns    |
| T <sub>HC</sub>  | CLK High time <sup>(3)</sup>                                       | 10  | -   | ns    |
| T <sub>SCE</sub> | CE setup time to CLK (guarantees proper counting) <sup>(3)</sup>   | 25  | -   | ns    |
| T <sub>HCE</sub> | T <sub>HCE</sub> CE High time (guarantees counters are reset) 20 - |     | ns  |       |
| T <sub>HOE</sub> | OE/RESET hold time (guarantees counters are reset)                 | 25  | -   | ns    |

#### Notes:

- AC test load = 50 pF. 1.
- Float delays are measured with 5 pF AC loads. Transition is measured at ±200 mV from steady state active levels. 2.
- З. Guaranteed by design, not tested.
- 4. All AC parameters are measured with  $V_{\text{IL}}$  = 0.0V and  $V_{\text{IH}}$  = 3.0V.
- 5.
- If T<sub>HCE</sub> High < 2  $\mu$ s, T<sub>CE</sub> = 2  $\mu$ s. If T<sub>HCE</sub> Low < 2  $\mu$ s, T<sub>OE</sub> = 2  $\mu$ s. 6.

#### AC Characteristics Over Operating Conditions for XC18V01, XC18V512, and XC18V256



| Symbol           | Description                                                      | Min | Max | Units |
|------------------|------------------------------------------------------------------|-----|-----|-------|
| T <sub>OE</sub>  | OE/RESET to data delay                                           | -   | 10  | ns    |
| T <sub>CE</sub>  | CE to data delay                                                 | -   | 15  | ns    |
| T <sub>CAC</sub> | CLK to data delay                                                | -   | 15  | ns    |
| Т <sub>ОН</sub>  | Data hold from $\overline{CE}$ , OE/RESET, or CLK                | 0   | -   | ns    |
| T <sub>DF</sub>  | $\overline{CE}$ or OE/RESET to data float delay <sup>(2)</sup>   | -   | 25  | ns    |
| T <sub>CYC</sub> | Clock periods                                                    | 30  | -   | ns    |
| T <sub>LC</sub>  | CLK Low time <sup>(3)</sup>                                      | 10  | -   | ns    |
| T <sub>HC</sub>  | CLK High time <sup>(3)</sup>                                     | 10  | -   | ns    |
| T <sub>SCE</sub> | CE setup time to CLK (guarantees proper counting) <sup>(3)</sup> | 20  | -   | ns    |
| T <sub>HCE</sub> | CE High time (guarantees counters are reset)                     | 20  | -   | ns    |
| T <sub>HOE</sub> | OE/RESET hold time (guarantees counters are reset)               | 20  | -   | ns    |

#### Notes:

- 1. AC test load = 50 pF.
- 2. Float delays are measured with 5 pF AC loads. Transition is measured at ±200 mV from steady state active levels.
- 3. Guaranteed by design, not tested.
- All AC parameters are measured with  $V_{\text{IL}}$  = 0.0V and  $V_{\text{IH}}$  = 3.0V. 4.
- If T<sub>HCE</sub> High < 2  $\mu$ s, T<sub>CE</sub> = 2  $\mu$ s. If T<sub>HOE</sub> High < 2  $\mu$ s, T<sub>OE</sub> = 2  $\mu$ s. 5.
- 6.

# AC Characteristics Over Operating Conditions When Cascading for XC18V04 and XC18V02



DS026\_07\_020300

| Symbol           | Description                              | Min | Мах | Units |
|------------------|------------------------------------------|-----|-----|-------|
| T <sub>CDF</sub> | CLK to data float delay <sup>(2,3)</sup> | -   | 25  | ns    |
| т <sub>оск</sub> | CLK to CEO delay <sup>(3)</sup>          | -   | 20  | ns    |
| T <sub>OCE</sub> | CE to CEO delay <sup>(3)</sup>           | -   | 20  | ns    |
| T <sub>OOE</sub> | OE/RESET to CEO delay <sup>(3)</sup>     | -   | 20  | ns    |

#### Notes:

1. AC test load = 50 pF.

2. Float delays are measured with 5 pF AC loads. Transition is measured at ±200 mV from steady state active levels.

3. Guaranteed by design, not tested.

4. All AC parameters are measured with  $V_{\rm IL}$  = 0.0V and  $V_{\rm IH}$  = 3.0V.

## AC Characteristics Over Operating Conditions When Cascading for XC18V01, XC18V512, and XC18V256



DS026\_07\_020300

| Symbol           | Description                                                    | Min | Мах | Units |
|------------------|----------------------------------------------------------------|-----|-----|-------|
| T <sub>CDF</sub> | CLK to data float delay <sup>(2,3)</sup>                       | -   | 25  | ns    |
| Т <sub>ОСК</sub> | CLK to CEO delay <sup>(3)</sup>                                | -   | 20  | ns    |
| T <sub>OCE</sub> | CE to CEO delay <sup>(3)</sup>                                 | -   | 20  | ns    |
| T <sub>OOE</sub> | $OE/\overline{RESET}$ to $\overline{CEO}$ delay <sup>(3)</sup> | -   | 20  | ns    |

#### Notes:

- 1. AC test load = 50 pF.
- 2. Float delays are measured with 5 pF AC loads. Transition is measured at ±200 mV from steady state active levels.
- 3. Guaranteed by design, not tested.
- 4. All AC parameters are measured with  $V_{\rm IL}$  = 0.0V and  $V_{\rm IH}$  = 3.0V.

#### **Ordering Information**



#### Notes:

- 1. XC18V04 and XC18V02 only.
- 2. XC18V01, XC18V512, and XC18V256 only.

#### **Valid Ordering Combinations**

| XC18V04VQ44C | XC18V02VQ44C | XC18V01VQ44C | XC18V512VQ44C | XC18V256VQ44C |
|--------------|--------------|--------------|---------------|---------------|
| XC18V04PC44C | XC18V02PC44C | XC18V01PC20C | XC18V512PC20C | XC18V256PC20C |
|              |              | XC18V01SO20C | XC18V512SO20C | XC18V256SO20C |
| XC18V04VQ44I | XC18V02VQ44I | XC18V01VQ44I | XC18V512VQ44I | XC18V256VQ44I |
| XC18V04PC44I | XC18V02PC44I | XC18V01PC20I | XC18V512PC20I | XC18V256PC20I |
|              |              | XC18V01SO20I | XC18V512SO20I | XC18V256SO20I |

#### **Marking Information**



#### 20-pin Package<sup>(1)</sup>

Due to the small size of the commercial serial PROM packages, the complete ordering part number cannot be marked on the package. The XC prefix is deleted and the package code is simplified. Device marking is as follows:



1. XC18V01, XC18V512, and XC18V256 only.

## **Revision History**

The following table shows the revision history for this document.

| Date     | Version | Revision                                                                                                                                                                                                                                                                                                                        |
|----------|---------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| 2/9/99   | 1.0     | First publication of this early access specification                                                                                                                                                                                                                                                                            |
| 8/23/99  | 1.1     | Edited text, changed marking, added CF and parallel load                                                                                                                                                                                                                                                                        |
| 9/1/99   | 1.2     | Corrected JTAG order, Security and Endurance data.                                                                                                                                                                                                                                                                              |
| 9/16/99  | 1.3     | Corrected SelectMAP diagram, control inputs, reset polarity. Added JTAG and $\overline{CF}$ description, 256 Kbit and 128 Kbit devices.                                                                                                                                                                                         |
| 01/20/00 | 2.0     | Added Q44 Package, changed XC18xx to XC18Vxx                                                                                                                                                                                                                                                                                    |
| 02/18/00 | 2.1     | Updated JTAG configuration, AC and DC characteristics                                                                                                                                                                                                                                                                           |
| 04/04/00 | 2.2     | Removed stand alone resistor on INIT pin in Figure 5. Added Virtex-E and EM parts to FPGA table.                                                                                                                                                                                                                                |
| 06/29/00 | 2.3     | Removed XC18V128 and updated format. Added AC characteristics for XC18V01, XC18V512, and XC18V256 densities.                                                                                                                                                                                                                    |
| 11/13/00 | 2.4     | Features: changed 264 MHz to 264 Mb/s at 33 MHz; AC Spec.: $T_{SCE}$ units to ns, $T_{HCE}$ CE High time units to $\mu$ s. Removed Standby Mode statement: "The lower power standby modes available on some XC18V00 devices are set by the user in the programming software". Changed 10,000 cycles endurance to 20,000 cycles. |
| 01/15/01 | 2.5     | Updated Figures 5 and 6, added 4.7 resistors. Identification registers: changes ISP PROM product ID from 06h to 26h.                                                                                                                                                                                                            |
| 04/04/01 | 2.6     | Updated Figure 6, Virtex SelectMAP mode; added XC2V products to Compatible PROM table; changed Endurance from 10,000 cycles, 10 years to 20,000, 20 years;                                                                                                                                                                      |
| 04/30/01 | 2.7     | Updated Figure 6: removed Virtex-E in Note 2, fixed SelectMAP mode connections.<br>Under AC Characteristics Over Operating Conditions for XC18V04 and XC18V02, changed T <sub>SCE</sub> from 25 ms to 25 ns.                                                                                                                    |
| 06/11/01 | 2.8     | AC Characteristics Over Operating Conditions for XC18V01, XC18V512, and XC18V256. Changed Min values for $T_{SCE}$ from 20 ms to 20 ns and for $T_{HCE}$ from 2 ms to 2 $\mu$ s.                                                                                                                                                |
| 09/28/01 | 2.9     | Changed the boundary scan order for the CEO pin in Table 1, updated the configuration bits values in the table under Xilinx FPGAs and Compatible PROMs, and added information to the Recommended Operating Conditions table.                                                                                                    |
| 11/12/01 | 3.0     | Updated for Spartan-IIE FPGA family.                                                                                                                                                                                                                                                                                            |
| 12/06/01 | 3.1     | Changed Figure 7(c).                                                                                                                                                                                                                                                                                                            |
| 02/27/02 | 3.2     | Updated Table 2 and Figure 6 for the Virtex-II Pro family of devices.                                                                                                                                                                                                                                                           |
| 03/15/02 | 3.3     | Updated Xilinx software and modified Figure 6 and Figure 7.                                                                                                                                                                                                                                                                     |
| 03/27/02 | 3.4     | Made changes to pages 1-3, 5, 7-11, 13, 14, and 18. Added new Figure 8 and Figure 9.                                                                                                                                                                                                                                            |
| 06/14/02 | 3.5     | Made additions and changes to Table 2.                                                                                                                                                                                                                                                                                          |
| 07/24/02 | 3.6     | Changed last bullet under Connecting Configuration PROMs, page 9.                                                                                                                                                                                                                                                               |
| 09/06/02 | 3.7     | Multiple minor changes throughout, plus the addition of <b>Pinout Diagrams</b> , page 4 and the deletion of Figure 9.                                                                                                                                                                                                           |
| 10/31/02 | 3.8     | Made minor change on Figure 7 (b) and changed orientation of SO20 diagram on page 5.                                                                                                                                                                                                                                            |
| 11/18/02 | 3.9     | Added XC2S400E and XC2S600E to Table 2.                                                                                                                                                                                                                                                                                         |



## Appendix C

## Glossary

#### AQL

Acceptable quality level. The relative number of devices, expressed in parts-per-million (ppm), that might not meet specification or might be defective. Typical values are around 10 ppm,

#### ASIC

Application-specific integrated circuit, also called a gate array.

#### asynchronous

Logic that is not synchronized by a clock. Asynchronous designs can be faster than synchronous ones, but are more sensitive to parametric changes, and are thus less robust.

#### ATM

Asynchronous transfer mode. A very-high-speed (megahertz to gigahertz) connectionoriented bit-serial protocol for transmitting data and real-time voice and video in fixedlength packets (480byte payload, 5-byte header).

#### back annotation

Automatically attaching timing values to the entered design format after the design has been placed and routed in a field-programmable gate array (FPGA).

#### behavioral language

Top-down description from an even higher level than VDHL.

#### bitstream

The bitstream is a binary representation of an implemented FPGA design. The bitstream is generated by Xilinx bit generation tools (BitGen and Makebits) and is denoted with the **.bit** extension. For information on creating BIT files, refer to the *Hardware Debugger Reference/User Guide*.

#### block RAM

An 18-Kbit block of random access memory (RAM) inside the Virtex-II device. Dual-port and synchronous operation are desirable.

#### block SelectRAM

Fully-synchronous, dual-port memories in the Virtex-II FPGAs. Each of these memories contain 18 x 1024 (18,432) bits. The organization of each memory is configurable. Block SelectRAM resources complement smaller, distributed, LUT-based SelectRAM resources.

#### Boundary Scan interface

One of the configuration interfaces on the Virtex device. This is a bit-serial interface. The Boundary Scan interface is also known as the JTAG port. Also see *SelectMAP interface*.

#### capture data

The flip-flop and pad data saved from the logic cells and I/O blocks into the bitstream for readback. Use the CAPTURE\_VIRTEX primitive in your HDL code to specify the trigger and clock for the capture operation.

#### compiler

Software that converts a higher-language description into a lower-level representation. For FPGAs, the complete partition, place, and process.

#### configurable logic block (CLB)

Xilinx-specific name for a block of logic surrounded by routing resources. The functional elements for constructing logic circuits. The Virtex-II CLB is made up of four slices, and each slice contains two Logic Cells.

#### configuration file

The internally stored file that controls the FPGA sot hat it performs the desired logic function. Also, the act of loading an FPGA with that file. That is, the process of programming Xilinx SRAM-based FPGAs with a bitstream.

#### configuration bitstream

Configuration commands with configuration data.

#### configuration clock (CCLK)

During configuration, the configuration clock (CCLK) is an output in Master modes or in the Asynchronous Peripheral mode but is an input in Slave, Synchronous Peripheral, Express, and SelectMAP/Slave Serial modes. After configuration, CCLK has a weak pullup and can be selected as the readback clock.

#### configuration commands

Instructions for the Virtex-II device. There are two classes of Configuration Command — Major and Minor. The Major Commands read and write data to configuration registers in the Virtex-II device. The Minor commands instruct the Virtex-II configuration logic to perform specific functions.

#### configuration data

Bits that directly define the state of programmable logic. These are written to a Virtex-II device in a configuration bitstream, and read as readback data from a Virtex-II device.

#### configuration frame

The configuration bits in a Virtex-II device are organized in columns. A column of CLBs with the I/O blocks above and below the CLBs contain 48 frames of configuration bits. The smallest number of bits that can be read or written through the configuration interfaces is one frame.

#### configuration interface

A logical interface on the Virtex-II device through which configuration commands and data can be read and written. A interface consists of one or more physical device pins.

#### configuration readback

The operation of reading configuration data (also known as readback data) from a Virtex-II device.

#### constraints

Performance requirements imposed on the design, usually in the form of maximum allowable delay, or the required operating frequency.

## CS pin

The  $\overline{CS}$  pin is the Chip Enable pin for Virtex-II devices. It is used only in SelectMAP mode. When  $\overline{CS}$  is asserted (Low) the device examines data on the Data bus. When  $\overline{CS}$  is deasserted (High), all CCLK transitions are ignored.

#### DataFrame

A DataFrame is a block of configuration data. A configuration bit-stream contains many such frames, each with a start bit and stop bits. Also see *configuration frame*.

#### device pin

One of the electrical connections on the package containing the Virtex-II device.

#### digital signal processing (DSP)

The manipulation of analog data that has been sampled and converted into a digital representation. Examples are filtering, convolution, Fast-Fourier-Transform, and so on.

#### **DIN** pin

During serial configuration, the DIN pin is the serial configuration data input receiving data on the rising edge of CCLK. During parallel configuration, DIN is the D0 input. After configuration, DIN is a user-programmable I/O pin.

#### DONE pin

The DONE pin on a Xilinx FPGA is a bidirectional signal with an optional internal pull-up resistor. As an output, it indicates the completion of the configuration process. As an input, a low level on DONE can be configured to delay the global logic initialization and the enabling of outputs.

#### DOUT pin

During configuration in any mode except Express and SelectMAP, the DOUT pin is the serial configuration data output that can drive the DIN pin of daisy-chained slave FPGAs. DOUT data changes on the rising edge of CCLK, one-and-a-half CCLK periods after it is received at the DIN pin (in Master Serial Mode only).

#### DOUT/BUSY pin

For Virtex-II devices, the DOUT/BUSY pin has a dual purpose, depending on device mode. When the device is in Serial mode, this pin functions as DOUT. When the device is in SelectMAP/Slave Parallel mode, this pin functions as a handshaking signal. If BUSY is asserted (High) on a rising edge of CCLK, the data is not seen on the data bus, and should be held until the data is accepted.

#### dynamic random access memory (DRAM)

A low-cost read-write memory where data is stored on capacitors and must be refreshed periodically. DRAMs are usually addressed by a sequence of two addresses, row address, and column address, which makes them slower and more difficult to use than SRAMs. Also see *SRAM*.

#### electronic data interchange format (EDIF)

Industry standard for specifying a logic design in text (ASCII) form.

#### electrostatic discharge (ESD)

High-voltage discharge can rupture the input transistor gate oxide. ESD-protection diodes divert the current to the supply leads.

#### failure in time (FIT)

Describes the number of device failures statistically expected for a certain number of device-hours. Expressed as failures per one billion (10<sup>9</sup>) device hours. Device temperature must be specified. Mean time between failure (MTBF) can be calculated from FIT. 10 FITs are good; 100 FITs are bad.

#### first-in first-out (FIFO)

FIFO memory where data is stored in the incoming sequence and is read out in the same sequence. Input and output can be asynchronous to each other. A FIFO needs no external addresses, although all modern FIFOs are implemented internally with RAMs driven by circular read and write counters.

#### flash

Non-volatile programmable technology, and alternative to electrically-erasable programmable read-only memory (EEPROM) technology. The memory content can be erased by an electrical signal. This allows in-system programmability and eliminates the need for ultraviolet light and quartz windows in the package.

#### flip-flop

Single-bit storage cell that samples its data input at the active (rising or falling) clock edge, and then presents the new state on its Q output after that clock edge, holding it there until after the next active clock edge.

#### frame

Also see *configuration frame*.

#### field programmable gate array (FPGA)

An integrated circuit that contains configurable (programmable) logic blocks and configurable interconnect between these blocks. Xilinx FPGAs are SRAM- based programmable logic devices (PLDs).

#### function generator

Also called a look-up table (LUT), with N inputs and one output. Can implement any logic function of its N inputs. N can be between 3 and 6; 4-input function generators are most popular.

#### gate

Smallest logic element with several inputs and one output. The AND gate output is High when all inputs are High. The OR gate output is High when at least one input is High. The NAND gate output is Low when all inputs are High. A 2-input NAND gate is used as the measurement unit for gate array complexity.

#### gate array

ASIC where transistors are predefined, and only the interconnect pattern is customized for the individual application.

#### graphical user interface (GUI)

The way of representing the computer output on the screen as graphics, pictures, icons, and windows. Pioneered by Xerox and the Apple Macintosh, now universally adopted, e.g., by Windows95 and others.

#### HDL

Hardware Description Language.

#### HardWire

Xilinx name for a low-cost derivative of an FPGA, where the configuration is fixed, but functionality and footprint are identical with the original FPGA-based design.

#### HDC pin

The High during configuration (HDC) pin is driven High until the I/Os become active in the Startup sequence. It is available as a control output indicating that configuration is not yet complete. After configuration, HDC is a user-programmable I/O pin.

#### hierarchical design

Design description in multiple layers, from the highest (overview) to he lowest (circuit details). An alternative is flat design, where everything is described at the same level of detail.

#### **INIT** pin

The INIT pin is a quadruple function signal. Before and during configuration, INIT is a bidirectional signal. A 1 - 10 k $\Omega$  external pull-up resistor is recommended. As an active-Low open-drain output, INIT is held Low during power stabilization and internal clearing of the configuration memory. As an active-Low input, it can be used to hold the FPGA in the internal WAIT state before the start of configuration. During configuration, a Low on this output indicates that a configuration data error has occurred. After the I/O become active in the Startup sequence, INIT becomes a user-programmable I/O.

#### intellectual property (IP)

In the legal sense, patents, copyrights, and trade secrets. In integrated circuits (ICs), predefined large functions, called "cores," that help the user complete a large design faster.

#### JTAG

Joint Test Action Group. Previous name for IEEE 1149.1 boundary scan, a method for testing boards and integrated circuits. Also see *Parallel Cable III*.

#### LogiBLOX

Library of logic modules, often with user-definable parameters, like data width. Similar to LPM.

#### logic cell (LC)

Metric for FPGA density. The basic building block of the Virtex-II CLB. An LC includes a 4-input function generator, carry logic, and a storage element.

#### LDC pin

Low during configuration (LDC) is driven Low until the I/Os become active in the Startup sequence. It is available as a control output indicating that configuration isn't complete. After configuration, LDC is a user-programmable I/O pin.

#### LPM

Library of Parametrized Modules. Library of logic modules, often with user-definable parameters, like data width. Similar to LogiBLOX.

#### LUT

Look-up table, also called a function generator with N inputs and one output. Can implement any logic function of its N inputs. N is between 3 and 6; most popular are 4-input LUTs.

#### LUT SelectRAM

Shallow RAM structure implemented in CLB look-up tables (LUTs). Also see *block SelectRAM*.

#### mapping

Process of assigning portions of the logic design to the physical chip resources (CLBs). With FPGAs, mapping is more demanding and more important a process than with gate arrays. Also see *synthesis*.

#### **MTBF**

Mean Time Between Failure. The statistically relevant up-time between equipment failure. Also see *failure in time (FIT)*.

#### MultiLINX cable

The MultiLINX cable provides many complex functions and can be loaded with new firmware as it becomes available. It can be connected to the host computer in two ways: via a Serial port or a USB port. The MultiLINX cable is supported by the Hardware Debugger software for Slave Serial and SelectMAP/Slave Parallel programming (as appropriate), as well as readback/verify. It is also supported by the JTAG programmer software for JTAG programming of both CPLDs and FPGAs.

#### netlist

Textual description of logic and interconnects. Also see XNF file and electronic data *interchange format (EDIF).* 

#### NRE

Non-Recurring Engineering charges. Start-up cost for the creation of an ASIC, gate array, or HardWire. Pays for layout, masks, and test development. FPGAs and CPLD do not require NRE.

#### optimization

Design change to improve performance. Also see synthesis.

#### pad

Pad bits are extra bits used to make the total number of bits in a frame an integral multiple of 32, the number of bits in a configuration word. A pad word is an extra word used at the end of a configuration frame for pipelining. A pad frame is an extra configuration frame used at the beginning of a configuration readback and at the end of a configuration write for pipelining.

#### Parallel Cable III

The Xilinx Parallel Cable III (model DLC5) is a serial download cable. The Parallel cable uses a serial 25-pin interface to the parallel port of a host computer and two 6-pin headers for flying-wire connectors to a target board. The Parallel cable is supported by the Hardware Debugger software for performing Slave Serial configuration of FPGAs only. The Parallel cable is also supported by the JTAG Programmer software for performing Slave Serial and Boundary Scan configuration of FPGAs, and Boundary Scan programming of CPLDs. For more information on using the Parallel cable, refer to Chapter 8 or this guide, the Hardware Debugger Reference/Users Guide, and the JTAG Programmer Guide.

#### partitioning

In FPGAs, the process of dividing the logic into subfunctions that can later be placed into individual CLBs. Partitioning precedes placement.

#### PCI

Peripheral Component Interface. Synchronous bus standard characterized by short range, light loading, low cost, and high performance. \_\_?-MHz PCI can support data byte transfers up to \_\_\_\_? megabytes per second (Mb/s) on \_\_? parallel data lines (including parity) and a common clock.

#### PCMCIA

Personal Computer Memory Card Interface Association. Physical and electrical standard for small plug-in boards for portable computers.

#### pin-locking

Rigidly defining and maintaining the functionality and timing requirements of device pins while the internal logic is still being designed or modified. Pin-locking has become important, since circuit board fabrication times are longer than PLD design implementation times.

D

### PIP

Programmable Interconnect Point. In Xilinx FPGAs, a point where two signal lines can be connected, as determined by the device configuration.

#### placement

In FPGAs, the process of assigning specific parts of the design to specific locations (CLBs) on the chip. Usually done automatically. Also see *partitioning*.

#### PLD

Programmable Logic Device. Generic name for all programmable logic: PALs, CPLDs, and FPGAs.

#### preamble

The Preamble is a 4-bit binary sentinel ("0010"b) used to indicate the beginning of the LengthCount in the Header portion of the bitstream. At the beginning of configuration, FPGAs ignore all data prior to the preamble but counts the number of data bits preceding the preamble, and the LengthCount counter increments for every rising CCLK edge, even the ones proceeding the preamble.

#### programmable interconnect point

See **PIP**.

#### **PROGRAM** pin

The PROGRAM pin is an active-Low input that forces clearing of the FPGA configuration memory and is used to initiate a configuration cycle. While PROGRAM is held Low, the FPGA drives INIT Low and continues to clear the configuration memory. When PROGRAM goes High, the FPGA finishes the current clear cycle, executes another complete clear cycle, goes into a WAIT state, and releases INIT.

#### readback

Initiating a readback causes the configuration memory to become accessible to be serially clocked out and read from the device, or (byte-wide in SelectMAP/Slave Parallel modes). The configuration memory contains the configuration data, facilitating a Read-Verification of the data. The configuration memory can also contain the CLB output logic states facilitating a Read-Capture of the internal logic states.Read-Verification and Read-Capture are used by the Hardware Debugger for hardware verification. For information on the readback specification and timing, refer to *The Programmable Logic Data Book*. For information on using the readback function in the Implementation Software, refer to the *Development System Reference Guide*. For information on using the *Reference/User Guide*. For information on connecting the XChecker cable for readback, refer to the *Hardware Users Guide*.

#### readback data

Configuration data read from a Virtex-II device. The data is organized as configuration frames.

#### routing

The interconnection or the process of creating the desired interconnection of logic cells to make them perform the desired function. Routing follows after partitioning and placement.

#### schematic

Graphic representation of a logic design in the form of interconnected gates, flip-flops, and larger blocks. Older and more visually intuitive alternative to the increasingly more popular equation-based or high-level language textual description of a logic design.

#### SelectMAP interface

One of the configuration interfaces on the Virtex-II device. This is a byte-serial interface. The pins in the SelectMAP interface can be used as user I/O after configuration has been completed or remain configured as a configuration interface.

#### SelectRAM

Xilinx-specific name for RAM implemented in CLBs.

#### simulation

Computer modeling of logic and (sometimes) timing behavior of logic driven by simulation inputs (stimuli or vectors).

#### slice

A subdivision of the Virtex-II CLB. There are four vertical slices in each Virtex-II CLB. Each slice contains two Logic Cells.

#### SRAM

Static random access memory. Read-Write memory with data stored in latches. Faster than DRAM and with simpler timing requirements, but smaller in size and about four times more expensive than DRAM of the same capacity.

#### static timing

Detailed description of on-chip logic and interconnect delays.

#### submicron

The smallest feature size is usually expressed in micron ( $\mu$  = millionth of a meter, or a thousandth of a millimeter). The state of the art is moving from 0.35 $\mu$  to 0.25 $\mu$  and soon may reach 0.18 $\mu$ . The wavelength of visible light is 0.4 $\mu$  to 0.8 $\mu$ . 25.4 $\mu$  = 1 mil, a thousandth of an inch.

#### synchronous

Circuitry that changes state only in response to a common clock, as opposed to asynchronous circuitry that responds to a multitude of derived signals. Synchronous circuits are easier to design, debug, modify, and better tolerate parameter changes and speed upgrades than asynchronous circuits.

#### sync word

A 32-bit word with a value that is used to synchronize the configuration logic.

#### synthesis

Optimization process of adapting a logic design to the logic resources available on the chip, like look-up tables, Longline, and dedicated carry. Synthesis precedes mapping.

#### TBUFs

Buffers with a 3-state option, where the output can be made inactive. Used for multiplexing different data sources onto a common bus. The pulldown-only option can use the bus as a "wired AND" function.

2

#### timing

Relating to delays, performance, or speed.

#### timing driven

A design or layout method that takes performance requirements into consideration.

#### UART

Universal asynchronous receiver/transmitter. An 8-bit parallel-to-serial and serial-toparallel converter, combined with parity and start-detect circuitry, and sometimes even FIFO buffers. Used widely in asynchronous serial communications interface, e.g., modems.

#### USB

Universal Serial Bus, A low-cost, low-speed, self-clocking bit-serial bus (1.5 MHz and 12 MHz) using four wires ( $V_{CC}$ , ground, differential data) to daisy-chain up to 128 devices.

#### VME

Older bus standard, popular with MC68000-based industrial computers.

## WRITE pin

The WRITE pin is an input to Virtex-II devices in the SelectMAP/Slave Parallel mode, indicating to the device which direction data is flowing on the Data bus. When WRITE is asserted (Low), data is entering the device (configuration). When WRITE is de-asserted (High), data is leaving the device (readback). If WRITE changes state when the device isn't expecting it, an abort occurs. For more information on the WRITE pin, refer to *The Programmable Logic Data Book* and "Design Considerations" on page 47.

#### XChecker cable

The Xilinx XChecker Cable (model DLC4) is a serial download cable. The XChecker uses a serial 9-pin interface to the communication port of a host computer and two 8-pin headers for flying-wire connectors to a target board. The XChecker cable is supported by the Hardware Debugger software for performing Slave Serial configuration and readback of FPGAs. The XChecker cable is also supported by the JTAG Programmer software for performing Slave Serial and Boundary Scan configuration of FPGAs, and Boundary Scan programming of CPLDs. For more information on using the XChecker cable refer to the *Hardware Users Guide* and the *Hardware Debugger Reference/Users Guide*.

#### XNF file

Xilinx-proprietary description format for a logic design. Alternative is EDIF.

## **Numerics**

3-state output buffer 163

## Α

additional resources 7 addressing scheme 103 AGP-2X 187 AQL defined 23 ASIC defined 23 asynchronous defined 23 asynchronous transfer mode 23 ATM defined 23 available products XC1700D 20

## В

back annotation defined 23 banks I/O 302 behavioral language defined 23 **BF957** composite pinout diagram 345 routing with LVDS pairs 391 standard routing 390 BG575 composite pinout diagram 325 routing with LVDS pairs 381 standard routing 380 BG728 composite pinout diagram 329 routing with LVDS pairs 383 standard routing 382 BGN files 409 bidirectional LVDS 221 bidirectional signals 165 BIT files description 408 disabling 413 loading downward 415 loading up or down 416 loading upward 417 bit swapping description 415 disabling 415 BitGen -b option 409 -d option 409

description 407 disabling DRC 409 DRC file 409 encryption options 227 -g option 409 to 412 -h option 413 input files 408 -j option 413 -l option 413 -m option 413 options 409 output files 408 PCF files 408 persistence switch 297 readback option 297 standard bitstream 293 syntax 408 -w option 413 bitstreams configuration 293 data frames 294 defined 23 encryption 226 loading encrypted 229 standard 293 block RAM defined 23 block SelectRAM timing model 21 timing parameters 21 Block SelectRAM Power section 395 board routability 370 board-level layout strategy 372 Boundary Scan interface defined 24 mode 246 models 405 Boundary Scan Description Language (BSDL) 405 BSDL files 405 buffers 3-state output 163 bidirectional LVDS 221 global clock 48 LDT 222 output 162 BUFGCE 60

## С

capacitors decoupling 365 capture operation 24 cascadable shift registers 123 CCLK defined 24

characteristics land pads 371 checksum 415 chip enable pin 25 ChipScope ILA 300 classification and export considerations 227 CLB / slice timing model 12 CLB Logic Power section 392 CLBs defined 24 clearing configuration memory 248 CLK 83 CLK2X 75 CLKDV 76 CLKFB 71 CLKIN 71 clock de-skew 67 clocks 48 buffer input 51 distribution 48 forwarding 211 global buffers 48 global networks 48 input clock tolerances 43 multiplexer waveform 59, 60 multiplexers 48 output clock precision 44 phase shifting 78 resources 48 command register (CMD) 288 commands file, executing 416 compiler defined 24 configurable logic block (CLB) 24 configuration 245 bitstream 287, 24 bitstream header 293 Boundary Scan mode 246 clearing memory 248 commands 24 data 24 data frames 287 data processing flow 291 file 24 frame 25 -g option 409 to 412 interface 25 internal processing 287 JTAG 369 logic 287 Master SelectMAP mode 246 Master Serial mode 246 mode pins 245 modes 245, 246

## 

option register (COR) 289 process 247 readback 25 register writes 292 Slave SelectMAP mode 246 Slave Serial mode 246 configuration registers 287 CMD 288 COR 289 CRC 290 CTL 289 FAR 289 **FDRI 289** FDRO 290 FLR 288 LOUT 290 **MASK 289** STAT 290 writes 292 conflict resolution 99 constraining placement 128 constraints 25 content-addressable memory (CAM) 123 control register (CTL) 289 control signals 67 conventions typographical 8 CRC register (CRC) 290 CRC sequence 294 crosstalk 369 CS pin 25 CS144 composite pinout diagram 312 cyclic redundancy checking (CRC) 295

## D

Data Encryption Standard (DES) 226 data frames 294 data sheets XC18V00 Series PROMs 419 DataFrame defined 25 DCI 188 I/O buffer library 194 I/O standards 202 software support 194 DCM Power section 396 **DCMs** clock de-skew 67 control signals 67 frequency synthesis 67 miscellaneous timing parameters 44 operating frequency ranges 41 phase shifting 67,78 port signals 71 timing model 41 timing parameters 42 waveforms 93

DDR input 203 output 204 output with 3-state control 206 SDRAM 211 debugging using ChipScope ILA 300 decoupling capacitors 365 dedicated pins 245, 303 **DES 226** de-skew circuit 70 **DESYNCH** command 295 device pin 25 differential signaling 222 Digital Clock Manager (DCM) 67 Digital Controlled Impedance (DCI) 188 DIN pin 25 distributed SelectRAM 96 DLLs characteristics 70 source clock input 71 DONE pin 25 double data rate (DDR) 203 DOUT/BUSY pin 26 DRC disabling for BitGen 409 DRC file 409 dynamic random access memory (DRAM) 26 dynamic read operations 124

## Ε

EDIF defined 26 electronic data interchange format (EDIF) 26 electrostatic discharge (ESD) 26 embedded multipliers 150 timing model 24 timing parameters 24 encryption BitGen options 227 bitstream 226 export considerations 227

## F

failure in time (FIT) 26 FDDRCPE 210 FDDRRSE 210 FF1152 composite pinout diagram 337 pinout compatibility diagram 349 routing with LVDS pairs 387 standard routing 386 FF1517 composite pinout diagram 341 routing with LVDS pairs 389 standard routing 388

FF896 composite pinout diagram 333 pinout compatibility diagram 349 routing with LVDS pairs 385 standard routing 384 FG256 bank information 314 composite pinout diagram 313 pinouts 304 routing with LVDS pairs 375 standard routing 374 FG456 composite pinout diagram 317 pinout compatibility diagram 348 routing with LVDS pairs 377 standard routing 376 FG676 composite pinout diagram 321 pinout compatibility diagram 348 routing with LVDS pairs 379 standard routing 378 field programmable gate array (FPGA) 26 fine phase adjustment 79 FIT defined 26 flash defined 26 flip-chip advantages 362 flip-chip packages 362 flip-flops defined 26 FPGA defined 26 frame defined 26 frame address register (FAR) 289 Frame Data Register Input (FDRI) 289 Frame Data Register Output (FDRO) 290 frame length register (FLR) 288 frequency synthesis 67 fully synchronous shift registers 129

## G

global clock buffers 48 global clock nets 48 global clocks input to output timing parameters 38 setup and hold timing parameters 39 Glossary 23 graphical user interface (GUI) 27 GTL 174 GTL+ 174 GUI defined 27

## Η

Hardware Description Language (HDL) 27
HDC pin 27
hierarchical design defined 27
High during configuration (HDC) 27
HSTL\_I 175, 178
HSTL\_II 176, 178
HSTL\_III 176, 179
HSTL\_IV 177, 180

I/O banks 302 I/O Buffer Information Specification (IBIS) 400 I/O standards DCI 202 I/Os single-ended standards 158 IBIS 400 advantages 400 file structure 401 generation 400 I/V and dV/dt curves 401 models 400 ramp keyword 402 simulations 402 simulators 404 IEEE 1149.1 405 IEEE 1532 298, 405 INIT pin 27 input clock tolerances timing parameters 43 input DDR 203 input files BitGen 408 PROMGen 414 Input/Output Power section 397 intellectual property (IP) 27 IOBs 3-state timing parameters 34 input timing parameters 28 output timing parameters 31 timing model 27 IOBUF 165 **IOSTANDARD** attribute 196 IP defined 27

#### J

JTAG 369 defined 28

## Κ

keys 229 creating 227

#### 

land pad characteristics 371 land pads 371 layout strategy 372 LDC pin 28 LDT 222 buffers 222 legacy data output register (LOUT) 290 legacy support 72 Lightning Data Transport (LDT) 222 LL files 408, 413 loading 229 locked output 71 LogiBLOX defined 28 logic allocation file 413 logic cell (LC) 28 look-up table (LUT) 28 Low during configuration (LDC) 28 low voltage differential signaling (LVDS) 217 low voltage positive emitter-coupled logic (LVPECL) 222 LPM defined 28 LUTs 123 defined 28 LVCMOS15 185 LVCMOS18 185 LVCMOS25 186 LVCMOS33 186 LVDS 217 3-state buffer termination 221 bidirectional 221 primitives 217 receiver termination 218 transmitter termination 219 LVPECL 222 to ?? primitives 222 receiver termination 224 transmitter termination 225 **LVTTL 184** 

#### Μ

mapping 28 mask file 413 MASK register (MASK) 289 Master SelectMAP mode 246 Master Serial mode 246 memory clearing 248 modes configuration 246 Boundary Scan 246 Master SelectMAP 246 Slave SelectMAP 246 Slave Serial 246 NO\_CHANGE 98 READ\_FIRST 98 WRITE\_FIRST 97 MSK files 409 MTBF defined 28 MultiLINX cable 28 multiplexers 133 clocks 48 large 133 wide-input 137 multipliers embedded 150

## Ν

National Institute of Standards and Technology (NIST) 226 netlist defined 29 NO\_CHANGE mode 98 Non-Registered Multiplier Power section 396 non-solder-mask defined (NSMD) 372

## 0

**OBUF 162** OBUFT 163 operating frequency ranges 41 optimization 29 ordering information XC1700D 20 output buffer (OBUF) 162 output clock precision timing parameters 44 output DDR 204 output DDR with 3-state control 206 output drive strength 167 output files BitGen 408 name, PROMGen 416 overwriting 413 PROMGen 414 output power/ground pairs 169 overview of user guide 7

#### Ρ

package specifications 351 packages flip-chip 362 thermal considerations 363 packets 293 data 294 headers 294 pads 371 defined 29 parallel termination 368 partitioning 29 PC20-84 specification 420 PCB layout considerations 365

## 

PCF files BitGen 408 PCI defined 29 PCI33 3 184 PCI66\_3 184 **PCIX 184** PCMCIA defined 29 persist option 246 phase shifting 67,78 pin-locking 29 pinout diagrams 311 pinout information 302 pins 245 chip enable 25 CS 25 dedicated 245, 303 device 25 DIN 25 DONE 25 DOUT/BUSY 26 dual-function 246 HDC 27 INIT 27 LDC 28 power 248 **PROGRAM 30** types 302 WRITE 32 pin-to-pin timing model 37 PIP defined 30 placement 30 placement constraints 128 port addressing scheme 103 port signals 71 power estimator 392 results 398 power pins 248 preamble defined 30 primitives LVDS 217, 222 PRM files 414 PROGRAM pin 30 programmable interconnect point (PIP) 30 programmable logic device (PLD) 30 PROMGen -b option 415 -c option 415 -d option 415 description 413, 414 examples 417 flow diagram 413 -help option 416 input files 414 -l option 416 -n option 416 -o option 416 options 415

output file name 416 output files 414 -p option 416 -r option 417 -s option 417 supported families 413 -u option 417 -x option 417 **PROMs** bit swapping 415 data sheet 419 files, description 414 formats 416 loading files 417 multiple files 417 package specifications 419 sizes 417

## R

rawbits file 409 RBT files 408, 409 read operations dynamic 124 static 124 READ\_FIRST mode 98 readback 296 capture 296 defined 30 enabling in software 297 IEEE 1532 flow 298 regular flow 297 verification 296 with Boundary Scan 297 readback data 30 Registered Multiplier Power section 397 registers configuration 287 resources available 7 routability guidelines 370 routing amount 394 challenges 370 defined 30 examples 373 strategy 371 RST 71

## S

schematic defined 31 SelectI/O-Ultra single-ended resources 157 SelectMAP interface defined 31 SelectRAM defined 31 distributed 96 series termination 368 shift registers cascadable 123 fully synchronous 129 operation 123 static length 130 signals bidirectional 165 simulation defined 31 simultaneous switching output (SSO) 169 single-ended I/O standards 158 single-ended SelectI/O-Ultra resources 157 Slave SelectMAP mode 246 Slave Serial mode 246 slew rate 167 slices 143 defined 31 SO20 specification 421 solder balls 370 solder-mask defined (SMD) 372 specifications PC20-84 420 PROM packages 419 SO20 421 VQ44 422 SRAM defined 31 SRL16 123 SRLC16 123 SSTL2\_I 182 SSTL2\_II 183 SSTL3\_I 181 SSTL3 II 181 standard bitstream 293 start-up sequence 295 STARTUP\_WAIT attribute 71 static length shift registers 130 static read operations 124 static timing defined 31 status register (STAT) 290 submicron defined 31 Sum of Products (SOP) 143 sync word defined 31 synchronous defined 31 synchronous DRAM 211 synthesis defined 31

#### T

TBUF defined 32 termination techniques 168 terminations parallel 368 series 368

thermal considerations 363 thermal management 364 timing 32 Timing Analyzer 11 timing driven defined 32 timing models 11 block SelectRAM 21 CLB / slice 12 DCM 41 embedded multiplier 24 **IOB 27** pin-to-pin 37 timing parameters block SelectRAM 21 DCM 42 embedded multiplier 24 general slice 13 global clock input to output 38 global clock setup and hold 39 input clock tolerances 43 IOB 3-state 34 IOB input 28 IOB output 31 miscellaneous DCM 44 output clock precision 44 slice distributed RAM 16 slice SRL 19 TMULT 24 transmission line effects 168, 367 TRCE 11 Triple Data Encryption Algorithm (TDEA) 226 Triple DES 226 typographical conventions 8

## U

UART defined 32 USB defined 32

## V

VBATT 229 VCC decoupling 365 VCCO 168 verification using ChipScope ILA 300 VHDL and Verilog templates 62, 89, 106, 116, 120, 131, 139, 145, 155, 211 Virtex-II DCI 188 DES 226 LUTs 123 multiplexers 133 package specifications 351 pinout diagrams 311 pinouts 302 power estimator 392

slices 143 VME defined 32 VQ44 specification 422 VREF 168

## W

wide-input multiplexers 137 WRITE pin 32 WRITE\_FIRST mode 97

## X

XC18V00 Series PROMs 419 XChecker cable 32

