## CYNSE70064 Network Search Engine

## CONTENTS

1.0 FEATURES ..... 9
2.0 FUNCTIONAL OVERVIEW ..... 9
3.0 PRODUCT SUMMARY ..... 10
3.1 Logic Block Diagram ..... 10
4.0 FUNCTIONAL DESCRIPTION ..... 10
4.1 CMD Bus and DQ Bus ..... 10
4.2 Database Entry (Data Array and Mask Array) ..... 10
4.3 Arbitration Logic ..... 10
4.4 Pipeline and SRAM Control ..... 11
4.5 Full Logic ..... 11
5.0 SIGNAL DESCRIPTIONS ..... 11
6.0 CLOCKS ..... 13
7.0 REGISTERS ..... 13
7.1 Comparand Registers ..... 13
7.2 Mask Registers ..... 14
7.3 Search Successful Registers (SSR[0:7]) ..... 14
7.4 Command Register ..... 14
7.5 Information Register ..... 15
7.6 Read Burst Address Register ..... 16
7.7 Write Burst Address Register Description ..... 16
7.8 NFA Register ..... 16
8.0 NSE ARCHITECTURE AND OPERATION OVERVIEW ..... 17
9.0 DATA AND MASK ADDRESSING ..... 18
10.0 COMMANDS ..... 18
10.1 Command Codes ..... 18
10.2 Commands and Command Parameters ..... 19
10.3 Read Command ..... 19
10.4 Write Command ..... 22
10.5 Search Command ..... 24
10.6 68-bit Search on Tables Configured as $\times 68$ using a Single CYNSE70064 Device ..... 24
10.7 68-bit Search on Tables Configured as $\times 68$ Using up to Eight CYNSE70064 Devices ..... 26
10.8 68-bit Search on Tables Configured as $\times 68$ Using up to 31 CYNSE70064 Devices ..... 33
10.9 136-bit Search on Tables Configured as $\times 136$ Using a Single CYNSE70064 Device ..... 48
10.10 136-bit Search on Tables Configured as $\times 136$ Using up to Eight CYNSE70064 Devices ..... 50
10.11 136-bit Search on Tables Configured as $\times 136$ Using up to 31 CYNSE70064 Devices ..... 56
10.12 272-bit Search on Tables Configured as û272 Using a Single CYNSE70064 Device ..... 71
10.13 272-bit Search on Tables x272-configured Using up to Eight CYNSE70064 Devices ..... 73
10.14 272-bit Search on Tables Configured as $\times 272$ Using up to 31 CYNSE70064 Devices ..... 78
10.15 Mixed-Sized Searches on Tables Configured with Different Widths Using an CYNSE70064 Device ..... 93
10.16 LRAM and LDEV Description ..... 95
10.17 Learn Command ..... 95

## CONTENTS (continued)

11.0 DEPTH-CASCADING ..... 99
11.1 Depth-Cascading up to Eight Devices (One Block) ..... 99
11.2 Depth-Cascading up to 31 Devices (Four Blocks) ..... 100
11.3 Depth-Cascading for a FULL Signal ..... 100
12.0 SRAM ADDRESSING ..... 101
12.1 Generating an SRAM BUS Address ..... 102
12.2 SRAM PIO Access ..... 102
12.3 SRAM Read with a Table of One Device ..... 102
12.4 SRAM Read with a Table of up to Eight Devices ..... 103
12.5 SRAM Read with a Table of up to 31 Devices ..... 106
12.6 SRAM Write with a Table of One Device ..... 109
12.7 SRAM Write with a Table of up to Eight Devices ..... 110
12.8 SRAM Write with Table(s) of up to 31 Devices ..... 113
13.0 APPLICATION ..... 116
14.0 JTAG (1149.1) TESTING ..... 117
15.0 ELECTRICAL SPECIFICATIONS ..... 118
16.0 AC TIMING WAVE FORMS ..... 119
17.0 PINOUT DESCRIPTIONS AND PACKAGE DIAGRAMS ..... 122
18.0 ORDERING INFORMATION ..... 122
19.0 PACKAGE DIAGRAMS ..... 123

## LIST OF FIGURES

Figure 6-1. CYNSE70064 Clocks (CLK2X and PHS_L) ..... 13
Figure 7-1. Comparand-Register Selection During Search and Learn Instructions ..... 13
Figure 7-2. Addressing the Global Masks Register Array ..... 14
Figure 8-1. CYNSE70064 Database WIDTH Configuration ..... 17
Figure 8-2. Multiwidth Database Configurations Example ..... 18
Figure 9-1. Addressing of the CYNSE70064 Data and Mask Arrays ..... 18
Figure 10-1. Single-Location Read Cycle Timing ..... 20
Figure 10-2. Burst Read of the Data and Mask Arrays (BLEN = 4) ..... 21
Figure 10-3. Single Write Cycle Timing ..... 22
Figure 10-4. Burst Write of the Data and Mask Arrays (BLEN = 4) ..... 23
Figure 10-5. Timing Diagram for 68 -bit Search in x68 Table (One Device) ..... 25
Figure 10-6. Timing Diagram for 68 -bit Search in x68 Table (One Device) ..... 25
Figure 10-7. $\times 68$ Table with One Device ..... 26
Figure 10-8. Hardware Diagram for a Table With Eight Devices ..... 28
Figure 10-9. Timing Diagram for 68-bit Search Device Number 0 ..... 29
Figure 10-10. Timing Diagram for 68 -bit Search Device Number 1 ..... 30
Figure 10-11. Timing Diagram for 68-bit Search Device Number 7 (Last Device) ..... 31
Figure 10-12. x68 Table with Eight Devices ..... 32
Figure 10-13. Hardware Diagram for a Table with 31 Devices ..... 34
Figure 10-14. Hardware Diagram for a Block of up to Eight Devices ..... 35
Figure 10-15. Timing Diagram for Each Device In Block Number 0 (Miss on Each Device) ..... 36
Figure 10-16. Timing Diagram for Each Device Above the Winning Device in Block Number 1 ..... 37
Figure 10-17. Timing Diagram for Globally Winning Device in Block Number 1 ..... 38
Figure 10-18. Timing Diagram for Devices Below the Winning Device in Block Number 1 ..... 39
Figure 10-19. Timing Diagram for Devices Above the Winning Device in Block Number 2 ..... 40
Figure 10-20. Timing Diagram for Globally Winning Device in Block Number 2 ..... 41
Figure 10-21. Timing Diagram for Devices Below the Winning Device in Block Number 2 ..... 42
Figure 10-22. Timing Diagram for Devices Above the Winning Device in Block Number 3 ..... 43
Figure 10-23. Timing Diagram for Globally Winning Device in Block Number 3 ..... 44
Figure 10-24. Timing Diagram for Devices Below the Winning Device in Block Number 3 (Except the Last Device [Device 30]) ..... 45
Figure 10-25. Timing Diagram for Device Number 6 in Block Number 3 (Device 30 in Depth-Cascaded Table) ..... 46
Figure 10-26. $\times 68$ Table with 31 Devices ..... 47
Figure 10-27. Timing Diagram for 136-bit Search (One Device) ..... 48
Figure 10-28. Hardware Diagram for a Table with One Device ..... 48
Figure 10-29. $\times 136$ Table with One Device ..... 49
Figure 10-30. Hardware Diagram for a Table with Eight Devices ..... 51
Figure 10-31. Timing Diagram for 136 -bit Search Device Number 0 ..... 52
Figure 10-32. Timing Diagram for 136-bit Search Device Number 1 ..... 53
Figure 10-33. Timing Diagram for 136-bit Search Device Number 7 (Last Device) ..... 54
Figure 10-34. $\times 136$ Table with Eight Devices ..... 55
Figure 10-35. Hardware Diagram for a Table with 31 Devices ..... 57
Figure 10-36. Hardware Diagram for a Block of Up to Eight Devices ..... 58
Figure 10-37. Timing Diagram for Each Device in Block Number 0 (Miss on Each Device) ..... 59
Figure 10-38. Timing Diagram for Each Device Above the Winning Device in Block Number 1 ..... 60
Figure 10-39. Timing Diagram for Globally Winning Device in Block Number 1 ..... 61
Figure 10-40. Timing Diagram for Devices Below the Winning Device in Block Number 1 ..... 62

## CYNSE70064

## LIST OF FIGURES (continued)

Figure 10-41. Timing Diagram for Devices Above the Winning Device in Block Number 2 ..... 63
Figure 10-42. Timing Diagram for Globally Winning Device in Block Number 2 ..... 64
Figure 10-43. Timing Diagram for Devices Below the Winning Device in Block Number 2 ..... 65
Figure 10-44. Timing Diagram for Devices Above the Winning Device in Block Number 3 ..... 66
Figure 10-45. Timing Diagram for Globally Winning Device in Block Number 3 ..... 67
Figure 10-46. Timing Diagram for Devices Below the Winning Device in Block Number 3 Except Device 30 (the Last Device) ..... 68
Figure 10-47. Timing Diagram for Device Number 6 in Block Number 3 (Device 30 in Depth-Cascaded Table) ..... 69
Figure 10-48. $\times 136$ Table with 31 Devices ..... 70
Figure 10-49. Timing Diagram for 272-bit Search (One Device) ..... 71
Figure 10-50. Hardware Diagram for a Table with One Device ..... 71
Figure 10-51. $\times 272$ Table with One Device ..... 72
Figure 10-52. Hardware Diagram for a Table with Eight Devices ..... 74
Figure 10-53. Timing Diagram for 272-bit Search Device Number 0 ..... 75
Figure 10-54. Timing Diagram for 272-bit Search Device Number 1 ..... 76
Figure 10-55. Timing Diagram for 272-bit Search Device Number 7 (Last Device) ..... 77
Figure 10-56. $\times 272$ Table with Eight Devices ..... 78
Figure 10-57. Hardware Diagram for a Table with 31 Devices ..... 80
Figure 10-58. Hardware Diagram for A Block of up to Eight Devices ..... 81
Figure 10-59. Timing Diagram for Each Device in Block Number 0 (Miss on Each Device) ..... 82
Figure 10-60. Timing Diagram for Each Device Above the Winning Device in Block Number 1 ..... 83
Figure 10-61. Timing Diagram for Globally Winning Device in Block Number 1 ..... 84
Figure 10-62. Timing Diagram for Devices Below the Winning Device in Block Number 1 ..... 85
Figure 10-63. Timing Diagram for Devices Above the Winning Device in Block Number 2 ..... 86
Figure 10-64. Timing Diagram for Globally Winning Device in Block Number 2 ..... 87
Figure 10-65. Timing Diagram for Devices Below the Winning Device in Block Number 2 ..... 88
Figure 10-66. Timing Diagram for Devices Above the Winning Device in Block Number 3 ..... 89
Figure 10-67. Timing Diagram for Globally Winning Device in Block Number 3 ..... 90
Figure 10-68. Timing Diagram for Devices Below the Winning Device in Block Number 3 Except Device 30 (the Last Device) ..... 91
Figure 10-69. Timing Diagram of the Last Device in Block Number 3 (Device 30 in the Table) ..... 92
Figure 10-70. ×272 Table with 31 Devices ..... 93
Figure 10-71. Timing Diagram for Mixed Search (One Device) ..... 94
Figure 10-72. Multiwidth Configurations Example ..... 94
Figure 10-73. Timing Diagram of Learn ( $\mathrm{TLSZ}=00$ ) ..... 96
Figure 10-74. Timing Diagram of Learn (Except on the Last Device [TLSZ = 01]) ..... 97
Figure 10-75. Timing Diagram of Learn on Device Number 7 (TLSZ = 01) ..... 98
Figure 11-1. Depth-Cascading to Form a Single Block ..... 99
Figure 11-2. Depth-Cascading Four Blocks ..... 100
Figure 11-3. Full Generation in a Cascaded Table ..... 101
Figure 12-1. SRAM Read Access ( $T L S Z=00$, HLAT $=000$, LRAM $=1$, LDEV = 1 ) ..... 103
Figure 12-2. Table of a Block of Eight Devices ..... 104
Figure 12-3. SRAM Read Through Device Number 0 in a Block of Eight Devices ..... 105
Figure 12-4. SRAM Read Timing for Device Number 7 in a Block of Eight Devices ..... 106
Figure 12-5. Table of 31 Devices Made of Four Blocks ..... 107
Figure 12-6. SRAM Read Through Device Number 0 in a Bank of 31 Devices (Device Number 0 Timing) ..... 108

## LIST OF FIGURES (continued)

Figure 12-7. SRAM Read Through Device Number 0 in Bank of 31 Devices(Device Number 30 Timing)109
Figure 12-8. SRAM Write Access (TLSZ = 00, HLAT = 000, LRAM = 1, LDEV = 1) ..... 110
Figure 12-9. Table of a Block of Eight Devices ..... 111
Figure 12-10. SRAM Write Through Device Number 0 in a Block of Eight Devices ..... 112
Figure 12-11. SRAM Write Timing for Device Number 7 in Block of Eight Devices ..... 113
Figure 12-12. Table of 31 Devices (Four Blocks) ..... 114
Figure 12-13. SRAM Write Through Device Number 0 in Bank of 31 Devices (Device 0 Timing) ..... 115
Figure 12-14. SRAM Write Through Device Number 0 in Bank of 31 CYNSE70064 Devices (Device Number 30 Timing) ..... 116
Figure 13-1. Sample Switch/Router Using the CYNSE70064 Device ..... 117
Figure 16-1. Input Wave Form for CYNSE70064 ..... 119
Figure 16-2. Output Load for CYNSE70064 ..... 120
Figure 16-3. 2.5 I/O Output Load Equivalent for CYNSE70064 ..... 120
Figure 16-4. AC Timing Wave Forms with CLK2X ..... 121
Figure 17-1. Pinout Diagram ..... 122
Figure 19-1. Package ..... 123

## LIST OF TABLES

Table 5-1. CYNSE70064 Signal Description ..... 11
Table 7-1. Register Overview ..... 13
Table 7-2. Search Successful Register Description ..... 14
Table 7-3. Command Register Description ..... 14
Table 7-4. Information Register Description ..... 15
Table 7-5. Read Burst Register Description ..... 16
Table 7-6. Write Burst Register Description ..... 16
Table 7-7. NFA Register ..... 16
Table 8-1. Bit Position Match ..... 17
Table 10-1. Command Codes ..... 18
Table 10-2. Command Parameters ..... 19
Table 10-3. Read Command Parameters ..... 19
Table 10-4. Read Address Format for Data Array, Mask Array, or SRAM ..... 20
Table 10-5. Read Address Format for Internal Registers ..... 21
Table 10-6. Read Address Format for Data and Mask Arrays ..... 21
Table 10-7. Write Address Format for Data Array, Mask Array, or SRAM (Single Write) ..... 22
Table 10-8. Write Address Format for Internal Registers ..... 23
Table 10-9. Write Address Format for Data and Mask Array (Burst Write) ..... 24
Table 10-10. The Latency of Search from Instruction to SRAM Access Cycle ..... 26
Table 10-11. Shift of SSF and SSV from SADR ..... 26
Table 10-12. Hit/Miss Assumption ..... 27
Table 10-13. The Latency of Search from Instruction to SRAM Access Cycle ..... 32
Table 10-14. Shift of SSF and SSV from SADR ..... 32
Table 10-15. Hit/Miss Assumptions ..... 33
Table 10-16. The Latency of Search from Instruction to SRAM Access Cycle ..... 47
Table 10-17. Shift of SSF and SSV from SADR ..... 47
Table 10-18. The Latency of Search from Instruction to SRAM Access Cycle ..... 49
Table 10-19. Shift of SSF and SSV from SADR ..... 49
Table 10-20. Hit/Miss Assumption ..... 50
Table 10-21. Search Latency from Instruction to SRAM Access Cycle ..... 55
Table 10-22. Shift of SSF and SSV from SADR ..... 55
Table 10-23. Hit/Miss Assumption ..... 56
Table 10-24. The Latency of Search from Instruction to SRAM Access Cycle ..... 70
Table 10-25. Shift of SSF and SSV from SADR ..... 70
Table 10-26. The Latency of Search from C and D Cycles to SRAM Access Cycle ..... 72
Table 10-27. Shift of SSF and SSV from SADR ..... 72
Table 10-28. Hit/Miss Assumption ..... 73
Table 10-29. The Latency of Search from C and D cycles to SRAM Access Cycle ..... 78
Table 10-30. Shift of SSF and SSV from SADR ..... 78
Table 10-31. Hit/Miss Assumption ..... 79
Table 10-32. The Latency of Search from C and D cycles to SRAM Access Cycle ..... 93
Table 10-33. Shift of SSF and SSV from SADR ..... 93
Table 10-34. The Latency of SRAM Write Cycle from Second Cycle of Learn Instruction ..... 98
Table 12-1. SRAM Bus Address ..... 102
Table 14-1. Supported Operations ..... 117
Table 14-2. TAP Device ID Register ..... 117
Table 15-1. DC Electrical Characteristics for CYNSE70064 ..... 118
Table 15-2. Operating Conditions for CYNSE70064 ..... 118

## LIST OF TABLES (continued)

Table 16-1. AC Timing Parameters with CLK2X ..... 119
Table 16-2. 2.5V AC Table for Test Condition of CYNSE70064 ..... 119
Table 18-1. Ordering Information ..... 122

## $1.0 \quad$ Features

- 64K 34-bit entries in a single device
- 32K entries in 68 -bit mode, 16 K entries in 136-bit mode, 8 K entries in 272 -bit mode
- 66 million transactions per second in 68 - and 136-bit configurations
- 33 million transactions in 34- and 272-bit configurations
- Searches any subfield in a single cycle
- Synchronous pipelined operation
- Up to 31 search engines can be cascaded
- When cascaded, the database entries can range up to 1984K 34-bit entries
- Multiple width tables in a single database bank
- Glueless interface to industry-standard SRAMs and/or SSRAMs
- Simple hardware instruction interface
- IEEE 1149.1 test access port
- 1.8 V core voltage supply
- 2.5/3.3V I/O voltage supply
- 272-pin BGA package.


### 2.0 Functional Overview

Cypress Semiconductor Corporation's (Cypress's) CYNSE70064 network search engine (NSE) incorporates patent-pending Associative Processing Technology ${ }^{\text {TM }}$ (APT) and is designed to be a high-performance, pipelined, synchronous, 32K-entry NSE. The CYNSE70064 database entry size can be 68 bits, 136 bits, or 272 bits. In the 68 -bit entry mode, the size of the database is 32 K entries. In the 136 -bit mode, the size of the database is 16 K entries, and in the 272 -bit mode, the size of the database is 8 K entries. The CYNSE70064 is configurable to support multiple databases with different entry sizes. The 34-bit entry table can be implemented using the global mask registers (GMRs) building-database size of 64 K entries with a single device.
The NSE can sustain 66 million transactions per second when the database is programmed or configured as 68 or 136 bits. When the database is programmed to have an entry size of 34 or 272 bits, the NSE will perform at 33 million transactions per second. The CYNSE70064 device can be used to accelerate network protocols such as longest-prefix match (CIDR), ARP, MPLS, and other layer 2, 3, and 4 protocols.
This high-speed, high-capacity NSE can be deployed in a variety of networking and communications applications. The performance and features of the CYNSE70064 make it attractive in applications such as Enterprise LAN switches and routers and broadband switching and/or routing equipment supporting multiple data rates at OC-48 and beyond. The NSE is designed to be scalable in order to support network database sizes to 1984 K entries specifically for environments that require large network policy databases.

### 3.0 Product Summary

### 3.1 Logic Block Diagram



### 4.0 Functional Description

The following subsections contain command (CMD) and DQ bus (command and databus), database entry, arbitration logic, pipeline and SRAM control, and full logic descriptions.

### 4.1 CMD Bus and DQ Bus

CMD[8:0] carries the CMD and its associated parameter. $D Q[67: 0]$ is used for data transfer to and from the database entries, which comprise a data and a mask field that are organized as data and mask arrays. The DQ bus carries the Search data (of the data and mask arrays and internal registers) during the Search command as well as the address and data during Read and/or Write operations. The DQ bus can also carry the address information for the flow-through accesses to the external SRAMs and/or SSRAMs.

### 4.2 Database Entry (Data Array and Mask Array)

Each database entry comprises a data and a mask field. The resultant value of the entry is " 1 ," " 0 ," or " $X$ (don't care)," depending on the value in the data and mask bits. The on-chip priority encoder selects the first matching entry in the database that is nearest to location 0 .

### 4.3 Arbitration Logic

When multiple search engines are cascaded to create large databases, the data being searched is presented to all search engines simultaneously in the cascaded system. If multiple matches occur within the cascaded devices, arbitration logic on the search engines will enable the winning device (with a matching entry that is closest to address 0 of the cascaded database) to drive the SRAM bus.

### 4.4 Pipeline and SRAM Control

Pipeline latency is added to give enough time to a cascaded system's arbitration logic to determine the device that will drive the index of the matching entry on the SRAM bus. Pipeline logic adds latency to both the SRAM access cycles and the SSF and SSV signals to align them to the host ASIC receiving the associated data.

### 4.5 Full Logic

Bit[0] in each of the 68 -bit entries has a special purpose for the Learn command ( $0=$ empty, $1=$ full). When all the data entries have bit[0] = 1, the database asserts the FULL flag, indicating that all the search engines in the depth-cascaded array are full.

### 5.0 Signal Descriptions

Table 5-1 lists and describes all CYNSE70064 signals.
Table 5-1. CYNSE70064 Signal Description

| Symbol | Type ${ }^{[1]}$ | Description |
| :---: | :---: | :---: |
| Clocks and Reset |  |  |
| CLK2X | I | Master Clock. CYNSE70064 samples all the data and control pins on the positive edge of CLK2X. All signals are driven out of the device on the rising edge of CLK2X (when PHS_L is LOW). |
| PHS_L | I | Phase. This signal runs at half the frequency of CLK2X and generates an internal CLK ${ }^{[2]}$ from CLK2X. See Section 6.0, "Clocks" on page 13. |
| TEST | I | Test Input (For Cypress Semiconductor Use Only). This signal should be connected to ground. |
| RST_L | I | Reset. Driving RST_L LOW initializes the device to a known state. |
| CMD and DQ Bus |  |  |
| CMD[8:0] | I | CMD Bus. [1:0] specifies the command and [8:2] contains the CMD parameters. The descriptions of individual commands explains the details of the parameters. The encoding of commands based on the [1:0] field are: <br> 00: PIO Read <br> 01: PIO Write <br> 10: Search <br> 11: Learn. |
| CMDV | I | CMD Valid. This signal qualifies the CMD bus: <br> 0 : No command <br> 1: Command. |
| DQ[67:0] | I/O | Address/Data Bus. This signal carries the Read and Write address and data during register, data, and mask array operations. It carries the compare data during Search operations. It also carries the SRAM address during SRAM PIO accesses. |
| $\mathrm{ACK}^{[3]}$ | T | Read Acknowledge. This signal indicates that valid data is available on the DQ bus during register, data, and mask array Read operations, or that the data is available on the SRAM data bus during SRAM Read operations. |
| $E \mathrm{CT}{ }^{[3]}$ | T | End of Transfer. This signal indicates the end of burst transfer to the data or mask array during Read or Write burst operations. |
| SSF | T | Search Successful Flag. When asserted, this signal indicates that the device is the global winner in a Search operation. |
| SSV | T | Search Successful Flag Valid. When asserted, this signal qualifies the SSF signal. |
| SRAM Interface |  |  |
| SADR[21:0] | T | SRAM Address. This bus contains address lines to access off-chip SRAMs that contain associative data. See Table 12-1 for the details of the generated SRAM address. In a database of multiple CYNSE70064s, each corresponding bit of SADR from all cascaded devices must be connected. |
| CE_L | T | SRAM Chip Enable. This is the chip-enable control for external SRAMs. In a database of multiple CYNSE70064s, CE_L of all cascaded devices must be connected. This signal is then driven by only one of the devices. |

Table 5-1. CYNSE70064 Signal Description (continued)

| Symbol | Type ${ }^{[1]}$ | Description |
| :---: | :---: | :---: |
| WE_L | T | SRAM Write Enable. This is the Write-enable control for external SRAMs. In a database of multiple CYNSE70064s, WE_L of all cascaded devices must be connected together. This signal is then driven by only one of the devices. |
| OE_L | T | SRAM Output Enable. This is the output-enable control for external SRAMs. Only the last device drives this signal (with the LRAM bit set). |
| ALE_L | T | Address Latch Enable. When this signal is LOW, the addresses are valid on the SRAM address bus. In a database of multiple CYNSE70064s, the ALE_L of all cascaded devices must be connected. This signal is then driven by only one of the devices. |
| Cascade Interface |  |  |
| LHI[6:0] | I | Local Hit In. These pins depth-cascade the device to form a larger table. One signal of this bus is connected to the LHO[1] or LHO[0] of each of the upstream devices in a block. All unused LHI pins are connected to a logic 0. (For more information, see Section 11.0, "Depth-Cascading" on page 99.) |
| LHO[1:0] | 0 | Local Hit Out. LHO[1] and LHO[0] are the same logical signal. Either the LHO[1] or the LHO[0] is connected to one input of the LHI bus of up to four downstream devices in a block of up to eight. (For more information see Section 11.0, "Depth-Cascading" on page 99.) |
| BHI[2:0] | I | Block Hit In. Inputs from the previous block BHO[2:0] are tied to BHI[2:0] of the current device. In a four-block system, the last block can contain only seven devices because the identification code 11111 is used for broadcast access. |
| BHO[2:0] | 0 | Block Hit Out. These outputs from the last device in a block are connected to the BHI[2:0] inputs of the devices in the downstream blocks. |
| FULI[6:0] | I | Full In. Each signal in this bus is connected to FULO[0] or FULO[1] of an upstream device to generate the FULL flag for the depth-cascaded block. |
| FULO[1:0] | 0 | Full Out. FULO[1] and FULO[0] are the same logical signal. One of these two signals must be connected to the FULI of up to four downstream devices in a depth-cascaded table. Bit [0] in the data array indicates whether the entry is full (1) or empty ( 0 ). This signal is asserted if all bits in the data array are ones. (Refer to Section 11.0, "Depth-Cascading" on page 99, for information on how to generate the FULL flag.) |
| FULL | 0 | Full Flag. When asserted, this signal indicates that the table of multiple depth-cascaded devices is full. |
| Device Identification |  |  |
| ID[4:0] | I | Device Identification. The binary-encoded device identification for a depth-cascaded system starts at 00000 and goes up to 11110.11111 is reserved for a special broadcast address that selects all cascaded search engines in the system. On a broadcast Read-only, the device with the LDEV bit set to 1 responds. |
| Supplies |  |  |
| $V_{\text {DD }}$ | n/a | Chip Core Supply. 1.8V. |
| $\mathrm{V}_{\text {DDQ }}$ | n/a | Chip I/O supply. 2.5 V or 3.3 V . |
| Test Access Port |  |  |
| TDI | I | Test access port's test data in. |
| TCK | I | Test access port's test clock. |
| TDO | T | Test access port's test data out. |
| TMS | I | Test access port's Test Mode Select. |
| TRST_L | I | Test access port's Reset. |

## Notes:

1. I = Input only, I/O = Input or Output, O = Output only, T = three-state output.
2. CLK" is an internal clock signal. Any reference to "CLK cycles" means one cycle of CLK.
3. ACK and EOT require a weak external pulldown such as $47 \mathrm{~K} \Omega$ or $100 \mathrm{~K} \Omega$.

CYNSE70064

### 6.0 Clocks

CYNSE70064 receives the CLK2X and PHS_L signals. It uses the PHS_L signal to divide CLK2X and generate an internal CLK, as shown in Figure 6-1. The CYNSE70064 uses CLK2X and CLK for internal operations.


Figure 6-1. CYNSE70064 Clocks (CLK2X and PHS_L)

### 7.0 Registers

All registers in the CYNSE70064 are 68 bits wide. The CYNSE70064 contains 16 pairs of comparand storage registers, eight pairs of GMRs, eight search successful index registers and one each of CMD, information, burst Read, burst Write, and next-free address registers. Table $7-1$ provides an overview of all the CYNSE70064 registers. The registers are ordered in ascending address order. Each register group is then described in the following subsections.

Table 7-1. Register Overview

| Address | Abbreviation | Type | Name |
| :---: | :---: | :---: | :--- |
| $0-31$ | COMP0-31 | R | Sixteen pairs of comparand registers that store comparands from the <br> DQ bus for learning later. |
| $32-47$ | MASKS | RW | Eight global mask register pairs. |
| $48-55$ | SSR0-7 | R | Eight search successful index registers. |
| 56 | COMMAND | RW | Command register. |
| 57 | INFO | R | Information register. |
| 58 | RBURREG | RW | Burst Read register. |
| 59 | WBURREG | RW | Burst Write register. |
| 60 | NFA | R | Next-free address register. |
| $61-63$ | - | - | Reserved. |

### 7.1 Comparand Registers

The device contains 3268 -bit comparand registers (16 pairs) dynamically selected in every Search operation to store the comparand presented on the DQ bus. The Learn command will later use these registers when executed. The CYNSE70064 stores the Search command's cycle A comparand in the even-numbered register and the cycle B comparand in the odd-numbered register, as shown in Figure 7-1.


Figure 7-1. Comparand-Register Selection During Search and Learn Instructions

### 7.2 Mask Registers

The device contains 16 68-bit global mask registers (eight pairs) dynamically selected in every Search operation to select the Search subfield. The addressing of these registers is explained in Figure 7-2. The three-bit GMR Index supplied on the command (CMD) bus can apply eight pairs of global masks during the Search and Write operations, as shown below. Note. In 68-bit Search and Write operations, the host ASIC must program both the even and odd mask registers with the same values.

| Index | 68 | 68 |
| :---: | :---: | :---: |
| 0 | 0 | 1 |
| 1 | 2 | 3 |
| 2 | 4 | 5 |
| 3 | 6 | 7 |
| 4 | 8 | 9 |
| 5 | 10 | 11 |
| 6 | 12 | 13 |
| 7 | 14 | 15 |

Search and Write Command Global Mask Selection
Figure 7-2. Addressing the Global Masks Register Array
Each mask bit in the GMRs is used during Search and Write operations. In Search operations, setting the mask bit to 1 enables compares; setting the mask bit to 0 disables compares (forced match) at the corresponding bit position. In Write operations to the data or mask array, setting the mask bit to 1 enables Writes; setting the mask bit to 0 disables Writes at the corresponding bit position.

### 7.3 Search Successful Registers (SSR[0:7])

The device contains eight search successful registers (SSRs) to hold the index of the location where a successful Search occurred. The format of each register is described in Table 7-2. The Search command specifies which SSR stores the index of a specific Search command in cycle B of the Search instruction. Subsequently, the host ASIC can use this register to access that data array, mask array, or external SRAM using the index as part of the indirect access address (see Table 10-4 and Table 10-7). The device with a valid bit set performs a Read or Write operation. All other devices suppress the operation.

Table 7-2. Search Successful Register Description

| Field | Range | Initial Value | Description |
| :---: | :---: | :---: | :--- |
| INDEX | $[14: 0]$ | $X$ | Index. This is the address of the 68-bit entry where a successful Search occurs. <br> The device updates this field only when the Search is successful. If a hit occurs <br> in a 136-bit entry-size quadrant, the LSB is 0. If a hit occurs in a 272-bit entry-size <br> quadrant, the two LSBs are 00. This index updates if the device is either a local <br> or global winner in a Search operation. |
| - | $[30: 15]$ | 0 | Reserved. |
| VALID | $[31]$ | 0 | Valid. During Search operation in a depth-cascaded configuration, the device <br> that is a global winner in a match sets this bit to 1. This bit updates only when <br> the device is a global winner in a Search operation. |
| - | $[67: 32]$ | 0 | Reserved. |

### 7.4 Command Register

Table 7-3 describes the command register fields.
Table 7-3. Command Register Description

| Field | Range | Initial Value | Description |
| :---: | :---: | :---: | :--- |
| SRST | $[0]$ | 0 | Software Reset. If 1, this bit resets the device with the same effect as a hardware reset. Internally, <br> it generates a reset pulse lasting for eight CLK cycles. This bit automatically resets to a 0 after <br> the reset has completed. |
| DEVE | $[1]$ | 0 | Device Enable. If 0, it keeps the SRAM bus (SADR, WE_L, CE_L, OE_L, and ALE_L), SSF, and <br> SSV signals in three-state condition and forces the cascade interface output signals LHO[1:0] <br> and BHO[2:0] to 0. It also keeps the DQ bus in input mode. The purpose of this bit is to make <br> sure that there are no bus contentions when the devices power up in the system. |

Table 7-3. Command Register Description (continued)

| Field | Range | Initial Value | Description |
| :---: | :---: | :---: | :---: |
| TLSZ | [3:2] | 01 | Table Size. The host ASIC must program this field to configure the chips into a table of a certain size. This field affects the pipeline latency of the Search and Learn operations as well as the Read and Write accesses to the SRAM (SADR[21:0], CE_L, OE_L, WE_L, ALE_L, SSV, SSF, and ACK). Once programmed, the Search latency stays constant. <br> Latency in number of CLK cycles <br> 00: One device 4 <br> 01: Up to eight devices 5 <br> 10: Up to 31 devices 6 <br> 11: Reserved. |
| HLAT | [6:4] | 000 | Latency of Hit Signals. This field further adds latency to the SSF and SSV signals during Search, and ACK signal during SRAM Read access by the following number of CLK cycles. |
| LDEV | [7] | 0 | Last Device in the Cascade. When set, this is the last device in the depth-cascaded table and is the default driver for the SSF and SSV signals. In the event of a Search failure, the device with this bit set drives the hit signals as follows: $\mathrm{SSF}=0, \mathrm{SSV}=1$. <br> During nonSearch cycles, the device with this bit set drives the signals as follows: $\mathrm{SSF}=0$, SSV $=0$. |
| LRAM | [8] | 0 | Last Device on the SRAM Bus. When set, this device is the last device on the SRAM bus in the depth-cascaded table and is the default driver for the SADR, CE_L, WE_L, and ALE_L signals. In cycles where no CYNSE70064 device in a depth-cascaded table drives these signals, this devices drives the signals as follows: SADR = 22'h3FFFFF, CE_L = 1, WE_L = 1, and ALE_L $=1$. OE_L is always driven by the device for which this bit is set. |
| CFG | [16:9] | 00000000 | Database Configuration. The device is divided internally into four partitions of $8 \mathrm{~K} \times 68$, each of which can be configured as $8 \mathrm{~K} \times 68,4 \mathrm{~K} \times 136$, or $2 \mathrm{~K} \times 272$, as follows. $00: 8 \mathrm{~K} \times 68$ <br> 01: $4 \mathrm{~K} \times 136$ <br> 10: $2 \mathrm{~K} \times 272$ <br> 11: Reserved <br> Bits [10:9] apply to configuring the first partition in the address space. <br> Bits [12:11] apply to configuring the second partition in the address space. <br> Bits [14:13] apply to configuring the third partition in the address space. <br> Bits [16:15] apply to configuring the fourth partition in the address space. |
|  | [67:17] | 0 | Reserved. |

### 7.5 Information Register

Table 7-4 describes the information register fields.
Table 7-4. Information Register Description

| Field | Range | Initial Value | Description |
| :---: | :---: | :---: | :--- |
| Revision | $[3: 0]$ | $000^{[4]}$ | Revision Number. This is the current device revision number. Numbers start <br> at one and increment by one for each revision of the device. |
| Implementation | $[6: 4]$ | 000 or 001 | This is the CYNSE70064 implementation number. |
| Reserved | $[7]$ | 0 | Reserved. |
| Device ID | $[11: 8]$ | 0001 or 0010 | This is the device identification number. |
| Device ID | $[12]$ | 0 or 1 | Reserved. |
| Device ID | $[15: 13]$ | 000 | These are the three MSBs of the device identification number. |
| MFID | $[31: 16]$ | $1101 \_1100 \_0111 \_1111$ | Manufacturer ID. This field is the same as the manufacturer identification <br> number and continuation bits in the TAP controller. |
| Reserved | $[67: 32]$ |  | Reserved. |

## Note:

4. This field may change in future versions.

### 7.6 Read Burst Address Register

Table $7-5$ shows the Read burst address register (RBURREG) fields which must be programmed before a burst Read.
Table 7-5. Read Burst Register Description

| Field | Range | Initial Value | Description |
| :---: | :---: | :---: | :--- |
| ADR | $[14: 0]$ | 0 | Address. This is the starting address of the data or mask array during a burst <br> Read operation. It automatically increments by one for each successive <br> Read of the data or mask array. Once the operation is complete, the contents <br> of this field must be reinitialized for the next operation. |
| BLEN | $[27: 19]$ | 0 | Reserved. |
|  | Length of Burst Access. The device provides the capability to read from 4-511 <br> locations in a single burst. The BLEN decrements automatically. Once the <br> operation is complete, the contents of this field must be reinitialized for the <br> next operation. |  |  |
|  | $[67: 28]$ |  | Reserved. |

### 7.7 Write Burst Address Register Description

Table 7-6 describes the Write burst address register (WBURREG) fields which must be programmed before a burst Write.
Table 7-6. Write Burst Register Description

| Field | Range | Initial Value | Description |
| :---: | :---: | :---: | :--- |
| ADR | $[14: 0]$ | 0 | Address. This is the starting address of the data or mask array during a burst Write <br> operation. It automatically increments by one for each successive Write of the data or <br> mask array. Once the operation is complete, the contents of this field must be reini- <br> tialized for the next operation. |
| BLEN | $[27: 19]$ | 0 | Reserved. <br> Length of Burst Access. The device provides the capability to Write from 4-511 locations <br> in a single burst. The BLEN decrements automatically. Once the operation is complete, <br> the contents of this field must be reinitialized for the next operation. |
|  | $[67: 28]$ |  | Reserved. |

### 7.8 NFA Register

Bit [0] of each 68-bit data entry is specially designated for use in the operation of the Learn command. For 68-bit-configured quadrants, this bit indicates whether a location is full (bit set to 1 ) or empty (bit set to 0 ). Every Write and/or Learn command loads the address of the first 68 -bit location that contains a 0 in the entry's bit[0]. This is stored in the NFA register (see Table 7-7). If all the bits[0] in a device are set to 1, the CYNSE70064 asserts FULO[1:0] to 1.
For 136-bit-configured quadrants, the LSB of the NFA register is always set to 0 . The host ASIC must set both bit[0] and bit[68] in a 136-bit word to either 0 or 1 to indicate full or empty status. Both bit[0] and bit[68] must be set to either 0 or 1, (that is, the 10 or 01 settings are invalid).

Table 7-7. NFA Register

| Address | $\mathbf{6 7 - 1 5}$ | $\mathbf{1 4 - 0}$ |
| :---: | :---: | :---: |
| 60 | Reserved | Index |

### 8.0 NSE Architecture and Operation Overview

The CYNSE 70064 consists of $32 \mathrm{~K} \times 68$-bit storage cells referred to as data bits. There is a mask cell corresponding to each data cell. Figure $8-1$ shows the three organizations of the device based on the value of the CFG bits in the command register.


Figure 8-1. CYNSE70064 Database WIDTH Configuration
During a Search operation, the Search data bit (S), data array bit (D), mask array bit (M) and the global mask bit (G) are used in the following manner to generate a match at that bit position (see Table 8-1). The entry with a match on every bit position results in a successful Search during a Search operation.

Table 8-1. Bit Position Match

| $\mathbf{G}$ | $\mathbf{M}$ | $\mathbf{D}$ | $\mathbf{S}$ | Match |
| :---: | :---: | :---: | :---: | :---: |
| 0 | $\mathbf{X}$ | $\mathbf{X}$ | $\mathbf{X}$ | 1 |
| 1 | 0 | $X$ | 0 | 1 |
| 1 | 1 | 0 | 0 | 1 |
| 1 | 1 | 1 | 1 | 0 |
| 1 | 1 | 0 | 1 | 0 |
| 1 | 1 | 1 | 1 |  |

In order for a successful Search within a device to make the device the local winner in the Search operation, all 68-bit positions must generate a match for a 68-bit entry in 68-bit configured quadrants, or all 136 -bit positions must generate a match for two consecutive even and odd 68-bit entries in quadrants configured as 136 bits, or all 272-bit positions must generate a match for four consecutive entries aligned to four entry-page boundaries of 68 -bit entries in quadrants configured as 272 bits.
An arbitration mechanism using a cascade bus determines the global winning device among the local winning devices in a Search cycle. The global winning device drives the SRAM bus, SSV, and the SSF signals. In case of a Search failure, the device(s) with the LDEV and LRAM bits set drives the SRAM bus, SSF, and SSV signals.
The CYNSE70064 device can be configured to contain tables of different widths, even within the same chip. Figure 8-2 shows a sample configuration of different widths.


Figure 8-2. Multiwidth Database Configurations Example

### 9.0 Data and Mask Addressing

Figure 9-1 shows CYNSE70064 data and mask array addressing.


Figure 9-1. Addressing of the CYNSE70064 Data and Mask Arrays

### 10.0 Commands

A master device such as an ASIC controller issues commands to the CYNSE70064 device using the command valid (CMDV) signal and the CMD bus. The following subsections describe the operation of the commands.

### 10.1 Command Codes

The CYNSE70064 implements four basic commands, shown in Table 10-1. The command code must be presented to CMD[1:0] while keeping the CMDV signal HIGH for two CLK2X cycles (designated as cycles A and B). The controller ASIC must align the instructions using the PHS_L signal. The CMD[8:2] field passes the parameters of the command in cycles A and B.

Table 10-1. Command Codes

| Command Code | Command | Description |
| :---: | :---: | :--- |
| 00 | Read | Reads one of the following: data array, mask array, device registers, or external SRAM. |
| 01 | Write | Writes one of the following: data array, mask array, device registers, or external SRAM. |
| 10 | Search | Searches the data array for a desired pattern using the specified register from the GMR <br> array and local mask associated with each data cell. |
| 11 | Learn | The device has internal storage for up to 16 comparands that it can learn. The device <br> controller can insert these entries at the next-free address (as specified by the NFA <br> register) using the Learn instruction. |

CYNSE70064

### 10.2 Commands and Command Parameters

Table 10-2 lists the CMD bus fields that contain the CYNSE70064 command parameters and their respective cycles. Each command is described separately in the subsections that follow.

Table 10-2. Command Parameters

| CMD | CYC | 8 | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| Read | A | SADR[21] | SADR[20] | X | 0 | 0 | 0 | $\begin{gathered} 0=\text { Single } \\ 1=\text { Burst } \end{gathered}$ | 0 | 0 |
|  | B | 0 | 0 | 0 | 0 | 0 | 0 | $\begin{gathered} 0=\text { Single } \\ 1=\text { Burst } \end{gathered}$ | 0 | 0 |
| Write | A | SADR[21] | SADR[20] | X | GMR Index [2:0] |  |  | $\begin{gathered} 0=\text { Single } \\ 1=\text { Burst } \end{gathered}$ | 0 | 1 |
|  | B | 0 | 0 | 0 | GMR Index [2:0] |  |  | $\begin{gathered} \hline 0=\text { Single } \\ 1=\text { Burst } \end{gathered}$ | 0 | 1 |
| Search | A | SADR[21] | SADR[20] | X | GMR Index 2:0] |  |  | ```68-bit or 136-bit: 0 272-bit: 1 in first cycle 0 in second cycle``` | 1 | 0 |
|  | B | SSRI[2:0] |  |  | Comparand Register Index |  |  |  | 1 | 0 |
| Learn ${ }^{[5]}$ | A | SADR[21] | SADR[20] | X | Comparand Register Index |  |  |  | 1 | 1 |
|  | B | 0 | 0 | Mode $0: 68$-bit $1: 136$-bit | Comparand Register Index |  |  |  | 1 | 1 |

### 10.3 Read Command

The Read can be a single Read of a data array, a mask array, an SRAM, or a register location (CMD[2] = 0). It can be a burst Read of the data (CMD[2] = 1) or mask array locations using an internal auto-incrementing address register (RBURADR). A description of each type is provided in Table 10-3. A single-location Read operation lasts six cycles, as shown in Figure 10-1. The burst Read adds two cycles for each successive Read. The SADR[21:20] bits supplied in the Read instruction cycle A drives SADR[21:20] signals during the Read of an SRAM location.

Table 10-3. Read Command Parameters

| CMD Parameter CMD[2] | Read Command | Description |
| :---: | :---: | :--- |
| 0 | Single Read | Reads a single location of the data array, mask array, external SRAM, or <br> device registers. All access information is applied on the DQ bus. |
| 1 | Burst Read | Reads a block of locations from the data array, or mask array as a burst. <br> The internal register (RBURADR) specifies the starting address and the <br> length of the data transfer from the data or mask array, and it auto-incre- <br> ments the address for each access. All other access information is <br> applied on the DQ bus. Note. The device registers and external SRAM <br> can only be read in single-Read mode. |

## Note:

5. The 272-bit-configured devices or 272-bit-configured quadrants within devices do not support the Learn instruction.


Figure 10-1. Single-Location Read Cycle Timing
The single Read operation takes six clock cycles, in the following sequence.

- Cycle 1: The host ASIC applies the Read instruction on the CMD[1:0] (CMD[2] = 0) using CMDV $=1$ and the DQ bus supplies the address, as shown in Table 10-4 and Table 10-5. The host ASIC selects the CYNSE70064 for which ID[4:0] matches the DQ[25:21] lines. If the DQ[25:21] = 11111, the host ASIC selects the CYNSE70064 with the LDEV bit set. The host ASIC also supplies SADR[21:20] on CMD[8:7] in cycle A of the Read instruction if the Read is directed to the external SRAM.
- Cycle 2: The host ASIC floats DQ[67:0] to three-state condition.
- Cycle 3: The host ASIC keeps DQ[67:0] in three-state condition.
- Cycle 4: The selected device starts to drive the DQ[67:0] bus, and drives the ACK signal from Z to LOW.
- Cycle 5: The selected device drives the Read data from the addressed location on the DQ[67:0] bus, and drives the ACK signal HIGH.
- Cycle 6: The selected device floats the DQ[67:0] to three-state condition and drives the ACK signal LOW.

At the termination of cycle 6, the selected device releases the ACK line to three-state condition. The Read instruction is complete, and a new operation can begin. Note that the latency of the SRAM Read will be different than the one described above (see Subsection 12.2, "SRAM PIO Access" on page 102). Table 10-4 lists and describes the format of the Read address for a data array, mask array, or SRAM.

Table 10-4. Read Address Format for Data Array, Mask Array, or SRAM

| DQ[67:30] | DQ[29] | DQ[28:26] | DQ[25:21] | DQ[20:19] | DQ[18:15] | DQ[14:0] |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| Reserved | 0: Direct <br> 1: Indirect | SSRI (applicable if $D Q[29]$ is indirect) | ID | $\begin{aligned} & \text { 00: Data } \\ & \text { Array } \end{aligned}$ | Reserved | If DQ[29] is 0 , this field carries the address of the data array location. If DQ[29] is 1 , the SSRI specified on DQ[28:26] is used to generate the address of the data array location: \{SSR[14:2], SSR[1] \| DQ[1], SSR[0] | DQ[0]\}. ${ }^{[6]}$ |
| Reserved | $\begin{gathered} \text { 0: Direct } \\ \text { 1: Indirect } \end{gathered}$ | SSRI (applicable if $D Q[29]$ is indirect) | ID | 01: Mask Array | Reserved | If $\mathrm{DQ}[29]$ is 0 , this field carries the address of the mask array location. If DQ[29] is 1 , the SSRI specified on DQ[28:26] is used to generate the address of the mask array location: \{SSR[14:2], SSR[1] \| DQ[1], SSR[0] | DQ[0]\}. ${ }^{[6]}$ |
| Reserved | $\begin{array}{\|c\|} \hline \text { 0: Direct } \\ \text { 1: Indirect } \end{array}$ | SSRI (applicable if $D Q[29]$ is indirect) | ID | 10: <br> External SRAM | Reserved | If DQ[29] is 0 , this field carries the address of the SRAM location. If DQ[29] is 1 , the SSRI specified on DQ[28:26] is used to generate the address of the SRAM location: \{SSR[14:2], SSR[1] \| DQ[1], SSR[0] | DQ[0]\} ${ }^{[6]}$ |

## Note:

6. "|" stands for logical OR operation. "\{\}" stands for concatenation operator.

Table 10-5 describes the Read address format for the internal registers. Figure 10-2 illustrates the timing diagram for the burst Read of the data or mask array.

Table 10-5. Read Address Format for Internal Registers

| DQ[67:26] | DQ[25:21] | DQ[20:19] | DQ[18:6] | DQ[5:0] |
| :---: | :---: | :---: | :---: | :---: |
| Reserved | ID | 11: Register | Reserved | Register Address |



Figure 10-2. Burst Read of the Data and Mask Arrays (BLEN = 4)
The Read operation lasts $4+2 n$ CLK cycles (where $n$ is the number of accesses in the burst specified by the BLEN field of the RBURREG) in the sequence shown below. This operation assumes that the host ASIC has programmed the RBURREG with the starting address (ADR) and the length of the transfer (BLEN) before initiating the burst Read command.

- Cycle 1: The host ASIC applies the Read instruction on CMD[1:0] (CMD[2] = 1) using CMDV $=1$ and the address supplied on the DQ bus, as shown in Table 10-6. The host ASIC selects the CYNSE70064 where ID[4:0] matches the DQ[25:21] lines. If the DQ[25:21] = 11111, the host ASIC selects the CYNSE70064 with the LDEV bit set.
- Cycle 2: The host ASIC floats DQ[67:0] to the three-state condition.
- Cycle 3: The host ASIC keeps DQ[67:0] in the three-state condition.
- Cycle 4: The selected device starts to drive the DQ[67:0] bus and drives ACK and EOT from $Z$ to LOW.
- Cycle 5: The selected device drives the Read data from the addressed location on the $D Q[67: 0]$ bus, and drives the $A C K$ signal HIGH.
Cycles 4 and 5 repeat for each additional access until all the accesses specified in the burst length (BLEN) field of RBURREG are complete. On the last transfer, the CYNSE70064 drives the EOT signal HIGH.
- Cycle (4+2n): The selected device drives the DQ[67:0] to the three-state condition, and drives the ACK and EOT signals LOW. At the termination of cycle $(4+2 n)$, the selected device floats the ACK line to the three-state condition. The burst Read instruction is complete, and a new operation can begin. Table 10-6 describes the Read address format for data and mask arrays for burst Read operations.

Table 10-6. Read Address Format for Data and Mask Arrays

| DQ[67:26] | DQ[25:21] | DQ[20:19] | DQ[18:15] | DQ[14:0] |
| :---: | :---: | :---: | :---: | :--- |
| Reserved | ID | 00: Data Array | Reserved | Do not care. These 15 bits come from the internal <br> register (RBURADR) which increments for each <br> access. |
| Reserved | ID | 01: Mask Array | Reserved | Do not care. These 15 bits come from the internal <br> register (RBURADR) which increments for each <br> access. |

### 10.4 Write Command

The Write can be a single Write of a data array, mask array, register, or external SRAM location (CMD[2] = 0). It can be a burst Write (CMD[2] = 1) using an internal auto-incrementing address register (WBURADR) of the data or mask array locations. A single-location Write is a 3-cycle operation, as shown in Figure 10-3. The burst Write adds one extra cycle for each successive location Write.


Figure 10-3. Single Write Cycle Timing
The following is the Write operation sequence, and Table 10-7 shows the Write address format for the data array, the mask array, or single-Write SRAM. Table 10-8 shows the Write address format for the internal registers.

- Cycle 1A: The host ASIC applies the Write instruction to the CMD[1:0] (CMD[2] = 0), using CMDV = 1 and the address supplied on the DQ bus. The host ASIC also supplies the GMR Index to mask the Write to the data or mask array location on CMD[5:3]. For SRAM Writes, the host ASIC must supply the SADR[21:20] on CMD[8:7].
- Cycle 1B:The host ASIC continues to apply the Write instruction to the CMD[1:0] (CMD[2] = 0), using CMDV = 1 and the address supplied on the DQ bus. The host ASIC continues to supply the GMR Index to mask the Write to the data or mask array locations in CMD[5:3]. The host ASIC selects the device where ID[4:0] matches the DQ[25:21] lines, or it selects all the devices when DQ[25:21] = 11111.
- Cycle 2: The host ASIC drives the DQ[67:0] with the data to be written to the data array, mask array, or register location of the selected device.
- Cycle 3: Idle cycle.

At the termination of cycle 3, another operation can begin.
Note. The latency of the SRAM Write will be different than the one described above (see Subsection 12.2, "SRAM PIO Access" on page 102).

Table 10-7. Write Address Format for Data Array, Mask Array, or SRAM (Single Write)

| $\begin{gathered} \text { DQ } \\ {[67: 30]} \end{gathered}$ | DQ[29] | DQ[28:26] | $\begin{array}{\|c\|} \hline \text { DQ } \\ {[25: 21]} \end{array}$ | DQ[20:19] | $\begin{gathered} \hline \text { DQ } \\ {[18: 15]} \end{gathered}$ | DQ[14:0] |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| Reserved | $\begin{array}{\|c\|c\|} \hline \text { 0: Direct } \\ \text { 1: Indirect } \end{array}$ | SSR (appli- cable if DQ[29] is indirect) | ID | 00: Data Array | Reserved | If $D Q[29]$ is 0 , this field carries the address of the data array location. <br> If DQ[29] is 1 , the SSR specified on DQ[28:26] is used to generate the address of data array location: \{SSR[14:2], SSR[1] \| DQ[1], SSR[0] | DQ[0]\}. [7] |
| Reserved | $\begin{array}{\|c\|} \hline 0: \text { Direct } \\ \text { 1: Indirect } \end{array}$ | $\begin{aligned} & \text { SSR (appli- } \\ & \text { cable if DQ[29] } \\ & \text { is indirect) } \end{aligned}$ | ID | 01: Mask Array | Reserved | If $D Q[29]$ is 0 , this field carries the address of the mask array location. <br> If DQ[29] is 1 , the SSR specified on DQ[28:26] is used to generate the address of the mask array location: \{SSR[14:2], SSR[1] \| DQ[1], SSR[0] | DQ[0]\}. ${ }^{[7]}$ |
| Reserved | $\begin{array}{\|c\|} \hline \text { 0: Direct } \\ \text { 1: Indirect } \end{array}$ | SSR (appli- cable if DQ[29] is indirect) | ID | 10: External SRAM | Reserved | If $D Q[29]$ is 0 , this field carries the address of the SRAM location. <br> If DQ[29] is 1 , the SSR specified on DQ[28:26] is used to generate the address of SRAM location: \{SSR[14:2], SSR[1] \| DQ[1], SSR[0] | DQ[0]\}.[7] |

## Note:

7. " |" stands for logical OR operation. "\}\}" stands for concatenation operator.

Table 10-8. Write Address Format for Internal Registers

| DQ[67:26] | DQ[25:21] | DQ[20:19] | DQ[18:6] | DQ[5:0] |
| :---: | :---: | :---: | :---: | :---: |
| Reserved | ID | 11: Register | Reserved | Register address |

Figure 10-4 shows the timing diagram of a burst Write operation of the data or mask array.


Figure 10-4. Burst Write of the Data and Mask Arrays (BLEN = 4)
The burst Write operation lasts for $(\mathrm{n}+2)$ CLK cycles. n signifies the number of accesses in the burst as specified in the BLEN field of the WBURREG register. The following is the block Write operation sequence. This operation assumes that the host ASIC has programmed the WBURREG with the starting address (ADR) and the length of transfer (BLEN) before initiating a burst Write command.

- Cycle 1A: The host ASIC applies the Write instruction to the CMD[1:0] (CMD[2] = 1 ), using CMDV $=1$ and the address supplied on the DQ bus, as shown in Table 10-9. The host ASIC also supplies the GMR Index to mask the Write to the data or mask array locations in CMD[5:3].
- Cycle 1B: The host ASIC continues to apply the Write instruction on the CMD[1:0] (CMD[2] = 1 ), using CMDV = 1 and the address supplied on the DQ bus. The host ASIC continues to supply the GMR Index to mask the Write to the data or mask array locations in CMD[5:3]. The host ASIC selects the device for which ID[4:0] matches the DQ[25:21] lines. It selects all the devices when DQ[25:21] = 11111.
- Cycle 2: The host ASIC drives the DQ[67:0] with the data to be written to the data or mask array location of the selected device. The CYNSE70064 writes the data from the DQ[67:0] bus only to the subfield that has the corresponding mask bit set to 1 in the GMR specified by the index CMD[5:3] supplied in cycle 1.
- Cycles $\mathbf{3}$ to $\mathbf{n + 1}$ : The host ASIC drives the DQ[67:0] with the data to be written to the next data or mask array location (addressed by the auto-increment ADR field of the WBURREG register) of the selected device.
The CYNSE70064 writes the data on the DQ[67:0] bus only to the subfield that has the corresponding mask bit set to 1 in the GMR specified by the index CMD[5:3] supplied in cycle 1. The CYNSE70064 drives the EOT signal LOW from cycle 3 to cycle n ; the CYNSE70064 drives the EOT signal HIGH in cycle $\mathrm{n}+1$ ( n is specified in the BLEN field of the WBURREG).
- Cycle n+2: TheCYNSE70064 drives the EOT signal LOW.

At the termination of cycle $n+2$, the CYNSE70064 floats the EOT signal to a three-state operation, and a new instruction can begin.

Table 10-9. Write Address Format for Data and Mask Array (Burst Write)

| DQ[67:26] | DQ[25:21] | DQ[20:19] | DQ[18:15] | DQ[14:0] |
| :---: | :---: | :---: | :---: | :--- |
| Reserved | ID | 00: Data array | Reserved | Do not care. These 15 bits come from the internal register <br> (WBURADR), which increments with each access. |
| Reserved | ID | 01: Mask array | Reserved | Do not care. These 15 bits come from the internal register <br> (WBURADR), which increments with each access. |

### 10.5 Search Command

This subsection describes the following:

- 68-bit Search on tables configured as $\times 68$ using one device
- 68 -bit Search on tables configured as $\times 68$ using up to eight devices
- 68-bit Search on tables configured as $\times 68$ using up to 31 devices
- 136-bit Search on tables configured as $\times 136$ using one device
- 136-bit Search on tables configured as $\times 136$ using up to eight devices
- 136-bit Search on tables configured as $\times 136$ using up to 31 devices
- 272-bit Search on tables configured as $\times 272$ using one device
- 272-bit Search on tables configured as $\times 272$ using up to eight devices
- 272-bit Search on tables configured as $\times 272$ using up to 31 devices
- Mixed-size searches on tables configured with different widths using an CYNSE70064.


### 10.6 68-bit Search on Tables Configured as $\times 68$ using a Single CYNSE70064 Device

Figure 10-5 shows the timing diagram for a Search command in the 68-bit-configured table ( $\mathrm{CFG}=00000000$ ) consisting of a single device for one set of parameters: $T L S Z=00, H L A T=000, L R A M=1$, and $L D E V=1$. The hardware diagram for this Search subsystem is shown in Figure 10-6.

$C F G=00000000, H L A T=000, T L S Z=00, L R A M=1, L D E V=1$.
Figure 10-5. Timing Diagram for 68-bit Search in x68 Table (One Device)


Figure 10-6. Timing Diagram for 68-bit Search in x68 Table (One Device)
The following is the sequence of operation for a single 68-bit Search command (also refer to Command and Command Parameters, Subsection 10.2 on page 19).

- Cycle A: The host ASIC drives the CMDV HIGH and applies Search command code (10) on CMD[1:0] signals. CMD[5:3] signals must be driven with the index to the GMR pair for use in this Search operation. CMD[8:7] signals must be driven with the same bits that will be driven on SADR[21:20] by this device if it has a hit. DQ[67:0] must be driven with the 68-bit data to be compared. The CMD[2] signal must be driven to logic 0 .
- Cycle B: The host ASIC continues to drive the CMDV HIGH and to apply Search command (10) on CMD[1:0]. CMD[5:2] must be driven by the index of the comparand register pair for storing the 136-bit word presented on the DQ bus during cycles $A$ and B. CMD[8:6] signals must be driven with the index of the SSR that will be used for storing the address of the matching entry and the hit flag (see page 14 for information on SSR[0:7]). The DQ[67:0] continues to carry the 68-bit data to be compared.
Note. For 68-bit searches, the host ASIC must supply the same 68-bit data on DQ[67:0] during both cycles A and B. The even and odd pair of GMRs selected for the compare must be programmed with the same value.

The logical 68 -bit Search operation is shown in Figure 10-7. The entire table consisting of 68 -bit entries is compared to a 68 -bit word K (presented on the DQ bus in both cycles A and B of the command) using the GMR and the local mask bits. The effective GMR is the 68-bit word specified by the identical value in both even and odd GMR pairs selected by the GMR Index in the command's cycle A. The 68-bit word K (presented on the DQ bus in both cycles A and B of the command) is also stored in both even and odd comparand register pairs selected by the Comparand Register Index in the command's cycle B. In a $\times 68$ configuration, only the even comparand register can be subsequently used by the Learn command. The word K (presented on the DQ bus in both cycles $A$ and $B$ of the command) is compared with each entry in the table starting at location 0 . The first matching entry's location address $L$ is the winning address that is driven as part of the SRAM address on the SADR[21:0] lines (see "SRAM Addressing" on page 101).


Figure 10-7. $\times 68$ Table with One Device
The Search command is a pipelined operation and executes a Search at half the rate of the frequency of CLK2X for 68-bit searches in $\times 68$-configured tables. The latency of SADR, CE_L, ALE_L, WE_L, SSV, and SSF from the 68-bit Search command cycle (two CLK2X cycles) is shown in Table 10-10.

Table 10-10. The Latency of Search from Instruction to SRAM Access Cycle

| Number of Devices | Max Table Size | Latency in CLK Cycles |
| :---: | :---: | :---: |
| $1(\mathrm{TLSZ}=00)$ | $32 \mathrm{~K} \times 68$ bits | 4 |
| $1-8(T L S Z=01)$ | $256 \mathrm{~K} \times 68$ bits | 5 |
| $1-31(\mathrm{TLSZ}=10)$ | $992 \mathrm{~K} \times 68$ bits | 6 |

The latency of a Search from command to SRAM access cycle is 4 for a single device in the table and TLSZ $=00$. In addition, SSV and SSF shift further to the right for different values of HLAT, as specified in Table 10-11.

Table 10-11. Shift of SSF and SSV from SADR

| HLAT | Number of CLK Cycles |
| :---: | :---: |
| 000 | 0 |
| 001 | 1 |
| 010 | 2 |
| 011 | 3 |
| 100 | 4 |
| 101 | 5 |
| 110 | 6 |
| 111 | 7 |

### 10.7 68-bit Search on Tables Configured as $\times 68$ Using up to Eight CYNSE70064 Devices

The hardware diagram of the Search subsystem of eight devices is shown in Figure 10-8. The following are the parameters programmed into the eight devices.

- First seven devices (devices $0-6$ ): $C F G=00000000, T L S Z=01, H L A T=010, L R A M=0$, and $L D E V=0$.
- Eighth device (device 7): $\mathrm{CFG}=00000000, \mathrm{TLSZ}=01, \mathrm{HLAT}=010, \mathrm{LRAM}=1$, and LDEV $=1$.

Note. All eight devices must be programmed with the same values for TLSZ and HLAT. Only the last device in the table (device number 7 in this case) must be programmed with LRAM $=1$ and LDEV = 1. All other upstream devices (devices 0 through 6 in this case) must be programmed with LRAM $=0$ and LDEV $=0$.
Figure $10-9$ shows the timing diagram for a Search command in the 68-bit-configured table of eight devices for device number 0. Figure 10-10 shows the timing diagram for a Search command in the 68-bit-configured table of eight devices for device number 1. Figure $10-11$ shows the timing diagram for a Search command in the 68 -bit-configured table of eight devices for device number 7 (the last device in this specific table). For these timing diagrams four 68 -bit searches are performed sequentially. Hit/Miss assumptions were made as shown below in Table 10-12.

Table 10-12. Hit/Miss Assumption

| Search Number | $\mathbf{1}$ | $\mathbf{2}$ | $\mathbf{3}$ | $\mathbf{4}$ |
| :---: | :---: | :---: | :---: | :---: |
| Device 0 | Hit | Miss | Hit | Miss |
| Device 1 | Miss | Hit | Hit | Miss |
| Devices 2-6 | Miss | Miss | Miss | Miss |
| Device 7 | Miss | Miss | Hit | Hit |



Figure 10-8. Hardware Diagram for a Table With Eight Devices
$C F G=00000000, H L A T=010, T L S Z=01, L R A M=0, L D E V=0$.
Note: |(LHI[6:0]) stands for the boolean 'OR' of the entire bus LHI[6:0]. Note: Each bit in LHO[1:0] is the same logical signal.

|  | 1 | 1 |
| :--- | :--- | :--- |
| Search1 | Search3 |  |
| (This | (This |  |
| device | device |  |
| is the | is the |  |
| global | global |  |
| winner.) | winner.) |  |
| Search2 |  | Search4 |
| (Miss | (Miss |  |
| on this | on this |  |
| device.) | device.) |  |

Figure 10-9. Timing Diagram for 68-bit Search Device Number 0


Figure 10-10. Timing Diagram for 68-bit Search Device Number 1


Figure 10-11. Timing Diagram for 68-bit Search Device Number 7 (Last Device)
The following is the sequence of operation for a single 68-bit Search command (also refer to "Command and Command Parameters," Subsection 10.2 on page 19).

- Cycle A: The host ASIC drives the CMDV HIGH and applies Search command code (10) to CMD[1:0] signals. CMD[5:3] signals must be driven with the index to the GMR pair for use in this Search operation. CMD[8:7] signals must be driven with the same bits that will be driven on SADR[21:20] by this device if it has a hit. DQ[67:0] must be driven with the 68-bit data to be compared. The CMD[2] signal must be driven to logic 0 .
- Cycle B: The host ASIC continues to drive the CMDV HIGH and to apply Search command (10) on CMD[1:0]. CMD[5:2] must be driven by the index of the comparand register pair for storing the 136-bit word presented on the DQ bus during cycles A and B. CMD[8:6] signals must be driven with the index of the SSR that will be used for storing the address of the matching entry and hit flag (see page 14 for a description of SSR[0:7]). The DQ[67:0] continues to carry the 68-bit data to be compared.

Note. For 68-bit searches, the host ASIC must supply the same 68-bit data on DQ[67:0] during both cycles A and B, and the even and odd pairs of GMRs selected for the comparison must be programmed with the same value.
The logical 68-bit Search operation is shown in Figure 10-12. The entire table with eight devices of 68 -bit entries is compared to a 68 -bit word $K$ (presented on the DQ bus in both cycles $A$ and $B$ of the command) using the GMR and the local mask bits. The effective GMR is the 68-bit word specified by the identical value in both even and odd GMR pairs in each of the eight devices and selected by the GMR Index in the command's cycle A. The 68-bit word K (presented on the DQ bus in both cycles A and B of the command) is also stored in both even and odd comparand register pairs (selected by the Comparand Register Index in command cycle $B$ ) in each of the eight devices. In the $\times 68$ configuration, only the even comparand register can subsequently be used by the Learn command in one of the devices (only the first non-full device). The word K (presented on the DQ bus in both cycles $A$ and $B$ of the command) is compared with each entry in the table starting at location 0 . The first matching entry's location address $L$ is the winning address that is driven as part of the SRAM address on the SADR[21:0] lines (see "SRAM Addressing" on page 101). The global winning device will drive the bus in a specific cycle. On a global miss cycle the device with LRAM = 1 (default driving device for the SRAM bus) and LDEV $=1$ (default driving device for SSF and SSV signals) will be the default driver for such missed cycles.


Figure 10-12. x68 Table with Eight Devices
The Search command is a pipelined operation and executes a Search at half the rate of the frequency of CLK2X for 68-bit searches in $\times 68$-configured tables. The latency of SADR, CE_L, ALE_L, WE_L, SSV, and SSF from the 68-bit Search command cycle (two CLK2X cycles) is shown in Table 10-13.

Table 10-13. The Latency of Search from Instruction to SRAM Access Cycle

| Number of Devices | Max Table Size | Latency in CLK Cycles |
| :---: | :---: | :---: |
| $1(T L S Z=00)$ | $32 \mathrm{~K} \times 68$ bits | 4 |
| $1-8(T L S Z=01)$ | $256 \mathrm{~K} \times 68$ bits | 5 |
| $1-31(\mathrm{TLSZ}=10)$ | $992 \mathrm{~K} \times 68$ bits | 6 |

The latency of the Search from command to SRAM access cycle is 5 for up to eight devices in the table (TLSZ = 01). SSV and SSF also shift further to the right for different values of HLAT, as specified in Table 10-14.

Table 10-14. Shift of SSF and SSV from SADR

| HLAT | Number of CLK Cycles |
| :---: | :---: |
| 000 | 0 |
| 001 | 1 |
| 010 | 2 |
| 011 | 3 |
| 100 | 4 |
| 101 | 5 |
| 110 | 6 |
| 111 | 7 |

## $10.8 \quad 68$-bit Search on Tables Configured as $\times 68$ Using up to 31 CYNSE70064 Devices

The hardware diagram of the Search subsystem of 31 devices is shown in Figure 10-13. Each of the four blocks in the diagram represents eight CYNSE70064 devices (except the last, which has seven devices). The diagram for a block of eight devices is shown in Figure 10-14. The following are the parameters programmed into the 31 devices.

- First thirty devices (devices 0-29): $C F G=00000000, T L S Z=10, H L A T=001, L R A M=0$, and LDEV $=0$.
- Thirty-first device (device 30): CFG $=00000000, T L S Z=10, H L A T=001$, LRAM $=1$, and LDEV $=1$.

Note. All 31 devices must be programmed with the same values for TLSZ and HLAT. Only the last device in the table must be programmed with LRAM = 1 and LDEV = 1 (device number 30 in this case). All other upstream devices must be programmed with LRAM $=0$ and LDEV $=0$ (devices 0 through 29 in this case).
The timing diagrams referred to in this paragraph reference the Hit/Miss assumptions defined in Table 10-15. For the purpose of illustrating the timings, it is further assumed that the there is only one device with a matching entry in each of the blocks. Figure $10-15$ shows the timing diagram for a Search command in the 68-bit-configured table of 31 devices for each of the eight devices in block 0 . Figure $10-16$ shows a timing diagram for a Search command in the 68 -bit-configured table of 31 devices for the all the devices in block number 1 (above the winning device in that block). Figure 10-17 shows the timing diagram for the globally winning device (defined as the final winner within its own and all blocks) in block number 1. Figure 10-18 shows the timing diagram for all the devices below the globally winning device in block number 1. Figure 10-19, Figure 10-20, and Figure 10-21 show the timing diagrams of the devices above the globally winning device, the globally winning device, and the devices below the globally winning device, respectively, for block number 2. Figure 10-22, Figure 10-23, Figure 10-24, and Figure 10-25 show the timing diagrams of the devices above globally winning device, the globally winning device, and the devices below the globally winning device except the last device (device 30), respectively, for block number 3.

The 68-bit Search operation is pipelined and executes as follows. Four cycles from the Search command, each of the devices knows the outcome internal to it for that operation. In the fifth cycle after the Search command, the devices in a block arbitrate for a winner amongst them (a "block" being defined as less than or equal to eight devices resolving the winner within them using the LHI[6:0] and LHO[1:0] signalling mechanism). In the sixth cycle after the Search command, the blocks (of devices) resolve the winning block through the $\mathrm{BHI}[2: 0]$ and $\mathrm{BHO}[2: 0]$ signalling mechanism. The winning device within the winning block is the global winning device for a Search operation.

Table 10-15. Hit/Miss Assumptions

| Search Number | $\mathbf{1}$ | $\mathbf{2}$ | $\mathbf{3}$ | $\mathbf{4}$ |
| :---: | :---: | :---: | :---: | :---: |
| Block 0 | Miss | Miss | Miss | Miss |
| Block 1 | Miss | Miss | Hit | Miss |
| Block 2 | Miss | Hit | Hit | Miss |
| Block 3 | Hit | Hit | Miss | Miss |



Figure 10-13. Hardware Diagram for a Table with 31 Devices


Figure 10-14. Hardware Diagram for a Block of up to Eight Devices


SADR[21:0] $\quad$ z


Figure 10-15. Timing Diagram for Each Device In Block Number 0 (Miss on Each Device)
$F G=00000000, H L A T=001, T L S Z=10, L R A M=0, L D E V=0$.
ote: |(BHI[2:0)] stands for the boolean 'OR' of the entire bus BHI[2:0]. ote: |(LHI[6:0]) stands for the boolean 'OR' for the entire bus LHI[6:0]. ote: Each bit in $\mathrm{BHO}[2: 0]$ is the same logical signal. ote: Each bit in LHO[1:0] is the same logical signal.

|  |  |
| :--- | :--- | :--- |
| Search1 | Search3 |
| (Miss | (Miss |
| on | on |
| this | this |
| device.) | device.) |

Figure 10-16. Timing Diagram for Each Device Above the Winning Device in Block Number 1


Figure 10-17. Timing Diagram for Globally Winning Device in Block Number 1


SADR[21:0] $\qquad$


Figure 10-18. Timing Diagram for Devices Below the Winning Device in Block Number 1
$C F G=00000000, \mathrm{HLAT}=001, \mathrm{TLSZ}=10, \operatorname{LRAM}=0, L D E V=0$.
Note: |(BHI[2:0]) stands for the boolean 'OR' of the entire bus BHI[2:0].
Note: |(LHI[6:0]) stands for the boolean 'OR' for the entire bus LHI[6:0].
Note: Each bit in $\mathrm{BHO}[2: 0]$ is the same logical signal.
Note: Each bit in LHO[1:0] is the same logical signal.


Figure 10-19. Timing Diagram for Devices Above the Winning Device in Block Number 2
$C F G=00000000, H L A T=001, T L S Z=10, L R A M=0, L D E V=0$.
Note: |(BHI[2:0]) stands for the boolean 'OR' of the entire bus BHI[2:0].
Note: |(LHI[6:0]) stands for the boolean 'OR' for the entire bus LHI[6:0].
Note: Each bit in $\mathrm{BHO}[2: 0]$ is the same logical signal.
Note: Each bit in LHO[1:0] is the same logical signal.

| Search2 | Search4 <br> (Global <br> (Miss |
| :--- | :--- |
| winner.) | on this <br> device.) |
|  |  |

Figure 10-20. Timing Diagram for Globally Winning Device in Block Number 2


Figure 10-21. Timing Diagram for Devices Below the Winning Device in Block Number 2


Figure 10-22. Timing Diagram for Devices Above the Winning Device in Block Number 3


Figure 10-23. Timing Diagram for Globally Winning Device in Block Number 3


Figure 10-24. Timing Diagram for Devices Below the Winning Device in Block Number 3
(Except the Last Device [Device 30])


Figure 10-25. Timing Diagram for Device Number 6 in Block Number 3 (Device 30 in Depth-Cascaded Table)
The following is the sequence of operation for a single 68-bit Search command (also refer to the "Command and Command Parameters," Subsection 10.2 on page 19).

- Cycle A: The host ASIC drives the CMDV HIGH and applies Search command code (10) on CMD[1:0] signals. CMD[5:3] signals must be driven with the index to the GMR pair for use in this Search operation. CMD[8:7] signals must be driven with the same bits that will be driven on SADR[21:20] by this device if it has a hit. DQ[67:0] must be driven with the 68-bit data to be compared. The CMD[2] signal must be driven to a logic 0 .
- Cycle B: The host ASIC continues to drive the CMDV HIGH and applies Search command (10) on CMD[1:0]. CMD[5:2] must be driven by the index of the comparand register pair for storing the 136-bit word presented on the DQ bus during cycles $A$ and B. CMD[8:6] signals must be driven with the index of the SSR that will be used for storing the address of the matching entry and the hit flag (see page 14 for the description of SSR[0:7]). The DQ[67:0] continues to carry the 68-bit data to be compared.
Note. For 68-bit searches, the host ASIC must supply the same 68-bit data on DQ[67:0] during both cycles A and B and the even and odd pair of global mask registers selected for the compare must be programmed with the same value.

CYNSE70064

The logical 68-bit Search operation is shown in Figure 10-26. The entire table ( 31 devices of 68 -bit entries) is compared to a 68 -bit word K (presented on the DQ bus in both cycles A and B of the command) using the GMR and the local mask bits. The effective GMR is the 68 -bit word specified by the identical value in both even and odd GMR pairs in each of the eight devices and selected by the GMR Index in the command's cycle A. The 68-bit word K (presented on the DQ bus in both cycles A and B of the command) is also stored in both even and odd comparand register pairs in each of the eight devices and selected by the Comparand Register Index in command's cycle B. In the $\times 68$ configuration, the even comparand register can be subsequently used by the Learn command only in the first non-full device. The word K (presented on the DQ bus in both cycles A and B of the command) is compared with each entry in the table starting at location 0 . The first matching entry's location address $L$ is the winning address that is driven as part of the SRAM address on the SADR[21:0] lines (see "SRAM Addressing" on page 101). The global winning device will drive the bus in a specific cycle. On global miss cycles the device with LRAM $=1$ and LDEV $=1$ will be the default driver for such missed cycles.


Figure 10-26. $\times 68$ Table with 31 Devices
The Search command is a pipelined operation and executes a Search at half the rate of the frequency of CLK2X for 68-bit searches in $\times 68$-configured tables. The latency of SADR, CE_L, ALE_L, WE_L, SSV, and SSF from the 68-bit Search command cycle (two CLK2X cycles) is shown in Table 10-16.

Table 10-16. The Latency of Search from Instruction to SRAM Access Cycle

| Number of Devices | Max Table Size | Latency in CLK Cycles |
| :---: | :---: | :---: |
| $1(\mathrm{TLSZ}=00)$ | $32 \mathrm{~K} \times 68$ bits | 4 |
| $1-8(\mathrm{TLSZ}=01)$ | $256 \mathrm{~K} \times 68$ bits | 5 |
| $1-31(\mathrm{TLSZ}=10)$ | $996 \mathrm{~K} \times 68$ bits | 6 |

For up to 31 devices in the table (TLSZ = 10), Search latency from command to SRAM access cycle is 6 . In addition, SSV and SSF shift further to the right for different values of HLAT, as specified in Table 10-17.

Table 10-17. Shift of SSF and SSV from SADR

| HLAT | Number of CLK Cycles |
| :---: | :---: |
| 000 | 0 |
| 001 | 1 |
| 010 | 2 |
| 011 | 3 |
| 100 | 4 |
| 101 | 5 |
| 110 | 6 |
| 111 | 7 |

### 10.9 136-bit Search on Tables Configured as $\times 136$ Using a Single CYNSE70064 Device

Figure 10-27 shows the timing diagram for a Search command in the 136-bit-configured table (CFG = 01010101) consisting of a single device for one set of parameters: $\mathrm{TLSZ}=00, \mathrm{HLAT}=001$, LRAM $=1$, and $\mathrm{LDEV}=1$. The hardware diagram for this Search subsystem is shown in Figure 10-28.


Figure 10-27. Timing Diagram for 136-bit Search (One Device)


Figure 10-28. Hardware Diagram for a Table with One Device
The following is the operation sequence for a single 136-bit Search command (also refer to "Command and Command Parameters," Subsection 10.2 on page 19).

- Cycle A: The host ASIC drives the CMDV HIGH and applies Search command code (10) to CMD[1:0] signals. CMD[5:3] signals must be driven with the index to the GMR pair for use in this Search operation. CMD[8:7] signals must be driven with the same bits that will be driven on SADR[21:20] by this device if it has a hit. DQ[67:0] must be driven with the 68-bit data ([135:68]) to be compared against all even locations. The CMD[2] signal must be driven to logic 0 .
- Cycle B: The host ASIC continues to drive the CMDV HIGH and applies the command code of Search command (10) on CMD[1:0]. CMD[5:2] must be driven by the index of the comparand register pair for storing the 136-bit word presented on the DQ bus during cycles $A$ and $B$. CMD[8:6] signals must be driven with the index of the SSR that will be used for storing the address of the matching entry and hit flag (see page 14 for the description of SSR[0:7]). The DQ[67:0] is driven with 68-bit data ([67:0]), compared to all odd locations.

Note. For 136-bit searches, the host ASIC must supply two distinct 68-bit data words on $D Q[67: 0]$ during cycles $A$ and $B$. The even-numbered GMR of the pair specified by the GMR Index is used for masking the word in cycle A. The odd-numbered GMR of the pair specified by the GMR Index is used for masking the word in cycle B.
The logical 136-bit Search operation is shown in Figure 10-29. The entire table of 136-bit entries is compared to a 136-bit word $K$ (presented on the DQ bus in cycles $A$ and $B$ of the command) using the GMR and the local mask bits. The GMR is the 136-bit word specified by the even and odd global mask pair selected by the GMR Index in the command's cycle A. The 136-bit word K (presented on the DQ bus in cycles A and B of the command) is also stored in both even and odd comparand register pairs selected by the Comparand Register Index in the command's cycle B. The two comparand registers can subsequently be used by the Learn command with the even comparand register stored in an even location, and the odd comparand register stored in an adjacent odd location. The word $K$ (presented on the DQ bus in cycles A and B of the command) is compared with each entry in the table starting at location 0 . The first matching entry's location address $L$ is the winning address that is driven as part of the SRAM address on the SADR[21:0] lines (see "SRAM Addressing" on page 101). Note. The matching address is always going to an even address for a 136-bit Search.


Figure 10-29. $\times 136$ Table with One Device
The Search command is a pipelined operation that executes searches at half the rate of the frequency of CLK2X for 136-bit searches in $\times 136$-configured tables. The latency of SADR, CE_L, ALE_L, WE_L, SSV, and SSF from the 136-bit Search command cycle (two CLK2X cycles) is shown in Table 10-18.

Table 10-18. The Latency of Search from Instruction to SRAM Access Cycle

| Number of Devices | Max Table Size | Latency in CLK Cycles |
| :---: | :---: | :---: |
| $1(\mathrm{TLSZ}=00)$ | $16 \mathrm{~K} \times 136$ bits | 4 |
| $1-8(\mathrm{TLSZ}=01)$ | $128 \mathrm{~K} \times 136$ bits | 5 |
| $1-31(\mathrm{TLSZ}=10)$ | $496 \mathrm{~K} \times 136$ bits | 6 |

For a single device in the table with $T L S Z=00$, the latency of the Search from command to SRAM access cycle is 4 . In addition, SSV and SSF shift further to the right for different values of HLAT, as specified in Table 10-19.

Table 10-19. Shift of SSF and SSV from SADR

| HLAT | Number of CLK Cycles |
| :---: | :---: |
| 000 | 0 |
| 001 | 1 |
| 010 | 2 |
| 011 | 3 |
| 100 | 4 |
| 101 | 5 |
| 110 | 6 |
| 111 | 7 |

### 10.10 136-bit Search on Tables Configured as $\times 136$ Using up to Eight CYNSE70064 Devices

The hardware diagram of the Search subsystem of eight devices is shown in Figure 10-30. The following are parameters programmed into the eight devices.

- First seven devices (devices $0-6$ ): $C F G=01010101, T L S Z=01, H L A T=010, L R A M=0$, and $L D E V=0$.
- Eighth device (device 7): CFG $=01010101, \mathrm{TLSZ}=01, \mathrm{HLAT}=010, \mathrm{LRAM}=1$, and LDEV $=1$.

Note. All eight devices must be programmed with the same value of TLSZ and HLAT. Only the last device in the table must be programmed with LRAM = 1 and LDEV = 1 (device number 7 in this case). All other upstream devices must be programmed with LRAM $=0$ and LDEV $=0$ (devices 0 through 6 in this case).
Figure $10-31$ shows the timing diagram for a Search command in the 136 -bit-configured table of eight devices for device 0 . Figure $10-32$ shows the timing diagram for a Search command in the 136-bit-configured table consisting of eight devices for device number 1. Figure $10-33$ shows the timing diagram for a Search command in the 136 -bit configured table consisting of eight devices for device number 7 (the last device in this specific table). For these timing diagrams, four 136-bit searches are performed sequentially, and the following Hit/Miss assumptions were made (see Table 10-20).

Table 10-20. Hit/Miss Assumption

| Search Number | $\mathbf{1}$ | $\mathbf{2}$ | $\mathbf{3}$ | $\mathbf{4}$ |
| :---: | :---: | :---: | :---: | :---: |
| Device 0 | Hit | Miss | Hit | Miss |
| Device 1 | Miss | Hit | Hit | Miss |
| Devices 2-6 | Miss | Miss | Miss | Miss |
| Device 7 | Miss | Miss | Hit | Hit |



Figure 10-30. Hardware Diagram for a Table with Eight Devices

$F G=01010101, H L A T=010, T L S Z=01, L R A M=0, L D E V=0$.
ote: Each bit in LHO[1:0] is the same logical signal.

Figure 10-31. Timing Diagram for 136-bit Search Device Number 0


Figure 10-32. Timing Diagram for 136-bit Search Device Number 1
$C F G=01010101, H L A T=010, T L S Z=01, L R A M=1, L D E V=1$.
Note: |(LHI[6:0]) stands for the boolean 'OR' of the entire bus LHI[6:0].
Note: Each bit in LHO[1:0] is the same logical signal.


Figure 10-33. Timing Diagram for 136-bit Search Device Number 7 (Last Device)
The following is the sequence of operation for a single 136-bit Search command (also see Subsection 10.2, "Commands and Command Parameters" on page 19).

- Cycle A: The host ASIC drives CMDV HIGH and applies Search command code (10) on CMD[1:0] signals. CMD[5:3] signals must be driven with the index to the GMR pair for use in this Search operation. CMD [8:7] signals must be driven with the same bits that will be driven by this device on SADR[21:20] if it has a hit. DQ[67:0] must be driven with the 68-bit data ([135:68]) in order to be compared against all even locations. The CMD[2] signal must be driven to a logic 0 .
- Cycle B: The host ASIC continues to drive CMDV HIGH and to apply the command code for Search command (10) on CMD[1:0]. CMD[5:2] must be driven by the index of the comparand register pair for storing the 136-bit word presented on the DQ bus during cycles A and B. CMD[8:6] signals must be driven with the SSR index that will be used for storing the address of the matching entry and the hit flag (see page 14 for the description of SSR[0:7]). The $\operatorname{DQ}[67: 0]$ is driven with 68-bit data ([67:0]) compared against all odd locations.

CYNSE70064

The logical 136-bit Search operation is shown in Figure 10-34. The entire table (eight devices of 136 -bit entries) is compared to a 136 -bit word $K$ (presented on the DQ bus in cycles $A$ and $B$ of the command) using the GMR and local mask bits. The GMR is the 136 -bit word specified by the even and odd global mask pair selected by the GMR Index in the command's cycle A.
The 136 -bit word $K$ (presented on the DQ bus in cycles A and B of the command) is also stored in the even and odd comparand registers specified by the Comparand Register Index in the command's cycle B. In $\times 136$ configurations, the even and odd comparand registers can subsequently be used by the Learn command in only one of the devices (the first non-full device). The word K (presented on the DQ bus in cycles A and B of the command) is compared to each entry in the table starting at location 0 . The first matching entry's location, address $L$, is the winning address that is driven as part of the SRAM address on the SADR[21:0] lines (see "SRAM Addressing" on page 101). The global winning device will drive the bus in a specific cycle. On global miss cycles the device with LRAM $=1$ (the default driving device for the SRAM bus) and LDEV $=1$ (the default driving device for SSF and SSV signals) will be the default driver for such missed cycles. Note. During 136-bit searches of 136 -bit-configured tables, the Search hit will always be at an even address.


Figure 10-34. $\times 136$ Table with Eight Devices
The Search command is a pipelined operation and executes a Search at half the rate of the frequency of CLK2X for 136-bit searches in $\times 136$-configured tables. The latency of SADR, CE_L, ALE_L, WE_L, SSV, and SSF from the 136-bit Search command cycle (two CLK2X cycles) is shown in Table 10-21.

Table 10-21. Search Latency from Instruction to SRAM Access Cycle

| Number of Devices | Max Table Size | Latency in CLK Cycles |
| :---: | :---: | :---: |
| $1(\mathrm{TLSZ}=00)$ | $16 \mathrm{~K} \times 136$ bits | 4 |
| $1-8(\mathrm{TLSZ}=01)$ | $128 \mathrm{~K} \times 136$ bits | 5 |
| $1-31(\mathrm{TLSZ}=10)$ | $496 \mathrm{~K} \times 136$ bits | 6 |

For one to eight devices in the table and $T L S Z=01$, the latency of a Search from command to SRAM access cycle is 5 . In addition, SSV and SSF shift further to the right for different values of HLAT as specified in Table 10-22.

Table 10-22. Shift of SSF and SSV from SADR

| HLAT | Number of CLK Cycles |
| :---: | :---: |
| 000 | 0 |
| 001 | 1 |
| 010 | 2 |
| 011 | 3 |
| 100 | 4 |
| 101 | 5 |
| 110 | 6 |
| 111 | 7 |

### 10.11 136-bit Search on Tables Configured as $\times 136$ Using up to 31 CYNSE70064 Devices

The hardware diagram of the Search subsystem of 31 devices is shown in Figure 10-35. Each of the four blocks in the diagram represents a block of eight CYNSE70064 devices (except the last, which has seven devices). The diagram for a block of eight devices is shown in Figure 10-36. Following are the parameters programmed into the 31 devices.

- First thirty devices (devices $0-29$ ): $C F G=01010101, T L S Z=10, H L A T=001, L R A M=0$, and LDEV $=0$.
- Thirty-first device (device 30): CFG = 01010101, TLSZ = 10, HLAT = 001, LRAM = 1, and LDEV = 1 .

Note. All 31 devices must be programmed with the same value of TLSZ and HLAT. Only the last device in the table must be programmed with LRAM = 1 and LDEV = 1 (device number 30 in this case). All other upstream devices must be programmed with LRAM $=0$ and LDEV $=0$ (devices 0 through 29 in this case).
The timing diagrams referred to in this paragraph reference the Hit/Miss assumptions defined in Table 10-23. For the purpose of illustrating timings, it is further assumed that the there is only one device with a matching entry in each of the blocks. Figure 10-37 shows the timing diagram for a Search command in the 136-bit-configured table ( 31 devices) for each of the eight devices in block number 0. Figure 10-38 shows the timing diagram for Search command in the 68 -bit-configured table ( 31 devices) for all the devices in block number 1 above the winning device in that block. Figure 10-39 shows the timing diagram for the globally winning device (the final winner within its own block and all blocks) in block number 1 . Figure 10-40 shows the timing diagram for all the devices below the globally winning device in block number 1. Figure 10-41, Figure 10-42, and Figure 10-43 respectively show the timing diagrams of the devices above globally winning device, the globally winning device and devices below the globally winning device for block number 2. Figure 10-44, Figure 10-45, Figure 10-46, and Figure 10-47 respectively show the timing diagrams of the devices above the globally winning device, the globally winning device, and devices below the globally winning device except the last device (device 30), and the last device (device 30) for block number 3.

The 136-bit Search operation is pipelined and executes as follows. Four cycles from the Search command, each of the devices knows the outcome internal to it for that operation. In the fifth cycle after the Search command, the devices in a block (being less than or equal to eight devices resolving the winner within them using the LHI[6:0] and LHO[1:0] signalling mechanism) arbitrate for a winner amongst them. In the sixth cycle after the Search command, the blocks (of devices) resolve the winning block through the $\mathrm{BHI}[2: 0]$ and $\mathrm{BHO}[2: 0]$ signalling mechanism. The winning device in the winning block is the global winning device for a Search operation.

Table 10-23. Hit/Miss Assumption

| Search Number | $\mathbf{1}$ | $\mathbf{2}$ | $\mathbf{3}$ | $\mathbf{4}$ |
| :---: | :---: | :---: | :---: | :---: |
| Block 0 | Miss | Miss | Miss | Miss |
| Block 1 | Miss | Miss | Hit | Miss |
| Block 2 | Miss | Hit | Hit | Miss |
| Block 3 | Hit | Hit | Miss | Miss |



Figure 10-35. Hardware Diagram for a Table with 31 Devices


Figure 10-36. Hardware Diagram for a Block of Up to Eight Devices




Figure 10-37. Timing Diagram for Each Device in Block Number 0 (Miss on Each Device)


Figure 10-38. Timing Diagram for Each Device Above the Winning Device in Block Number 1
$C F G=01010101, H L A T=001, T L S Z=10, L R A M=0, L D E V=0$.
Note: |(BHI[2:0]) stands for the boolean 'OR' of the entire bus BHI[2:0].
Note: |(LHI[6:0]) stands for the boolean ‘OR’ for the entire bus LHI[6:0].
Note: Each bit in $\mathrm{BHO}[2: 0]$ is the same logical signal.
Note: Each bit in LHO[1:0] is the same logical signal.

| Search2 | Search4 <br> (Miss |
| :--- | :--- |
| (Miss |  |
| on this | on this |
| device) | device.) |

Figure 10-39. Timing Diagram for Globally Winning Device in Block Number 1


Figure 10-40. Timing Diagram for Devices Below the Winning Device in Block Number 1


Figure 10-41. Timing Diagram for Devices Above the Winning Device in Block Number 2


Figure 10-42. Timing Diagram for Globally Winning Device in Block Number 2


Figure 10-43. Timing Diagram for Devices Below the Winning Device in Block Number 2


Figure 10-44. Timing Diagram for Devices Above the Winning Device in Block Number 3


Figure 10-45. Timing Diagram for Globally Winning Device in Block Number 3


Figure 10-46. Timing Diagram for Devices Below the Winning Device in Block Number 3 Except Device 30 (the Last Device)


Figure 10-47. Timing Diagram for Device Number 6 in Block Number 3 (Device 30 in Depth-Cascaded Table)
The following is the sequence of operation for a single 136-bit Search command (also refer to "Command and Command Parameters," Subsection 10.2 on page 19).

- Cycle A: The host ASIC drives the CMDV HIGH and applies Search command code (10) on CMD[1:0] signals. CMD[5:3] signals must be driven with the index to the GMR pair for use in this Search operation. CMD[8:7] signals must be driven with the bits that will be driven on SADR[21:20] by this device if it has a hit. DQ[67:0] must be driven with the 68-bit data ([135:68]) in order to be compared against all even locations. The CMD[2] signal must be driven to logic 0 .
- Cycle B: The host ASIC continues to drive the CMDV HIGH and to apply Search command code (10) on CMD[1:0]. CMD[5:2] must be driven by the index of the comparand register pair for storing the 136-bit word presented on the DQ bus during cycles A and B. CMD[8:6] signals must be driven with the index of the SSR that will be used for storing the address of the matching entry and the hit flag (see page 14 for the description of SSR[0:7]). The DQ[67:0] is driven with 68-bit data ([67:0])to be compared against all odd locations.
The logical 136-bit Search operation is as shown in the following Figure 10-48. The entire table of 31 devices (consisting of 136-bit entries) is compared against a 136 -bit word $K$ that is presented on the DQ bus in cycles $A$ and $B$ of the command using the GMR and local mask bits. The GMR is the 136 -bit word specified by the even and odd global mask pair selected by the GMR Index in the command's cycle A.

CYNSE70064

The 136 -bit word $K$ that is presented on the $D Q$ bus in cycles $A$ and $B$ of the command is also stored in the even and odd comparand registers specified by the Comparand Register Index in the command's cycle B. In $\times 136$ configurations, the even and odd comparand registers can subsequently be used by the Learn command in only the first non-full device. Note. The Learn command is supported for only one of the blocks consisting of up to eight devices in a depth-cascaded table of more than one block. The word K that is presented on the DQ bus in cycles $A$ and $B$ of the command is compared with each entry in the table starting at location 0 . The first matching entry's location address $L$ is the winning address that is driven as part of the SRAM address on the SADR[21:0] lines (see Section 12.0, "SRAM Addressing" on page 101). The global winning device will drive the bus in a specific cycle. On global miss cycles the device with LRAM $=1$ (the default driving device for the SRAM bus) and LDEV = 1 (the default driving device for SSF and SSV signals) will be the default driver for such missed cycles. Note. During 136 -bit searches of 136-bit-configured tables, the Search hit will always be at an even address.


Figure 10-48. $\times 136$ Table with 31 Devices
The Search command is a pipelined operation. It executes a Search at half the rate of the frequency of CLK2X for 136-bit searches in $\times 136$-configured tables. The latency of SADR, CE_L, ALE_L, WE_L, SSV, and SSF from the 136-bit Search command cycle (two CLK2X cycles) is shown in Table 10-24.

Table 10-24. The Latency of Search from Instruction to SRAM Access Cycle

| Number of Devices | Max Table Size | Latency in CLK Cycles |
| :---: | :---: | :---: |
| $1(\mathrm{TLSZ}=00)$ | $16 \mathrm{~K} \times 136$ bits | 4 |
| $1-8(\mathrm{TLSZ}=01)$ | $128 \mathrm{~K} \times 136$ bits | 5 |
| $1-31(\mathrm{TLSZ}=10)$ | $496 \mathrm{~K} \times 136$ bits | 6 |

The latency of a Search from command to the SRAM access cycle is 6 for $1-31$ devices in the table and where TLSZ $=10$. In addition, SSV and SSF shift further to the right for different values of HLAT, as specified in Table 10-25.

Table 10-25. Shift of SSF and SSV from SADR

| HLAT | Number of CLK Cycles |
| :---: | :---: |
| 000 | 0 |
| 001 | 1 |
| 010 | 2 |
| 011 | 3 |
| 100 | 4 |
| 101 | 5 |
| 110 | 6 |
| 111 | 7 |

### 10.12 272-bit Search on Tables Configured as û272 Using a Single CYNSE70064 Device

Figure 10-49 shows the timing diagram for a Search command in the 272-bit-configured table (CFG = 10101010) consisting of a single device for one set of parameters: $T L S Z=00, H L A T=001$, LRAM $=1$, and $L D E V=1$. The hardware diagram for this Search subsystem is shown in Figure 10-50.
cycle cyclecycle cycle cyclecycle cycle cycle cycle cycle


Figure 10-49. Timing Diagram for 272-bit Search (One Device)


Figure 10-50. Hardware Diagram for a Table with One Device
The following is the sequence of operation for a single 136-bit Search command (also refer to Subsection 10.2, "Commands and Command Parameters" on page 19).

- Cycle A: The host ASIC drives the CMDV HIGH and applies Search command code (10) on CMD[1:0] signals. CMD[5:3] signals must be driven with the index to the GMR pair used for bits [271:136] of the data being searched. DQ[67:0] must be driven with the 68 -bit data ([271:204]) to be compared to all locations 0 in the four 68 -bits-word page. The CMD[2] signal must be driven to logic 1. Note. CMD[2] $=1$ signals that the Search is a $\times 272$-bit Search. CMD[8:3] in this cycle is ignored.
- Cycle B: The host ASIC continues to drive the CMDV HIGH and continues to apply the command code of Search command (10) on CMD[1:0]. The DQ[67:0] is driven with the 68-bit data ([204:136]) to be compared to all locations 1 in the four 68-bits-word page.
- Cycle C: The host ASIC drives the CMDV HIGH and applies Search command code (10) on CMD[1:0] signals. CMD[5:3] signals must be driven with the index to the GMR pair used for bits [135:0] of the data being searched. CMD[8:7] signals must be driven with the bits that will be driven on SADR[21:20] by this device if it has a hit. DQ[67:0] must be driven with the 68 -bit data ([135:68]) to be compared to all locations 2 in the four 68 -bits-word page. The CMD[2] signal must be driven to logic 0 .
- Cycle D: The host ASIC continues to drive the CMDV HIGH and applies Search command code (10) on CMD[1:0]. CMD[8:6] signals must be driven with the index of the SSR that will be used for storing the address of the matching entry and the hit flag (see page 14 for the description of SSR[0:7]). The DQ[67:0] is driven with the 68-bit data ([67:0]) to be compared to all locations 3 in the four 68-bits-word page. CMD[5:2] is ignored because the Learn instruction is not supported for x 272 tables.
Note. For 272-bit searches, the host ASIC must supply four distinct 68-bit data words on DQ[67:0] during cycles A, B, C, and D. The GMR index in cycle A selects a pair of GMRs that apply to DQ data in cycles $A$ and $B$. The GMR index in cycle $C$ selects a pair of GMRs that apply to DQ data in cycles C and D.
The logical 272-bit Search operation is shown in Figure 10-51. The entire table of 272-bit entries is compared to a 272-bit word $K$ that is presented on the DQ bus in cycles A, B, C, and D of the command using the GMR and local mask bits. The GMR is the 272 -bit word specified by the two pairs of GMRs selected by the GMR Indexes in the command's cycles A and C. The 272-bit word K that is presented on the DQ bus in cycles $\mathrm{A}, \mathrm{B}, \mathrm{C}$ and D of the command is compared with each entry in the table starting at location 0 . The first matching entry's location address $L$ is the winning address that is driven as part of the SRAM address on SADR[21:0] lines (see "SRAM Addressing" on page 101). Note. The matching address is always going to be location 0 in a four-entry page for a 272-bit Search (two LSBs of the matching index will be 00).


Figure 10-51. $\times 272$ Table with One Device
The Search command is a pipelined operation and executes at one-fourth the rate of the frequency of CLK2X for 272-bit searches in $\times 272$-configured tables. The latency of SADR, CE_L, ALE L, WE L, SSV, and SSF from the 272 -bit Search command (measured in CLK cycles) from the CLK2X cycle that contains the C and D cycles is shown in Table 10-26.

Table 10-26. The Latency of Search from C and D Cycles to SRAM Access Cycle

| Number of Devices | Max Table Size | Latency in CLK Cycles |
| :---: | :---: | :---: |
| $1(T L S Z=00)$ | $8 \mathrm{~K} \times 272$ bits | 4 |
| $1-8(T L S Z=01)$ | $64 \mathrm{~K} \times 272$ bits | 5 |
| $1-31(\mathrm{TLSZ}=10)$ | $248 \mathrm{~K} \times 272$ bits | 6 |

The latency of a Search from command to SRAM access cycle is 4 for only a single device in the table and TLSZ = 00. In addition, SSV and SSF shift further to the right for different values of HLAT, as specified in Table 10-27.

Table 10-27. Shift of SSF and SSV from SADR

| HLAT | Number of CLK Cycles |
| :---: | :---: |
| 000 | 0 |
| 001 | 1 |
| 010 | 2 |
| 011 | 3 |

CYNSE70064

Table 10-27. Shift of SSF and SSV from SADR (continued)

| HLAT | Number of CLK Cycles |
| :---: | :---: |
| 100 | 4 |
| 101 | 5 |
| 110 | 6 |
| 111 | 7 |

### 10.13 272-bit Search on Tables x272-configured Using up to Eight CYNSE70064 Devices

The hardware diagram of the Search subsystem of eight devices is shown in Figure 10-52. The following are the parameters programmed in the eight devices.

- First seven devices (devices $0-6$ ): $C F G=10101010, T L S Z=01, H L A T=000, L R A M=0$, and $L D E V=0$.
- Eighth device (device 7): CFG = 10101010, TLSZ $=01$, HLAT $=000$, LRAM $=1$, and LDEV $=1$.

Note. All eight devices must be programmed with the same value of TLSZ and HLAT. Only the last device in the table must be programmed with LRAM = 1 and LDEV = 1 (device number 7 in this case). All other upstream devices must be programmed with LRAM $=0$ and LDEV $=0$ (devices 0 through 6 in this case).
Figure 10-53 shows the timing diagram for a Search command in the 272-bit-configured table of eight devices for device number 0 . Figure $10-54$ shows the timing diagram for a Search command in the 272 -bit-configured table of eight devices for device number 1 . Figure $10-55$ shows the timing diagram for a Search command in the 272-bit-configured table of eight devices for device number 7 (the last device in this specific table). For these timing diagrams three 272-bit searches are performed sequentially. The following Hit/Miss assumptions were made as shown in Table 10-28.

Table 10-28. Hit/Miss Assumption

| Search Number | $\mathbf{1}$ | $\mathbf{2}$ | $\mathbf{3}$ |
| :---: | :---: | :---: | :---: |
| Device 0 | Hit | Miss | Miss |
| Device 1 | Miss | Hit | Miss |
| Devices 2-6 | Miss | Miss | Miss |
| Device 7 | Miss | Miss | Miss |



Figure 10-52. Hardware Diagram for a Table with Eight Devices


Figure 10-53. Timing Diagram for 272-bit Search Device Number 0


Figure 10-54. Timing Diagram for 272-bit Search Device Number 1


Figure 10-55. Timing Diagram for 272-bit Search Device Number 7 (Last Device)
The following is the sequence of operation for a single 272-bit Search command (also see "Commands and Command Parameters" on page 19).

- Cycle A: The host ASIC drives the CMDV HIGH and applies Search command code (10) on CMD[1:0] signals. CMD[5:3] signals must be driven with the index to the GMR pair used for bits [271:136] of the data being searched in this operation. DQ[67:0] must be driven with the 68-bit data ([271:204]) to be compared against all locations 0 in the four-word 68 -bit page. The $C M D[2]$ signal must be driven to logic 1 . Note. $C M D[2]=1$ signals that the Search is a $\times 272$ bit Search. CMD[8:3] in this cycle is ignored.
- Cycle B: The host ASIC continues to drive the CMDV HIGH and applies Search command code (10) on CMD[1:0]. The DQ[67:0] is driven with the 68-bit data ([203:136]) to be compared against all locations 1 in the four 68-bits-word page.
- Cycle C: The host ASIC drives the CMDV HIGH and applies Search command code (10) on CMD[1:0] signals. CMD[5:3] signals must be driven with the index to the GMR pair used for bits [135:0] of the data being searched. CMD[8:7] signals must be driven with the bits that will be driven on SADR[21:20] by this device if it has a hit. DQ[67:0] must be driven with the 68-bit data ([135:68]) to be compared against all locations 2 in the four 68-bits-word page. The CMD[2] signal must be driven to logic 0 .
- Cycle D: The host ASIC continues to drive the CMDV HIGH and applies Search command code (10) on CMD[1:0]. CMD[8:6] signals must be driven with the index of the SSR that will be used for storing the address of the matching entry and the hit flag (see page 14 for the description of SSR[0:7]). The DQ[67:0] is driven with the 68-bit data ([67:0]) to be compared to all locations 3 in the four 68-bits-word page. CMD[5:2] is ignored because the Learn instruction is not supported for $\times 272$ tables.
Note. For 272-bit searches, the host ASIC must supply four distinct 68-bit data words on DQ[67:0] during cycles A, B, C, and D. The GMR index in cycle A selects a pair of GMRs in each of the eight devices that apply to DQ data in cycles A and B. The GMR index in cycle C selects a pair of GMRs in each of the eight devices that apply to DQ data in cycles C and D.

The logical 272-bit Search operation is shown in Figure 10-56. The entire table of 272-bit entries is compared to a 272 -bit word $K$ that is presented on the DQ bus in cycles $A, B, C$, and $D$ of the command using the GMR and the local mask bits. The GMR is the 272-bit word specified by the two pairs of GMRs selected by the GMR Indexes in the command's cycles A and C in each of the eight devices. The 272 -bit word K that is presented on the DQ bus in cycles $\mathrm{A}, \mathrm{B}, \mathrm{C}$, and D of the command is compared to each entry in the table starting at location 0 . The first matching entry's location address $L$ is the winning address that is driven as part of the SRAM address on the SADR[21:0] lines (see "SRAM Addressing" on page 101). Note. The matching address is always going to be a location 0 in a four-entry page for 272-bit Search (two LSBs of the matching index will be 00).


Figure 10-56. $\times 272$ Table with Eight Devices
The Search command is a pipelined operation and executes Search at one fourth the rate of the frequency of CLK2X for 272-bit searches in x272-configured tables. The latency of SADR, CE_L, ALE_L, WE_L, SSV, and SSF from the 272-bit Search command (measured in CLK cycles) from the CLK2X cycle that contains the C and D cycles is shown in Table 10-29.

Table 10-29. The Latency of Search from C and D cycles to SRAM Access Cycle

| Number of Devices | Max Table Size | Latency in CLK Cycles |
| :---: | :---: | :---: |
| $1(\mathrm{TLSZ}=00)$ | $8 \mathrm{~K} \times 272$ bits | 4 |
| $1-8(\mathrm{TLSZ}=01)$ | $64 \mathrm{~K} \times 272$ bits | 5 |
| $1-31(\mathrm{TLSZ}=10)$ | $248 \mathrm{~K} \times 272$ bits | 6 |

The latency of Search from command to SRAM access cycle is 5 for only a single device in the table and TLSZ = 01. In addition, SSV and SSF shift further to the right for different values of HLAT, as specified in Table 10-30.

Table 10-30. Shift of SSF and SSV from SADR

| HLAT | Number of CLK Cycles |
| :---: | :---: |
| 000 | 0 |
| 001 | 1 |
| 010 | 2 |
| 011 | 3 |
| 100 | 4 |
| 101 | 5 |
| 110 | 6 |
| 111 | 7 |

### 10.14 272-bit Search on Tables Configured as $\times 272$ Using up to 31 CYNSE70064 Devices

The hardware diagram of the Search subsystem of 31 devices is shown in Figure 10-57. Each of the four blocks in the diagram represents a block of eight CYNSE70064 devices, except the last which has seven devices. The diagram for a block of eight devices is shown in Figure 10-58. The following are the parameters programmed into the 31 devices.

- First thirty devices (devices $0-29$ ): $C F G=10101010, T L S Z=10, H L A T=000, L R A M=0$, and $L D E V=0$.
- Thirty-first device (device 30 ): $\mathrm{CFG}=10101010, \mathrm{TLSZ}=10, \mathrm{HLAT}=000$, LRAM $=1$, and LDEV $=1$.

Note. All 31 devices must be programmed with the same value of TLSZ and HLAT. Only the last device in the table must be programmed with LRAM $=1$ and LDEV $=1$ (device number 30 in this case). All other upstream devices must be programmed with LRAM $=0$ and LDEV $=0$ (devices 0 through 29 in this case).

The timing diagrams referred to in this paragraph reference the Hit/Miss assumptions defined in Table 10-31. For the purpose of illustrating the timings, it is further assumed that there is only one device with the matching entry in each block. Figure 10-59 shows the timing diagram for a Search command in the 272-bit-configured table consisting of 31 devices for each of the eight devices in block number 0. Figure $10-60$ shows the timing diagram for a Search command in the 272-bit-configured table of 31 devices for all devices above the winning device in block number 1 . Figure 10-61 shows the timing diagram for the globally winning device (the final winner within its own and all blocks) in block number 1. Figure 10-62 shows the timing diagram for all the devices below the globally winning device in block number 1. Figure 10-63, Figure 10-64, and Figure 10-65, respectively, show the timing diagrams of the devices above the globally winning device, the globally winning device, and the devices below the globally winning device for block number 2. Figure 10-66, Figure 10-67, Figure 10-68, and Figure 10-69, respectively, show the timing diagrams of the device above the globally winning device, the globally winning device, the devices below the globally winning device (except device 30), and last device (device 30) for block number 3.
The 272-bit Search operation is pipelined and executes as follows. Four cycles from the last cycle of the Search command each of the devices knows the outcome internal to it for that operation. In the fifth cycle from the Search command, the devices in a block (which is less than or equal to eight devices resolving the winner within them using an $\mathrm{LHI}[6: 0$ ] and $\mathrm{LHO}[1: 0$ ] signalling mechanism) arbitrate for a winner. In the sixth cycle after the Search command, the blocks of devices resolve the winning block through a $\mathrm{BHI}[2: 0]$ and $\mathrm{BHO}[2: 0]$ signalling mechanism. The winning device within the winning block is the global winning device for the Search operation.

Table 10-31. Hit/Miss Assumption

| Search Number | $\mathbf{1}$ | $\mathbf{2}$ | $\mathbf{3}$ |
| :---: | :---: | :---: | :---: |
| Block 0 | Miss | Miss | Miss |
| Block 1 | Miss | Miss | Hit |
| Block 2 | Miss | Hit | Hit |
| Block 3 | Hit | Hit | Miss |



Figure 10-57. Hardware Diagram for a Table with 31 Devices


Figure 10-58. Hardware Diagram for A Block of up to Eight Devices


Note: Each bit in $\mathrm{BHO}[2: 0]$ is the same logical signal.
Note: Each bit in LHO[1:0] is the same logical signal.
Figure 10-59. Timing Diagram for Each Device in Block Number 0 (Miss on Each Device)


Figure 10-60. Timing Diagram for Each Device Above the Winning Device in Block Number 1


Figure 10-61. Timing Diagram for Globally Winning Device in Block Number 1


BHO[2:0] 0

SADR[21:0] $\quad$ z
CE_L
ALE_L $\quad \mathrm{z}$



SSV $\qquad$ z $\quad: \quad 1 \quad 1 \quad 1: \quad 1$

SSF $z$
$=000, ~ T L S Z ~=~ 10, ~ L R A M ~=~$
0, LDEV $=0$.
Note: |(BHI[2:0]) stands for the boolean 'OR' of the entire bus BHI[2:0].
Note: |(LHI[6:0]) stands for the boolean ‘OR’ for the entire bus LHI[6:0].
Note: Each bit in $\mathrm{BHO}[2: 0]$ is the same logical signal.
Note: Each bit in LHO[1:0] is the same logical signal.

Search2
(Miss on this device.)

Figure 10-62. Timing Diagram for Devices Below the Winning Device in Block Number 1


Note: Each bit in $\mathrm{BHO}[2: 0]$ is the same logical signal.
Note: Each bit in LHO[1:0] is the same logical signal.
Figure 10-63. Timing Diagram for Devices Above the Winning Device in Block Number 2


Note: Each bit in $\mathrm{BHO}[2: 0]$ is the same logical signal.
Note: Each bit in LHO[1:0] is the same logical signal.
Figure 10-64. Timing Diagram for Globally Winning Device in Block Number 2


Figure 10-65. Timing Diagram for Devices Below the Winning Device in Block Number 2


Figure 10-66. Timing Diagram for Devices Above the Winning Device in Block Number 3


BHO[2:0] $\qquad$

$C F G=10101010$, HLAT $=000, T L S Z=10, L R A M=0, L D E V=0$.
Note: |(BHI[2:0]) stands for the boolean 'OR' of the entire bus BHI[2:0].
Note: |(LHI[6:0]) stands for the boolean ‘OR’ for the entire bus LHI[6:0].
Note: Each bit in $\mathrm{BHO}[2: 0]$ is the same logical signal.
Note: Each bit in LHO[1:0] is the same logical signal.
Search2
(Hit but not
Figure 10-67. Timing Diagram for Globally Winning Device in Block Number 3

$C F G=10101010, H L A T=000, T L S Z=10, L R A M=0, L D E V=0$.
Note: |(BHI[2:0]) stands for the boolean 'OR' of the entire bus BHI[2:0]
Note: |(LHI[6:0]) stands for the boolean ‘OR’ for the entire bus LHI[6:0].
Note: Each bit in $\mathrm{BHO}[2: 0]$ is the same logical signal.
Note: Each bit in LHO[1:0] is the same logical signal.

Figure 10-68. Timing Diagram for Devices Below the Winning Device in Block Number 3 Except Device 30 (the Last Device)


Note: Each bit in $\mathrm{BHO}[2: 0]$ is the same logical signal.
Note: Each bit in LHO[1:0] is the same logical signal.
Figure 10-69. Timing Diagram of the Last Device in Block Number 3 (Device 30 in the Table)
The following is the sequence of operation for a single 272-bit Search command (also refer to Subsection 10.2, "Commands and Command Parameters" on page 19).

- Cycle A: The host ASIC drives the CMDV HIGH and applies Search command code (10) on CMD[1:0] signals. CMD[5:3] signals must be driven with the index to the GMR pair used for bits [271:136] of the data being searched. DQ[67:0] must be driven with the 68-bit data ([271:204])to be compared to all locations 0 in the four 68 -bits-word page. The CMD[2] signal must be driven to logic 1. Note. CMD[2] = 1 signals that the Search is a $\times 272$-bit Search. CMD[8:7] is ignored in this cycle.
- Cycle B: The host ASIC continues to drive the CMDV HIGH and applies Search command (10) on CMD[1:0]. The DQ[67:0] is driven with the 68-bit data ([203:136]) to be compared to all locations 1 in the four 68 -bits-word page.
- Cycle C: The host ASIC drives the CMDV HIGH and applies Search command code (10) on CMD[1:0] signals. CMD[5:3] signals must be driven with the index to the GMR pair used for the bits [135:0] of the data being searched. CMD[8:7] signals must be driven with the bits that will be driven by this device on SADR[21:20] if it has a hit. DQ[67:0] must be driven with the 68 -bit data ([135:68]) to be compared to all locations 2 in the four 68-bits-word page. The CMD[2] signal must be driven to logic 0 .
- Cycle D: The host ASIC continues to drive the CMDV HIGH and continues to apply Search command code (10) on CMD[1:0]. CMD[8:6] signals must be driven with the index of the SSR that will be used for storing the address of the matching entry and the hit flag (see page 14 for a description of SSR[0:7]). The $\operatorname{DQ}[67: 0]$ is driven with the 68 -bit data ([67:0]) to be compared to all locations 3 in the four 68-bits-word page. CMD[5:2] is ignored because the Learn instruction is not supported for x272 tables.
Note. For 272-bit searches, the host ASIC must supply four distinct 68-bit data words on DQ[67:0] during cycles A, B, C, and D. The GMR Index in cycle A selects a pair of GMRs in each of the 31 devices that apply to DQ data in cycles A and B. The GMR Index in cycle C selects a pair of GMRs in each of the 31 devices that apply to DQ data in cycles C and D.

CYNSE70064

The logical 272-bit Search operation is as shown in Figure 10-70. The entire table of 272-bit entries is compared to a 272-bit word $K$ that is presented on the DQ bus in cycles $A, B, C$, and $D$ of the command using the GMR and local mask bits. The GMR is the 272-bit word specified by the two pairs of GMRs selected by the GMR Indexes in the command's cycles A and C in each of the 31 devices. The 272-bit word $K$ that is presented on the DQ bus in cycles $A, B, C$, and $D$ of the command is compared to each entry in the table starting at location 0 . The first matching entry's location address $L$ is the winning address that is driven as part of the SRAM address on the SADR[21:0] lines (see "SRAM Addressing" on page 101). Note. The matching address is always going to be location 0 in a four-entry page for 272-bit Search (two LSBs of the matching index will be 00).


The Search command is a pipelined operation and executes a Search at one-fourth the rate of the frequency of CLK2X for 272-bit searches in $\times 272$-configured tables. The latency of SADR, CE_L, ALE_L, WE_L, SSV, and SSF from the 272-bit Search command (measured in CLK cycles) from the CLK2X cycle that contains the C and D cycles is shown in Table 10-32.

Table 10-32. The Latency of Search from C and D cycles to SRAM Access Cycle

| Number of Devices | Max Table Size | Latency in CLK Cycles |
| :---: | :---: | :---: |
| $1(\mathrm{TLSZ}=00)$ | $8 \mathrm{~K} \times 272$ bits | 4 |
| $1-8(\mathrm{TLSZ}=01)$ | $64 \mathrm{~K} \times 272$ bits | 5 |
| $1-31(\mathrm{TLSZ}=10)$ | $248 \mathrm{~K} \times 272$ bits | 6 |

The latency of a Search from command to SRAM access cycle is 6 for only a single device in the table and TLSZ = 10. In addition, SSV and SSF shift further to the right for different values of HLAT, as specified in Table 10-33.

Table 10-33. Shift of SSF and SSV from SADR

| HLAT | Number of CLK Cycles |
| :---: | :---: |
| 000 | 0 |
| 001 | 1 |
| 010 | 2 |
| 011 | 3 |
| 100 | 4 |
| 101 | 5 |
| 110 | 6 |
| 111 | 7 |

### 10.15 Mixed-Sized Searches on Tables Configured with Different Widths Using an CYNSE70064 Device

This subsection will cover mixed searches ( $\times 68, \times 136$, and $\times 272$ ) with tables of different widths $(\times 68, \times 136, \times 272$ ). The sample operation shown is for a single device with CFG $=10010000$ containing three tables of $\times 68, \times 136$, and $\times 272$ widths. The operation can be generalized to a block of eight to 31 devices using four blocks; the timing and the pipeline operation is the same as described previously for fixed searches on a table of one-width-size.
Figure 10-71 shows three sequential searches: first, a 68-bit Search on the table configured as $\times 68$, then a 136-bit Search on a table configured as $\times 136$, and finally a 272 -bit Search on the table configured as $\times 272$ bits that each results in a hit. Note. The
$D Q[67: 66]$ will be 00 in each of the two $A$ and $B$ cycles of the $\times 68$-bit Search (Search1). DQ[67:66] is 01 in each of the $A$ and $B$ cycles of the $\times 136$-bit Search (Search2). DQ[67:66] is 10 in each of the A, B, C, and D cycles of the $\times 272$-bit Search (Search3). By having table designation bits, the CYNSE70064 enables the creation of many tables in a bank of search engines of different widths.
Figure $10-72$ shows the sample table. Two bits in each 68 -bit entry will need to designated as the table number bits. One example choice can be the 00 values for the table configured as $\times 68,01$ values for tables configured as $\times 136$, and 10 values for tables configured as $\times 272$. For the above explanation, it is further assumed that bits [67:66] for each entry will be designed as such table designation bits.


Figure 10-72. Multiwidth Configurations Example

### 10.16 LRAM and LDEV Description

When search engines are cascaded using multiple CYNSE70064s, the SADR, CE_L, and WE_L (three-state signals) are all tied together. In order to eliminate external pull-ups and pull-downs, one device in a bank is designated as the default driver. For non-Search or non-Learn cycles (see Subsection 10.17, "Learn Command" on page 95) or Search cycles with a global miss, the SADR, CE_L, and WE_L signals are driven by the device with the LRAM bit set. It is important that only one device in a bank of search engines that are cascaded have this bit set. Failure to do so will cause contention on SADR, CE_L, and WE_L and can potentially cause damage to the device(s).
Similarly, when search engines using multiple CYNSE70064s are cascaded, SSF and SSV (also three-state signals) are tied together. In order to eliminate external pull-up and pull downs, one device in a bank is designated as the default driver. For nonSearch cycles or Search cycles with a global miss the SSF and SSV signals are driven by the device with the LDEV bit set. It is important that only one device in a bank of search engines that are cascaded together have this bit set. Failure to do so will cause contention on SSV and SSF and can potentially cause damage to the device(s).

### 10.17 Learn Command

Bit[0] of each 68-bit data location specifies whether an entry in the database is occupied. If all the entries in a device are occupied, the device asserts FULO signal to inform the downstream devices that it is full. The result of this communication between depth-cascaded devices determines the global FULL signal for the entire table. The FULL signal in the last device determines the fullness of the depth-cascaded table.
The device contains 16 pairs of internal, 68-bit-wide comparand registers that store the comparands as the device executes searches. On a miss by the Search signalled to ASIC through the SSV and SSF signals (SSV = 1, SSF = 0), the host ASIC can apply the Learn command to learn the entry from a comparand register to the next-free location (see Subsection 7.8, "NFA Register" on page 16). The NFA updates to the next-free location following each Write or Learn command.
In a depth-cascaded table, only a single device will learn the entry through the application of a Learn instruction. The determination of which device is going to learn is based on the FULI and FULO signalling between the devices. The first non-full device learns the entry by storing the contents of the specified comparand registers to the location(s) pointed to by NFA.
In a $\times 68$-configured table the Learn command writes a single 68 -bit location. In a $\times 136$-configured table the Learn command writes the next even and odd 68 -bit locations. In 136 -bit mode, bit[0] of the even and odd 68 -bit locations is 0 , which indicates that they are cascaded empty, or 1, which indicates that they are occupied.
The global FULL signal indicates to the table controller (the host ASIC) that all entries within a block are occupied and that no more entries can be learned. The CYNSE70064 updates the signal after each Write or Learn command to a data array. The Learn command generates a Write cycle to the external SRAM, also using the NFA register as part of the SRAM address (see Section 12.0, "SRAM Addressing" on page 101).
The Learn command is supported on a single block containing up to eight devices if the table is configured either as a $\times 68$ or $\mathrm{a} \times 136$. The Learn command is not supported for $\times 272$-configured tables.
Learn is a pipelined operation and lasts for two CLK cycles, as shown in Figure 10-73 where TLSZ = 00, and Figure 10-74 and Figure $10-75$ where TLSZ $=01$. Figure $10-74$ and Figure $10-75$ assume that the device performing the Learn operation is not the last device in the table and has its LRAM bit set to 0 . Note. The OE_L for the device with the LRAM bit set goes HIGH for two cycles for each Learn (one during the SRAM Write cycle, and one the cycle before). The latency of the SRAM Write cycle from the second cycle of the instruction is shown in Table 10-34.


Figure 10-73. Timing Diagram of Learn $(T L S Z=00)$


Figure 10-74. Timing Diagram of Learn (Except on the Last Device [TLSZ = 01])


Figure 10-75. Timing Diagram of Learn on Device Number 7 (TLSZ = 01)

Table 10-34. The Latency of SRAM Write Cycle from Second Cycle of Learn Instruction

| Number of Devices | Latency in CLK Cycles |
| :---: | :---: |
| $1(T L S Z=00)$ | 4 |
| $1-8(T L S Z=01)$ | 5 |
| $1-31(T L S Z=10)$ | 6 |

The Learn operation lasts two CLK cycles. The sequence of operation is as follows.

- Cycle 1A: The host ASIC applies the Learn instruction on CMD[1:0] using CMDV = 1. The CMD[5:2] field specifies the index of the comparand register pair that will be written in the data array in the 136-bit-configured table. For a Learn in a 68 -bit-configured table, the even-numbered comparand specified by this index will be written. CMD[8:7] carries the bits that will be driven on SADR[21:20] in the SRAM Write cycle.
- Cycle 1B: The host ASIC continues to drive the CMDV to 1, the CMD[1:0] to 11, and the CMD[5:2] with the comparand pair index. CMD[6] must be set to 0 if the Learn is being performed on a 68-bit-configured table, and to 1 if the Learn is being performed on a 136-bit-configured table.
- Cycle 2: The host ASIC drives the CMDV to 0.

At the end of cycle 2, a new instruction can begin. The latency of the SRAM Write is the same as the Search to the SRAM Read cycle. It is measured from the second cycle of the Learn instruction.

CYNSE70064

### 11.0 Depth-Cascading

The Search engine application can depth-cascade the devices to various table sizes of different widths ( 68 bits, 136 bits, or 272 bits). The devices perform all the necessary arbitration to decide which device will drive the SRAM bus. The latency of the searches increases as the table size increases; the Search rate remains constant.

### 11.1 Depth-Cascading up to Eight Devices (One Block)

Figure $11-1$ shows how up to eight devices can be cascaded to form $256 \mathrm{~K} \times 68,128 \mathrm{~K} \times 136$, or $64 \mathrm{~K} \times 272$ tables. It also shows the interconnection between the devices for depth-cascading. Each Search engine asserts the LHO[1] and LHO[0] signals to inform downstream devices of its result. The LHI[6:0] signals for a device are connected to LHO signals of the upstream devices. The host ASIC must program the TLSZ to 01 for each of up to eight devices in a block. Only a single device drives the SRAM bus in any single cycle.


Figure 11-1. Depth-Cascading to Form a Single Block

### 11.2 Depth-Cascading up to 31 Devices (Four Blocks)

Figure 11-2 shows how to cascade up to four blocks. Each block contains up to eight CYNSE70064 devices except the last, and the interconnection within each was shown in the previous subsection with the cascading of up to eight devices in a block. Note. The interconnection between blocks for depth-cascading is important. For each Search, a block asserts BHO[2], BHO[1], and $\mathrm{BHO}[0]$. The $\mathrm{BHO}[2: 0]$ signals for a block are the signals taken only from the last device in the block. For all other devices within that block, these signals stay open and floating. The host ASIC must program the table size (TLSZ) field to 10 in each of the devices for cascading up to 31 devices (in up to four blocks).


Figure 11-2. Depth-Cascading Four Blocks

### 11.3 Depth-Cascading for a FULL Signal

Bit[0] of each of the 68 -bit entries is designated as a special bit ( $1=$ occupied; $0=$ empty). For each Learn or PIO Write to the data array, each device asserts FULO[1] and FULO[0] if it does not have any empty locations within it (see Figure 11-3). Each device combines the FULO signals from the devices above it with its own full status to generate a FULL signal that gives the full status of the table up to the device asserting the FULL signal. Figure 11-3 shows the hardware connection diagram for generating the FULL signal that goes back to the ASIC. In a depth-cascaded block of up to eight devices, the FULL signal from the last device should be fed back to the ASIC controller to indicate the fullness of the table. The FULL signal of the other devices should be left open. Note. The Learn instruction is supported for only up to eight devices, whereas FULL cascading is allowed only for one block in tables containing more than eight devices. In tables for which a Learn instruction is not going to be used, the bit[0] of each 68 -bit entry should always be set to 1 .


Figure 11-3. Full Generation in a Cascaded Table

### 12.0 SRAM Addressing

Table 12-1 describes the commands used to generate addresses on the SRAM address bus. The index [14:0] field contains the address of a 68-bit entry that results in a hit in 68-bit-configured partition. It is the address of the 68-bit entry that lies at the 136-bit page, and the 272-bit page boundaries in 136-bit- and 272-bit-configured quadrants, respectively.
Section 7.0, "Registers" on page 13 of this specification, describes the NFA and SSR registers. ADR[14:0] contains the address supplied on the DQ bus during PIO access to the CYNSE70064. Command bits 8 , and 7 \{CMD[8:7]\} are passed from the command to the SRAM address bus. See Section 10.0, "Commands" on page 18, for more information. ID[4:0] is the ID of the device driving the SRAM bus (see Section 17.0, "Pinout Descriptions and Package Diagrams" on page 122, for more information).

### 12.1 Generating an SRAM BUS Address

Table 12-1. SRAM Bus Address

| Command | SRAM Operation | $\mathbf{2 1}$ | $\mathbf{2 0}$ | $[19: 15]$ | [14:0] |
| :---: | :---: | :---: | :---: | :---: | :---: |
| Search | Read | C8 | C7 | ID[4:0] | Index[14:0] |
| Learn | Write | C8 | C7 | ID[4:0] | NFA[14:0] |
| PIO Read | Read | C8 | C7 | ID[4:0] | ADR14:0] |
| PIO Write | Write | C8 | C7 | ID[4:0] | ADR[14:0] |
| Indirect Access | Write/Read | C8 | C7 | ID[4:0] | SSR[14:0] |

### 12.2 SRAM PIO Access

The remainder of Section 12.0 describes SRAM Read and SRAM Write operations.
SRAM Read enables Read access to the off-chip SRAM containing associative data. The latency from the issuance of the Read instruction to the address appearing on the SRAM bus is the same as the latency of the Search instruction and will be depend on the value programmed for the TLSZ parameter in the device configuration register. The latency of the ACK from the Read instruction is the same as the latency of the Search instruction to the SRAM address plus the HLAT programmed in the configuration register. Note. SRAM Read is a blocking operation-no new instruction can begin until the ACK is returned by the selected device performing the access.
SRAM Write enables Write access to the off-chip SRAM containing associative data. The latency from the second cycle of the Write instruction to the address appearing on the SRAM bus is the same as the latency of the Search instruction and will depend on the TLSZ value parameter programmed in the device configuration register. Note. SRAM Write is a pipelined operation-new instruction can begin right after the previous command has ended.

### 12.3 SRAM Read with a Table of One Device

SRAM Read enables Read access to the off-chip SRAM containing associative data. The latency from the issuance of the Read instruction to the address appearing on the SRAM bus is the same as the latency of the Search instruction and will depend on the TLSZ value parameter programmed in the device configuration register. The latency of the ACK from the Read instruction is the same as the latency of the Search instruction to the SRAM address plus the HLAT programmed in the configuration register. The following explains the SRAM Read operation in a table with only one device that has the following parameters: $\mathrm{TLSZ}=00$, HLAT $=000$, LRAM $=1$, and LDEV $=1$. Figure $12-1$ shows the associated timing diagram. For the following description, the selected device refers to the only device in the table because it is the only device to be accessed.

- Cycle 1A: The host ASIC applies the Read instruction on the CMD[1:0] using CMDV = 1. The DQ bus supplies the address, with DQ[20:19] set to 10 , to select the SRAM address. The host ASIC selects the device for which the ID[4:0] matches the DQ[25:21] lines. During this cycle, the host ASIC also supplies SADR[21:20] on CMD[8:7].
- Cycle 1B: The host ASIC continues to apply the Read instruction on the CMD[1:0] using CMDV =1. The DQ bus supplies the address with DQ[20:19] set to 10 to select the SRAM address.
- Cycle 2: The host ASIC floats DQ[67:0] to a three-state condition.
- Cycle 3: The host ASIC keeps DQ[67:0] in a three-state condition.
- Cycle 4: The selected device starts to drive DQ[67:0] and drives ACK from High-Z to LOW.
- Cycle 5: The selected device drives the Read address on SADR[21:0]; it also drives ACK HIGH, CE_L LOW, and ALE_L LOW.
- Cycle 6: The selected device drives CE_L HIGH, ALE_L HIGH, the SADR bus, the DQ bus in a three-state condition, and ACK LOW.
At the end of cycle 6, the selected device floats ACK in a three-state condition, and a new command can begin.

$T L S Z=00, H L A T=000, \operatorname{LRAM}=1, L D E V=1$
DQ driven by CYNSE70064
Figure 12-1. SRAM Read Access $(T L S Z=00, H L A T=000, L R A M=1, L D E V=1)$


### 12.4 SRAM Read with a Table of up to Eight Devices

The following explains the SRAM Read operation completed through a table of up to eight devices using the following parameters: TLSZ $=01$. Figure $12-2$ diagrams a block of eight devices. The following assumes that SRAM access is successfully achieved through CYNSE70064 device number 0. Figure 12-3 and Figure 12-4 show timing diagrams for device number 0 and device number 7, respectively.

- Cycle 1A: The host ASIC applies the Read instruction on CMD[1:0] using CMDV $=1$. The DQ bus supplies the address, with DQ[20:19] set to 10 , to select the SRAM address. The host ASIC selects the device for which ID[4:0] matches the DQ[25:21] lines. During this cycle the host ASIC also supplies SADR[21:20] on CMD[8:7].
- Cycle 1B: The host ASIC continues to apply the Read instruction on CMD[1:0] using CMDV $=1$. The DQ bus supplies the address, with DQ[20:19] set to 10 to select the SRAM address.
- Cycle 2: The host ASIC floats DQ[67:0] to a three-state condition.
- Cycle 3: The host ASIC keeps $D Q[67: 0]$ in a three-state condition.
- Cycle 4: The selected device starts to drive DQ[67:0].
- Cycle 5: The selected device continues to drive DQ[67:0] and drives ACK from High-Z to LOW.
- Cycle 6: The selected device drives the Read address on SADR[21:0]. It also drives ACK HIGH, CE_L LOW, WE_L HIGH, and ALE_L LOW.
- Cycle 7: The selected device drives CE_L, ALE_L, WE_L, and DQ bus in a three-state condition. It continues to drive ACK LOW. At the end of cycle 7, the selected device floats ACK in three-state condition and a new command can begin.


Figure 12-2. Table of a Block of Eight Devices


Figure 12-3. SRAM Read Through Device Number 0 in a Block of Eight Devices

$T L S Z=01, H L A T=000, L R A M=1, L D E V=1$.
Figure 12-4. SRAM Read Timing for Device Number 7 in a Block of Eight Devices

### 12.5 SRAM Read with a Table of up to 31 Devices

The following explains the SRAM Read operation accomplished through a table of up to 31 devices, using the following parameters: $\mathrm{TLSZ}=10$. The diagram of such a table is shown in Figure 12-5. The following assumes that SRAM access is being accomplished through CYNSE70064 device number 0, that device number 0 is the selected device. Figure 12-6 and Figure 12-7 show the timing diagrams for device number 0 and device number 30 , respectively.

- Cycle 1A: The host ASIC applies the Read instruction to CMD[1:0] using CMDV $=1$. The DQ bus supplies the address, with DQ[20:19] set to 10 , to select the SRAM address. The host ASIC selects the device for which the ID[4:0] matches the DQ[25:21] lines. During this cycle, the host ASIC also supplies SADR[21:20] on CMD[8:7].
- Cycle 1B: The host ASIC continues to apply the Read instruction to CMD[1:0] using CMDV = 1. The DQ bus supplies the address, with DQ[20:19] set to 10, to select the SRAM address.
- Cycle 2: The host ASIC floats DQ[67:0] to a three-state condition.
- Cycle 3: The host ASIC keeps DQ[67:0] in a three-state condition.
- Cycle 4: The selected device starts to drive DQ[67:0].
- Cycles 5 to 6: The selected device continues to drive DQ[67:0].
- Cycle 7: The selected device continues to drive DQ[67:0] and drives an SRAM Read cycle.
- Cycle 8: The selected device drives ACL from Z to LOW.
- Cycle 9: The selected device drives ACK to HIGH.
- Cycle 10: The selected device drives ACK from HIGH to LOW.

At the end of cycle 10, the selected device floats ACL in a three-state condition.


Figure 12-5. Table of 31 Devices Made of Four Blocks


Figure 12-6. SRAM Read Through Device Number 0 in a Bank of 31 Devices (Device Number 0 Timing)


TLSZ $=10$, HLAT $=010, \operatorname{LRAM}=1, L D E V=1$
Figure 12-7. SRAM Read Through Device Number 0 in Bank of 31 Devices (Device Number 30 Timing)

### 12.6 SRAM Write with a Table of One Device

SRAM Write enables Write access to the off-chip SRAM that contains associative data. The latency from the second cycle of the Write instruction to the address appearing on the SRAM bus is the same as the latency of the Search instruction, and will depend on the TLSZ value parameter programmed in the device configuration register. The following explains the SRAM Write operation accomplished through a table of only one device with the following parameters: TLSZ = 00, HLAT = 000, LRAM = 1, and LDEV $=1$. Figure $12-8$ shows the timing diagram. For the following description the selected device refers to the only device in the table as it is the only device that will be accessed.

- Cycle 1A: The host ASIC applies the Write instruction on the CMD[1:0], using CMDV $=1$. The DQ bus supplies the address with DQ[20:19] set to 10 to select the SRAM address. The host ASIC selects the device for which the ID[4:0] matches the DQ[25:21] lines. The host ASIC also supplies SADR[21:20] on CMD[8:7] in this cycle. Note. CMD[2] must be set to 0 for SRAM Write as burst Writes into the SRAM are not supported.
- Cycle 1B: The host ASIC continues to apply the Write instruction on the CMD[1:0], using CMDV = 1 . The DQ bus supplies the address with DQ[20:19] set to 10 to select the SRAM address. Note that CMD[2] must be set to 0 for SRAM Write as burst Writes into the SRAM are not supported.
- Cycle 2: The host ASIC continues to drive DQ[67:0]. The data in this cycle is not used by the CYNSE70064.
- Cycle 3: The host ASIC continues to drive DQ[67:0]. The data in this cycle is not used by the CYNSE70064.

At the end of cycle 3, a new command can begin. The Write is a pipelined operation, however the Write cycle appears at the SRAM bus with the same latency as the latency of Search instruction as measured from the second cycle of the Write command.

$T L S Z=00, H L A T=000, L R A M=1, L D E V=1$.
Figure 12-8. SRAM Write Access (TLSZ = 00, HLAT = 000, LRAM = 1, LDEV = 1)

### 12.7 SRAM Write with a Table of up to Eight Devices

The following explains the SRAM Write operation done through a table(s) of up to eight devices with the following parameters (TLSZ $=01$ ). The diagram of such a table is shown in Figure 12-9. The following assumes that SRAM access is done through CYNSE70064 device number 0. Figure $12-10$ and Figure $12-11$ show the timing diagram for device number 0 and device number 7 , respectively.

- Cycle 1A: The host ASIC applies the Write instruction on the CMD[1:0] using CMDV = 1 . The DQ bus supplies the address with DQ[20:19] set to 10 to select the SRAM address. The host ASIC selects the device for which the ID[4:0] matches the $\mathrm{DQ}[25: 21]$ lines. The host ASIC also supplies SADR[21:20] on CMD[8:7] in this cycle. Note. CMD[2] must be set to 0 for SRAM Write, as burst Writes into the SRAM are not supported.
- Cycle 1B: The host ASIC continues to apply the Write instruction on the CMD[1:0] using CMDV =1. The DQ bus supplies the address with DQ[20:19] set to 10 to select the SRAM address. Note. CMD[2] must be set to 0 for SRAM Write, as burst Writes into the SRAM are not supported.
- Cycle 2: The host ASIC continues to drive DQ[67:0]. The data in this cycle is not used by the CYNSE70064.
- Cycle 3: The host ASIC continues to drive DQ[67:0]. The data in this cycle is not used by the CYNSE70064.

At the end of cycle 3, a new command can begin. The Write is a pipelined operation, but the Write cycle appears at the SRAM bus with the same latency as that of a Search instruction as measured from the second cycle of the Write command.


Figure 12-9. Table of a Block of Eight Devices


Figure 12-10. SRAM Write Through Device Number 0 in a Block of Eight Devices


Figure 12-11. SRAM Write Timing for Device Number 7 in Block of Eight Devices

### 12.8 SRAM Write with Table(s) of up to 31 Devices

The following explains the SRAM Write operation done through a table(s) of up to 31 devices with the following parameters (TLSZ = 10). The diagram of such table(s) is shown in Figure 12-12. The following assumes that SRAM access is done through CYNSE70064 device number 0-device 0 is the selected device. Figure 12-13 and Figure 12-14 show the timing diagram for device number 0 and device number 30, respectively.

- Cycle 1A: The host ASIC applies the Write instruction on the CMD[1:0] using CMDV = 1 . The DQ bus supplies the address with DQ[20:19] set to 10 to select the SRAM address. The host ASIC selects the device for which the ID[4:0] matches the DQ[25:21] lines. The host ASIC also supplies SADR[21:20] on CMD[8:7] in this cycle. Note. CMD[2] must be set to 0 for SRAM Write, as burst Writes into the SRAM are not supported.
- Cycle 1B: The host ASIC continues to apply the Write instruction on the CMD[1:0] using CMDV $=1$. The DQ bus supplies the address with DQ[20:19] set to 10 to select the SRAM address. Note. CMD[2] must be set to 0 for SRAM Write, as burst Writes into the SRAM are not supported.
- Cycle 2: The host ASIC continues to drive DQ[67:0]. The data in this cycle is not used by the CYNSE70064.
- Cycle 3: The host ASIC continues to drive DQ[67:0]. The data in this cycle is not used by the CYNSE70064.

At the end of cycle 3, a new command can begin. The Write is a pipelined operation, but the Write cycle appears at the SRAM bus with the same latency as that of a Search instruction, as measured from the second cycle of the Write command.


Figure 12-12. Table of 31 Devices (Four Blocks)


Figure 12-13. SRAM Write Through Device Number 0 in Bank of 31 Devices (Device 0 Timing)


Figure 12-14. SRAM Write Through Device Number 0 in Bank of 31 CYNSE70064 Devices (Device Number 30 Timing)

### 13.0 Application

Figure 13-1 shows how a Search engine subsystem can be formed using a host ASIC and an CYNSE70064 bank. It also shows how this Search engine subsystem is integrated in a switch or router. The CYNSE70064 can access synchronous and asynchronous SRAMs by allowing the host ASIC to set the same HLAT parameter in all search engines within a bank of search engines.


Figure 13-1. Sample Switch/Router Using the CYNSE70064 Device

### 14.0 JTAG (1149.1) Testing

The CYNSE70064 supports the Test Access Port and Boundary Scan Architecture as specified in the IEEE JTAG standard number 1149.1. The pin interface to the chip consists of five signals with the standard definitions: TCK, TMS, TDI, TDO, and TRST_L. Table 14-1 describes the operations that the test access port controller supports, and Table 14-2 describes the TAP Device ID Register. Note. To disable JTAG functionality, connect the TCK, TMS and TDI pins to $\mathrm{V}_{\text {DDQ }}$ through a pull-up, and TRST_L to ground through a pull-down.

Table 14-1. Supported Operations

| Instruction | Type | Description |
| :---: | :---: | :--- |
| SAMPLE/PRELOAD | Mandatory | Sample/Preload. This operation loads the values of signals going to and from I/O <br> pins into the boundary scan shift register to provide a snapshot of the normal <br> functional operation. |
| EXTEST | Mandatory | External Test. This operation uses boundary scan values shifted in from TAP to <br> test connectivity external to the device. |
| INTEST | Optional | Internal Test. This operation allows slow-speed functional testing of the device <br> using the boundary scan register to provide I/O values. |

Table 14-2. TAP Device ID Register

| Field | Range | Initial Value | Description |
| :---: | :---: | :---: | :--- |
| Revision | $[31: 28]$ | 0001 | Revision Number. This is the current device revision number. Numbers <br> start from 1 and increment by 1 for each revision of the device. |
| Part Number | $[27: 12]$ | 0000000000000001 | This is the part number for the device. |
| MFID | $[11: 1]$ | $000 \_1101 \_1100$ | Manufacturer ID. This field is the same as the manufacturer ID used in the <br> TAP controller. |
| LSB | $[0]$ | 1 | Least significant bit. |

CYNSE70064

### 15.0 Electrical Specifications

This section describes the electrical specifications, capacitance, operating conditions, DC characteristics, and AC timing parameters for the CYNSE70064, as shown in Table 15-1 and Table 15-2.

Table 15-1. DC Electrical Characteristics for CYNSE70064

| $\begin{array}{\|c} \text { Parame- } \\ \text { ter } \end{array}$ | Description | Test Conditions | Min. | Max. | Unit |
| :---: | :---: | :---: | :---: | :---: | :---: |
| ILI | Input leakage current | $\mathrm{V}_{\text {DDQ }}=\mathrm{V}_{\text {DDQ }} \mathrm{Max}, \mathrm{V}_{\text {IN }}=0$ to $\mathrm{V}_{\text {DDQ }}$ Max | -10 | 10 | $\mu \mathrm{A}$ |
| Lo | Output leakage current | $\mathrm{V}_{\mathrm{DDQ}}=\mathrm{V}_{\text {DDQ }} \mathrm{Max}, \mathrm{V}_{\mathrm{IN}}=0$ to $\mathrm{V}_{\text {DDQ }}$ Max | -10 | 10 | $\mu \mathrm{A}$ |
| $\mathrm{V}_{\text {IL }}$ | Input LOW voltage ( $\mathrm{V}_{\mathrm{DDQ}}=3.3 \mathrm{~V}$ ) |  | -0.3 | 0.8 | V |
| $\mathrm{V}_{\mathrm{IH}}$ | Input HIGH voltage ( $\mathrm{V}_{\mathrm{DDQ}}=3.3 \mathrm{~V}$ ) |  | 2.0 | $\mathrm{V}_{\mathrm{DDQ}}+0.3$ | V |
| $\mathrm{V}_{\text {OL }}$ | Output LOW voltage ( $\mathrm{V}_{\mathrm{DDQ}}=3.3 \mathrm{~V}$ ) | $\mathrm{V}_{\text {DDQ }}=\mathrm{V}_{\text {DDQ }} \mathrm{Min}, \mathrm{I}_{\text {OL }}=8 \mathrm{~mA}$ |  | 0.4 | V |
| $\mathrm{V}_{\mathrm{OH}}$ | Output HIGH voltage ( $\left.\mathrm{V}_{\mathrm{DDQ}}=3.3 \mathrm{~V}\right)$ | $\mathrm{V}_{\mathrm{DDQ}}=\mathrm{V}_{\mathrm{DDQ}} \mathrm{Min}, \mathrm{I}_{\mathrm{OH}}=8 \mathrm{~mA}$ | 2.4 |  | V |
| $\mathrm{V}_{\text {IL }}$ | Input LOW voltage ( $\mathrm{V}_{\mathrm{DDQ}}=2.5 \mathrm{~V}$ ) |  | -0.3 | 0.7 | V |
| $\mathrm{V}_{\mathrm{IH}}$ | Input HIGH voltage ( $\mathrm{V}_{\mathrm{DDQ}}=2.5 \mathrm{~V}$ ) |  | 1.7 | $\mathrm{V}_{\mathrm{DDQ}}+0.3$ | V |
| $\mathrm{V}_{\text {OL }}$ | Output LOW voltage ( $\mathrm{V}_{\mathrm{DDQ}}=2.5 \mathrm{~V}$ ) | $\mathrm{V}_{\mathrm{DDQ}}=\mathrm{V}_{\text {DDQ }} \mathrm{Min}, \mathrm{I}_{\mathrm{OL}}=8 \mathrm{~mA}$ |  | 0.4 | V |
| $\mathrm{V}_{\mathrm{OH}}$ | Output HIGH voltage ( $\left.\mathrm{V}_{\mathrm{DDQ}}=2.5 \mathrm{~V}\right)$ | $\mathrm{V}_{\mathrm{DDQ}}=\mathrm{V}_{\mathrm{DDQ}} \mathrm{Min}, \mathrm{I}_{\mathrm{OH}}=8 \mathrm{~mA}$ | 2.0 |  | V |
| $\mathrm{I}_{\text {D2 } 2}$ | 3.3 V supply current at $\mathrm{V}_{\mathrm{DD}} \mathrm{Max}$ | 66 MHz Search rate, l ${ }_{\text {OUT }}=0 \mathrm{~mA}$ |  | 200 | mA |
| $\mathrm{I}_{\text {D2 } 2}$ | 3.3 V supply current at $\mathrm{V}_{\mathrm{DD}} \mathrm{Max}$ | 50 MHz Search rate, lout $=0 \mathrm{~mA}$ |  | 150 | mA |
| $\mathrm{I}_{\text {DD2 }}$ | 2.5 V supply current at $\mathrm{V}_{\mathrm{DD}} \mathrm{Max}$ | $66 \mathrm{MHz} \mathrm{Search} \mathrm{rate} \mathrm{lout}=,0 \mathrm{~mA}$ |  | 160 | mA |
| $\mathrm{I}_{\text {D2 } 2}$ | 2.5 V supply current at $\mathrm{V}_{\mathrm{DD}} \mathrm{Max}$ | $50 \mathrm{MHz} \mathrm{Search} \mathrm{rate} \mathrm{lout}=,0 \mathrm{~mA}$ |  | 120 | mA |
| IDDI | 1.8 V supply current at $\mathrm{V}_{\mathrm{DD}} \mathrm{Max}$ | 66 MHz Search rate |  | 2300 | mA |
| ldD | 1.8 V supply current at $\mathrm{V}_{\text {DD }} \mathrm{Max}$ | 50 MHz Search rate |  | 1800 | mA |


| Symbol | Parameter | Max | Unit |
| :---: | :---: | :---: | :---: |
| $\mathrm{C}_{\mathrm{IN}}$ | Input capacitance | 6 | $\mathrm{pF}^{[8]}$ |
| $\mathrm{C}_{\text {OUT }}$ | Output capacitance | 6 | $\mathrm{pF}^{[9]}$ |

Table 15-2. Operating Conditions for CYNSE70064

| Symbol | Parameter | Min (3.3V) | Max (3.3V) | Min (2.5V) | Max (2.5V) | Unit |
| :---: | :--- | :---: | :---: | :---: | :---: | :---: |
| $\mathrm{V}_{\mathrm{DDQ}}$ | Operating voltage for IO | 3.135 | 3.465 | 2.4 | 2.6 | V |
| $\mathrm{~V}_{\mathrm{DD}}$ | Operating supply voltage | 1.7 | 1.9 | 1.7 | 1.9 | V |
| $\mathrm{~V}_{\mathrm{IH}}$ | Input HIGH voltage $^{[10]}$ | 2.0 | $\mathrm{~V}_{\mathrm{DDQ}}+0.3$ | 1.7 | $\mathrm{~V}_{\mathrm{DDQ}}+0.3$ | V |
| $\mathrm{~V}_{\mathrm{IL}}$ | Input LOW voltage $^{[17]}$ | -0.3 | 0.8 | -0.3 | 0.7 | V |
| $\mathrm{t}_{\mathrm{A}}$ | Ambient operating temperature | 0 | 70 | 0 | 70 | ${ }^{\circ} \mathrm{C}$ |
|  | Supply voltage tolerance | $-5 \%$ | $+5 \%$ | $-5 \%$ | $+5 \%$ |  |

## Notes:

8. $f=1 \mathrm{MHz}, \mathrm{V}_{\mathrm{IN}}=0 \mathrm{~V}$.
9. $f=1 \mathrm{MHz}, \mathrm{V}_{\text {OUT }}=0 \mathrm{~V}$
10. Maximum allowable applies to overshoot only ( $\mathrm{V}_{\mathrm{DDQ}}$ is 2.5 V supply)
11. Minimum allowable applies to undershoot only.

CYNSE70064

### 16.0 AC Timing Wave Forms

Table 16-1 shows the AC timing parameters for the CYNSE70064 device; Table 16-2 shows the same parameters but for 2.5 V .
Table 16-1. AC Timing Parameters with CLK2X

| Parameter | Description | CYNSE70064-050 |  | CYNSE70064-066 |  | Unit |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: |
|  |  | Min. | Max. | Min. | Max. |  |
| $\mathrm{f}_{\text {CLOCK }}$ | CLK2X frequency. |  | 100 |  | 133 | MHz |
| $\mathrm{t}_{\text {CLK }}$ | CLK2X period. | 10 |  | 7.5 |  | ns |
| $\mathrm{t}_{\text {CKHI }}$ | CLK2X HIGH pulse. ${ }^{[12]}$ | 4.0 |  | 3.0 |  | ns |
| $\mathrm{t}_{\text {CKLO }}$ | CLK2X LOW pulse. ${ }^{[12]}$ | 4.0 |  | 3.0 |  | ns |
| $\mathrm{t}_{\text {ISCH }}$ | Input setup time to CLK2X rising edge. ${ }^{[12]}$ | 2.5 |  | 2.5 |  | ns |
| $\mathrm{t}_{\mathrm{IHCH}}$ | Input hold time to CLK2X rising edge. ${ }^{[12]}$ | 0.6 |  | 0.6 |  | ns |
| $\mathrm{t}_{\mathrm{ICSCH}}$ | Cascaded input setup time to CLK2X rising edge. ${ }^{[12]}$ | 4.2 |  | 4.2 |  | ns |
| $\mathrm{t}_{\text {ICHCH }}$ | Cascaded input hold time to CLK2X rising edge. ${ }^{112]}$ | 0.6 |  | 0.6 |  | ns |
| $\mathrm{t}_{\text {CKHOV }}$ | Rising edge of CLK2X to LHO, FULO, BHO, FULL valid. ${ }^{[13]}$ |  | 9.5 |  | 8.5 | ns |
| $\mathrm{t}_{\text {CKHDV }}$ | Rising edge of CLK2X to DQ valid. ${ }^{[13]}$ |  | 10.0 |  | 9.0 | ns |
| $\mathrm{t}_{\text {CKHDZ }}$ | Rising edge of CLK2X to DQ High-Z. ${ }^{[14]}$ | 1.2 | 9.5 | 1.2 | 8.5 | ns |
| $\mathrm{t}_{\text {CKHSV }}$ | Rising edge of CLK2X to SRAM bus valid. ${ }^{[13]}$ |  | 10.0 |  | 9.0 | ns |
| $\mathrm{t}_{\text {CKHSHZ }}$ | Rising edge of CLK2X to SRAM bus high-Z. ${ }^{[14]}$ |  | 7.0 |  | 6.5 | ns |
| $\mathrm{t}_{\text {CKHSLZ }}$ | Rising edge of CLK2X to SRAM bus LOW-Z. ${ }^{[14]}$ | 7.5 |  | 7.0 |  | ns |

Table 16-2. 2.5V AC Table for Test Condition of CYNSE70064

| Conditions | Results |
| :--- | :--- |
| Input pulse levels $\left(\mathrm{V}_{\mathrm{DDQ}}=3.3 \mathrm{~V}\right)$ | GND to 3.0V |
| Input pulse levels $\left(\mathrm{V}_{\mathrm{DDQ}}=2.5 \mathrm{~V}\right)$ | GND to 2.5 V |
| Input rise and fall times measured at 0.3 V and $2.7 \mathrm{~V}\left(\mathrm{~V}_{\mathrm{DDQ}}=3.3 \mathrm{~V}\right)$ | $\leq 2$ ns see (Figure 16-1) |
| Input rise and fall times measured at 0.25 V and $2.25 \mathrm{~V}\left(\mathrm{~V}_{\mathrm{DDQ}}=2.5 \mathrm{~V}\right)$ | $\leq 2$ ns see (Figure 16-1) |
| Input timing reference levels $\left(\mathrm{V}_{\mathrm{DDQ}}=3.3 \mathrm{~V}\right)$ | 1.5 V |
| Input timing reference levels $\left(\mathrm{V}_{\mathrm{DDQ}}=2.5 \mathrm{~V}\right)$ | 1.25 |
| Output reference levels $\left(\mathrm{V}_{\mathrm{DDQ}}=3.3 \mathrm{~V}\right)$ | 1.5 V |
| Output reference levels $\left(\mathrm{V}_{\mathrm{DDQ}}=2.5 \mathrm{~V}\right)$ | 1.25 V |
| Output load | See Figure 16-2 and Figure 16-3 |

## Notes:

12. Values are based on $50 \%$ signal levels.
13. Based on an AC load of CL = 30pF (see Figure 16-1, Figure 16-2, and Figure 16-3).
14. These parameters are sampled but not $100 \%$ tested, and are based on an AC load of 5 pF .


Figure 16-1. Input Wave Form for CYNSE70064


Figure 16-2. Output Load for CYNSE70064


Figure 16-3. 2.5 I/O Output Load Equivalent for CYNSE70064
Figure $16-4$ shows timing waveform diagrams.

[^0]

Figure 16-4. AC Timing Wave Forms with CLK2X

### 17.0 Pinout Descriptions and Package Diagrams

In the following figures and tables the CYNSE70064 device pinout descriptions and package diagrams are shown. Figure 17-1 shows the pinout diagram, Table lists descriptions for the pinout diagram, and Figure 19-1 illustrates the package from various views.


Figure 17-1. Pinout Diagram

### 18.0 Ordering Information

Table 18-1 provides ordering information.
Table 18-1. Ordering Information

| Part Number | Description | Frequency | Temperature Range |
| :---: | :---: | :---: | :---: |
| CYNSE70064-050 | Search engine | 50 MHz | Commercial |
| CYNSE70064-066 | Search engine | 66 MHz | Commercial |
| CYNSE70064-083 | Search engine | 83 MHz | Commercial |

### 19.0 Package Diagrams

272-Lead Ball Grid Array ( $27 \times 27 \times 2.33 \mathrm{~mm}$ ) BG272


Figure 19-1. Package
Associative Processing Technology ${ }^{\text {TM }}$ (APT), CYNSE70032, CYNSE70064, and CYNSE70128, are trademarks of Cypress Semiconductor. All product and company names mentioned in this document are the trademarks of their respective holders.

| Document Title: CYNSE70064 Network Search Engine |  |  |  |  |
| :---: | :---: | :---: | :---: | :--- |
| Document Number: 38-02041 |  |  |  |  |
| REV. | ECN NO. | Issue <br> Date | Orig. of <br> Change | Description of Change |
| $* *$ | 111438 | $02 / 21 / 02$ | AFX | New Data Sheet |


[^0]:    Notes:
    15. Output loading is specified with $C_{L}=5 \mathrm{pF}$ as in Figure 16-3. Transition is measured at $\pm 200 \mathrm{mV}$ from steady-state voltage.
    16. The load used for $\mathrm{V}_{\mathrm{OH}}, \mathrm{V}_{\mathrm{OL}}$ testing is shown in Figure 16-3.

