现在的位置: 首页 > 综合 > 正文

Intel Sandy Bridge

2013年08月19日 ⁄ 综合 ⁄ 共 1806字 ⁄ 字号 评论关闭

Intel Sandy Bridge

Configuration

Intel i5-2400 (Sandy Bridge), 3.1 GHz, 32 nm.

  • L1 Data cache = 32 KB. 64 B/line, 8-WAY. (Write-Allocate?), 2 * 16 Bytes read ports + 16 Bytes store port.
  • L1 Instruction cache = 32 KB. 8-WAY. 64 B/line
  • mOp Cache: 1.5k instructions, 8-WAY, 6 MOP / line
  • 4 decoders, 16 bytes per cycle
  • L2 cache = 256 KB. 64 B/line, 8-WAY
  • L3 cache = 6 MB. 64 B/line, 12-WAY
  • Load Buffers = 64
  • Store Buffers = 36
  • Line fill buffers (LFB) = 10
  • RS = 54 items
  • PRF Integer = 160 registers
  • PRF Float = 144 registers
  • ROB = 168 items
  • RSB (return stack buffer) = 16 items ?

4 KB pages mode (64-bit Windows)

  • Data TLB L1 size = 64 items. Miss penalty = 7. 4- WAY
  • Instruction TLB L1 size = 64 items per thread.
  • TLB L2 size = 512 items. . 4- WAY
  • PDE cache = 32 items?

Size

Latency

Description

32 K

4 | 5

TLB + L1

256 K

12

+7 (L2)

2 M

36

+17 (L3) +7 (L1 TLB miss)

6 M

46

+10 (L2 TLB miss)

64 M

46 + 65 ns

+ 65 ns (RAM)

...

58 + 65 ns

+ 12 (PDE cache miss)

  • L1 Latency = 4 cycles is for direct pointers mode. The [R1 + R2*4] Load has 5 cycles latency.
  • L3 Latency for cores 2 and 3 is 1-cycle smaller than L3 Latency for cores 1 and 4.

2 MB pages mode (64-bit Windows, 64-bit soft)

  • Data TLB L1 size = 32 items.
  • Data TLB L2 size = 512 items.
  • Instruction TLB L1 = ? items

1 GB mode

  • Data TLB L1 size = 4 items.
  • Data TLB L2 size = 512 items.
  • Instruction TLB L1 = ? items

MISC

Branch misprediction penalty = 14 cycles.

  • 64-bytes range cross penalty = 5 cycles
  • 4096-bytes range cross penalty = 24 cycles
  • L1 B/W (Parallel Random Read) = 0.51 cycles per one access
  • L2->L1 B/W (Parallel Random Read) = 2.5 cycles per cache line
  • L2->L1 B/W (Read, 64 bytes step) = 2.1 cycles per cache line
  • L2 Write (Write, 64 bytes step) = 6.70 cycles per write (cache line)
  • L3->L1 B/W (Parallel Random Read) = 4.85 cycles per cache line
  • L3->L1 B/W (Read, 64 bytes step) = 5.05 cycles per cache line
  • L3 Write (Write, 64 bytes step) = 9.00 cycles per write (cache line)
  • RAM Read B/W (Parallel Random Read) = 10 ns / cache line = 6400 MB/s
  • RAM Read B/W (Read, 8 Bytes step) = 13700 MB/s
  • RAM Read B/W (Read, 64 Bytes step) = 15200 MB/s
  • RAM Read B/W (Read, 64 Bytes step - pointer chasing) = 11700 MB/s
  • RAM Write B/W (Write, 4-64 Bytes step) = 9200 MB/s

 

原文地址: http://www.7-cpu.com/cpu/SandyBridge.html

抱歉!评论已关闭.