background image

Vol. 3B 18-55

PERFORMANCE MONITORING

18.9.4.2   Load Latency Performance Monitoring Facility

The load latency facility in Intel microarchitecture code name Sandy Bridge is similar to that in prior microarchitec-
ture. It provides software a means to characterize the average load latency to different levels of cache/memory 
hierarchy. This facility requires processor supporting enhanced PEBS record format in the PEBS buffer, see 
Table 18-23 and Section 18.9.4.1. This field measures the load latency from load's first dispatch of till final data 
writeback from the memory subsystem. The latency is reported for retired demand load operations and in core 
cycles (it accounts for re-dispatches).
To use this feature software must assure:

One of the IA32_PERFEVTSELx MSR is programmed to specify the event unit 

MEM_TRANS_RETIRED, and the 

LATENCY_ABOVE_THRESHOLD event mask must be specified (IA32_PerfEvtSelX[15:0] = 1CDH). The corre-
sponding counter IA32_PMCx will accumulate event counts for architecturally visible loads which exceed the 

Table 18-32.  PEBS Performance Events for Intel® Microarchitecture Code Name Sandy Bridge

Event Name

Event Select

Sub-event 

UMask

INST_RETIRED

C0H

PREC_DIST

01H

1

NOTES:

1. Only available on IA32_PMC1.

UOPS_RETIRED

C2H

All

01H

Retire_Slots

02H

BR_INST_RETIRED

C4H

Conditional

01H

Near_Call

02H

All_branches

04H

Near_Return

08H

Near_Taken

20H

BR_MISP_RETIRED

C5H

Conditional

01H

Near_Call

02H

All_branches

04H

Not_Taken

10H

Taken

20H

MEM_UOPS_RETIRED

D0H

STLB_MISS_LOADS

11H

STLB_MISS_STORE

12H

LOCK_LOADS

21H

SPLIT_LOADS

41H

SPLIT_STORES

42H

ALL_LOADS

81H

ALL_STORES

82H

MEM_LOAD_UOPS_RETIRED

D1H

L1_Hit

01H

L2_Hit

02H

L3_Hit

04H

Hit_LFB

40H

MEM_LOAD_UOPS_LLC_HIT_RETIRED

D2H

XSNP_Miss

01H

XSNP_Hit

02H

XSNP_Hitm

04H

XSNP_None

08H