WSM_EP_SP_EVENTS(3CPC) CPU Performance Counters Library Functions
NAME
wsm_ep_sp_events - processor model specific performance counter events
DESCRIPTION
This manual page describes events specific to the following Intel CPU
models and is derived from Intel's perfmon data. For more information,
please consult the Intel Software Developer's Manual or Intel's perfmon
website.
CPU models described by this document:
+o Family 0x6, Model 0x25 The following events are supported:
arith.cycles_div_busy Cycles the divider is busy
arith.div Divide Operations executed
arith.mul Multiply operations executed
baclear.bad_target BACLEAR asserted with bad target address
baclear.clear BACLEAR asserted, regardless of cause
baclear_force_iq Instruction queue forced BACLEAR
bpu_clears.early Early Branch Prediciton Unit clears
bpu_clears.late Late Branch Prediction Unit clears
bpu_missed_call_ret Branch prediction unit missed call or return
br_inst_decoded Branch instructions decoded
br_inst_exec.any Branch instructions executed
br_inst_exec.cond Conditional branch instructions executed
br_inst_exec.direct Unconditional branches executed
br_inst_exec.direct_near_call Unconditional call branches executed
br_inst_exec.indirect_near_call Indirect call branches executed
br_inst_exec.indirect_non_call Indirect non call branches executed
br_inst_exec.near_calls Call branches executed
br_inst_exec.non_calls All non call branches executed
br_inst_exec.return_near Indirect return branches executed
br_inst_exec.taken Taken branches executed
br_inst_retired.all_branches Retired branch instructions (Precise Event)
br_inst_retired.conditional Retired conditional branch instructions (Precise Event)
br_inst_retired.near_call Retired near call instructions (Precise Event)
br_misp_exec.any Mispredicted branches executed
br_misp_exec.cond Mispredicted conditional branches executed
br_misp_exec.direct Mispredicted unconditional branches executed
br_misp_exec.direct_near_call Mispredicted non call branches executed
br_misp_exec.indirect_near_call Mispredicted indirect call branches executed
br_misp_exec.indirect_non_call Mispredicted indirect non call branches executed
br_misp_exec.near_calls Mispredicted call branches executed
br_misp_exec.non_calls Mispredicted non call branches executed
br_misp_exec.return_near Mispredicted return branches executed
br_misp_exec.taken Mispredicted taken branches executed
br_misp_retired.all_branches Mispredicted retired branch instructions (Precise Event)
br_misp_retired.conditional Mispredicted conditional retired branches (Precise Event)
br_misp_retired.near_call Mispredicted near retired calls (Precise Event)
cache_lock_cycles.l1d Cycles L1D locked
cache_lock_cycles.l1d_l2 Cycles L1D and L2 locked
cpu_clk_unhalted.ref_p Reference base clock (133 Mhz) cycles when thread is not halted
(programmable counter)
cpu_clk_unhalted.thread_p Cycles when thread is not halted (programmable counter)
cpu_clk_unhalted.total_cycles Total CPU cycles
dtlb_load_misses.any DTLB load misses
dtlb_load_misses.pde_miss DTLB load miss caused by low part of address
dtlb_load_misses.stlb_hit DTLB second level hit
dtlb_load_misses.walk_completed DTLB load miss page walks complete
dtlb_load_misses.walk_cycles DTLB load miss page walk cycles
dtlb_misses.any DTLB misses
dtlb_misses.large_walk_completed DTLB miss large page walks
dtlb_misses.stlb_hit DTLB first level misses but second level hit
dtlb_misses.walk_completed DTLB miss page walks
dtlb_misses.walk_cycles DTLB miss page walk cycles
ept.walk_cycles Extended Page Table walk cycles
es_reg_renames ES segment renames
fp_assist.all X87 Floating point assists (Precise Event)
fp_assist.input X87 Floating poiint assists for invalid input value (Precise
Event)
fp_assist.output X87 Floating point assists for invalid output value (Precise
Event)
fp_comp_ops_exe.mmx MMX Uops
fp_comp_ops_exe.sse_double_precision SSE* FP double precision Uops
fp_comp_ops_exe.sse_fp SSE and SSE2 FP Uops
fp_comp_ops_exe.sse_fp_packed SSE FP packed Uops
fp_comp_ops_exe.sse_fp_scalar SSE FP scalar Uops
fp_comp_ops_exe.sse_single_precision SSE* FP single precision Uops
fp_comp_ops_exe.sse2_integer SSE2 integer Uops
fp_comp_ops_exe.x87 Computational floating-point operations executed
fp_mmx_trans.any All Floating Point to and from MMX transitions
fp_mmx_trans.to_fp Transitions from MMX to Floating Point instructions
fp_mmx_trans.to_mmx Transitions from Floating Point to MMX instructions
ild_stall.any Any Instruction Length Decoder stall cycles
ild_stall.iq_full Instruction Queue full stall cycles
ild_stall.lcp Length Change Prefix stall cycles
ild_stall.mru Stall cycles due to BPU MRU bypass
ild_stall.regen Regen stall cycles
inst_decoded.dec0 Instructions that must be decoded by decoder 0
inst_queue_write_cycles Cycles instructions are written to the instruction queue
inst_queue_writes Instructions written to instruction queue.
inst_retired.any_p Instructions retired (Programmable counter and Precise Event)
inst_retired.mmx Retired MMX instructions (Precise Event)
inst_retired.total_cycles Total cycles (Precise Event)
inst_retired.x87 Retired floating-point operations (Precise Event)
io_transactions I/O transactions
itlb_flush ITLB flushes
itlb_miss_retired Retired instructions that missed the ITLB (Precise Event)
itlb_misses.any ITLB miss
itlb_misses.walk_completed ITLB miss page walks
itlb_misses.walk_cycles ITLB miss page walk cycles
l1d.m_evict L1D cache lines replaced in M state
l1d.m_repl L1D cache lines allocated in the M state
l1d.m_snoop_evict L1D snoop eviction of cache lines in M state
l1d.repl L1 data cache lines allocated
l1d_cache_prefetch_lock_fb_hit L1D prefetch load lock accepted in fill buffer
l1d_prefetch.miss L1D hardware prefetch misses
l1d_prefetch.requests L1D hardware prefetch requests
l1d_prefetch.triggers L1D hardware prefetch requests triggered
l1d_wb_l2.e_state L1 writebacks to L2 in E state
l1d_wb_l2.i_state L1 writebacks to L2 in I state (misses)
l1d_wb_l2.m_state L1 writebacks to L2 in M state
l1d_wb_l2.mesi All L1 writebacks to L2
l1d_wb_l2.s_state L1 writebacks to L2 in S state
l1i.cycles_stalled L1I instruction fetch stall cycles
l1i.hits L1I instruction fetch hits
l1i.misses L1I instruction fetch misses
l1i.reads L1I Instruction fetches
l2_data_rqsts.any All L2 data requests
l2_data_rqsts.demand.e_state L2 data demand loads in E state
l2_data_rqsts.demand.i_state L2 data demand loads in I state (misses)
l2_data_rqsts.demand.m_state L2 data demand loads in M state
l2_data_rqsts.demand.mesi L2 data demand requests
l2_data_rqsts.demand.s_state L2 data demand loads in S state
l2_data_rqsts.prefetch.e_state L2 data prefetches in E state
l2_data_rqsts.prefetch.i_state L2 data prefetches in the I state (misses)
l2_data_rqsts.prefetch.m_state L2 data prefetches in M state
l2_data_rqsts.prefetch.mesi All L2 data prefetches
l2_data_rqsts.prefetch.s_state L2 data prefetches in the S state
l2_lines_in.any L2 lines alloacated
l2_lines_in.e_state L2 lines allocated in the E state
l2_lines_in.s_state L2 lines allocated in the S state
l2_lines_out.any L2 lines evicted
l2_lines_out.demand_clean L2 lines evicted by a demand request
l2_lines_out.demand_dirty L2 modified lines evicted by a demand request
l2_lines_out.prefetch_clean L2 lines evicted by a prefetch request
l2_lines_out.prefetch_dirty L2 modified lines evicted by a prefetch request
l2_rqsts.ifetch_hit L2 instruction fetch hits
l2_rqsts.ifetch_miss L2 instruction fetch misses
l2_rqsts.ifetches L2 instruction fetches
l2_rqsts.ld_hit L2 load hits
l2_rqsts.ld_miss L2 load misses
l2_rqsts.loads L2 requests
l2_rqsts.miss All L2 misses
l2_rqsts.prefetch_hit L2 prefetch hits
l2_rqsts.prefetch_miss L2 prefetch misses
l2_rqsts.prefetches All L2 prefetches
l2_rqsts.references All L2 requests
l2_rqsts.rfo_hit L2 RFO hits
l2_rqsts.rfo_miss L2 RFO misses
l2_rqsts.rfos L2 RFO requests
l2_transactions.any All L2 transactions
l2_transactions.fill L2 fill transactions
l2_transactions.ifetch L2 instruction fetch transactions
l2_transactions.l1d_wb L1D writeback to L2 transactions
l2_transactions.load L2 Load transactions
l2_transactions.prefetch L2 prefetch transactions
l2_transactions.rfo L2 RFO transactions
l2_transactions.wb L2 writeback to LLC transactions
l2_write.lock.e_state L2 demand lock RFOs in E state
l2_write.lock.hit All demand L2 lock RFOs that hit the cache
l2_write.lock.i_state L2 demand lock RFOs in I state (misses)
l2_write.lock.m_state L2 demand lock RFOs in M state
l2_write.lock.mesi All demand L2 lock RFOs
l2_write.lock.s_state L2 demand lock RFOs in S state
l2_write.rfo.hit All L2 demand store RFOs that hit the cache
l2_write.rfo.i_state L2 demand store RFOs in I state (misses)
l2_write.rfo.m_state L2 demand store RFOs in M state
l2_write.rfo.mesi All L2 demand store RFOs
l2_write.rfo.s_state L2 demand store RFOs in S state
large_itlb.hit Large ITLB hit
load_block.overlap_store Loads that partially overlap an earlier store
load_dispatch.any All loads dispatched
load_dispatch.mob Loads dispatched from the MOB
load_dispatch.rs Loads dispatched that bypass the MOB
load_dispatch.rs_delayed Loads dispatched from stage 305
load_hit_pre Load operations conflicting with software prefetches
longest_lat_cache.miss Longest latency cache miss
longest_lat_cache.reference Longest latency cache reference
lsd.active Cycles when uops were delivered by the LSD
lsd.inactive Cycles no uops were delivered by the LSD
lsd_overflow Loops that can't stream from the instruction queue
machine_clears.cycles Cycles machine clear asserted
machine_clears.mem_order Execution pipeline restart due to Memory ordering conflicts
machine_clears.smc Self-Modifying Code detected
macro_insts.decoded Instructions decoded
macro_insts.fusions_decoded Macro-fused instructions decoded
mem_inst_retired.loads Instructions retired which contains a load (Precise Event)
mem_inst_retired.stores Instructions retired which contains a store (Precise Event)
mem_load_retired.dtlb_miss Retired loads that miss the DTLB (Precise Event)
mem_load_retired.hit_lfb Retired loads that miss L1D and hit an previously allocated LFB
(Precise Event)
mem_load_retired.l1d_hit Retired loads that hit the L1 data cache (Precise Event)
mem_load_retired.l2_hit Retired loads that hit the L2 cache (Precise Event)
mem_load_retired.llc_miss Retired loads that miss the LLC cache (Precise Event)
mem_load_retired.llc_unshared_hit Retired loads that hit valid versions in the LLC cache (Precise
Event)
mem_load_retired.other_core_l2_hit_hitm Retired loads that hit sibling core's L2 in modified or
unmodified states (Precise Event)
mem_store_retired.dtlb_miss Retired stores that miss the DTLB (Precise Event)
mem_uncore_retired.local_dram Load instructions retired with a data source of local DRAM or
locally homed remote hitm (Precise Event)
mem_uncore_retired.other_core_l2_hitm Load instructions retired that HIT modified data in sibling
core (Precise Event)
mem_uncore_retired.remote_cache_local_home_hit Load instructions retired remote cache HIT data source (Precise
Event)
mem_uncore_retired.remote_dram Load instructions retired remote DRAM and remote home-remote
cache HITM (Precise Event)
mem_uncore_retired.uncacheable Load instructions retired IO (Precise Event)
offcore_requests.any All offcore requests
offcore_requests.any.read Offcore read requests
offcore_requests.any.rfo Offcore RFO requests
offcore_requests.demand.read_code Offcore demand code read requests
offcore_requests.demand.read_data Offcore demand data read requests
offcore_requests.demand.rfo Offcore demand RFO requests
offcore_requests.l1d_writeback Offcore L1 data cache writebacks
offcore_requests.uncached_mem Offcore uncached memory accesses
offcore_requests_outstanding.any.read Outstanding offcore reads
offcore_requests_outstanding.any.read_not_empty Cycles offcore reads busy
offcore_requests_outstanding.demand.read_code Outstanding offcore demand code reads
offcore_requests_outstanding.demand.read_code_not_empty Cycles offcore demand code read busy
offcore_requests_outstanding.demand.read_data Outstanding offcore demand data reads
offcore_requests_outstanding.demand.read_data_not_empty Cycles offcore demand data read busy
offcore_requests_outstanding.demand.rfo Outstanding offcore demand RFOs
offcore_requests_outstanding.demand.rfo_not_empty Cycles offcore demand RFOs busy
offcore_requests_sq_full Offcore requests blocked due to Super Queue full
partial_address_alias False dependencies due to partial address aliasing
rat_stalls.any All RAT stall cycles
rat_stalls.flags Flag stall cycles
rat_stalls.registers Partial register stall cycles
rat_stalls.rob_read_port ROB read port stalls cycles
rat_stalls.scoreboard Scoreboard stall cycles
resource_stalls.any Resource related stall cycles
resource_stalls.fpcw FPU control word write stall cycles
resource_stalls.load Load buffer stall cycles
resource_stalls.mxcsr MXCSR rename stall cycles
resource_stalls.other Other Resource related stall cycles
resource_stalls.rob_full ROB full stall cycles
resource_stalls.rs_full Reservation Station full stall cycles
resource_stalls.store Store buffer stall cycles
sb_drain.any All Store buffer stall cycles
seg_rename_stalls Segment rename stall cycles
simd_int_128.pack 128 bit SIMD integer pack operations
simd_int_128.packed_arith 128 bit SIMD integer arithmetic operations
simd_int_128.packed_logical 128 bit SIMD integer logical operations
simd_int_128.packed_mpy 128 bit SIMD integer multiply operations
simd_int_128.packed_shift 128 bit SIMD integer shift operations
simd_int_128.shuffle_move 128 bit SIMD integer shuffle/move operations
simd_int_128.unpack 128 bit SIMD integer unpack operations
simd_int_64.pack SIMD integer 64 bit pack operations
simd_int_64.packed_arith SIMD integer 64 bit arithmetic operations
simd_int_64.packed_logical SIMD integer 64 bit logical operations
simd_int_64.packed_mpy SIMD integer 64 bit packed multiply operations
simd_int_64.packed_shift SIMD integer 64 bit shift operations
simd_int_64.shuffle_move SIMD integer 64 bit shuffle/move operations
simd_int_64.unpack SIMD integer 64 bit unpack operations
snoop_response.hit Thread responded HIT to snoop
snoop_response.hite Thread responded HITE to snoop
snoop_response.hitm Thread responded HITM to snoop
snoopq_requests.code Snoop code requests
snoopq_requests.data Snoop data requests
snoopq_requests.invalidate Snoop invalidate requests
snoopq_requests_outstanding.code Outstanding snoop code requests
snoopq_requests_outstanding.code_not_empty Cycles snoop code requests queued
snoopq_requests_outstanding.data Outstanding snoop data requests
snoopq_requests_outstanding.data_not_empty Cycles snoop data requests queued
snoopq_requests_outstanding.invalidate Outstanding snoop invalidate requests
snoopq_requests_outstanding.invalidate_not_empty Cycles snoop invalidate requests queued
sq_full_stall_cycles Super Queue full stall cycles
sq_misc.lru_hints Super Queue LRU hints sent to LLC
sq_misc.split_lock Super Queue lock splits across a cache line
ssex_uops_retired.packed_double SIMD Packed-Double Uops retired (Precise Event)
ssex_uops_retired.packed_single SIMD Packed-Single Uops retired (Precise Event)
ssex_uops_retired.scalar_double SIMD Scalar-Double Uops retired (Precise Event)
ssex_uops_retired.scalar_single SIMD Scalar-Single Uops retired (Precise Event)
ssex_uops_retired.vector_integer SIMD Vector Integer Uops retired (Precise Event)
store_blocks.at_ret Loads delayed with at-Retirement block code
store_blocks.l1d_block Cacheable loads delayed with L1D block code
two_uop_insts_decoded Two Uop instructions decoded
uop_unfusion Uop unfusions due to FP exceptions
uops_decoded.esp_folding Stack pointer instructions decoded
uops_decoded.esp_sync Stack pointer sync operations
uops_decoded.ms_cycles_active Uops decoded by Microcode Sequencer
uops_decoded.stall_cycles Cycles no Uops are decoded
uops_executed.core_active_cycles Cycles Uops executed on any port (core count)
uops_executed.core_active_cycles_no_port5 Cycles Uops executed on ports 0-4 (core count)
uops_executed.core_stall_count Uops executed on any port (core count)
uops_executed.core_stall_count_no_port5 Uops executed on ports 0-4 (core count)
uops_executed.core_stall_cycles Cycles no Uops issued on any port (core count)
uops_executed.core_stall_cycles_no_port5 Cycles no Uops issued on ports 0-4 (core count)
uops_executed.port0 Uops executed on port 0
uops_executed.port015 Uops issued on ports 0, 1 or 5
uops_executed.port015_stall_cycles Cycles no Uops issued on ports 0, 1 or 5
uops_executed.port1 Uops executed on port 1
uops_executed.port2_core Uops executed on port 2 (core count)
uops_executed.port234_core Uops issued on ports 2, 3 or 4
uops_executed.port3_core Uops executed on port 3 (core count)
uops_executed.port4_core Uops executed on port 4 (core count)
uops_executed.port5 Uops executed on port 5
uops_issued.any Uops issued
uops_issued.core_stall_cycles Cycles no Uops were issued on any thread
uops_issued.cycles_all_threads Cycles Uops were issued on either thread
uops_issued.fused Fused Uops issued
uops_issued.stall_cycles Cycles no Uops were issued
uops_retired.active_cycles Cycles Uops are being retired
uops_retired.any Uops retired (Precise Event)
uops_retired.macro_fused Macro-fused Uops retired (Precise Event)
uops_retired.retire_slots Retirement slots used (Precise Event)
uops_retired.stall_cycles Cycles Uops are not retiring (Precise Event)
uops_retired.total_cycles Total cycles using precise uop retired event (Precise Event)
inst_retired.total_cycles_ps Total cycles (Precise Event)
SEE ALSO
cpc(3CPC) https://download.01.org/perfmon/index/illumos June 18, 2018 illumos