Abstract

Counters are fundamental to monitoring: how many requests were processed, how many CPU-seconds consumed, how many bytes sent over a network. Very likely you are already monitoring your applications and operating systems via the hundreds or thousands of counters they expose.

But did you know that modern CPUs have counters right inside the silicon circuitry? They count things like number of cache hits, or RAM accesses, or the number of instructions processed. These can give an amazingly detailed view of how the system is performing.

Historically it was impossible to access CPU hardware counters on cloud servers, as they were hidden by the hypervisor. But in the last few years, CPU manufacturers and cloud providers have added interfaces to bridge this gap

This talk looks at:

What are CPU hardware counters?
What kinds of CPU counters are most useful?
Using hardware counters as metrics, and via profiles.
Specific tools: Prometheus Node Exporter, CAdvisor, Linux Perf, PerfGo
Real-world examples where CPU hardware counters enable performance improvements.
We will focus on the Linux operating system.

By the end of the session you should:

Know what data is available via CPU hardware counters.
Be inspired to install some tools to collect and visualise the data.
Understand the limitations on shared cloud infrastructure.

Who is this session for?

Those interested in system performance.
Software engineers or observability architects.
Someone ideally familiar with modern CPU architectures - pipelining, multi-level cache, branch prediction.

Interview:

What is your session about, and why is it important for senior software developers?

It's about performance-tracking hardware built into every server CPU - how to access that information and how to use it to tune systems for efficiency.

Why is it critical for software leaders to focus on this topic right now, as we head into 2026?

By tuning system performance, leaders can deliver a snappier experience to their users, reduce cloud costs, and reduce carbon footprint.

What are the common challenges developers and architects face in this area?

It's difficult to get started, as there are many steps to follow. Then there are thousands of counters to choose from, so some understanding is needed to focus your investigation.

What's one thing you hope attendees will implement immediately after your talk?

Try it out - find a suitable machine, fetch some hardware counters, and see what they tell you about your systems.

What makes QCon stand out as a conference for senior software professionals?

Talks are delivered by engineers who have built and operated production systems.

Speaker

Bryan Boreham

Distinguished Engineer @Grafana Labs, Member of the Prometheus Team, Expert in Distributed Systems and Computer Performance

Bryan Boreham is a Distinguished Engineer at Grafana Labs, working on highly scalable storage for metrics, logs, traces and profiles. A contributor to many Open Source projects since 1988, Bryan is a member of the Prometheus team and previous maintainer of CNCF Cortex and CNI projects. Earlier, he led interest rate trading systems at Barclays Investment Bank, where performance engineering was business-critical.

His work spans distributed systems, deep performance analysis, and understanding how software really behaves under load.

Understanding and Tuning System Performance with CPU Hardware Counters

Abstract

Interview:

What is your session about, and why is it important for senior software developers?

Why is it critical for software leaders to focus on this topic right now, as we head into 2026?

What are the common challenges developers and architects face in this area?

What's one thing you hope attendees will implement immediately after your talk?

What makes QCon stand out as a conference for senior software professionals?

Speaker

Bryan Boreham

Find Bryan Boreham at:

Speaker

Bryan Boreham

Date

Location

Track

Topics

Share

From the same track

A Deterministic Simulation Testing (DST) Journey: From WASM in Go to State Machines in Rust

Fixing the AI Infra Scale Problem by Stuffing 1M Sandboxes in a Single Server

Use<’lifetimes> For<’what>

Unconference: Native Languages

Follow QCon

Contact

Menu

Conferences around the World