Practically all modern software is operating as a distributed system, even if it might not look like it at first glance. API calls and TCP/IP requests obviously cross process boundaries and use network connections, but so does accessing storage, policy enforcement, or logging within a cloud-based context...and all those operations can fail, often in interesting ways.
In this track, we'll look at how to tackle these distributed system problems. How do we connect components without adding extra points of failure or impacting scalability? What about failure? If a system grinds to a halt, we'll know - but what about partial/grey failures or intermittent performance degradation? We'll look at using observability to keep on top of our systems while, ideally, not breaking the bank in logging costs.
From this track
Timeouts, Retries and Idempotency In Distributed Systems
Wednesday Apr 9 / 10:35AM BST
The definition of insanity is doing the same thing over and over again” - this quote attributed to Einstein warns us of the danger of magical thinking, hoping that trying something just one more time will achieve success when before we failed. But is this really insanity?

Sam Newman
Microservice, Cloud, CI/CD Expert, Author of "Building Microservices" and "Monolith to Microservices", 20+ Years Experience as a Developer
From Confusion to Clarity: Advanced Observability Strategies for Media Workflows at Netflix
Wednesday Apr 9 / 11:45AM BST
Managing media workflows at the Netflix scale is both thrilling and daunting. With millions of workflow executions across hundreds of types and over 500 million CPU hours consumed quarterly, costs can skyrocket, and encoding issues can disrupt the streaming experience.

Sujana Sooreddy
Software Engineer @Netflix - Building High Scale Observability Solutions

Naveen Mareddy
Staff Engineer @Netflix, 20+ years in Software Engineering, Creator of MediaInfra Meetup, Speaker, Mentor
Scaling API Independence: Mocking, Contract Testing & Observability in Large Microservices Environments
Wednesday Apr 9 / 01:35PM BST
Microservices promise faster deployments and team autonomy. In reality, engineers are often blocked waiting for APIs, dealing with broken sandboxes, or wrangling test environments.

Tom Akehurst
CTO and Co-Founder @WireMock, 20+ Years Building Enterprise Systems
From Dashboard Soup to Observability Lasagna: Building Better Layers
Wednesday Apr 9 / 02:45PM BST
Let's be honest - observability can suck. Ever feel like you're swimming in dashboard soup? You know the feeling: tons of single-use dashboards, building new ones during every incident only to lose them in the chaos, and spending ages creating visualizations that no one ever looks at again.

Martha Lambert
Product Engineer @incident.io, Building Reliable and Observable Systems
Platforms for Secure API Connectivity With Architecture as Code
Wednesday Apr 9 / 03:55PM BST
As microservices and complex platforms become the standard, ensuring secure connectivity while maintaining a smooth developer experience is a significant challenge. Traditional security models often introduce friction, slowing down innovation and deployment.

James Gough
Distinguished Engineer, API Platform Lead Architect @Morgan Stanley, Co-Author of Optimizing Java