Enchant Your AI and APIs with eBPF Magic 🪄

Summary

Disclaimer: This summary has been generated by AI. It is experimental, and feedback is welcomed. Please reach out to info@qconlondon.com with any comments or concerns.

The presentation explores the capabilities of eBPF (Extended Berkeley Packet Filter) in enhancing AI and API functionalities. The session focuses on integrating eBPF within Kubernetes to improve the observability and control over AI applications without altering the existing code.

  • Introduction to eBPF: eBPF is a technology embedded in the Linux kernel, enabling applications to run without modification while allowing developers to add features like encryption and traffic pattern changes transparently.
  • Application in AI: The talk demonstrates how to use eBPF to manipulate AI application behaviors dynamically, such as live changes to prompts or models to enhance safety and efficiency.
  • Infrastructure Setup: Steps are outlined for building an AI gateway within Kubernetes that intercepts and modifies traffic transparently, using a proxy and eBPF programs for monitoring and controlling AI agent interactions.
  • Technical Aspects: eBPF programs are attached at various levels of the Linux kernel, like network cards and sockets, to control and observe traffic without impacting the underlying code execution.
  • Security and Observability: eBPF provides robust tools for enforcing security constraints and observing application behavior across different system areas, ensuring secure and efficient operations.

Conclusion: The session concludes with the potential of eBPF in transforming application and infrastructure management by seamlessly integrating advanced features and maintaining control over AI interactions within Kubernetes environments.

This is the end of the AI-generated content.


Abstract

It is a common occurrence to see applications thrown over the fence, landing somewhere in production without a second thought about their lifecycle or how they may need maintaining in the future to connect to more efficient API endpoints. Some of these applications are quickly thrown up to provide a temporary solution, but we all know that there is nothing more permanent than temporary. But day two, we may need to enable encryption, modify their API calls/behaviours, and understand and observe what these applications are doing.

In this talk we will utilise the power of Open Source, eBPF  and some of the powers of Kubernetes, to demonstrate that we can now add features to existing application behaviour both *live* and without making any changes to the application itself. From transparently adding encryption to an existing application without breaking its traffic patterns, to changing an AI application by live changing prompts or models providing safety or efficiency in application behaviour and finally providing understanding into what these applications are doing. 


Speaker

Dan Finneran

Principal Community Advocate at Isovalent @Cisco

Dan Finneran is a Principal Community Advocate at Isovalent at Cisco. His journey to today has included bare-metal, jails, zones, vms and containers where he is currently enjoying the fast-paced ride in the cloud native space. He also created & maintains a popular Open-Source load-balancer for Kubernetes and contributes to upstream Kubernetes. He’s also been fortunate to present at events ranging from the British computing society, HPE Technical solutions summit to DockerCon and KubeCon amongst others.

Read more

Date

Monday Mar 16 / 03:55PM GMT ( 50 minutes )

Location

Whittle (3rd Fl.)

Topics

AI ebpf Kubernetes

Share

From the same track

Session architecture

From Fan-Out to Fast: Sub-100ms API Design in Distributed Systems

Monday Mar 16 / 10:35AM GMT

A “simple” API request rarely stays simple. In distributed systems, one call quickly turns into fan-out across gateways, services, caches, and databases — and your p99 becomes the sum of every hop and every flaky dependency.

Speaker image - Saranya Vedagiri

Saranya Vedagiri

Senior Staff Engineer @eBay

Session Platform Engineering

APIs for Agents: Rethinking API Programs in the MCP Era

Monday Mar 16 / 01:35PM GMT

As API programs mature, a familiar gap emerges: some teams operate with strong standards, reusable platforms, and clear governance,  while others rely on informal guidance and best-effort consistency.

Speaker image - Jim Gough

Jim Gough

Distinguished Engineer, API Platform Lead Architect @Morgan Stanley, Co-Author of Optimizing Java

Speaker image - Andreea Niculcea

Andreea Niculcea

Vice President @Morgan Stanley

Session architecture

Managing Asynchronous APIs at Scale

Monday Mar 16 / 05:05PM GMT

When event-driven architectures are small, teams can reason about events through word-of-mouth. They know who publishes what, who consumes it, and how messages flow through the system. Teams manage their own infrastructure or raise tickets to request changes.

Speaker image - Ian Cooper

Ian Cooper

Senior Principal Engineer @Just Eat Takeaway

Session Observability

Uncorking Queueing Bottlenecks with OpenTelemetry

Monday Mar 16 / 11:45AM GMT

Queues are the backbone of scalable, asynchronous systems, but they can easily create a tangled web of complexity. When things slow down, the bottleneck could be anywhere, from producer lag to consumer exhaustion, and standard metrics often fail to show the full picture.

Speaker image - Julian Wreford

Julian Wreford

Team Lead of Operability Team @Gearset, Software Engineer Turned Accidental SRE

Speaker image - Oli Lane

Oli Lane

Engineering Team Lead @Gearset, Focusing on Engineering Culture, Observability, and Platform Reliability

Session

Unconference: Connecting Systems

Monday Mar 16 / 02:45PM GMT